10,000 Matching Annotations
  1. Oct 2025
    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Review

      GENERAL QUESTIONS:

      (1) For many enveloped viruses, the attachment factors - paradoxically - are also surface glycoproteins, often complexed with a distinct fusion protein. The authors note here that the glycoportiens do not inhibit the initial binding, but only limit the stability of the adhesive interface needed for subsequent membrane fusion and viral uptake. How these antagonistic tendencies might play out should be discussed.

      When the surface density of receptor molecules for a virus with glycans increases, the density of free glycans not bound to the virus increases along with the amount of virus adsorbed. However, if the total amount of glycans is considered to be a function of the receptor density, the reaction may become more complicated. This complication may also be affected by the prolonged infection. If the receptor density on the cell surface is high, the infection inhibitory effect of glycans may not be obtained in a system in which a high concentration of virus is supplied from the outside world for a long time. This is because once viruses have entered the cell, they accumulate inside the cell, and viral infection is affected by the total accumulated amount, which is the integration of the number of viruses that have entered over time. This distinction indicates that the virus entry reaction and the total amount of infection in the cell must be considered separately. This is an important point, but it was not clearly mentioned in the original manuscript.

      Our experiments were conducted under conditions that clearly allowed us to detect the virusinhibiting function of glycans without being affected by the above points. In order to clarify these points, we will revise this article as follows, referring to an experiment that is somewhat related to this discussion (the Adenovirus infection experiment into HEK293T cells shown in Figure S1F)..

      (Page-3, Introduction)

      While there are known examples of glycans that function as viral receptors (Thompson et al., 2019), these results demonstrate that a variety of glycoproteins negatively regulate viral infection in a wide range of systems. All of these results suggest that bulky membrane glycoproteins nonspecifically inhibit viral infection.

      (Page 20, Discussion)

      When the virus receptor is a glycoprotein or glycan itself, the inhibition of virus infection by glycans becomes more complex because the total amount of glycans is also a function of the receptor density. It is also important to note that the total amount of infection into a cell is the time integral of virus entry. Even if the probability of virus entry is significantly reduced by glycans, the cumulative number of virus entries may increase if high concentrations of virus continue to be supplied from outside the cell for a long period of time. In the case of Adenovirus, which continues to amplify in HEK293T cells after infection, we showed that MUC1 on the cell surface has an inhibitory effect on long-term cumulative infection (Supplementary Figure 1F). However, such an accumulation effect may be caseby-case depending on the virus cell system, and may be more pronounced when the cell surface density of virus receptor molecules is high. As a result, if the virus receptor molecule is a glycan or glycoprotein and infection continues for a long period of time, the infection inhibition effect may not be observed despite an apparent increase in the total amount of glycans in the cell. In any case, our results clarified the factor of virus entry inhibition dependent on the total amount of glycans because appropriate conditions were set.

      (2) Unlike polymers tethered to solid surface undergoing mushroom-to-brush transition in densitydependent manner, the glycoproteins at the cell surface are of course mobile (presumably in a density-dependent manner). They can thus redistribute in spatial patterns, which serve to minimize the free energy. I suggest the authors explicitly address how these considerations influence the in vitro reconstitution assays seeking to assess the glycosylation-dependent protein packing.

      We performed additional experiments using lipid bilayers that had lost fluidity, and found that there is no significant difference in protein binding between fluid and nonfluid bilayers. The redistribution of molecules due to molecular fluidity may play some roles but not in our experimental systems. It suggests that glycoproteins can generate intermolecular repulsion even in fluid conditions such as cell membranes, just as they do on the solid phase. This experiment was also very useful because it allowed us to compare our results in the fluid bilayer with solid-state measurements of saturation molecular density and the brush transition. This comparison gave us confidence that in the reconstituted membrane system, even at saturation density, the membrane proteins are not as stretched as they are in the condensed brush state. We have therefore added a new paragraph and a new figure (Supplementary Fig. 5B) to discuss this issue, as follows:

      The molecular structural state of these proteins needs to be further discussed to estimate the contribution of f<sub>el</sub>, which represents resistance to molecular elongation. Our results suggest that these densely packed nonglycosylated molecules are no longer in a free mushroom state. However, their saturation density was several times lower than previously reported brush transition densities, such as 65000 µm<sup>-2</sup> for 17 kDa polyacrylamide (R<sub>F</sub> ~ 15 nm) on a solid surface (Wu et al., 2002). To compare our data on fluid bilayers with previously reported data on solid surfaces, we performed additional experiments with lipid bilayers that lost fluidity. No significant changes in protein binding between fluid and nonfluid bilayers were observed for both b-MUC1 and g-MUC1 molecules (Supplementary Figure 5B). This result suggests that membrane fluidity does not affect the average intermolecular distance or other relevant parameters that control molecular binding in the reconstituted system. Based on these, we speculate that the saturated protein density observed in our experiments is lower than or at most comparable to the actual brush transition density. Thus, although these crowded proteins may be restricted from free random motion, they are not significantly extended as in the condensed brush state, in which the contribution of resistance to molecular extension f<sub>el</sub> is expected to be small relative to the overall free energy of the system.

      (3) The discussion of the role of excluded volume in steric repulsion between glycoprotein needs clarification. As presented, it's unclear what the role of "excluded volume" effects is in driving steric repulsion? Do the authors imply depletion forces? Or the volume unavailable due to stochastic configurations of gaussian chains? How does the formalism apply to branched membrane glycoproteins is not immediately obvious.

      Regarding the excluded volume due to steric repulsion between glycoproteins, we considered the volume that cannot be used by glycans as Gaussian chains branching from the main chain. We would like to expand on this point by adding several papers that make similar arguments. I'm glad you brought this up because we hadn't considered depletion forces - the excluded volume between glycoproteins should generate a depletion force, but in this case we believe this force will not have a significant effect on viruses that are larger than the glycoproteins. We also attempted to clarify the discussion in this section by focusing on intermolecular repulsion, and restructured paragraphs, which are also related to General Question 2 and Specific Question 2. The relevant part has been revised as follows. (page 15~page16):

      To compare the packing of proteins with different molecular weights and R<sub>F</sub>, These were smaller than the coverage of molecules at hexagonal close packing that is ~90.7%. In contrast, the coverage of b-CD43 and b-MUC1 at saturated binding was estimated to be greater than 100% under this normalization standard, indicating that the mean projected sizes of these molecules in surface direction were smaller than those expected from their R<sub>F</sub> Thus, it is clear that glycosylation reduces the saturation density of membrane proteins, regardless of molecular size.

      Highly glycosylated proteins resisted densification, indicating that some intermolecular repulsion is occurring. In the framework of polymer brush theory, the intermolecular repulsion of densely packed highly glycosylated proteins is due to an increase in either f<sub>el</sub>, f<sub>int</sub> (d<R<sub>F</sub>), or both (Hansen et al., 2003; Wu et al., 2002). The term of intermolecular interaction, f<sub>int</sub>, is regulated by intermolecular steric repulsion, which occurs when neighboring molecules cannot approach the excluded volume created by the stochastic configuration of the polymer chain (Attili et al., 2012; Faivre et al., 2018; Kreussling and Ullman, 1954; Kuo et al., 2018; Paturej et al., 2016). The magnitude of this steric repulsion depends largely on R<sub>F</sub> in dilute solutions, but the molecular structure may also affect it when molecules are densified on a surface. In other words, the glycans protruding between molecules can cause steric inhibition between neighboring proteins (Figure 5D). Such intermolecular repulsion due to branched side chains occurs only when the molecules are in close proximity and sterically interact on a twodimensional surface, but not in dilute solution, and does not occur in unbranched polymers such as underglycosylated proteins (Figure 5D). Based on the above, we propose the following model for membrane proteins: Only when the membrane proteins are glycosylated does strong steric repulsion occur between neighboring molecules during the densification process, suppressing densification.

      The molecular structural state of these proteins needs to be further discussed to estimate the contribution of f<sub>el</sub>, which represents resistance to molecular elongation. Our results suggest that these densely packed nonglycosylated molecules are no longer in a free mushroom state. However, their saturation density was several times lower than previously reported brush transition densities, such as 65000 µm<sup>-2</sup> for 17 kDa polyacrylamide (R<sub>F</sub> ~ 15 nm) on a solid surface (Wu et al., 2002). To compare our data on fluid bilayers with previously reported data on solid surfaces, we performed additional experiments with lipid bilayers that lost fluidity. No significant changes in protein binding between fluid and nonfluid bilayers were observed for both b-MUC1 and g-MUC1 molecules (Supplementary Figure 5B). This result suggests that membrane fluidity does not affect the average intermolecular distance or other relevant parameters that control molecular binding in the reconstituted system. Based on these, we speculate that the saturated protein density observed in our experiments is lower than or at most comparable to the actual brush transition density. Thus, although these crowded proteins may be restricted from free random motion, they are not significantly extended as in the condensed brush state, in which the contribution of resistance to molecular extension f<sub>el</sub>, is expected to be small relative to the overall free energy of the system.

      Note that this does not mean that glycoproteins cannot form condensed brush structures: in fact, highly glycosylated molecules (e.g., MUC1) can form brush structures in cells when such proteins are expressed at very high densities. (Shurer et al., 2019). In these cells, ………. Such membrane deformation results in the increase of total surface area to reduce the density of glycoproteins, indicating that there is strong intermolecular repulsion between glycoproteins. In any case, the free energy of the system is determined by the balance between protein binding and insertion into the membrane, protein deformation, and repulsive forces between proteins, which determine the density of proteins depending on the configuration of the system. Thus, although strong intermolecular repulsions were prominently observed in our simplified system, this may not be the case in other systems. ……

      (4) The authors showed that glycoprotein expression inversely correlated with viral infection and link viral entry inhibition to steric hindrance caused by the glycoprotein. Alternative explanations would be that the glycoprotein expression (a) reroutes endocytosed viral particles or (b) lowers cellular endocytic rates and via either mechanism reduce viral infection. The authors should provide evidence that these alternatives are not occurring in their system. They could for example experimentally test whether non-specific endocytosis is still operational at similar levels, measured with fluid-phase markers such as 10kDa dextrans.

      The results of the experiment suggested by the reviewer are shown in the new Supplementary Figure 3B. (This results in generation of a new Supplementary Figure 3, and previous Supplementary Figures 4-5 are now renumbered as Supplementary Figures 5-6). Endocytosis of 10KDa dextran was attenuated by the expression of several large-sized molecules, but was not affected by the expression of many other glycoproteins that have the ability to inhibit infection. These results were clearly different from the results in which virus infection was inhibited more by the amount of glycan than by molecular weight. Therefore, it was found that many glycoproteins inhibit virus infection through processes other than endocytosis. Based on the above, we have added the following to the manuscript: (p9 New paragraph:)

      We also investigated the effect of membrane glycoproteins on membrane trafficking, another process involved in viral infection. Expression of MUC1 with higher number of tandem repeats reduced the dextran transport in the fluid phase, while expression of multiple membrane glycoproteins that have infection inhibitory effects, including truncated MUC1 molecules, showed no effect on fluid phase endocytosis, indicating a molecular weight-dependent effect (Supplementary Figure 3B). The molecular weight-dependent inhibition of endocytosis may be due to factors such as steric inhibition of the approach of dextran molecules or a reduction in the transportable volume within the endosome. In any case, it is clear that many low molecular weight glycoproteins inhibit infection by disturbing processes other than endocytosis. Based on the above, we focus on the effect of glycoproteins on the formation of the interface between the virus and the cell membrane.

      (5) The authors approach their system with the goal of generalizing the cell membrane (the cumulative effect of all cell membrane molecules on viral entry), but what about the inverse? How does the nature of the molecule seeking entry affect the interface? For example, a lipid nanoparticle vs a virus with a short virus-cell distance vs a virus with a large virus-cell distance?

      Thank you for your interesting comment. If the molecular size of the ligand is large, it should affect virus adsorption and molecular exclusion from the interface. In lipid nanoparticle applications, controlling this parameter may contribute to efficiency. In addition, a related discussion is the influence of virus shell molecules that are not bound to the receptor. I will revise the text based on the above.

      Discussion (as a new paragraph after the paragraph added in Q1):

      In this study, we attempted to generalize the surface structure on the cell side, but the surface structure on the virus side may also have an effect. The efficiency of virus adsorption and the efficiency of cell membrane protein exclusion from the interface will change depending on the molecular length of the receptor-ligand, although receptor priming also has an effect. In addition, free ligands of the viral envelope or other coexisting glycoproteins may also have an effect as they are also required for exclusion from the virus-cell interface. In fact, there are reports that expression of CD43 and PSGL-1 on the virus surface reduces virus infection efficiency (Murakami et al., 2020). Such interface structure may be one of the factors that determine the infection efficiency that differs depending on the virus strain. More generally, modification of the surface structure may be effective for designing materials such as lipid nanoparticles that construct the interface with cell.

      SPECIFIC QUESTIONS:

      (1) The proposed mechanism indicates that glycosylation status does not produce an effect in the "trapping" of virus, but in later stages of the formation of the virus/membrane interface due to the high energetic costs of displacing highly glycosylated molecules at the vicinity of the virus/membrane interface. It is suggested to present a correlation between the levels of glycans in the Calu-3 cell monolayers and the number of viral particles bound to cell surface at different pulse times. Results may be quantified following the same method as shown in Figure 2 for the correlation between glycosylation levels and viral infection (in this case the resulting output could be number of viral particles bound as a function of glycan content).

      The results of this experiment are now shown as Supplementary Figure 2F and 2G. We compared the amount of virus bound after incubation for 10 minutes or for 3 hours as in the infection experiment, but no negative correlation was found between the total amount of glycans on the surface of the Calu3 monolayer and the amount of virus bound. Interestingly, there was a sight positive correlation was detected, which may be due to concentrated virus receptor expressions in glycan-enriched cells. This result shows that glycoproteins do not strongly inhibit virus binding. We will amend the text as follows (see also Q6).

      (Page 10)

      Glycans could be one of the biochemical substances ……We found that a large number of SARS-CoV2-PP can still bind to cells even when cells expressed sufficient amounts of the glycoprotein that could account for the majority of glycans within these cells and inhibit viral infection (Figure 3A). Similarly, on the two-dimensional culture surface of Calu-3 cells, no negative correlation was observed between the number of viruses bound and the total amount of glycans on the cell surface (Supplementary Figure 2F-G). The slight positive correlation between bound virus and glycans may be due to higher expression levels of viral receptors in glycan-rich cells. ….

      (2) The use of the purified glycosylated and non-glycosylated ectodomains of MUC1 and CD-43 to establish a relationship between glycosylation and protein density into lipid bilayers on silica beads is an elegant approach. An assessment of the impact of glycosylation in the structural conformation of both proteins, for instance determining the Flory radius of the glycosylated and non-glycosylated ectodomains by the FRET-FLIM approach used in Figure 4 would serve to further support the hypothesis of the article.

      Unfortunately, the proposed experiment did not provide a strong enough FRET signal for analysis. This was due in part to the difficulty in constructing a bead-coated bilayer incorporating PlasMem Bright Red, which established a good FRET pair in cell experiments. We also tried other fluorescent molecules, but were unable to obtain a strong and stable FRET signal. Another reason may be that the curvature of the beads is larger than that of the cells, making it difficult to obtain a sufficient cumulative FRET effect from multiple membrane dyes. We plan to improve the experimental system in the future.

      On the other hand, we also found that in this system, the signal changes were very subtle, making it difficult to detect molecular conformational changes using FRET. After reconsidering general questions (2) and (3), we speculated that the molecular density in the experiment, even at saturation binding, was below or at most equivalent to the brush transition point. In other words, proteins on the bead-coated bilayer may not be significantly extended in the vertical direction. Therefore, the conformational changes of these proteins may not be large enough to be detected by the FRET assay. We updated Figure 3C and Figure 5D (model description) to better reflect the above discussion and introduced the following discussion in the manuscript.

      (page11)

      We introduced the framework of conventional polymer brush theory to study the structure of viruscell interfaces containing proteins……. Numerous experimental measurements of the formation of polymer brushes have also been reported (Overney et al., 1996; Wu et al., 2002; Zhao and Brittain, 2000). In these measurements, the transition to a brush typically occurs at a density higher than that required to pack a surface with hemispherical polymers of diameter R<sub>F</sub>. This is the point at which the energy loss due to repulsive forces between adjacent molecules (f<sub>int</sub>) exceeds the energy required to stretch the polymer perpendicularly into a brush (f<sub>el</sub>).

      (page16)

      The molecular structural state of these proteins needs to be further discussed to estimate the contribution of f<sub>el</sub>, which represents resistance to molecular elongation. Our results suggest that these densely packed nonglycosylated molecules are no longer in a free mushroom state. However, their saturation density was several times lower than previously reported brush transition densities, such as 65000 µm<sup>-2</sup> for 17 kDa polyacrylamide (R<sub>F</sub> ~ 15 nm) on a solid surface (Wu et al., 2002). To compare our data on fluid bilayers with previously reported data on solid surfaces, we performed additional experiments with lipid bilayers that lost fluidity. No significant changes in protein binding between fluid and nonfluid bilayers were observed for both b-MUC1 and g-MUC1 molecules (Supplementary Figure 5B). This result suggests that membrane fluidity does not affect the average intermolecular distance or other relevant parameters that control molecular binding in the reconstituted system. Based on these, we speculate that the saturated protein density observed in our experiments is lower than or at most comparable to the actual brush transition density. Thus, although these crowded proteins may be restricted from free random motion, they are not significantly extended as in the condensed brush state, in which the contribution of resistance to molecular extension f<sub>el</sub> is expected to be small relative to the overall free energy of the system.

      Note that this does not mean that glycoproteins cannot form condensed brush structures: in fact, highly glycosylated molecules (e.g., MUC1) can form brush structures in cells when such proteins are expressed at very high densities. (Shurer et al., 2019). In these cells, ………. Such membrane deformation results in the increase of total surface area to reduce the density of glycoproteins, indicating that there is strong intermolecular repulsion between glycoproteins. In any case, the free energy of the system is determined by the balance between protein binding and insertion into the membrane, protein deformation, and repulsive forces between proteins, which determine the density of proteins depending on the configuration of the system. Thus, although strong intermolecular repulsions were prominently observed in our simplified system, this may not be the case in other systems. ……

      (3) The MUC1 glycoprotein is reported to have a dramatic effect in reducing viral infection shown in Fig 1F. On the contrary, in a different experiment shown in Fig2D and Fig2H MUC1 has almost no effect in reducing viral infection. It is not clear how these two findings can be compatible.

      The immunostaining results show that the density of MUC1 molecules is very low in the experimental system in Figure 2 (Figure 2C), which is supported by the SC-RNASeq data (as shown in Supplementary Figure 2A, MUC1 is not listed as a top molecule). In other words, the MUC1 expression level in this experiment is too low to affect virus infection inhibition. On the other hand, the Pearson correlation function represents the strength of the linear relationship between two variables, so it is not the most appropriate indicator for seeing the correlation with the MUC1 expression level, which has little change (Figure 2D, 2F). In fact, even TOS analysis, which can see the correlation by focusing on the cells with the highest expression level, cannot detect the correlation (Figure 2H).Therefore, the MUC1 data in Figure 2DFH will be annotated and corrected in the figure legend.

      Figure2 Legend:

      MUC1 has a small mean expression level and variance, and is more affected by measurement noise than other molecules when calculating the Pearson correlation function (Figure 2C-2F). In addition, the number of cells in which expression can be detected is small, so no significant correlation was detected by TOS analysis (Figure 2H).

      (4) Why is there a shift in the use of the glycan marker? How does this affect the conclusions? For the infection correlation relating protein expression with glycan content the PNA-lectin was used together with flow cytometry. For imaging the infection and correlating with glycan content the SSA-lectin is used.

      For each cell line, we selected the lectin that could be measured over the widest dynamic range. This lectin is thought to recognize the predominant glycan species in the cell line (Fig. S1C, Fig. 2D). In our model, we believe that viral infection inhibition is not specific to the type of sugar, but is highly dependent on the total amount of glycans. If this hypothesis is correct, the reason we used different lectins in each experiment is simply to select the lectin that recognizes the most predominant glycan species that is most convenient for predicting the total amount of glycans in cells. This hypothesis is consistent with our observations, where the total amount of glycans estimated by different lectins could explain the infection inhibition in a similar way in the experiments in Figures 1 and 2, and the TOS analysis in Figure 2 showed that minor glycans also have an infection inhibitory effect. On the other hand, it is of course possible to predict the total amount of glycans more accurately by obtaining as much information on glycans as possible (related to Q5). Based on the above discussion, the manuscript will be revised as follows.

      Page5

      Using HEK293T cell lines exogenously expressing genes of these proteins tagged with fluorescent markers, their glycosylation was measured by binding of a lectin from Arachis hypogaea (PNA), and the number of these proteins in the cells was measured simultaneously. PNA was used for the measurement because it has a wider dynamic range than other lectins (Supplementary Figure 1C). This suggests that GalNAc recognized by PNA is predominantly present on glycans of HEK293T cells, especially on the termini of glycans that are amenable to lectin binding, compared to other saccharides.. …

      page9  

      Our findings suggest that membrane glycoproteins nonspecifically inhibit viral infection, and we hypothesize that their inhibitory function is also nonspecific depending on the type of glycan. Our hypothesis is consistent with the observations in the TOS analysis. Although minor saccharide species in the system (such as GlcNAc and GalNAc recognized by DSA, WGA, or PNA) showed anticolocalization with infection, their scores were much lower than those of major saccharide species. This suggests that all major and minor saccharide species have an infection inhibitory effect, but cells enriched with minor type glycans are only partially present in the system, and the contribution of these cells to virus inhibition is also partial. It is also consistent with the observation that the amount of GalNAc recognized by PNA determines the virus infection inhibition in HEK 293T cells (Figure 1). Therefore, we believe that our assay using a single type of predominantly expressed lectin is still useful for estimating the total glycan content. Nevertheless, the virus infection rate may show a better correlation with a more accurately estimated total glycan in each cell. For example, the use of multiple lectins with appropriate calibration to integrate multiple signals to simultaneously detect a wider range of saccharide species would allow for more accurate estimation. It should be noted that the amount of bound lectin does not necessarily measure the overall glycan composition but likely reflects the sugar population at the free end of the glycan chain to which the lectin binds most.

      (5) The authors in several instances comment on the relevance and importance of the total glycan content. Nevertheless, these conclusions are often drawn when using only one glycan-binding lectin. In fact, the anti-correlation with viral infection is distinct for the various lectins (Fig 2D and Fig 2H). Would it make more sense to use a combination of lectins to get a full glycan spectrum?

      As stated in the answer to Q4, we believe that we were able to detect the infection-suppressing effect of the total glycan amount by using the measurement value of the major component glycan as an approximation. However, as you pointed out, if we could accurately measure the minor glycan components and add up their values, we believe that we could measure the total glycan amount more accurately. In order to measure multiple glycans simultaneously and with high accuracy, some kind of biochemical calibration may be necessary to compare the measurements of lectin-glycan pairs with different binding constants. We believe that these are very useful techniques, and would like to consider them as a future challenge. The corrections listed in Q4 are shown below.

      (Page 9)

      Nevertheless, the virus infection rate may show a better correlation with a more accurately estimated total glycan in each cell. For example, the use of multiple lectins with appropriate calibration to integrate multiple signals to simultaneously detect a wider range of glycans would allow for more accurate estimation. …….

      (6) Fig 3A shows virus binding to HEK cells upon MUC1 expression. Please provide the surface expression of the MUC1 so that the data can be compared to Fig 1F. Nevertheless, it is not clear why the authors used MUC expression as a parameter to assess virus binding. Alternatively, more conclusive data supporting the hypothesis would be the absence of a correlation between total glycan content and virus binding capacity.

      The relationship between the expression level of MUC1 in each cell and the amount of virus binding is shown in Supplementary Figure 3A. There is no correlation between the two. In HEK293T cells, many glycans are modified with MUC1, so MUC1 was used as the indicator for analysis (Supplementary Figure 1C). As you pointed out, it is better to use the amount of glycan as an indicator, so we analyzed the relationship between the amount of bound virus and the amount of glycan on the surface on the Calu-3 monolayer (Supplementary Figure 2F, 2G, introduced in the answer to Specific (Q1)). In any case, no correlation was found between virus binding and surface glycans. I will correct the manuscript as follows.

      (page 9)

      Glycans could be one of the biochemical substances that link the intracellular molecular composition and macroscopic steric forces at the cell surface. To clarify this connection, we further investigated the mechanism by which membrane glycoproteins inhibit viral infection. First, we measured viral binding to cells to determine which step of infection is inhibited. We found that a large number of SARS-CoV2-PP can still bind to cells even when cells expressed sufficient amounts of the glycoprotein that could account for the majority of glycans within these cells and inhibit viral infection (Figure 3A). Similarly, on the two-dimensional culture surface of Calu-3 cells, no correlation was observed between the number of viruses bound and the total amount of glycans on the cell surface (Supplementary Figure 2F-G). These results indicate that glycoproteins do not inhibit virus binding to cells, but rather inhibit the steps required for subsequent virus internalization.

      (7) While the use of the Flory model could provide a simplification for a (disordered) flexible structure such as MUC1, where the number of amino acids equals N in the Flory model, this generalisation will not hold for all the proteins. Because folding will dramatically change the effective polypeptide chain-length and reduce available positioning of the amino acids, something the authors clearly measured (Fig 4G), this generalisation is not correct. In fact, the generalisation does not seem to be required because the authors provide an estimation for the effective Flory radius using their FRET approach

      Current theories generalizing the Flory model to proteins are incomplete, and it is certainly not possible to accurately estimate the size of individual molecules undergoing different folding. However, we found such a generalized model to be useful in understanding the overall properties of membrane proteins. In our experiments, we were indeed able to obtain the R<sub>F</sub>s of some individual molecules by FRET measurements. However, this modeling made it possible to estimate the distribution range of the RFs, including for larger proteins that cannot be measured by FRET. For example, from our results, we can estimate that the upper limit of the RFs of the longest membrane proteins is about 10.5 nm, assuming that the proteins follow the Flory model in all respects except for the shortening of the effective length due to folding. These analyses are useful for physical modeling of nonspecific phenomena, as in our case.

      In order to discuss the balance between such theoretical validity and the convenience of practical handling, we revise the manuscript as follows.

      (page 13) 

      This shift in ν indicates that glycosylation increases the size of the protein at equilibrium, but the change in R<sub>F</sub> is slight, e.g., a 1.3-fold increase for one of the longest ectodomains with N = 4000 when these values of ν are applied. This calculation also gives a rough estimate of the upper limit of the R<sub>F</sub> of the extracellular domains of all membrane proteins in the human genome (approximately 10.5 nm). Physically, this change in ν by glycosylation may be caused by the increased intramolecular exclusion induced sterically between glycan chains. This estimated ν are much smaller than that of 0.6 for polymers in good solvents, possibly due to protein folding or anchoring effects on the membrane. In fact, the ν of an intrinsically disordered protein in solution has been reported to be close to 0.6 (Riback et al., 2019; Tesei et al., 2024). Overall, these analyses using the Flory model provide information on the size distribution of membrane proteins and the influence of glycans, although the model cannot predict the exact size of each protein due to its specific folding.

      MINOR COMMENTS/EDITS:

      (1) In Figures 2A and 2C, as well as Supplemental Figure 2C, the fluorescent images indicate that GFP expression differs among the various groups. Ideally, these should be at the same GFP expression level, as the glycan and antibody staining occurred post-viral infection. For instance, ACE2 is a well-known positive control and should enhance SARS-CoV-2 infection. Yet, based on the findings presented in Supplemental Figure 2C, ACE2 appears to correlate with the lowest infection rate. The relationship between the infection rate and key glycoproteins needs clearer quantification.

      We measured the virus inhibition effect specific to each molecule using a cell line expressing low levels of viral receptors and glycoproteins (Fig. 1). On the other hand, the system in Fig. 2 contains diverse viral receptors and glycoproteins and has not been genetically manipulated. (We apologize that there was a typo in our description of experiment, which will be corrected, as shown below). The variation in infection rate between samples was caused by multiple factors but was not related to the molecule for which the correlation was measured. The receptor-based normalization used in the experiment in Fig. 1 cannot be applied in this system in Fig.2 due to the complexity of the gene expression profile. Therefore, instead of such parameter-based normalization, we applied Pearson correlation and TOS analysis. In the calculation of Pearson correlation, intensities are normalized. TOS analysis allows the analysis of colocalization between the groups with the highest fluorescence intensity. Therefore, in both cases of variation in overall infection rate and variation in the distribution of infected populations, samples with large variations can be reasonably compared by Pearson correlation and TOS analysis, respectively. We extend the discussion on statistics and revise the manuscript as follows.

      (page 8-9)

      To test this hypothesis, we infected a monolayer of epithelial cells endogenously expressing highly heterogeneous populations of glycoproteins with SARS-CoV-2-PP, and measured viral infection from cell to cell visually by microscope imaging. …

      Pearson correlation is effective for comparing samples with varying scales of data because it normalizes the data values by the mean and variance. However, as observed in our experiments, this may not be the case when the distribution of data within a sample varies between samples. In addition, as has already been reported, the distribution of infected cells often deviates significantly from the normal distribution of data that is the premise of Pearson correlation (Russell et al., 2018) (Figure 2B). To further analyze data in such nonlinear situations, we applied the threshold overlap score (TOS) analysis (Figure 2G-H, Supplementary Figure 2E). This is one statistical method for analyzing nonlinear correlations, and is specialized for colocalization analysis in dual color images (Sheng et al., 2016). TOS analysis involves segmentation of the data based on signal intensity, as in other nonlinear statistics (Reshef et al., 2011). The computed TOS matrix indicates whether the number of objects classified in each region is higher or lower than expected for uniformly distributed data, which reflects co-localization or anti-localization in dual-color imaging data. For example, calculated TOS matrices show strong anti-localization for infection and glycosylation when both signals are high (Figure 2GH). This confirms that high infection is very unlikely to occur in cells that express high levels of glycans. The TOS analysis also yielded better anti-localization scores for some of the individual membrane proteins, especially those that are heterogeneously distributed across cells (Figure 2H). This suggests that TOS analysis can highlight the inhibitory function of molecules that are sparsely expressed among cells, reaffirming that high expression of a single type of glycoprotein can create an infection-protective surface in a single cell and that such infection inhibition is not protein-specific. In contrast, for more uniformly distributed proteins such as the viral receptor ACE2, TOS analysis and Pearson correlation showed similar trends, although the two are mathematically different (Figure 2D, 2H). Because glycoprotein expression levels and virus-derived GFP levels were treated symmetrically in these statistical calculations, the same logic can be applied when considering the heterogeneity of infection levels among cells. Therefore, it is expected that TOS analysis can reasonably compare samples with different virus infection level distributions by focusing on cells with high infection levels in all samples.

      (2) For clarity, the authors should consider separating introductory and interpretive remarks from the presentation of results. These seem to get mixed up. The introduction section could be expanded to include more details about glycoproteins, their relevance to viral infection, and explanations of N- and O-glycosylation.

      Following the suggestion, (1) we added an explanation of the relationship between glycoproteins and viral infection, and N-glycosylation and O-glycosylation to the Introduction section, and (2) moved the introductory parts in the Results section to the Introduction section, as follows.

      (1; page3)

      While there are known examples of glycans that function as viral receptors (Thompson et al., 2019), these results demonstrate that a variety of glycoproteins negatively regulate viral infection in a wide range of systems. These glycoprotein groups have no common amino acid sequences or domains. The glycans modified by these proteins include both the N-type, which binds to asparagine, and the O-type, which binds to serine and threonine. Furthermore, there have been no reports of infection-suppressing effects according to the specific monosaccharide type in the glycan. All of these results suggest that bulky membrane glycoproteins nonspecifically inhibit viral infection.

      (2 : Page 4-5)

      To confirm that glycans are a general chemical factor of steric repulsion, an extensive list of glycoproteins on the cell membrane surface would be useful. The wider the range of proteins to be measured, the better. Therefore, we collect information on glycoproteins on the genome and compile them into a list that is easy to use for various purposes. Then, by analyzing sample molecules selected from this list, it may be possible to infer the effect of the entire glycoprotein population on the steric inhibition of virus infection, despite the complexity and diversity of the Glycome (Dworkin et al., 2022; Huang et al., 2021; Moremen et al., 2012; Rademacher et al., 1988). Elucidation of the mechanism of how glycans regulate steric repulsion will also be useful to quantitatively discuss the relationship between steric repulsion and intracellular molecular composition. For this purpose, we apply the theories of polymer physics and interface chemistry.

      Results

      List of membrane glycoproteins in human genome and their inhibitory effect on virus infection

      To test the hypothesis that glycans contribute to steric repulsion at the cell surface, we first generate a list of glycoproteins in the human genome and then measure the glycan content and inhibitory effect on viral infection of test proteins selected from the list (Figure 1A). To compile the list of glycoproteins, we ….

      (3) In the sentence, "glycoproteins expressed lower than CD44 or other membrane proteins including ERBB2 did not exhibit any such correlation, although ERBB2 expressed ~4 folds higher amount than CD44 and shared ~7% among all membrane proteins," it is unclear which protein has a higher expression level: CD44 or ERBB2? Furthermore, the use of the word "although" needs clarification.

      Corrected as follows:

      (page 8)

      ……showed a weak inverse correlation with viral infection; even such a weak correlation was not observed with other proteins, including ERBB2, which is approximately four-fold more highly expressed than CD44

      (4) In Supplementary Figure 5, please provide an explanation of the data in the figure legend, particularly what the green and red signals represent.

      Corrected as follows:

      STORM images of all analyzed cells, expressing designated proteins. The detected spots of SNAPsurface Alexa 647 bound to each membrane protein are shown in red, and the spots of CF568conjugated anti-mouse IgG secondary antibody that recognizes Spike on SARS-CoV2-PP are shown in green. For cells, a pair of two-color composite images and a CF658-only image are shown. Numbers on axes are coordinates in nanometer.

      (5) It would be good to see a comprehensive demonstration of the exact method for estimation of membrane protein density (in the SI), since this is an integral part of many of the analyses in this paper. The method is detailed in the Methods section in text and is generally acceptable, but this methodology can vary quite widely and would be more convincing with calibration data provided.

      We added flow cytometry and fluorometer data for calibration (Supplementary Figure 1L,M) and introduced a sentence explaining the procedure for obtaining the values used for calibration as follows:

      (page 54)

      …….Liposome standards containing fluorescent molecules (0.01– 0.75 mol% perylene (Sigma), 0.1– 1.25 mol% Bodipy FL (Thermo), and 0.005– 0.1% DiD) as well as DOPC (Avanti polar lipids) were measured in flow cytometry (Supplmentary Figure 1L). Meanwhile, by fluorimeter, fluorescence signals of these liposomes and known concentrations of recombinant mTagBFP2, AcGFP and TagRFP-657 proteins and SNAP-Surface 488 and Alexa 647 dyes (New England Biolabs) were measured in the same excitation and emission ranges as in flow cytometry assays (Supplementary Figure 1M). Ratios between the integral of fluorescent intensities in this range between two dyes of interest are used for converting the signals measured in flow cytometry. Additional information needed for calibration is the size difference between liposomes and cells. The average diameter of liposomes is measured to be 130 nm, and the diameter of HEK 293T cells is estimated to be 13 µm (Furlan et al., 2014; Kaizuka et al., 2021b; Ushiyama et al., 2015). From these data, the signal from cells acquired by flow cytometry can be calibrated to molecular surface density. For example, the Alexa 647 signal acquired by flow cytometry can be converted to the signal of the same concentration of DID dye using fluorometer data, but the density of the dye is unknown at this point. This converted DID signal can then be calibrated to the density on liposomes rather than cells using liposome flow cytometry data. Finally, adjusted for the size difference between liposomes and cells, the surface molecular density on cells is determined. By going through one cycle of these procedures, we could obtain calibration unit, such as 1 flow cytometry signal for a cell in the designated illumination and detection setting = 0.0272 mTagBFP2 µm<sup>-2</sup> on cell surface.

      (Figure legend, Supporting Figure 1: )

      … L. Flow cytometry measurements for liposomes containing serially diluted dye-conjugated lipids and fluorescent membrane incorporating molecules (Bodipy-FL, peryelene, and DID) with indicated mol%. Linear fitting shown was used for calibration.  M. Fluorescence emission spectrum for equimolar molecules (50µM for green and far-red channels, and 100µM for blue channel), excited at 405 nm, 488 nm, and 638 nm, respectively. Membrane dyes were measured as incorporated in liposomes. Purified recombinant mTagBFP2 was used.

      (6) Fig 2A: The figure legend should describe the microscopy method for a quick and easy reference.

      Corrected as follows:

      (Figure legend, Figure 2)

      A. Maximum projection of Z-stack images at 1 µm intervals taken with a confocal microscope. SARSCoV2-pp-infected, air-liquid interface (ALI)-cultured Calu-3 cell monolayers were chemically fixed and imaged by binding of Alexa Fluor 647-labeled Neu5AC-specific lectin from Sambucus sieboldiana (SSA) and GFP expression from the infecting virus.

      (7) Fig 2B: what is the color bar supposed to represent? Is it the pixel density per a particular value? Units and additional description are required. In addition, these are "arbitrary units" of fluorescence, but you should tell us if they've been normalized and, if so, how. They must have been normalized, since the values are between 0 and 1, but then why does the scale bar for SSA only go to 0.5?

      The color bar shows the number of pixels for each dot, resulting in the scale for density scatter plot. The scale on the X-axis was incorrect. All these issues have been fixed in this revision, in the figure and in the legend as follows.

      (Figure legend, Figure 2)

      B. Density scatter plot of normalized fluorescence intensities in all pixels in Figure 2A in both GFP and SSA channels. Color indicates the pixel density.  

      (8) Fig 3D has a typo: this should most likely be "grafted polymer."

      (9) Fig 3E has a suspected typo: in the text, the author uses the word "exclusion" instead of "extrusion." The former makes more sense in this context.

      (10) Fig 5A has a typo: "Suppoorted" instead of Supported Lipid Bilayer.

      (11) Fig 7E-F has a suspected typo: Again, this should most likely be the word "exclusion" instead of "extrusion."

      Thank you so much for pointing out these mistakes, I have corrected them all as suggested.

      (12) Which other molecules are referred to, on page 6 (middle), that do not have an inhibitory effect? Please specify.

      We specified the molecules that have inhibitory effects, and revised as follows: 

      These proteins include those previously reported (MUC1, CD43) as well as those not yet reported (CD44, SDC1, CD164, F174B, CD24, PODXL) (Delaveris et al., 2020; Murakami et al., 2020). In contrast, other molecules (VCAM-1, EPHB1, TMEM123, etc.) showed little inhibitory effect on infection within the density range we used.

      (13) Fig 2 B: the color LUT is not labelled nor explained.

      Corrected as described in (7)

      (14) Please provide the scale bars for figures Fig 2A, C, E and Suppl Fig 2C, D.

      Corrected. 

      (15) Please provide the name for the example of a 200 aa protein that is meant to inhibit viral infection but is not bigger than ACE2. Also providing the densities in Fig 3A would help to correlate the data to Fig 1F.

      Corrected as follows: 

      (page 10)

      We found that a large number of SARS-CoV2-PP can still bind to cells even when cells expressed sufficient amounts of the glycoprotein (mean density ~50 µm<sup>-2</sup>) that could account for the majority of glycans within these cells and inhibit viral infection (Figure 3A). …..

      In our measurements, a protein with extracellular domain of ~200 amino acids (e.g. CD164 (138aa)) at a density of ~100 μm-2 showed significant inhibition in viral infection. This molecule is shorter than the receptor ACE2 (722 aa),

      (16) In the experiments conducted in HeK cells expressing the different glycoproteins studies it is mentioned that results of infection were normalised by the amount ACE2 expression. Is the expression of receptor homogenous in the experiments conducted in Figure 2? Clarify in the methods if the expression of receptor has been quantified and somehow used to correct the intensity values of GFP used to determine infection.

      As also explained for Q1, the system in Fig. 2 contains diverse viral receptors and glycoproteins, and the receptor-based normalization used in the experiment in Fig. 1 cannot be applied. Instead, we applied Pearson correlation and TOS analysis. In the calculation of Pearson correlation, intensities are normalized. TOS analysis allows the analysis of colocalization between the groups with the highest fluorescence intensity. Therefore, in both cases of variation in overall infection rate and variation in the distribution of infected populations, samples with large variations can be reasonably compared by Pearson correlation and TOS analysis, respectively. We extend the discussion on statistics and revise the manuscript as follows.

      (page 8-9)

      Pearson correlation is effective for comparing samples with varying scales of data because it normalizes the data values by the mean and variance. However, as observed in our experiments, this may not be the case when the distribution of data within a sample varies between samples. In addition, as has already been reported, the distribution of infected cells often deviates significantly from the normal distribution of data that is the premise of Pearson correlation (Russell et al., 2018) (Figure 2B). To further analyze data in such nonlinear situations, we applied the threshold overlap score (TOS) analysis (Figure 2G-H, Supplementary Figure 2E). This is one statistical method for analyzing nonlinear correlations, and is specialized for colocalization analysis in dual color images (Sheng et al., 2016). TOS analysis involves segmentation of the data based on signal intensity, as in other nonlinear statistics (Reshef et al., 2011). The computed TOS matrix indicates whether the number of objects classified in each region is higher or lower than expected for uniformly distributed data, which reflects co-localization or anti-localization in dual-color imaging data. For example, calculated TOS matrices show strong anti-localization for infection and glycosylation when both signals are high (Figure 2GH). This confirms that high infection is very unlikely to occur in cells that express high levels of glycans. The TOS analysis also yielded better anti-localization scores for some of the individual membrane proteins, especially those that are heterogeneously distributed across cells (Figure 2H). This suggests that TOS analysis can highlight the inhibitory function of molecules that are sparsely expressed among cells, reaffirming that high expression of a single type of glycoprotein can create an infection-protective surface in a single cell and that such infection inhibition is not protein-specific. In contrast, for more uniformly distributed proteins such as the viral receptor ACE2, TOS analysis and Pearson correlation showed similar trends, although the two are mathematically different (Figure 2D, 2H). Because glycoprotein expression levels and virus-derived GFP levels were treated symmetrically in these statistical calculations, the same logic can be applied when considering the heterogeneity of infection levels among cells. Therefore, it is expected that TOS analysis can reasonably compare samples with different virus infection level distributions by focusing on cells with high infection levels in all samples.

      (17) Can you provide additional details about the method of thresholding to eliminate "background" localisations in STORM?

      Method section was corrected as follows: 

      (page 59)

      …Viral protein spots not close to cell membranes were eliminated by thresholding with nearby spot density for cell protein. Specifically, the entire image was pixelated with a 0.5µm square box and all viral protein signals within the box that had no membrane protein signals were removed. Also, viral protein spots only sparsely located were eliminated by thresholding with nearby spot density for viral protein. This thresholding process removed any detected viral protein spot that did not have more than 100 other viral protein spots within 1µm.

      (18) The article says "It was shown that the number of bound lectins correlated with the amount of glycans, not with number of proteins (Figure 1E)". Figure 1E correlates experimental PNA/mol with predicted glycosylation sites, not with the number of expressed proteins. Correct sentence with the right Figure reference.

      As you pointed out, the meaning of this sentence was not clear. We have amended it as follows to clarify our intention:

      (page 8)

      Since a wide range of glycoproteins inhibit viral infection, it is possible that all types of glycoproteins have an additive effect for this function. ……. In this cell line, this inverse correlation was most pronounced when quantifying N-acetylneuraminic acid (Neu5AC, recognized by lectins SSA and MAL) compared to the various types of glycans, while some other glycans also showed weak correlations (Supplementary Figure 2C). These results showed that the amount of virus infection in cell anticorrelated with the amount of total glycans on the cell surface. As amount of glycans is determined by the total population of glycocalyx, infection inhibitory effect can be additive by glycoprotein populations as we hypothesized.

      If the inhibitory effect is nonspecific and additive, the contribution of each protein is likely to be less significant. To confirm this, we also measured the correlation between the density of each glycoprotein and viral infection. CD44, which was shown to…….. Our results demonstrate that total glycan content is a superior indicator than individual glycoprotein expression for assessing infection inhibition effect generated by cell membrane glycocalyx. These results are consistent with our hypothesis regarding the additive nature of the nonspecific inhibitory effects of each glycoprotein.

    1. Reviewer #1 (Public review):

      Summary:

      The manuscript analyses the effects of deleting the TgfbR1 and TgfbR2 receptors from endothelial cells at postnatal stages on vascular development and blood-retina barrier maturation in the retina. The authors find that deletion of these receptors affects vascular development in the retina but importantly it affects the infiltration of immune cells across the vessels in the retina. The findings demonstrate that Tgf-beta signaling through TgfbR1/R2 heterodimers regulates primarily the immune phenotypes of endothelial cells in addition to regulating vascular development, but has minor effects on the BRB maturation. The data provided by the authors provides a solid support for their conclusions.

      Strengths:

      (1) The manuscript uses a variety of elegant genetic studies in mice to analyze the role of TgfbR1 and TgfbR2 receptors in endothelial cells at postnatal stages of vascular development and blood-retina barrier maturation in the retina.

      (2) The authors provide a nice comparison of the vascular phenotypes in endothelial-specific knockout of TgfbR1 and TgfbR2 in the retina (and to a lesser degree in the brain) with those from Npd KO mice (loss of Ndp/Fzd4 signaling) or loss of VEGF-A signaling to dissect the specific roles of Tgf-beta signaling for vascular development in the retina.

      (3) The snRNAseq data of vessel segments from the brains of WT versus TgfbR1 -iECKO mice provides a nice analysis of pathways and transcripts that are regulated by Tgf-beta signaling in endothelial cells.

      Weaknesses (Original Submission):

      (1) The authors claim that choroidal neovascular tuft phenotypes are similar in TgfbrR1 KO and TgfbrR2 KO mice. However, the phenotypes look more severe in the TgfbrR1 KO rather than TgfbrR2 KO mice. Can the authors show a quantitative comparison of the number of choroidal neovascular tufts per whole eye cross-section in both genotypes?

      (2) In the analysis of Sulfo-NHS-Biotin leakage in the retina to assess blood-retina barrier maturation, the authors claim that there is increased vascular leakage in the TgfbR1 KO mice. However, there does not seem like Sulfo-NHS-biotin is leaking outside the vessels. Therefore, it cannot be increased vascular permeability. Can the authors provide a detailed quantification of the leakage phenotype?

      (3) The immune cell phenotyping by snRNAseq seems premature as the number of cells is very small. The authors should sort for CD45+ cells and perform single cell RNA sequencing.

      (4) The analysis of BBB leakage phenotype in TgfbR1 KO mice needs to be more detailed and include some tracers in addition to serum IgG leakage.

      (5) A previous study (Zarkada et al., 2021, Developmental Cell) showed that EC-deletion of Alk5 affects the D tip cells. The phenotypes of those mice look very similar to those shown for TgfbrR1 KO mice. Are D tip cells lost in these mutants by snRNAseq?

      Comments on revisions:

      The authors have addressed the major weaknesses that I raised with the original submission adequately in the revised manuscript.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review): 

      Weaknesses: 

      (1) The authors claim that choroidal neovascular tuft phenotypes are similar in TgfbrR1 KO and TgfbrR2 KO mice. However, the phenotypes look more severe in the TgfbrR1 KO rather than TgfbrR2 KO mice. Can the authors show a quantitative comparison of the number of choroidal neovascular tufts per whole eye cross-section in both genotypes? 

      Thank you for asking about this.  Each VE-cad-CreER;TGFBR1 CKO/- and VE-cad-CreER;TGFBR2 CKO/- retina exhibits multiple zones of choroidal neovascularization.  The examples in Figures 1 and Figure 1 – Figure supplements 1 and 2 are mostly from retinas with loss of TGFBR1, but we could have chosen similar examples from retinas with loss of TGFBR2.  The quantification in the original version of Figure 1- Figure supplement 1 panel C had a labeling error.  It actually showed the quantification choroidal neovascularization (CNV) in the sum of both VE-cad-CreER;TGFBR1 CKO/- and VE-cad-CreER;TGFBR2 CKO/- retinas, not only in VE-cad-CreER;TGFBR1 CKO/- retinas as originally labeled.  The point that it made is that CNV is seen with loss of TGF-beta signaling but not in control retinas or retinas with loss of Norrin signaling.  We have now updated that plot by separating the data points for VE-cad-CreER;TGFBR1 CKO/- and VE-cad-CreER;TGFBR2 CKO/- retinas, so that they can be compared to each other.   The result shows ~2.5-fold more CNV in VE-cad-CreER;TGFBR2 CKO/- retinas compared to VE-cad-CreER;TGFBR1 CKO/-.  We think it likely that a more extensive sampling would show little or no difference between these two genotypes – but the data is what it is. This is now described in the Results section. 

      We have also added a panel D to Figure 1- Figure supplement 1, which shows a retina flatmount analysis of CNV.  This is done by mounting the retina with the photoreceptor side up so that the outer retina can be optimally imaged. 

      (2) In the analysis of Sulfo-NHS-Biotin leakage in the retina to assess blood-retina barrier maturation. The authors claim that there is increased vascular leakage in the TgfbR1 KO mice. However, it does not seem like Sulfo-NHS-biotin is leaking outside the vessels. Therefore, it cannot be increased vascular permeability. Can the authors provide a detailed quantification of the leakage phenotype? 

      Thank you for raising this point.  Your comment prompted us to look at this question in greater depth with more experiments.  We have expanded Figure 2 to show and quantify a comparison between control (i.e. phenotypically WT), NdpKO, and TGFBR1 endothelial KO and we have expanded the associated part of the Results section (Figure 2C and D).  In a nutshell, control retinas show little Sulfo-NHS-biotin accumulation in or around the vasculature or in the parenchyma; NdpKO retinas show Sulfo-NHS-biotin accumulation in the vasculature and in the parenchyma (i.e., the area between the vessels); and VEcadCreER;Tgfbr1CKO/- retinas show Sulfo-NHS-biotin accumulation in the vascular tufts with minimal accumulation in the non-tuft vasculature and minimal leakage into the parenchyma.   The conclusion is that the bulk of the retinal vasculature in TGFBR1 endothelial KO mice is minimally or not at all leaky – very different from the situation with loss of Norrin/Frizzled4 signaling.

      (3) The immune cell phenotyping by snRNAseq is premature, as the number of cells is very small. The authors should sort for CD45+ cells and perform single-cell RNA sequencing. 

      Thank you for raising this point.  For the revised manuscript, we have performed additional snRNAseq analyses using the same tissue processing protocol as for our original snRNAseq data.  We have opted to homogenize the tissue and prepare nuclei (our original method) rather than dissociate the tissue and FACS sorting for CD45+ cells because the nuclear isolation approach is unbiased – we assume that nuclei from all cell types are present after tissue homogenization.  By contrast, we cannot be certain that CD45 FACS will capture the full range of immune cells since some cells may not express CD45, may express CD45 at low level, or may be tightly adherent to other cells, such as vascular endothelial cell.  Additionally, by following the original protocol, we can combine the original snRNAseq dataset and the new snRNAseq dataset.  In the revised manuscript we present the snRNAseq data from the combination of the original and the more recent snRNAseq datasets (revised Figure 4; N=628 immune cell nuclei).  The new analysis comes to the same conclusions as the original analysis: the immune cell infiltrate in the mutant retinas is composed of a wide variety of immune cells.

      (4) The analysis of BBB leakage phenotype in TgfbR1 KO mice needs to be more detailed and include tracers as well as serum IgG leakage. 

      As described in our response to query 2, we have conducted additional experiments to look at vascular leakage in control, VE-cad-CreER;TGFBR1 CKO/-, and NdpKO retinas.  We have also looked at Sulfo-NHS-biotin leakage in the VE-cadCreER;TGFBR1 CKO/- brain, and it is indistinguishable from WT controls.  Since Sulfo-NHS-biotin is a low MW tracer (<1,000 kDa), this implies that loss of TGF-beta signaling does not increase non-specific diffusion of either low or high MW molecules.  Therefore, the elevated levels of IgG in the brain parenchyma in young VE-cad-CreER;TGFBR1 CKO/- mice (Figure 8A) likely represents specific transport of IgG across the BBB.  Such transport is known to occur via Fc receptors expressed on vascular endothelial cells, although it is normally greater in the brain-to-blood direction than in the blood-to-brain direction.  For example, see Lafrance-Vanasse et al (2025) Leveraging neonatal Fc receptor (FcRn) to enhance antibody transport across the blood brain barrier.  Nat Commun. 16:4143.  This is now described in greater detail in the Results section.

      (5) A previous study (Zarkada et al., 2021, Developmental Cell) showed that EC-deletion of Alk5 affects the D tip cells. The phenotypes of those mice look very similar to those shown for TgfbrR1 KO mice. Are D-tip cells lost in these mutants by snRNAseq? 

      Please note: Alk5 is another name for TGFBR1.  This is noted in the second sentence of paragraph 4 of the Introduction.  The reviewer is correct: there are a lot of similarities because these are exactly the same KO mice.  Also, Zarkada and we used the same VEcadCreER to recombine the CKO allele.  The proposed snRNAseq analysis would serve as an independent check on the diving (D) tip vs stalk cell analyses published in Zarkada et al (2021) Specialized endothelial tip cells guide neuroretina vascularization and blood-retina-barrier formation. Dev Cell 56:2237-2251.  We have not gone in this direction because the question of tip vs. stalk cells and of subtypes of tip cells in WT vs. mutant retinas is beyond our focus on choroidal neovascularization and the role of immune cells and vascular inflammation.  The proposed snRNAseq analysis would also require a major effort since tip cells are rare and must be harvested from large numbers of early postnatal retinas followed by FACS enrichment for vascular endothelial cells.  Finally, we have no reason to doubt the results of Zarkada et al.

      Reviewer #2 (Public review): 

      Summary:

      The authors meticulously characterized EC-specific Tgfbr1, Tgfbr2, or double knockout in the retina, demonstrating through convincing immunostaining data that loss of TGF-β signaling disrupts retinal angiogenesis and choroidal neovascularization. Compared to other genetic models (Fzd4 KO, Ndp KO, VEGF KO), the Tgfbr1/2 KO retina exhibits the most severe immune cell infiltration. The authors proposed that TGF-β signaling loss triggers vascular inflammation, attracting immune cells - a phenotype specific to CNS vasculature, as non-CNS organs remain unaffected. 

      Strengths: 

      The immunostaining results presented are clear and robust. The authors performed well-controlled analyses against relevant mouse models. snRNA-seq corroborates immune cell leakage in the retina and vascular inflammation in the brain. 

      Weaknesses: 

      The causal link between TGF-β loss, vascular inflammation, and immune infiltration remains unresolved. The authors' model posits that EC-specific TGF-β loss directly causes inflammation, which recruits immune cells. However, an alternative explanation is plausible: Tgfbr1/2 KO-induced developmental defects (e.g., leaky vessels) permit immune extravasation, subsequently triggering inflammation. The observations that vein-specific upregulation of ICAM1 staining and the lack of immune infiltration phenotypes in the non-CNS tissues support the alternative model. Late-stage induction of Tgfbr1/2 KO (avoiding developmental confounders) could clarify TGF-β's role in retinal angiogenesis versus anti-inflammation. 

      Thank you for raising this point.  Your comment prompted us to look at this question in greater depth with more experiments.  We have expanded Figure 2 to show and quantify a comparison between control (i.e. phenotypically WT), NdpKO, and TGFBR1 endothelial KO and we have expanded the associated part of the Results section (Figure 2C and D).  In a nutshell, control retinas show little Sulfo-NHS-biotin accumulation in or around the vasculature or in the parenchyma; NdpKO retinas show Sulfo-NHS-biotin accumulation in the vasculature and in the parenchyma (i.e., the area between the vessels); and VEcadCreER;Tgfbr1CKO/- retinas show Sulfo-NHS-biotin accumulation in the vascular tufts with minimal accumulation in the non-tuft vasculature and minimal leakage into the parenchyma.   The conclusion is that the bulk of the retinal vasculature in TGFBR1 endothelial KO mice is minimally or not at all leaky – very different from the situation with loss of Norrin/Frizzled4 signaling.

      In the revised manuscript, we have expanded the Discussion section to address the two alternative hypotheses raised by the reviewer.  Here are the relevant data in a nutshell: (1) vascular leakage into the parenchyma, as measured with sulfo-NHSbiotin, in TGFBR1 endothelial CKO retinas is far less than in NdpKO retinas, where nearly all ECs convert to a fenestration+ (PLVAP+) phenotype and there is leakage of sulfo-NHS-biotin, (2) ICAM1 in ECs in TGFBR1 endothelial CKO retinas increases several-fold more than in NdpKO or Frizzled4KO retinas, (3) TGFBR1 endothelial CKO retinas have more infiltrating immune cells than NdpKO or Frizzled4KO retinas, and (4) in TGFBR1 endothelial CKO retinas large numbers of immune cells are observed within and adjacent to blood vessels.  We think that the simplest explanation for these data is that loss of TGFbeta signaling in ECs causes an endothelial inflammatory state with enhanced immune cell extravasation.  That said, the case for this model is not water-tight, and there could be less direct mechanisms at play.  In particular, this model does not explain why the inflammatory phenotype is limited to CNS (and especially retinal) vasculature.

      Regarding the last sentence of the reviewer’s comment (“Late stage induction…”), we have tried activating CreER recombination at different ages and we observe a large reduction in the inflammatory phenotype when recombination is initiated after vascular development is complete.   This observation suggests that the vascular developmental/anatomic defect – and perhaps the resulting retinal hypoxia response – is required for the inflammatory phenotype.  In the revised manuscript we have expanded the Results and Discussion sections to describe this observation.

      Reviewer #1 (Recommendations for the authors): 

      Suggestions for experiments: 

      (1) The authors need to show a quantitative comparison of the number of choroidal neovascular tufts per whole eye crosssection in both genotypes (TgfbR1 and TgfbR2 KO mice). 

      Thank you for raising this point.  The quantification in the original version of Figure 1- Figure supplement 1 panel C was mis-labeled.  It quantifies choroidal neovascularization (CNV) in both VE-cad-CreER;TGFBR1 CKO/- and VE-cadCreER;TGFBR2 CKO/- retinas, not VE-cad-CreER;TGFBR1 CKO/- retinas only as originally labeled.  The point it makes is that CNV is seen with loss of TGF-beta signaling but not in control retinas or retinas with loss of Norrin signaling.  We have now corrected that plot by separating the data points for VE-cad-CreER;TGFBR1 CKO/- and VE-cad-CreER;TGFBR2 CKO/- retinas, so that they can be compared to each other.   The result shows ~2.5-fold more CNV in VE-cad-CreER;TGFBR2 CKO/- retinas compared to VE-cad-CreER;TGFBR1 CKO/-.  This is now described in the Results section. 

      (2) In the analysis of Sulfo-NHS-Biotin leakage in the retina to assess blood-retina barrier maturation. The authors should provide a detailed quantification of the leakage phenotype outside the vessels into the CNS parenchyma, both in the retina and brain, in TgfbR1 KO mice. 

      Thank you for raising this point.  There is no detectable Sulfo-NHS-biotin leakage into the brain parenchyma in VE-cadCreER;TGFBR1 CKO/- mice.  We have expanded Figure 2 to show and quantify the data for retinal vascular leakage (Figure 2C and D).  The data show that in VE-cad-CreER;TGFBR1 CKO/- mice there is accumulation of Sulfo-NHS-biotin in the vascular tufts but minimal accumulation elsewhere in the retinal vasculature and minimal leakage of Sulfo-NHS-biotin into the retinal parenchyma.

      (3) The immune cell phenotyping by snRNAseq is premature, as the number of cells is very small. The authors should sort for CD45+ cells and perform single-cell RNA sequencing to ascertain these preliminary data. 

      Thank you for raising this point.  We have performed additional snRNAseq analyses using the same tissue processing protocol as for our original snRNAseq data to increase the numbers of cells.  We have opted to homogenize the tissue and prepare nuclei (our original method) rather than dissociating the cells and FACS sorting for CD45+ cells because the nuclear isolation approach is unbiased – we assume that nuclei from all cell types are present.  By contrast, we cannot be certain that CD45 FACS will capture the full range of immune cells, since some cells may not express CD45, may express CD45 at low level, or may be tightly adherent to other cells, such as vascular endothelial cell.  Additionally, by following the original protocol, we can combine the original snRNAseq dataset of and the new snRNAseq dataset.  In the revised manuscript we present the snRNAseq data from the combination of the original and the more recent snRNAseq datasets (revised Figure 4; N=628 immune cell nuclei).  The new analysis comes to the same conclusion as in the original submission, namely that the immune cell infiltrate in the mutant retinas is composed of a wide variety of immune cells.  The Results section has been expanded to describe this new data and analysis.    

      (4) The analysis of BBB leakage phenotype in TgfbR1 KO mice needs to be more detailed and include tracers as well as serum IgG leakage. 

      Sulfo-NHS biotin leakage in the VE-cad-CreER;TGFBR1 CKO/- brain is minimal, and it is indistinguishable from WT controls.  Since Sulfo-NHS biotin is a low MW tracer (<1,000 kDa), this implies that loss of TGF-beta signaling does not increase non-specific diffusion of either low or high MW molecules.  Therefore, the elevated levels of IgG in the brain parenchyma in young VE-cad-CreER;TGFBR1 CKO/- mice (Figure 8A) likely represents specific transport of IgG across the BBB.  Such transport is known to occur via Fc receptors expressed on vascular endothelial cells, although it is normally greater in the brain-to-blood direction than in the blood-to-brain direction.  For example, see Lafrance-Vanasse et al (2025) Leveraging neonatal Fc receptor (FcRn) to enhance antibody transport across the blood brain barrier.  Nat Commun. 16:4143.  This is now described in greater detail in the Results section.

      (5) The authors should perform a more detailed RNAseq analysis of tip and stack (stalk) cells in TgfbrR1 KO mice to determine whether D tip cells are lost in these mutants by snRNAseq. 

      The proposed snRNAseq analysis would serve as an independent check on the diving (D) tip vs stalk cell analyses published by Zarkada et al, who analyzed the same VE-cad-CreER;TGFBR1 CKO/- mutant mice, although they refer to the TGFBR1 gene by its alternate name ALK5 [Zarkada et al (2021) Specialized endothelial tip cells guide neuroretina vascularization and blood-retina-barrier formation. Dev Cell 56:2237-2251].  We have not gone in this direction because the question of tip vs. stalk cells and of subtypes of tip cells in WT vs. mutant retinas is beyond our focus on choroidal neovascularization and the role of immune cells and vascular inflammation.  The proposed snRNAseq analysis would also require a major effort since tip cells are rare and must be harvested from large numbers of early postnatal retinas followed by FACS enrichment for vascular endothelial cells.

      Suggestions for improving the manuscript:  

      (6) The statement that ECs acquire properties of immune cells (Page 2, Line 90) is incorrect. Endothelial cells may acquire characteristics of antigen presenting cells. 

      Thank you for that correction.  Based on the review from Amersfoort et al (2022) (Amersfoort J, Eelen G, Carmeliet P. (2022) Immunomodulation by endothelial cells - partnering up with the immune system? Nat Rev Immunol 22:576-588) and the articles cited in it, we have changed the sentence to “Although vascular endothelial cells (ECs) are not generally considered to be part of the immune system, in some locations and under some conditions they acquire properties characteristic of immune cells, including secretion of cytokines, surface display of co-stimulatory or co-inhibitory receptors, and antigen presentation in association with MHC class II proteins (Pober and Sessa, 2014; Amersfoort et al., 2022).”  

      (7) The statement in Page 3, Line 100-101 [In CNS ECs, quiescence is maintained in part by the actions of astrocyte-derived Sonic Hedgehog, with the result that few immune cells other than resident microglia are found within the CNS (Alvarez et al., 2011).] is incomplete. Wnt signaling also suppresses the expression of leukocyte adhesion molecules from endothelial cells and therefore helps with immune cell quiescence. 

      Thank you for raising that point.  We have expanded that sentence to include Wnt signaling in CNS endothelial cells, as described in the following reference: Lengfeld JE, Lutz SE, Smith JR, Diaconu C, Scott C, Kofman SB, Choi C, Walsh CM, Raine CS, Agalliu I, Agalliu D. (2017) Endothelial Wnt/beta-catenin signaling reduces immune cell infiltration in multiple sclerosis. Proc Natl Acad Sci USA 114:E1168-E1177.

      (8) It may be beneficial for the reader to separate the results of the vascular phenotypes related to choroidal neovascularization compared to retinal vascular development. 

      Thank you for this suggestion.  The two topics are partly overlapping: choroidal neovascularization is described in Figure 1, and retinal development is described in Figures 1 and 2.  The challenge is that some of same images illustrate both phenotypes as in Figure 1, so the topics cannot be easily separated.

      (9) In addition to comparing the phenotypes in Tgfb signaling mutant mice with Wnt signaling and VEGF-A signaling mutants, the authors should compare and contrast their data with those found in Alk5 KO mice, as there are a lot of similarities. 

      The reviewer has alerted us to a nomenclature challenge which we will try to resolve in the introduction: Alk5 is just another name for TGFBR1.  The reviewer is correct: there are a lot of similarities between the present study and that of Zarkada et al (2021) because both use the same TGFBR1(=Alk5) CKO mice.

      Reviewer #2 (Recommendations for the authors): 

      Figure 2 

      For 2B, the authors should clarify whether the two regions shown in the Tgfbr1 KO retina (P14) represent central vs. peripheral areas, as phenotype severity varies. 

      For 2C, does the uneven biotin accumulation reflect developmental gradients (e.g., central-peripheral maturation timing)? 

      Thank you for raising these points.  Regarding Figure 2B, these images are all from the mid-peripheral retina, where the phenotype is moderately severe.  This is now noted in the figure legend.

      Regarding Figure 2C, the reviewer is correct that the pattern of Sulfo-NHS-biotin is uneven in VEcadCreER;Tgfbr1CKO/- retinas – it accumulates only in the tufts.  We have expanded Figure 2C to show a comparison between control (i.e.

      phenotypically WT), NdpKO, and TGFBR1 endothelial KO retinas, and we have expanded the associated part of the Results section.  In a nutshell, control retinas show little Sulfo-NHS-biotin accumulation in the vasculature or in the parenchyma; NdpKO retinas show Sulfo-NHS-biotin accumulation in the vasculature and in the parenchyma (i.e., the area between the vessels); and VEcadCreER;Tgfbr1CKO/- retinas show Sulfo-NHS-biotin accumulation in the vascular tufts with minimal accumulation in the non-tuft vasculature and minimal leakage into the parenchyma.   The conclusion is that the bulk of the retinal vasculature in TGFBR1 endothelial KO mice is not leaky – very different from the situation with loss of Norrin/Frizzled4 signaling.

      Figure 6 

      The claim that PECAM1+ rings on veins reflect EC-immune cell binding is uncertain, as PECAM1 is also known to be expressed by immune cells. The complete correlation of PECAM1 and CD45 staining signals suggests that a subset of immune cells upregulates PECAM1. The VEcadCreER;Tgfbr1 flox/-; SUN1:GFP reporter would be helpful to delineate ECimmune cell proximity. Super-resolution imaging with Z-stacks could also resolve spatial relationships (luminal vs. abluminal immune cell adhesion). 

      Thank you for this comment.  The reviewer is correct that, at the resolution of these images, we cannot determine whether the PECAM1 immunostaining signal is derived from ECs, from leukocytes, or from both.  This is now stated in the Results section.  The PECAM1-rich endothelial ring structure associated with leukocyte extravasation has been characterized in various publications, for example in (1) Carman CV, Springer TA. (2004) A transmigratory cup in leukocyte diapedesis both through individual vascular endothelial cells and between them. J Cell Biol 167:377-388 and (2) Mamdouh Z, Mikhailov A, Muller WA. (2009) Transcellular migration of leukocytes is mediated by the endothelial lateral border recycling compartment. J Exp Med 206:2795-2808.  The ring structures visualized in Figure 6D by PECAM1 immunostaining conform to the ring structures described in these and other papers.  In showing these structures, our point is simply that they likely represent sites of leukocyte extravasation.  This is now clarified in the text.  We have also added some additional references on leukocyte extravasation and the ring structures.

      Figure 7 

      A time-course analysis of ICAM1 would strengthen the mechanistic model. Does ICAM1 upregulation precede immune infiltration (supporting inflammation as the primary defect)? Given that immune cells appear by P14 (per snRNA-seq), is ICAM1 elevated earlier? 

      This is an interesting idea, but based on what is known about leukocyte adhesion and extravasation we predict that there will not be a clean temporal separation between ICAM1 induction and leukocyte adhesion/infiltration.  That is, if the proinflammatory state causes an increase in the number of leukocytes, then as ICAM1 levels increase, leukocyte adhesion would also increase.  Similarly, if the presence of leukocytes increases the pro-inflammatory state, then as the number of leukocytes increases, the levels of ICAM1 would be predicted to increase.  Thus, we think that a time course analysis is unlikely to provide a definitive conclusion.

      Figure 8-SF1 

      In brain slices, a transient pan-IgG accumulation suggests a self-resolving defect in the BBB. However, this BBB impairment appears to be spatiotemporally distinct from ICAM1 upregulation. ICAM1 staining is restricted to the lesion site, aligning with immune cell-driven inflammation. 

      Thank you for raising these points.  The reviewer is correct that these observations don’t fit together in a clear way.  There does not appear to be a general increase in brain vascular permeability in VE-cad-CreER;TGFBR1 CKO/- mice, as shown by sulfo-NHS-biotin.  However, there is a large and transient increase in IgG in the brain parenchyma, suggestive of a general vascular alteration, and – as the reviewer correctly notes – it is not accompanied by a generalized increase in ICAM1 vascular immunostaining.  At this point, we don’t have any real insight into the mechanistic basis of the transient IgG increase.

      Thank you for handling this manuscript.

    1. Reviewer #1 (Public review):

      Summary:

      Zhang et al. addressed the question of whether advantageous and disadvantageous inequality aversion can be vicariously learned and generalized. Using an adapted version of the ultimatum game (UG), in three phases, participants first gave their own preference (baseline phase), then interacted with a "teacher" to learn their preference (learning phase), and finally were tested again on their own (transfer phase). The key measure is whether participants exhibited similar choice preference (i.e., rejection rate and fairness rating) influenced by the learning phase, by contrasting their transfer phase and baseline phase. Through a series of statistical modeling and computational modeling, the authors reported that both advantageous and disadvantageous inequality aversion can indeed be learned (Study 1), and even be generalised (Study 2).

      Strengths:

      This study is very interesting, that directly adapted the lab's previous work on the observational learning effect on disadvantageous inequality aversion, to test both advantageous and disadvantageous inequality aversion in the current study. Social transmission of action, emotion, and attitude have started to be looked at recently, hence this research is timely. The use of computational modeling is mostly appropriate and motivated. Study 2 that examined the vicarious inequality aversion on conditions where feedback was never provided is interesting and important to strengthen the reported effects. Both studies have proper justifications to determine the sample size.

      Weaknesses:

      Despite the strengths, a few conceptual aspects and analytical decisions have to be explained, justified, or clarified.

      INTRODUCTION/CONCEPTUALIZATION

      (1) Two terms seem to be interchangeable, which should not, in this work: vicarious/observational learning vs preference learning. For vicarious learning, individuals observe others' actions (and optionally also the corresponding consequence resulted directly by their own actions), whereas, for preference learning, individuals predict, or act on behalf of, the others' actions, and then receive feedback if that prediction is correct or not. For the current work, it seems that the experiment is more about preference learning and prediction, and less so about vicarious learning. But the intro and set are heavily around vicarious learning, and late the use of vicarious learning and preference learning is rather mixed in the text. I think either tone down the focus on vicarious learning, or discuss how they are different. Some of the references here may be helpful: Charpentier et al., Neuron, 2020; Olsson et al., Nature Reviews Neuroscience, 2020; Zhang & Glascher, Science Advances, 2020

      EXPERIMENTAL DESIGN

      (2) For each offer type, the experiment "added a uniformly distributed noise in the range of (-10 ,10)". I wonder how this looks like? With only integers such as 25:75, or even with decimal points? More importantly, is it possible to have either 70:30 or 90:10 option, after adding the noise, to have generated an 80:20 split shown to the participants? If so, for the analyses later, when participants saw the 80:20 split, which condition did this trial belong to? 70:30 or 90:10? And is such noise added only to the learning phase, or also to the baseline/transfer phases? This requires some clarification.

      (3) For the offer conditions (90:10, 70:30, 50:50, 30:70, 10:90) - are they randomized? If so, how is it done? Is it randomized within each participants, and/or also across participants (such that each participant experienced different trial sequences)? This is important, as the order especially for the leanring phase can largely impact on the preference learning of the participants.

      STATISTICAL ANALYSIS & COMPUTATIONAL MODELING

      (4) In Study 1 DI offer types (90:10, 70:30), the rejection rate for DI-AI averse looks consistently higher than that for DI averse (ie, blue line is above the yellow line). Is this significant? If so, how come? Since this is a between-subject design, I would not anticipate such a result (especially for the baseline). Also, for the LME results (eg, Table S3), only interactions were reported but not the main results.

      (5) I do not particularly find this analysis appealing: "we examined whether participants' changes in rejection rates between Transfer and Baseline, could be explained by the degree to which they vicariously learned, defined as the change in punishment rates between the first and last 5 trials of the Learning phase." Naturally, participants' behavior in the first 5 trials in the learning phase will be similar to those in the baseline; and their behavior in the last 5 trials in the learning phase would echo those at the transfer phase. I think it would be stronger to link the preference learning results to the chance between baseline and transfer phase, eg, by looking at the difference between alpha (beta) at the end of the learning phase and the initial alpha (beta).

      (6) I wonder if data from the baseline and transfer phases can also be modeled, using a simple Fehr-Schimdt model? This way, the change in alpha/beta can also be examined between the baseline and transfer phase.

      (7) I quite liked Study 2 that tests the generalization effect, and I expected to see an adapted computational modeling to directly reflect this idea. Indeed, the authors wrote "[...] given that this model [...] assumes the sort of generalization of preferences between offer types [...]". But where exactly did the preference learning model assumed the generalization? In the methods, the modeling seems to be only about Study 1; did the authors advise their model to accommodate Study 2? The authors also ran simulation for the learning phase in Study 2 (Figure 6), and how did the preference updated (if at all) for offers (90:10 and 10:90) where feedback was not given? Extending/Unpacking the computational modeling results for Study2 will be very helpful for the paper.

      Comments on revisions:

      I kept my original public review, so that future readers can see the progress and development of the manuscript.

      The authors have largely addressed my original questions/concerns, and I have two outstanding comments.

      (a) Related to my original comment #6, where I suggested to apply the F-S model also to the baseline and transfer phase. The authors were inclined not to do it, but in fact later in comment #7 and in the manuscript they opted to use a more complex F-S-based model to their learning phase. I agree that the rejection rate is indeed a clear indication, but for completeness, it'd be more consistent and compelling if the paper follows a model-free (model-agnostic) and model-based approach in all phases of the experiment.

      (b) Related to my original comment #4, I appreciate that the authors have provided more details of their LMM models. But I don't think it is accurate regardless. First, all offer levels (50:50, 30:70, 10:90), should not be coded as pure categorical levels. In fact, they have an ordinal meaning, a single ordinal predictor with three levels should be used. This also avoids the excessive number of interactions the authors have pointed out.

      Second, running a model with only interactions without main effects is flawed. All textbooks on stats emphasize that without the presence of the main effects, the interpretation of interaction only is biased.

      So these LMMs needs to be revised before the manuscript eventually gets to a version of record.

    2. Reviewer #2 (Public review):

      Summary:

      This study investigates whether individuals can learn to adopt egalitarian norms that incur a personal monetary cost, such as rejecting offers that benefit them more than the giver (advantageous inequitable offers). While these behaviors are uncommon, two experiments aim to demonstrate that individuals can learn to reject such offers by observing a "teacher" who follows these norms. The authors use computational modelling to argue that learners adopt these norms through a sophisticated process, inferring the latent structure of the teacher's preferences, akin to theory of mind.

      Strengths:

      This paper is well-written and tackles an important topic relevant to social norms, morality, and justice. The findings are promising (though further control conditions are necessary to support the conclusions). The study is well-situated in the literature, with a clever experimental design and a computational approach that may offer insights into latent cognitive processes. In the revision, the authors clarified some questions related to the initial submission.

      Weaknesses:

      Despite these strengths, I remain unconvinced that the current evidence supports the paper's central claims. Below, I outline several issues that, in my view, limit the strength of the conclusions.

      (1) Experimental Design and Missing Control Condition:

      The authors set out to test whether observing a "teacher" who is averse to advantageous inequity (Adv-I) will affect observers' own rejection of Adv-I offers. However, I think the design of the task lacks an important control condition needed to address this question. At present, participants are assigned to one of two teachers: DIS or DIS+ADV. Behavioral differences between these groups can only reveal relative differences in influence; they cannot establish whether (and how) either teacher independently affects participants' own behavior. For example, a significant difference between conditions can emerge even if participants are only affected by the DIS teacher and are not affected at all by the DIS+ADV teacher. What is crucially missing here is a no-teacher control condition, which can then be compared with each teacher condition separately. This control condition would also control for pure temporal effects unrelated to teacher influence (e.g., increasing Adv-I rejections due to guilt build-up).

      While this criticism applies to both experiments, it is especially apparent in Experiment 2. As shown in Figure 4, the interaction for 10:90 offers reflects a decrease in rejection rates following the DIS teacher, with no significant change following the DIS+ADV teacher. Ignoring temporal effects, this pattern suggests that participants may be learning NOT to reject from the DIS teacher, rather than learning to reject from the DIS+ADV teacher. On this basis, I do not see convincing evidence that participants' own choices were shaped by observing Adv-I rejections.

      In the Discussion, the authors write that "We found that participants' own Adv-I-averse preferences shifted towards the preferences of the Teacher they just observed, and the strength of these contagion effects related to the degree of behavior change participants exhibited on behalf of the Teachers, suggesting that they internalized, at least somewhat, these inequity preferences." However, there is no evidence that directly links the degree of behaviour change (on the teacher's behalf) to contagion effects (own behavioural change). I think there was a relevant analysis in the original version, but it was removed from the current version.

      (2) Modelling Efforts: The modelling approach is underdeveloped. The identification of the "best model" lacks transparency, as no model-recovery results are provided. Additionally, behavioural fits for the losing models are not shown, leaving readers in the dark about where these models fail. Readers would benefit from seeing qualitative/behavioural patterns that favour the winning model. Moreover, the reinforcement learning (RL) models used are overly simplistic, treating actions as independent when they are likely inversely related. For example, the feedback that the teacher would have rejected an offer provides evidence that rejection is "correct" but also that acceptance is "an error," and the latter is not incorporated into the modelling. In other words, offers are modelled as two-armed bandits (where separate values are learned for reject and accept actions), but the situation is effectively a one-armed bandit (if one action is correct, the other is mistaken). It is unclear to what extent this limitation affects the current RL formulations. Can the authors justify/explain their reasoning for including these specific variants? The manuscript only states Q-values for reject actions, but what are the Q-values for accept actions? This is unclear.

      In Experiment 2, only the preferred model is capable of generalization, so it is perhaps unsurprising that this model "wins." However, this does not strongly support the proposed learning mechanism, lacking a comparison with simpler generalizing mechanisms (see following comments).

      (3) Conceptual Leap in Modelling Interpretation: The distinction between simple RL models and preference-inference models seems to hinge on the ability to generalize learning from one offer to another. Whereas in the RL models, learning occurs independently for each offer (hence no cross-offer generalization), preference inference allows for generalization between different offers. However, the paper does not explore "model-free" RL models that allow generalization based on the similarity of features of the offers (e.g., payment for the receiver, payment for the offer-giver, who benefits more). Such models are more parsimonious and could explain the results without invoking a theory of mind or any modelling of the teacher. In such model versions, a learner acquires a functional form that allows prediction of the teacher's feedback based on offer features (e.g., linear or quadratic weighting). Because feedback for an offer modulates the parameters of this function (feature weights), generalization occurs without necessarily evoking any sophisticated model of the other person. This leaves open the possibility that RL models could perform just as well or even outperform the preference learning model, casting doubt on the authors' conclusions.

      Of note: even the behaviourists knew that when Little Albert was taught to fear rats, this fear generalized to rabbits. This could occur simply because rabbits are somewhat similar to rats. But this doesn't mean Little Albert had a sophisticated model of animals that he used to infer how they behave.

      In their rebuttal letter, the authors acknowledge these possibilities, but the manuscript still does not explore or address alternative mechanisms.

      (4) Limitations of the Preference-Inference Model: The preference-inference model struggles to capture key aspects of the data, such as the increase in rejection rates for 70:30 DI offers during the learning phase (e.g., Fig. 3A, AI+DI blue group). This is puzzling. Thinking about this, I realized the model makes quite strong, unintuitive predictions which are not examined. For example, if a subject begins the learning phase rejecting the 70:30 offer more than 50% of the time (meaning the starting guilt parameter is higher than 1.5), then, over learning, the tendency to reject will decrease to below 50% (the guilt parameter will be pulled down below 1.5). This is despite the fact that the teacher rejects 75% of the offers. In other words, as learning continues, learners will diverge from the teacher. On the other hand, if a participant begins learning by tending to accept this offer (guilt < 1.5), then during learning, they can increase their rejection rate but never above 50%. Thus, one can never fully converge on the teacher. I think this relates to the model's failure in accounting for the pattern mentioned above. I wonder if individuals actually abide by these strict predictions. In any case, these issues raise questions about the validity of the model as a representation of how individuals learn to align with a teacher's preferences (given that the model doesn't really allow for such an alignment).

      In their rebuttal letter, the authors acknowledged these anomalies and stated that they were able to build a better model (where anomalies are mitigated, though not fully eliminated). But they still report the current model and do not develop/discuss alternatives. A more principled model may be a Bayesian model where participants learn a belief distribution (rather than point estimates) regarding the teacher's parameters.

      (5) Statistical Analysis: The authors state in their rebuttal letter that they used the most flexible random effect structure in mixed-effects models. But this seems not to be the case in the model reported in Table SI3 (the very same model was used for other analyses too). Indeed, here it seems only intercepts are random effects. This left me confused about which models were used.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review): 

      Most human traits and common diseases are polygenic, influenced by numerous genetic variants across the genome. These variants are typically non-coding and likely function through gene regulatory mechanisms. To identify their target genes, one strategy is to examine if these variants are also found among genetic variants with detectable effects on gene expression levels, known as eQTLs. Surprisingly, this strategy has had limited success, and most disease variants are not identified as eQTLs, a puzzling observation recently referred to as "missing regulation". 

      In this work, Jeong and Bulyk aimed to better understand the reasons behind the gap between disease-associated variants and eQTLs. They focused on immune-related diseases and used lymphoblastoid cell lines (LCLs) as a surrogate for the cell types mediating the genetic effects. Their main hypothesis is that some variants without eQTL evidence might be identifiable by studying other molecular intermediates along the path from genotype to phenotype. They specifically focused on variants that affect chromatin accessibility, known as caQTLs, as a potential marker of regulatory activity. 

      The authors present data analyses supporting this hypothesis: several disease-associated variants are explained by caQTLs but not eQTLs. They further show that although caQTLs and eQTLs likely have largely overlapping underlying genetic variants, some variants are discovered only through one of these mapping strategies. Notably, they demonstrate that eQTL mapping is underpowered for gene-distal variants with small effects on gene expression, whereas caQTL mapping is not dependent on the distance to genes. Additionally, for some disease variants with caQTLs but no corresponding eQTLs in LCLs, they identify eQTLs in other cell types. 

      Altogether, Jeong and Bulyk convincingly demonstrate that for immune-related diseases, discovering the missing disease-eQTLs requires both larger eQTL studies and a broader range of cell types in expression assays. It remains to be seen what fractions of the missing diseaseeQTLs will be discovered with either strategy and whether these results can be extended to other diseases or traits. 

      We thank the reviewer for their accurate summary of our study and positive review of our findings for immune-related diseases.

      It should be noted that the problem of "missing regulation" has been investigated and discussed in several recent papers, notably Umans et al., Trends in Genetics 2021; Connally et al., eLife 2022; Mostafavi et al., Nat. Genet. 2023. The results reported by Jeong and Bulyk are not unexpected in light of this previous work (all of which they cite), but they add valuable empirical evidence that mostly aligns with the model and discussions presented in Mostafavi et al. 

      We thank the reviewer for their positive review of our results and manuscript. As Reviewer #1 noted, whether our and others' observation extends to other diseases or traits is an open question. For instance, Figure 2b in Mostafavi et al., Nat. Genet. (2023) demonstrated that there was a spectrum of depletion of eQTLs and enrichment of GWAS signals in constrained genes across various tissues and traits, respectively. Therefore, gene expression constraint may play a larger or smaller role in different diseases or traits. That immune cell types and cell states are extremely diverse (Schmiedel et al., Cell (2018) and Calderon et al., Nat. Genet. (2019), just to name a few) likely adds to the complexity of gene regulation that contributes to immune-mediated disease.

      Reviewer #2 (Public Review): 

      Summary: 

      eQTLs have emerged as a method for interpreting GWAS signals. However, some GWAS signals are difficult to explain with eQTLs. In this paper, the authors demonstrated that caQTLs can explain these signals. This suggests that for GWAS signals to actually lead to disease phenotypes, they must be accessible in the chromatin. This implies that for GWAS signals to translate into disease phenotypes, they need to be accessible within the chromatin. 

      However, fundamentally, caQTLs, like GWAS, have the limitation of not being able to determine which genes mediate the influence on disease phenotypes. This limitation is consistent with the constraints observed in this study. 

      We thank the reviewer for their accurate summary of our results.

      (1) For reproducibility, details are necessary in the method section.

      Details about adding YRI samples in ATAC-seq: For example, how many samples are there, and what is used among public data? There is LCL-derived iPSC and differentiated iPSC (cardiomyocytes) data, not LCL itself. How does this differ from LCL, and what is the rationale for including this data despite the differences?

      Banovich et al., Genome Research (2018) (PMID: 29208628), who generated data using LCLderived iPSCs and differentiated iPSCs (cardiomyocytes), also generated ATAC-seq data from 20 YRI LCL samples. We analyzed those data to identify open chromatin regions (i.e., ATACseq peaks) in LCLs and merged the regions with open chromatin regions identified with 100 GBR LCL samples from two studies by Kumasaka et al. (Nature Genetics (2016)

      PMID: 26656845 and Nature Genetics (2019) PMID: 30478436). However, we restricted the caQTL analysis to only the 100 GBR samples because of possible ancestry effects and batch effects. We attempted caQTL analysis with the 20 YRI samples as well, but the result was noisy, likely due to smaller sample size and lower read depth of the ATAC-seq data.

      caQTL is described as having better power than eQTL despite having fewer samples. How does the number of ATAC peaks used in caQTL compare to the number of gene expressions used in eQTL?

      The number of ATAC peaks used in caQTL (99,320) is ~6.7 times greater than the number of genes (14,872) used in the eQTL analysis. Therefore, there is a higher chance of detecting a significant caQTL signal and a significant colocalization signal than there is for eQTLs. However, we reasoned that since distal eQTLs are more easily detected as caQTLs and since increasing the sample size of eQTLs through meta-analysis uncovered additional eQTL colocalization at loci with caQTL colocalization only, colocalized caQTLs are likely capturing disease-relevant regulatory effects.

      Details about RNA expression data: In the method section, it states that raw data (ERP001942) was accessed, and in data availability, processed data (E-GEUV-1) was used. These need to be consistent.

      Thank you for pointing this out. We used the processed data from Expression Atlas (https://www.ebi.ac.uk/gxa/experiments/E-GEUV-1/Results), and that's what we meant by "We downloaded RNA expression level data of the LCL samples from the Expression Atlas." We have revised the “RNA expression data preparation” section in our manuscript to make the text clearer.

      How many samples were used (the text states 373, but how was it reduced from the original 465, and the total genotype is said to be 493 samples while ATAC has n=100; what are the 20 others?), and it mentions European samples, but does this exclude YRI?

      We thank the reviewer for pointing out these points of confusion. Our reported count of 493 samples included YRI samples with RNA-seq data or ATAC-seq data that we ultimately did not use for QTL analyses. There were 373 European samples with RNA-seq data that we used for eQTL analysis, and 100 GBR samples (including some that overlap with the 373 European samples) that we used for caQTL analysis. We have revised the text to clarify these points.

      (2) Experimental results determining which TFs might bind to the representative signals of caQTL are required.

      We agree that caQTL colocalization is just the start of elucidating the regulatory mechanism of a GWAS locus. Determining which TFs are bound and which TFs' binding is altered would be necessary to describe the causal regulatory mechanism. For this, we utilized the Cistrome database to search for TFs whose binding overlaps the colocalized caQTL peaks. We present the results of this analysis in Supplementary Table 3 and Supplementary Figure 4, both of which we have added in our revised manuscript. Overall, protein factors associated with active transcription, such as POL2RA, and several immune cell TFs, including RUNX3, SPI1, and RELA, were frequently detected in those peaks. Detecting these factors in most peaks supports the likelihood that the colocalized caQTL peaks are active cis-regulatory elements. These results are consistent with our observation of enriched caQTL-mediated heritability in regions with active histone marks (Figure 1).

      (3) It is stated that caQTL is less tissue-specific compared to eQTL; would caQTL performed with ATAC-seq results from different cell types, yield similar results?

      We thank the reviewer for the question. Calderon et al. (PMID: 31570894) observed that "most effects on allelic imbalance (of ATAC-seq) were shared regardless of lineage or condition". Yet, there were regions where a different cell type or state would show inaccessibility (Figure 4d in Calderon et al.). Thus, we expect that ATAC-seq results from different cell types (e.g., T cells, B cells, monocytes, etc.) would lead to additional caQTLs showing colocalization at cell-typespecific open chromatin. However, if a region is accessible in both cell types, caQTL may be detected in both. Moreover, Alasoo et al., Nature Genetics (2018) (PMID: 29379200) observed that “many disease-risk variants affect chromatin structure in a broad range of cellular states, but their effects on expression are highly context specific.” In both studies, the authors investigated immune cell types, and there could be different observations in non-immune cell types and other diseases and traits.

      Reviewer #1 (Recommendations For The Authors): 

      I think it would strengthen the paper to explore gene-level differences in the discovery of caQTLs and eQTLs. For example, complex disease-relevant genes, on average, have more/longer regulatory domains (as shown by Wang and Goldstein, AJHG 2020; Mostafavi et al., Nat. Genet. 2023). Therefore, it is plausible that for such genes, caQTLs are much more easily discoverable than eQTLs due to (i) a larger mutational target size for caQTLs, and (ii) dispersion of expression heritability across multiple domains, which hampers the discovery of eQTLs but not caQTLs, which are studied independently of other domains in the region. In other words, discovered caQTLs and eQTLs likely vary in terms of their distance to genes (as the authors report), as well as their target genes.

      We thank the reviewer for the suggestion to explore gene-level differences. We expect that the effects of complex disease-relevant genes having more / longer regulatory domains, on average, to explain our observations. We agree on both of your points that there are many more regulatory elements that are captured as accessible regions than expressed genes and that genes often have multiple independent eQTLs leading to dispersion of heritability. The genelevel trend that we described was the distance of the regulatory element from the genes. Additional analyses would be a relevant future direction.

      Also considering gene-level analysis, Mostafavi et al. show that the types of biases they report for eQTLs also apply to other molecular QTLs. It would be valuable to compare GWAS hits with versus without caQTL colocalization. Similarly, it would be insightful to compare GWAS hits with both colocalized caQTLs and eQTLs to GWAS hits with colocalized caQTLs but no eQTLs in any of the cell types. 

      We thank the reviewer for the comment. Investigating for potential biases in the colocalized caQTL would be useful, but we considered it beyond the scope of this work. In terms of biological factors, we demonstrated through mediated heritability analyses that more accessible chromatin (based on ATAC-seq read coverage) and regions with active histone marks were enriched for autoimmune disease associations (Figure 1). Furthermore, as greater distance of the regulatory variant from the transcription start site significantly reduced the cis-heritability, we would expect that distance would play a major role, similar to Mostafavi et al.’s conclusions.

      I don't think the argument for the role of natural selection contributing to the "missing regulation" is presented accurately. Specifically, large eQTLs acting on top trait-relevant genes are under stronger selection and thus, on average, segregate at lower frequencies. This makes them difficult to discover in eQTL assays. However, if not lost, they contribute as much, if not more, to trait heritability than weaker eQTLs at the same gene because their larger effects compensate for their lower frequency. At the most extreme, selection should have a "flattening" effect (e.g., see Simons et al., PLOS Biol 2018; O'Connor et al., AJHG 2019): weak and strong eQTLs at the same gene are expected to contribute equally to heritability. Therefore, the statement "Consequently, only weak eQTL variants, often in regions distal to the gene's promoter, may remain and affect traits" is not correct. If this turns out to be empirically true, other models, such as pleiotropic selection, need to explain it. 

      We thank the reviewer for the correction. We agree with the comment and have revised the sentences in the introduction accordingly.

      It is worth speculating why caQTLs may be more consistent across cell types than cis-eQTLs. Additionally, readers may infer from the paper that the focus should shift from eQTLs to caQTLs, which may not be the authors' intention. Perhaps these approaches are complementary: caQTLs can help with TSS-distal disease variants, while finding the target gene and regulatory context is more straightforward with eQTL colocalization. Addressing these points in the discussion will be helpful.

      We appreciate the reviewer's suggestion to clarify the advantages of incorporating cis-eQTLs and caQTLs. Our argument is exactly as you put it, and we added a paragraph on this in the Discussion.

      I believe the authors could do more to contextualize their findings within the existing literature on the subject, particularly Umans et al., Trends in Genetics 2021; Connally et al., eLife 2022; and Mostafavi et al., Nat. Genet. 2023. For instance, Umans et al. suggest that "if most standard eQTLs are generally benign, increasing sample size and adding more tissue types in an effort to identify even more standard eQTLs may not help us to explain many more disease risk mutations". Conversely, Mostafavi et al. argue for a multipronged approach, which appears more aligned with the authors' conclusions.

      We followed the reviewer’s suggestion to place our work in the context of existing literature on this topic. Moreover, we clarified what our recommendations for future data generation are.

      I thought Figures 1C-D were unclear. 

      We added a sentence in the figure legend describing that stronger and more significant enrichment indicate that mediated heritability is concentrated in that subset.

      Reviewer #2 (Recommendations For The Authors): 

      Complete workflow figures for caQTL calling and eQTL calling are required. 

      To improve clarity of the caQTL and eQTL calling workflow, we added Supplementary Figure 1.

    1. Reviewer #2 (Public review):

      Summary:

      This manuscript reports high-resolution functional MRI data and MEG data revealing additional mechanistic information about an established paradigm studying how foveal regions of primary visual cortex (V1) are involved in processing peripheral visual stimuli. Because of the retinotopic organization of V1, peripheral stimuli should not evoke responses in the regions of V1 that represent stimuli in the center of the visual field (the fovea). However, functional MRI responses in foveal regions do reflect the characteristics of peripheral visual stimuli - this is a surprising finding first reported in 2008. The present study uses fMRI data with sub-millimeter resolution to study the how responses at different depths in the foveal gray matter do or don't reflect peripheral object characteristics during 2 different tasks: one in which observers needed to make detailed judgments about object identity, and one in which observers needed to make more coarse judgments about object orientation. FMRI results reveal interesting and informative patterns in these two conditions. A follow-on MEG study yields information about the timing of these responses. Put together, the findings settle some questions in the field and add new information about the nature of visual feedback to V1.

      Strengths:

      (1) Rigorous and appropriate use of "laminar fMRI" techniques.

      (2) The introduction does an excellent job of contextualizing the work.

      (3) Control experiments and analyses are designed and implemented well

      Weaknesses:

      (1) The use of the term "low order" to describe object orientation is potentially confusing. During review, the authors considered this issue and responded that they would continue with the use of the term low-order to describe object orientation because a low-pass spatial frequency filter would provide object orientation information. This is certainly a reasonable perspective; nonetheless, this reviewer thinks spatial frequencies that low are not readily represented by neurons in early visual cortex and it is common to use "low-order" to refer to features extracted in early visual areas, so I think this causes confusion.

      (2) The methods contain a nice description of the methods for "correcting the vascular-related signals". I'm guessing this is the method that removed, e.g., 22% of foveal voxels (previous paragraph), but it's not entirely clear whether the voxel selection methods described in the "correcting the vascular-related signals" are describing the same processing step referred to in the previous paragraph as "a portion of voxels was removed based on large vein distribution".

      (3) It is quite difficult to perform laminar analyses across multiple visual areas because distortion compensation is not perfect and registration of functional to anatomical data will always be a bit better in some places and a bit worse in others. An ideal manuscript would include some images showing registration quality in V1, LOC, and IPS regions for a few different participants, or include some kind of quality metric indicating the confidence in depth assignments in different regions.

      (4) For the decoding analysis, it would be helpful to have more information about how samples were defined for each condition -- were the beta values for entire blocks used as samples for each condition, or were separate timepoints during a block used in the SVM as repeated samples for each condition?

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):

      (1.1) The authors argue that low-level features in a feedback format could be decoded only from deep layers of V1 (and not superficial layers) during a perceptual categorization task. However, previous studies (Bergman et al., 2024; Iamshchinina et al., 2021) demonstrated that low-level features in the form of feedback can be decoded from both superficial and deep layers. While this result could be due to perceptual task or highly predictable orientation feature (orientation was kept the same throughout the experimental block), an alternative explanation is a weaker representation of orientation in the feedback (even before splitting by layers there is only a trend towards significance; also granger causality for orientation information in MEG part is lower than that for category in peripheral categorization task), because it is orthogonal to the task demand. It would be helpful if the authors added a statistical comparison of the strength of category and orientation representations in each layer and across the layers.

      We agree that the strength of feedback information is related to task demand. Specifically, we would like to highlight the relationship between task demand and feedback information in the superficial layer. Previous studies have shown that foveal feedback information is observed only when the task requires the identity information of the peripheral objects (Williams et al., 2008; Fan et al., 2016; Yu and Shim, 2016). In this study, we found that the deep layer represented both orientation and categorical feedback information, while the superficial layer only represented categorical information. This suggests that feedback information in the superficial layer may be related to (or enhanced by) the task demands. In other words, if the experimental design required participants to discriminate orientation rather than object identity, we would expect stronger orientation information in foveal V1 and significant decoding performance of orientation feedback information in the superficial layer of foveal V1. This assumption is consistent with the anatomical connections of the superficial layer, which not only receives feedback connections but also sends outputs to higher-level regions for further processing. This is also consistent with Iamshchinina et al.’s observation that, when orientation information had to be mentally rotated and reported (i.e., task-relevant), it was observed in both the superficial and deep layers of V1. Bergmann et al. observed illusory color information in the superficial layer of V1, which may reflect a combination of lateral propagation and feedback mechanisms in the superficial layer that support visual filling-in phenomena. We have revised the discussion in the manuscript: In other words, if the experimental design required participants to discriminate orientation rather than object identity, we would expect stronger orientation information in foveal V1 and significant decoding performance of orientation feedback information in the superficial layer of foveal V1. Recent studies (Iamshchinina et al., 2021; Bergman et al., 2024) have also highlighted the relationship between feedback information and neural representations in V1 superficial layer.

      To further demonstrate the laminar profiles of low- and high-order information, we have re-analyzed the data and added more fine-scale laminar profiles with statistical comparisons in the revised manuscript. The results again showed significant neural decoding performances in the deep layer of both category and orientation information, and only significant decoding performances of category information in the superficial layer.

      (1.2) The authors argue that category feedback is not driven by low-level confounding features embedded in the stimuli. They demonstrate the ability to decode orientations, particularly well represented by V1, in the absence of category discrimination. However, the orientation is not a category-discriminating feature in this task. It could be that the category-discriminating features cannot be as well decoded from V1 activity patterns as orientations. Also, there are a number of these category discriminating features and it is unclear if it is a variation in their representational strength or merely the absence of the task-driven enhancement that preempts category decoding in V1 during the foveal task. In other words, I am not sure whether, if orientation was a category-specific feature (sharpies are always horizontal and smoothies are vertical), there would still be no category decoding.

      The low-order features mentioned in the manuscript refer to visual information encoded intrinsically in V1, independent of task demands. In the foveal experiment, the task is to discriminate the color of fixation, which is unrelated to the category or orientation of the object stimuli. The results showed that only orientation information could be decoded from foveal V1. This indicates that low-order information, such as orientation, is strongly and automatically encoded in V1, even when it is irrelevant to the task. Meanwhile, category information could not be decoded, indicating that category information relies on feedback signals driven by attention or the task to the objects, both of which are absent in the fixation task. Other evidence indicates that category feedback is not driven by low-level features intrinsically encoded in V1. First, the laminar profiles of these two types of feedback information differ considerably (see response to 1.1). Second, only category feedback information was correlated with behavioral performance (MEG experiment). These findings demonstrate that category feedback information is task-driven and differs from the automatically encoded low-order information in foveal V1. The reviewer expressed some uncertainty that, whether “if orientation was a category-specific feature (sharpies are always horizontal and smoothies are vertical), there would still be no category decoding”. Our data showed that orientation could be automatically decoded in V1, regardless of task demand. Thus, if orientation was a category-specific feature in the foveal task (i.e., sharpies are always horizontal and smoothies are always vertical), category decoding would be successful in V1. However, in this scenario, the orientation and other shape features are not independent, thus preventing us to find out whether non-orientation shape features could be decoded in V1.  

      Reviewer #2 (Public review):

      (2.1) While not necessarily a weakness, I do not fully agree with the description of the 2 kinds of feedback information as "low-order" and "high-order". I understand the motivation to do this - orientation is typically considered a low-level visual feature. But when it's the orientation of an entire object, not a single edge, orientation can only be defined after the elements of the object are grouped. Also, the discrimination between spikies and smoothies requires detecting the orientations of particular edges that form the identifying features. To my mind, it would make more sense to refer to discrimination of object orientation as "coarse" feature discrimination, and orientation of object identity as "fine" feature discrimination. Thus, the sentence on line 83, for example, would read "Interestingly, feedback with fine and coarse feature information exhibits different laminar profiles.".

      We agree that the object orientation (invariant to object category or identity) is defined on a larger spatial scale than the local orientation features such as local edges, however, in this sense, the object orientation is a coarse feature. In contrast, the category-defining information is mainly contributed by the local shape information (i.e., little cubes vs. bumps), which is more fine-scale information. One way to look at this difference is that the object orientation information is mainly carried by low-spatial frequency information and will survive low-pass filtering, hence “coarse”; while the object category information would largely be lost if the objects underwent low-pass spatial filtering.

      We believe the labeling words “low-order” and “high-order” are consistent with the typical use of these terms in the literature, referring to features intrinsically encoded in early visual cortex vs. in high level object sensitive cortical regions. The more important aspects of our results are in their differential engagement in feedforward vs. feedback processing, with low-order features automatically represented in the early visual cortex during feedforward processing while high-order features represented due to feedback processing. Results from the foveal fMRI experiment (Exp. 2) strongly support this assumption that, when objects were presented at the fovea and the task was a fixation color task irrelevant to object information, foveal V1 could only represent orientation information, not category information. Notably, there was a dramatic difference in decoding performance in foveal V1 between Exp.1 and Exp.2, which ruled out the argument that both orientation and category information were driven by local edge information represented in V1.

      (2.2) Figure 2 and text on lines 185, and 186: it is difficult to interpret/understand the findings in foveal ROIs for the foveal control task without knowing how big the ROI was. Foveal regions of V1 are grossly expanded by cortical magnification, such that the central half-degree can occupy several centimeters across the cortical surface. Without information on the spatial extent of the foveal ROI compared to the object size, we can't know whether the ROI included voxels whose population receptive fields were expected to include the edges of the objects.

      The ROI of foveal V1 was defined using data from independent localizer runs. In each localizer run, flashing checkerboards of the same size as the objects in the task runs were presented at the fovea or in the periphery. The ROI of foveal V1 was identified as the voxels responsive to the foveal checkerboards. In other words, The ROI of foveal V1 included the voxels whose population receptive fields covered the entire object in the foveal visual field.

      We included a figure in the revised manuscript comparing the activation maps induced by the foveal object stimulus in the task runs with the ROI coverage defined by the localizer runs. 

      (2.3) Line 143 and ROI section of the methods: in order for the reader to understand how robust the responses and analyses are, voxel counts should be provided for the ROIs that were defined, as well as for the number (fraction) of voxels excluded due to either high beta weights or low signal intensity (lines 505-511).

      In the revised manuscript, we have included the number of voxels in each ROI and the criteria for voxel selection:

      For each ROI, the number of voxels depended on the size of the activated region, as estimated from the localizer data. The numbers are as follows: foveal V1, 2185 ± 389; peripheral V1, 1294± 215; LOC, 3451 ± 863; and pIPS, 5154 ± 1517. To avoid the signals of large vessels, a portion of voxels was removed based on the distribution of large vessels: V1 foveal, 22.5% ± 6.6%; V1 peripheral, 6.8% ± 3.9%; LOC, 16.1% ± 8.1% ; and pIPS, 5.1% ± 3.2%. For the decoding analysis, the top 500 responsive voxels in each ROI were selected to balance the voxel numbers across different ROIs for training and testing the decoder.

      (2.4) I wasn't able to find mention of how multiple-comparisons corrections were performed for either the MEG or fMRI data (except for one Holm-Bonferonni correction in Figure S1), so it's unclear whether the reported p-values are corrected.

      For the fMRI results, there is strong evidence showing that feedback information is sent to the foveal V1 during a peripheral object task (Williams et al., 2008; Fan et al., 2016; Yu and Shim, 2016). In addition, anatomical and functional evidence shows that the superficial and deep layers of V1 receive feedback information during visual processing. Therefore, in the current study, we specifically examined two types of feedback information in the superficial and deep layers of foveal V1, and did not apply multiple-comparison correction to the decoding results.

      Regarding the MEG results, since we did not have a strong prior about when feedback information would arrive in the foveal V1, a cluster-based permutation method was used to correct for multiple comparisons in each time course. Specifically, for each time point, the sign of the effect for each participant was randomly flipped 50000 times to obtain the null hypothesis distribution for each time point. Clusters were defined as continuous significant time points in the real and flipped time series, and the effects in each cluster were summed to create a cluster-based effect. The most significant cluster-based effect in each flipped time series was then used to generate the corrected null hypothesis distribution.

      We included these clarifications in Significance testing part of the revised manuscript.

      Reviewer #1 (Recommendations for the authors):

      It would be helpful if the authors could elaborate more on the fMRI decoding results in higher-order visual areas in the Discussion (there are recent studies also investigating higher-order visual areas (Carricarte et al., 2024) and associative areas (Degutis et al., 2024)) and relate it to the MEG information transmission results between the areas overlapping with the regions recorded in the fMRI part of the study.

      We have discussed the fMRI decoding results in the LOC and IPS in the revised manuscript: 

      In the current study, fMRI signals from early visual cortex and two high-level brain regions (LOC and pIPS) were recorded. Neural dynamics of these regions were extracted from MEG signals. Decoding analyses based on fMRI and MEG signals consistently showed that object category information could be decoded from both regions. These findings raise an important question:  Further Granger causality analysis indicates that the feedback information in foveal V1 was mainly driven by signals from the LOC. Layer-specific analysis showed that category information could be decoded in the middle and superficial layers of the LOC. A reasonable interpretation of this result is that feedforward information from the early visual cortex was received by the LOC’s middle layer, then the category information was generated and fed back to foveal V1 through the LOC’s superficial layer. A recent study (Carricarte et al., 2024) found that, in object selective regions in temporal cortex, the deep layer showed the strongest fMRI responses during an imagery task. Together, the results suggest that the deep and superficial layers correspond to different feedback mechanisms. It is worth noting that other cortical regions may also generate feedback signals to the early visual cortex. The current study did not have simultaneously recorded fMRI signals from the prefrontal cortex, but it has been shown that feedback signals can be traced back to the prefrontal cortex during complex cognitive tasks, such as working memory (Finn et al., 2019; Degutis et al., 2024). Further fMRI studies with submillimeter resolution and whole-brain coverage are needed to test other potential feedback pathways during object processing.

      The behavioral performance seems quite low (67%), could authors explain the reasons for it?

      We designed the object stimuli to be difficult to distinguish on purpose. Some of our pilot data showed that the more involved the participants were in the peripheral object task, the easier the foveal feedback information was to decoded. It is reasonable to assume that if the peripheral objects were easily distinguishable, the feedback mechanism may not be fully recruited during object processing. Furthermore, since we were decoding category and orientation information rather than identity information, the difficulty of distinguishing two objects from the same category and with the same orientation would not affect the decoding of category and orientation information in the neural signals.

      Reviewer #2 (Recommendations for the authors):

      (1) Line 52: the meaning of the sentence starting with "However, ..." is not entirely clear. Maybe the word "while" is missing after the first comma?

      (2) Line 224. If I'm understanding the rationale for the MEG analysis correctly, it was not possible to localize foveal regions, but the cross-location decoding analysis was used to approximate the strength and timing of feedback information. If this is the case, "neural representations in the foveal region" were not extracted.

      (3) Figure 4. The key information is too small to see. The lines indicating where decoding performance was significant are quite thin but very important, and the text next to them indicating onset times of significant decoding is in such a small font size I needed to zoom in to 300% to read it (yes, my eyes are getting old and tired). Increasing the font size used to represent key information would be nice.

      (4) Figure 4 caption. Line 270 describes the line color in the plots as yellow, but that color is decidedly orange to my eye.

      (5) Line 340/341: Papers that define and describe feedback-receptive fields seem important to cite here:

      Keller, A. J., Roth, M. M., & Scanziani, M. (2020). Feedback generates a second receptive field in neurons of the visual cortex. Nature, 582(7813), 545-549.

      Kirchberger, L., Mukherjee, S., Self, M. W., & Roelfsema, P. R. (2023). Contextual drive of neuronal responses in mouse V1 in the absence of feedforward input. Science advances, 9(3), eadd2498.

      (6) Lines 346-350: this sentence seems to have some missing or misused words, because the syntax isn't intact.

      (7) Line 367: supports should be support.

      We thank the reviewers for the comments and have corrected them in the manuscript.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review): 

      Wang, Junxiu et al. investigated the underlying molecular mechanisms of the insecticidal activity of betulin against the peach aphid, Myzus persicae. There are two important findings described in this manuscript: (a) betulin inhibits the gene expression of GABA receptor in the aphid, and (b) betulin binds to the GABA receptor protein, acting as an inhibitor. The first finding is supported by RNA-Seq and RNAi, and the second one is convinced with MST and electrophysiological assays. Further investigations on the betulin binding site on the receptor protein provided a fundamental discovery that T228 is the key amino acid residue for its affinity, thereby acting as an inhibitor, backed up by site-directed mutagenesis of the heterologously-expressed receptor in E. coli and by CRISPR-genome editing in Drosophila.

      Although the manuscript does have strengths in principle, the weaknesses do exist: the manuscript would benefit from more comprehensive analyses to fully support its key claims in the manuscript. In particular:

      (1) The Western blotting results in Figure 5A & B appear to support the claim that betulin inhibits GABR gene expression (L26), as a decrease in target protein levels is often indicative of suppressed gene expression. The result description for Figure 5A & B is found in L312-L316, within Section 3.6 ("Responses of MpGABR to betulin"), where MST and voltage-clamp assays are also presented. It seems the observed decrease in MpGABR protein content is due to gene downregulation, rather than a direct receptor protein-betulin interaction. However, this interpretation lacks discussion or analysis in either the corresponding results section or the Discussion. In contrast, Figures 5C-F are specifically designed to illustrate protein-betulin interactions. Presenting Figure 5A & B alongside these panels might lead to confusion, as they support distinct claims (gene expression vs. protein binding/inhibition). Therefore, I recommend moving Figure 5A & B either to the end of Figure 3 or to a separate figure altogether to improve clarity and logical flow. A minor point in the Western blotting experiment is that although GAPDH was used as a reference protein, there is no explanation in the corresponding M&M section.

      We thank the reviewer for the concise and accurate summary and appreciate the constructive feedback on the article’s strengths and weaknesses.

      (A) According to your suggestion, the original Figure 5A and B have been inserted into Figure 3, following Figure 3D. The original Figure 3E-I has been saved as a new figure, to illustrate the RNAi assay.

      (b) “GAPDH was used as a reference protein” has been supplied in the M&M section, see

      Line 209.

      (2) The description of the electrophysiological recording experiment is unclear regarding the use of GABA. I didn't realize that GABA, the true ligand of the GABA receptor, was used in this inhibition experiment until I reached the Results section (L321), which states, "In the presence of only GABA, a fast inward current was generated." Crucially, no details are provided on the experiment itself, including how GABA was applied (e.g., concentration, duration, whether GABA was treated, followed by betulin, or vice versa). This information is essential for reproducibility. Please ensure these details are thoroughly described in the corresponding M&M section.

      We thank the reviewer for the valuable comments.

      (a) Detailed information on how to apply GABA has been added to the corresponding M&M section (Lines 260-263): After 3 days of incubation, the oocytes were used for electrophysiological recording. GABA was dissolved in 1 × Ringer's solution to prepare 100 µM GABA solution. Subsequently, the 100 µM GABA solutions containing different concentrations of betulin (0, 5, 10, 20, 40, 80, 160, 320 µM) were used to perfuse the oocytes.

      (b) Additionally, we also checked other contents of M&M section to ensure that sufficient detail has been supplied.

      (3) The phylogenetic analysis, particularly concerning Figures 4 and 6B, needs significant attention for clarity and representativeness. First, your claim that MpGABR is only closely related to CAI6365831.1 (L305-L310) is inconsistent with the provided phylogenetic tree, which shows MpGABR as equally close to Metopolophium dirhodum (XP_060864885.1) and Acyrthosiphon pisum (XP_008183008.2). Therefore, singling out only Macrosiphum euphorbiae (CAI6365831.1) is not supported by the data. Second, the representation of various insect orders is insufficient. All 11 sequences in the Hemiptera category (in both Figure 4 and Figure 6B) are exclusively from the Aphididae family. This small subset cannot represent the highly diverse Order Hemiptera. Consequently, statements like "only THR228 was conserved in Hemiptera" (L338), "The results of the sequence alignment revealed that only THR228 was conserved in Hemiptera" (L430), or "THR228... is highly conserved in Hemiptera" (L486) are not adequately supported. Third, similar concerns apply to the Diptera order, which includes 10 Drosophila and 2 mosquito samples (not diverse or representative enough), and likely to other orders as well. Thereby, the Figure 6B alignment should be revised accordingly to reflect a more accurate representation or to clarify the scope of the analysis. Fourth, there's a discrepancy in the phylogenetic method used: the M&M section (L156) states that MEGA7, ClustalW, and the neighbor-joining method were used, while the Figure 4 caption mentions that MEGA X, MUSCLE, and the Maximum likelihood method were employed. This inconsistency needs to be clarified and made consistent throughout the manuscript. Fifth, I have significant concerns about the phylogenetic tree itself (Figure 4). A small glitch was observed at the Danaus plexippus node, which raises suspicion regarding potential manipulation after tree construction. More critically, the tree, especially within Coleoptera, does not appear to be clearly resolved. I am highly concerned about whether all included sequences are true GABR orthologs or if the dataset includes partial or related sequences that could distort the phylogeny. Finally, for Figure 6B, both protein (XP_) and nucleotide (XM_) sequences were mix used. I recommend using the protein sequences instead of nucleotide sequences in this figure panel, as protein sequences are more directly informative.

      We thank the reviewer for the careful reading and valuable comments.

      (a) Firstly, according to your comments, phylogenetic analysis has been re-performed with more represent species from each Order (Fig. 5 and Fig. 7B). The results revealed that only THR228 was conserved across 11 species in the Aphididae family of Hemiptera. Therefore, the expressions like "only THR228 was conserved in Hemiptera" have been revised to “among the four residues, only THR228 was conserved across 11 species in the Aphididae family of Hemiptera” (Line 106, Line 369, Line 477, and Lines 563-564).

      (b) We have modified the description of Fig. 5 (the original Fig. 4): MpGABR  (XP_022173711.1) was found to be genetically closely related to CAI6365831.1 from Macrosiphum euphorbiae, XP 008183008.2 from Acyrthosiphon pisum, and XP 060864885.1 from Metopolophium dirhodum (Fig. 5 and Table S6). See Lines 342-346.

      (c) Phylogenetic analysis was performed using MEGA7 with multiple amino acid sequence alignment (ClustalW) and the neighbor-joining method. We have revised the Fig. 5 (the original Fig. 4) caption to make it accurate and consistent throughout the manuscript.

      (d) We are sorry about the small glitch at the Danaus plexippus node. Actually, after the phylogenetic tree was constructed, it was imported in Adobe Illustration for coloring and classification annotation. There may have been operational errors during the process of resizing the image, resulting in the occurrence of the small glitch. Besides, the unclear clustering of Coleoptera may be due to improper regulation of distance (pixels) of branch from nodes. Again, thanks for your careful reading. We have rebuilt the phylogenetic tree.

      (e) Based on your suggestion, the sequence IDs have been unified as the protein sequence IDs (Fig. 5, Fig. 7B and Table S6)

      (4) The Discussion section requires significant revision to provide a more insightful and interpretative analysis of the results. Currently, much of the section primarily restates findings rather than offering deeper discussion. For instance, L409-L419 restate the results, followed by the short sentence "Collectively, these results suggest that betulin may have insecticidal effects on aphids by inhibiting MpGABR expression". It could be further expanded to make it beneficial to elaborate on proposed mechanisms by which gene expression might be suppressed, including any potential transcription factors involved. In contrast, while L422-L442 also initially summarize results, the subsequent paragraph (L445-L472) effectively discusses the potential mechanisms of inhibitory action and how mortality is triggered, which is a good model for other parts of the section. However, all the discussion ends up with a short statement, "implying that betulin acts as a CA of MpGABR" (L472), which appears to be a leap. The inference that betulin acts as a competitive antagonist (CA) is solely based on the location of its extracellular binding site, which does not exactly overlap with the GABA binding site. It needs stronger justification or actually requires further experimental validation. The authors should consider rephrasing this statement to acknowledge the need for additional studies to definitively confirm this mechanism of action.

      We appreciate the reviewer's careful reading and valuable feedback, which will certainly enhance the quality of our manuscript.

      (a) Possible reasons for the effect of betulin on MpGABR expression have been discussed in our manuscript (Lines 455-466): The regulation of gene expression is sophisticated and delicate (Pope and Medzhitov 2018). The regulatory network controlling GABR expression remains unclear. In adult rats, epileptic seizures has been reported to increase the levels of brain-derived neurotrophic factor (BDNF), which in turn prompted the transcription factors CREB and ICER to reduce the gene expression of the GABR α1 subunit (Lund et al. 2008). In Drosophila, it has been demonstrated that WIDE AWAKE, which regulated the onset of sleep, interacted with the GABR and upregulated its expression level (Liu et al. 2014). In Drosophila brain, circular RNA circ_sxc was found to inhibit the expression of miR-87-3p in the brain through sponge adsorption, thereby regulating the expression of neurotransmitter receptor ligand proteins, including GABR, and ensuring the normal function of synaptic signal transmission in brain neurons (Li et al. 2024). However, it remains unclear how betulin reduces the expression of MpGABR, and further research is needed.

      (b) In the Discussion section, we acknowledged the need for further research to ultimately confirm the mechanism by which betulin competes with GABA for binding to MpGABR (Lines 532-535): Although the mechanism by which betulin competes with GABA for binding to MpGABR requires further experimental validation, our work may have provided a novel target for developing insecticides.

      (c) Besides, we have added the discussion of the sensitivity of GABA receptor to betulin in Discussion section (Lines 491-501): Studies on key amino acids that are crucial for GABR function has primarily focused on transmembrane regions. For instance, based on the mutational research and Drosophila GABR modeling approach, multiple key amino acids were identified as insecticide targets in the transmembrane domain (Nakao and Banba 2021). Guo et al. proposed that amino acid substitutions in the transmembrane domain 2 contribute to terpenoid insensitivity during plant-insect coevolution (Guo et al. 2023). However, these studies have neglected the extracellular domain. Our study signified that betulin targets the THR228 site in the extracellular domain of MpGABR, which is conserved only in the Aphididae family. Therefore, betulin is speculated to be a specific insecticidal substance evolved by plants in response to aphid infestation. Besides, further verification is needed to determine whether betulin is toxic to other insect species.

      (d) Furthermore, the discussion of potential ecological risks of deploying betulin as a bioinsecticide has been elaborated in our manuscript (Lines 538-553): The development of bioinsecticides should not only focus on the toxic effects of active substance on target organisms, but also on their influence on the ecosystem (Haddi et al. 2020). Although our results indicate that betulin has specific toxicity to aphids, previous studies have reported that betulin and its derivatives had effects on Plutella xylostella L. (Huang et al. 2025), Aedes aegypti (de Almeida Teles et al. 2024), and Drosophila melanogaster (Lee and Min 2024). Therefore, further research is needed to determine whether there are other insecticidal mechanisms or off target effects of betulin. Additionally, betulin exhibits a wide range of pharmacological activities (Amiri et al. 2020), which have been used to treat various diseases, such as cancer (Lv 2023), glioblastoma (Li et al. 2022), inflammation (Szlasa et al. 2023) and hyperlipidemia (Tang et al. 2011). Before applying betulin in the field, it is necessary to fully verify and consider whether betulin has any impact on farmers' health. Furthermore, will betulin cause residue or diffusion in the process of field application? Will long-term application promote the evolution of resistance to aphids or other insects? These issues also need further experimental verification. In summary, before any field application, further research is needed on the environmental behavior, degradation process, and safety of betulin.

      Reviewer #2 (Public review):

      Summary:

      This important study shows that betulin from wild peach trees disrupts neural signaling in aphids by targeting a conserved site in the insect GABA receptor. The authors present a nicely integrated set of molecular, physiological, and genetic experiments to establish the compound's species-specific mode of action. While the mechanistic evidence is solid, the manuscript would benefit from a broader discussion of evolutionary conservation and

      potential off-target ecological effects.

      Strengths:

      The main strengths of the study lie in its mechanistic clarity and experimental rigor. The identification of a betulin-binding single threonine residue was supported by (1) site-directed mutagenesis and (2) functional assays. These experiments strongly support the specificity of action. Furthermore, the use of comparative analyses between aphids and fruit flies demonstrates an important effort to explore species specificity, and the integration of quantitative data further enhances the robustness of the conclusions.

      Weaknesses:

      There are several important limitations that need to be addressed. The manuscript does not explore whether the observed sensitivity to betulin reflects a broadly conserved feature of GABA receptors across animal lineages or a more lineage-specific adaptation. This evolutionary context is crucial for understanding the broader significance of the findings.

      In addition, while the compound's aphicidal effect is well established, the potential for off-target effects in non-target organisms - especially vertebrates - remains unaddressed, despite prior evidence that betulin interacts with mammalian GABAa receptors. There is little discussion on the ecological or environmental safety of exogenous betulin application, such as persistence, degradation, or exposure risks.

      We sincerely thank the reviewer for the time and effort dedicated to our manuscript's detailed review and assessment. The revision suggestions were constructive, and we have provided a point-by-point response to address them.

      (a) Briefly introduce the evolutionary conservation of GABA receptors has been added in the Introduction (Lines 90-98): Previous study has proposed that vertebrate and human GABR genes maintain a broad and conservative gene clustering pattern, while in invertebrates, this pattern is missing, indicating that these gene clusters formed early in vertebrate evolution and were established after diverging from invertebrates. Notably, invertebrates each possess a unique GABR gene pair, which are homologous with human GABR α and β subunits, suggesting that the existing GABR gene cluster evolved from an ancestral α - β subunit gene pair (Tsang et al. 2006). During the coevolution of plants and insects, the duplications and amino acid substitutions in GABR may be beneficial for the adaptation to insecticides and terpenoid compounds (Guo et al. 2023).

      (b) We have added the discussion of the sensitivity of GABA receptor to betulin in Discussion section (Lines 491-501): Studies on key amino acids that are crucial for GABR function has primarily focused on transmembrane regions. For instance, based on the mutational research and Drosophila GABR modeling approach, multiple key amino acids were identified as insecticide targets in the transmembrane domain (Nakao and Banba 2021). Guo et al. proposed that amino acid substitutions in the transmembrane domain 2 contribute to terpenoid insensitivity during plant-insect coevolution (Guo et al. 2023). However, these studies have neglected the extracellular domain. Our study signified that betulin targets the THR228 site in the extracellular domain of MpGABR, which is conserved only in the Aphididae family. Therefore, betulin is speculated to be a specific insecticidal substance evolved by plants in response to aphid infestation. Besides, further verification is needed to determine whether betulin is toxic to other insect species.

      (c) The discussion of potential ecological risks of deploying betulin as a bioinsecticide has been elaborated in our manuscript (Lines 538-553): The development of bioinsecticides should not only focus on the toxic effects of active substance on target organisms, but also on their influence on the ecosystem (Haddi et al. 2020). Although our results indicate that betulin has specific toxicity to aphids, previous studies have reported that betulin and its derivatives had effects on Plutella xylostella L. (Huang et al. 2025), Aedes aegypti (de Almeida Teles et al. 2024), and Drosophila melanogaster (Lee and Min 2024). Therefore, further research is needed to determine whether there are other insecticidal mechanisms or off target effects of betulin. Additionally, betulin exhibits a wide range of pharmacological activities (Amiri et al. 2020), which have been used to treat various diseases, such as cancer (Lv 2023), glioblastoma (Li et al. 2022), inflammation (Szlasa et al. 2023) and hyperlipidemia (Tang et al. 2011). Before applying betulin in the field, it is necessary to fully verify and consider whether betulin has any impact on farmers' health. Furthermore, will betulin cause residue or diffusion in the process of field application? Will long-term application promote the evolution of resistance to aphids or other insects? These issues also need further experimental verification. In summary, before any field application, further research is needed on the environmental behavior, degradation process, and safety of betulin.

      Reviewer #1 (Recommendations for the authors):

      (1) L28 Provide the full name of MST.

      Thanks for your suggestion. The full name of MST, microscale thermophoresis, has been supplied.

      (2) L87 in the Order Hemiptera.

      Thanks for your suggestion. Corrected.

      (3) L99 "Leaf bioassay" would be better to differentiate the greenhouse and field bioassays.

      Thanks for your suggestion. Corrected.

      (4) L104 It should be 7 doses, including the "0 mg/mL" control.

      Thanks for your suggestion. Corrected.

      (5) L104 Since the LC50 of pymetrozine is 1.0612 mg/mL, a wider range of doses should have been tested compared to the dose range of betulin.

      Thanks for your comment.

      (a) Firstly, seven doses (0, 0.0625, 0.125, 0.25, 0.5, 1, and 2 mgmL<sup>-1</sup>) were set to calculate the LC50 of betulin and pymetrozine. Since the LC50 values of betulin and pymetrozine are 0.1641 and 1.0612 mgmL<sup>–1</sup>, respectively, which are within the set range, indicating that the set dose range is reasonable and the LC50 values of betulin and pymetrozine are reliable.

      (b) To compare the control effects of betulin and pymetrozine against M. persicae, LC50 of betulin (0.1641 mgmL<sup>-1</sup>) and pymetrozine (1.0612 mgmL<sup>-1</sup>) were used to treat M. persicae.

      (6) L109 Greenhouse and field bioassays.

      Thanks for your suggestion. Corrected.

      (7) L112 Tween-80 and acetone in L103. Keep the order consistent throughout the manuscript.

      Thanks for your suggestion. Corrected.

      (8) L122 Mortality was recorded at 1, 5, 9, and 14 days after treatment. Revise the other similar mistakes throughout the manuscript (e.g. L250, L254, L255, L256, L259, etc.).

      Thanks for your suggestion. Corrected.

      (9) L126 apterous instead of wingless (keep a consistent expression).

      Thanks for your suggestion. Corrected.

      (10) L138 Primer Premier?

      Thanks for your comment. Corrected.

      (11) L141 Add RPS18 primers in Table S2.

      Thanks for your comment. Corrected.

      (12) L155 MEGA7 vs. MEGAX (as described in the Figure 4 caption).

      Thanks for your comment. Corrected.

      (13) L156 NJ method vs. ML method (as described in the Figure 4 caption).

      Thanks for your comment. Corrected.

      (14) L157 2.7. RNAi assay (Remove "In vitro" and re-number the following M&M sections accordingly).

      Thanks for your comment. Corrected.

      (15) L163 Add dsGFP primers in Table S2.

      Thanks for your comment. Corrected.

      (16) L166 apterous instead of wingless (keep a consistent expression).

      Thanks for your comment. Corrected.

      (17) L172 Add the source of pET-B2M vector.

      pET-B2M vector was obtained from BGI (Shenzhen, China), which has been added in our manuscript (Line 194).

      (18) L195 coding sequence instead of cDNA.

      Thanks for your comment. Corrected.

      (19) L198 the mutations of R224A ...

      Thanks for your comment. Corrected.

      (20) L199 TYR), or T228R ...

      Thanks for your comment. Corrected.

      (21) L211 and 90 ng.

      Thanks for your comment. Corrected.

      (22) L213 genomic DNA instead of gDNA, because gDNA may be confused in the context of sgRNA.

      Thanks for your suggestion. Corrected.

      (23) L253 (Fig. 1A-B).

      Thanks for your comment. Corrected.

      (24) L268 Explain why these 15 DEGs were selected for qRT-PCR.

      Thanks for your comment. These 15 DEGs were randomly selected and act as representative DEGs with different expression levels. The reason for selection of these 15 DEGs were added in the manuscript (Lines 295-296).

      (25) L287 What about GABRB? It has a TM domain.

      GABRB refers to “gamma-aminobutyric acid receptor subunit beta-like” annotated on NCBI. Theoretically, it should contain four transmembrane structural domains, while it has only one, indicating that it is incomplete.

      (26) L297 Add dsGFP as another control group.

      Thanks for your comment. Corrected.

      (27) L299 increased by 30.44% (Remove a comma).

      Thanks for your comment. Corrected.

      (28) L308 XM_022318019.1 (or protein accession number with XP_).

      Thanks for your comment. Corrected.

      (29) L338 that THR228 was conserved only in Hemiptera.

      Thanks for your comment. Since our original intention was to emphasize that THR228 is the only conserved among the four key amino acid residues, after careful consideration, we retained the expression "only THR228".

      (30) L342 or T228R.

      Thanks for your comment. Corrected.

      (31) L382 Is pyrhidone a general name for pymetrozine?

      Thanks for your comment. Corrected.

      (32) L450 Remove "and so on".

      Thanks for your comment. Corrected.

      (33) Figure 1D: Remove "Environment friendly". Replace the plant pot image on the right side with the one sprayed with pymetrozine, like the one in Figure 1F.

      Thanks for your comment. 

      (a) "Environment friendly" in Figure 1D has been removed.

      (b) We have attempted to modify the Figure 1D according to your suggestion. However, the modified Figure 1D is similar to Figure 1F and appears monotonous. Therefore, we have retained the original framework of Figure 1D.

      (34) Figure 2E 111036117 and 111041856 are in different IDs (XM_). I suggest keeping GeneID in Figure 2E and Table S2, as shown in Table S4.

      Thanks for your comment. Corrected.

      (35) Figure 2H: Add unit of the heatmap values. Or just add the title (e.g., expression level) on top of the bar.

      Thanks for your comment. Corrected.

      (36) Figure 3A: Add "aa" next to 700.

      Thanks for your comment. Corrected.

      (37) Figure 3E-G: Revise the tick marks on Y-axis: 0.0, 0.5, 1.0, and 1.5.

      Thanks for your comment. Corrected.

      (38) Figure 5C: Remove "1" and move "WT" up to the position where "1" was.

      Thanks for your comment. Corrected.

      (39) Figure 5D: Revise the tick marks on the Y-axis: 0.0, 0.5, 1.0, and 1.5.

      Thanks for your comment. Corrected.

      (40) Figure 5E: Remove the decimal. (e.g. 5 uM, 10 uM, 20 uM, etc.).

      Thanks for your comment. Corrected.

      (41) Figure 6B: What are the numbers next to the amino acid sequences? Provide the information in the figure caption.

      Thanks for your comment. The numbers next to the amino acid indicates the site of the last residue of the key amino acids, which was supplied in the figure caption.  

      (42) Figure 6D: Revise the tick marks on the Y-axis: 0.0, 0.5, 1.0, and 1.5. The X-axis title should be betulin (see Figure 5D). In the figure caption at the 5th row from the top, R244A should be R224A.

      Thanks for your comment. Corrected.

      (43) Figure 7E: R122T (not R1272T).

      Thanks for your comment. Corrected.

      (44) Supplementary Figure 1: It should be Figure S1. Add dsGFP in the figure caption.

      Thanks for your comment. Corrected.

      (45) Figure S2: What are the two pink bars and the other bars in brown or blue? Add an appropriate explanation in the figure caption.

      Thanks for your comment. Corrected.

      (46) Table S1: r square?

      Thanks for your comment. It is “r square” and corrected.

      (47) Table S2: (a) Add horizontal lines to separate qPCR, RNAi, cloning, and heterologous expression from each other (b) Replace XM_022318017.1 and XM_022318019.1 with their corresponding GeneIDs, as shown in Table S4. (c) AK340444.1 is a sequence from another aphid (Acyrthosiphon pisum)-Revise it. (d) In the cloning primers, place MpGABR first, followed by MpGABRAP and MpGABRB, as shown in the manuscript and Table S5. (e) Also, in the cloning primers, MpGABRB and MpGABRAP use reverse primers without stop codon, while MpGABR uses stop codon (TCA = TGA in reverse)-Revise it accordingly. Otherwise, provide the reason.

      Thanks for your comment. Corrected.

      (48) Table S3: (a) Add "Drosophila melanogaster" and the target sequence ID in the table caption. Is it KF881792.1, as shown in Table S6? (b) Align the sequences to the left side. 

      Thanks for your comment. 

      (a) The GenBank number of target sequence is KF881792.1 (Drosophila melanogaster). We have added this information in the Table S3 note.

      (b) It has been adjusted according to your suggestion.

      (49) Table S5: (a) Replace the accession numbers with GeneID, as shown in Table S4. K340444.1 is a sequence from another aphid (Acyrthosiphon pisum), (b) Coding sequences with stop codon are 2082, 357, and 753, respectively, while the sequences without stop codon are 2079, 354, and 750, respectively. The lengths of the deduced amino acids are 693, 118, and 250. Revise accordingly.

      Thanks for your comment. Corrected.

      (50) Table S6: (a) Use GenBank No for protein sequences. There is no Gene ID in this table. (b) Order (instead of Class). (c) See my comment on the phylogenetic analysis above.

      Thanks for your comment. Corrected.

      (51) Table S7 (a) Add unit under "Binding Energy". (b) There are two ALA226 [Alkyl] with two different distances. (c) PHE227 at the bottom should be THR228?

      Thanks for your comment.

      (a) The unit of "Binding Energy" was kcalmol<sup>–1</sup>, and it was added in the table caption.

      (b) Refer to Figure 6A, there were two Alkyl interaction between ALA226 and betulin. Therefore, there were two ALA226 [Alkyl] with two different distances.

      (c) Similarly, there were two Pi-Alkyl interactions between PHE227 and betulin. Thus, there were two rows of PHE227 in the table.

      (52) Table S9 (a) R117T should be R122T. (b) r square?

      Thanks for your comment. a and b Corrected.

      Reviewer #2 (Recommendations for the authors):

      (1) Introduction

      (a) It lacks a deeper biological and evolutionary framing of the GABA receptor system. As GABA receptors are highly conserved across animal taxa, the observed interaction between betulin and the aphid GABA receptor could have broader implications. This possibility is not addressed in the current version, which limits the reader's appreciation of the relevance of this mode of action.

      (b) Previous reports of betulin activity in mammalian systems are not mentioned in the introduction, even though they are directly relevant to concerns about off-target toxicity. Therefore, the introduction should be revised to (i) briefly introduce the evolutionary conservation of GABA receptors, and (ii) acknowledge that betulin may affect a broader range of organisms, which sets up the need for caution in its application.

      Thanks for your important suggestions.

      (a) Briefly introduce the evolutionary conservation of GABA receptors has been added in the Introduction (Lines 90-98): Previous study has proposed that vertebrate and human GABR genes maintain a broad and conservative gene clustering pattern, while in invertebrates, this pattern is missing, indicating that these gene clusters formed early in vertebrate evolution and were established after diverging from invertebrates. Notably, invertebrates each possess a unique GABR gene pair, which are homologous with human GABR α and β subunits, suggesting that the existing GABR gene cluster evolved from an ancestral α - β subunit gene pair (Tsang et al. 2006). During the coevolution of plants and insects, the duplications and amino acid substitutions in GABR may be beneficial for the adaptation to insecticides and terpenoid compounds (Guo et al. 2023).

      (b) The possible effects of betulin on a broader range of organisms have been acknowledged in the Introduction section (Lines 68-77): An immune stimulant, Ir-Bet, was prepared using iridium complex and betulin, which evoked ferritinophagy-enhanced ferroptosis, thereby activating anti-tumor immunity (Lv 2023). The anti-inflammatory effect of betulin has been reported in macrophages at lymphoma site in mice (Szlasa et al. 2023). Betulin has been found to improve hyperlipidemia and insulin resistance and decrease atherosclerotic plaques by inhibiting the maturation of sterol regulatory element-binding protein (Tang et al. 2011). Besides, betulin and its derivatives have been found to exhibit insecticidal activity against Plutella xylostella L. (Huang et al. 2025), Aedes aegypti (de Almeida Teles et al. 2024), and Drosophila melanogaster (Lee and Min 2024).

      (c) At the end of the introduction, we remind that betulin should be used with caution (Lines 111-112): However, given that betulin may affect a wider range of organisms, it should be used with caution.

      (2) Method

      Number of biological replicates in all assays and justification of thresholds used for significance in RNAi and survival experiments are not addressed in the manuscript.

      Thanks for your careful reading. We have checked Materials and Methods section and added corresponding number of biological replicates in all assays. Besides, the p-values for the corresponding significance analyses of RNAi and survival experiments have been added to our Manuscript.

      (2)  Discussion

      (a) Consistent with the comments on the Introduction, the absence of discussion on (i) the evolutionary conservation of GABA receptor sensitivity to betulin, (ii) potential off-target effects in non-target insects and vertebrates (if so, this cannot be use for "eco-friendly pesticide" as the authors stated in the manuscript), and (iii) ecological risks associated with the exogenous application of betulin limits both the interpretive depth and applied relevance of the study.

      (b) To strengthen the Discussion, the authors should consider addressing: (i) whether the observed sensitivity reflects a conserved pharmacological vulnerability across animal taxa or a lineage-specific adaptation; (ii) the potential ecological risks of deploying betulin as a bioinsecticide, and (iii) the need for future research into the environmental fate, degradation, and safety profile of betulin prior to any field-level application.

      Thank you for your valuable comments.

      (a) We have added the discussion of the sensitivity of GABA receptor to betulin in Discussion section (Lines 491-501): Studies on key amino acids that are crucial for GABR function has primarily focused on transmembrane regions. For instance, based on the mutational research and Drosophila GABR modeling approach, multiple key amino acids were identified as insecticide targets in the transmembrane domain (Nakao and Banba 2021). Guo et al. proposed that amino acid substitutions in the transmembrane domain 2 contribute to terpenoid insensitivity during plant-insect coevolution (Guo et al. 2023). However, these studies have neglected the extracellular domain. Our study signified that betulin targets the THR228 site in the extracellular domain of MpGABR, which is conserved only in the Aphididae family. Therefore, betulin is speculated to be a specific insecticidal substance evolved by plants in response to aphid infestation. Besides, further verification is needed to determine whether betulin is toxic to other insect species.

      (b) The discussion of potential ecological risks of deploying betulin as a bioinsecticide has been elaborated in our manuscript (Lines 538-551): The development of bioinsecticides should not only focus on the toxic effects of active substance on target organisms, but also on their influence on the ecosystem (Haddi et al. 2020). Although our results indicate that betulin had specific toxicity to aphids, previous studies have reported that betulin and its derivatives had effects on Plutella xylostella L. (Huang et al. 2025), Aedes aegypti (de Almeida Teles et al. 2024), and Drosophila melanogaster (Lee and Min 2024). Therefore, further research is needed to determine whether there are other insecticidal mechanisms or off target effects of betulin. Additionally, betulin exhibits a wide range of pharmacological activities (Amiri et al. 2020), which have been used to treat various diseases, such as cancer (Lv 2023), glioblastoma (Li et al. 2022), inflammation (Szlasa et al. 2023) and hyperlipidemia (Tang et al. 2011). Before applying betulin in the field, it is necessary to fully verify and consider whether betulin has any impact on farmers' health. Furthermore, will betulin cause residue or diffusion in the process of field application? Will long-term application promote the evolution of resistance to aphids or other insects? These issues also need further experimental verification. 

      (c) Additionally, at the end of the Discussion, we remind that more research is needed before any field application of betulin (Lines 551-553): In summary, before any field application, further research on the environmental behavior, degradation process, and safety of betulin is needed.

      Reference

      Amiri S, Dastghaib S, Ahmadi M, Mehrbod P, Khadem F, Behrouj H, Aghanoori M, Machaj F, Ghamsari M, Rosik J, Hudecki A, Afkhami A, Hashemi M, Los M, Mokarram P, Madrakian T, Ghavami S. 2020. Betulin and its derivatives as novel compounds with different pharmacological effects. Biotechnology Advances 38: 107409.

      de Almeida Teles AC, dos Santos BO, Santana EC, Durço AO, Conceição LSR, Roman Campos D, de Holanda Cavalcanti SC, de Souza Araujo AA, dos Santos MRV. 2024.

      Larvicidal activity of terpenes and their derivatives against Aedes aegypti: a systematic review and meta-analysis. Environmental Science and Pollution Research 31: 64703-64718.

      Guo L, Qiao X, Haji D, Zhou T, Liu Z, Whiteman NK, Huang J. 2023. Convergent resistance to GABA receptor neurotoxins through plant–insect coevolution. Nature Ecology & Evolution 7: 1444-1456.

      Haddi K, Turchen LM, Viteri Jumbo LO, Guedes RN, Pereira EJ, Aguiar RW, Oliveira EE. 2020. Rethinking biorational insecticides for pest management: unintended effects and consequences. Pest Management Science 76: 2286-2293.

      Huang X, Hao N, Shu L, Wei Z, Shi J, Tian Y, Chen G, Yang X, Che Z. 2025. Preparation and insecticidal activities of betulin-cinnamic acid-related hybrid compounds and insights into the stress response of Plutella xylostella L. Pest Management Science 81: 4243-4255.

      Lee HY, Min KJ. 2024. Betulinic acid increases the lifespan of Drosophila melanogaster via Sir2 and FoxO activation. Nutrients 16: 441.

      Li Q, Wang L, Tang C, Wang X, Yu Z, Ping X, Ding M, Zheng L. 2024. Adipose tissue exosome circ_sxc mediates the modulatory of adiposomes on brain aging by inhibiting brain dme-miR-87-3p. Molecular Neurobiology 61: 224-238.

      Li Y, Wang Y, Gao L, Tan Y, Cai J, Ye Z, Chen A, Xu Y, Zhao L, Tong S, Sun Q, Liu B, Zhang S, Tian D, Deng G, Zhou J, Chen Q. 2022. Betulinic acid self-assembled nanoparticles for effective treatment of glioblastoma. Journal of Nanobiotechnology 20: 39.

      Liu S, Lamaze A, Liu Q, Tabuchi M, Yang Y, Fowler M, Bharadwaj R, Zhang J, Bedont J,

      Blackshaw S, Lloyd Thomas E, Montell C, Sehgal A, Koh K, Wu Mark N. 2014. WIDE AWAKE mediates the circadian timing of sleep onset. Neuron 82: 151-166.

      Lund IV, Hu Y, Raol YH, Benham RS, Faris R, Russek SJ, Brooks Kayal AR. 2008. BDNF selectively regulates GABAA receptor transcription by activation of the JAK/STAT pathway. Science Signaling 1: ra9.

      Lv M, Zheng Y, Wu J, Shen Z, Guo B, Hu G, Huang Y, Zhao J, Qian Y, Su Z, Wu C, Xue X, Liu H, Mao Z. 2023. Evoking ferroptosis by synergistic enhancement of a cyclopentadienyl iridium-betulin immune agonist. Angewandte Chemie International Edition 62: e202312897.

      Nakao T, Banba S. 2021. Important amino acids for function of the insect Rdl GABA receptor. Pest Management Science 77: 3753-3762.

      Pope SD, Medzhitov R. 2018. Emerging principles of gene expression programs and their regulation. Molecular Cell 71: 389-397.

      Szlasa W, Ślusarczyk S, Nawrot Hadzik I, Abel R, Zalesińska A, Szewczyk A, Sauer N, Preissner R, Saczko J, Drąg M, Poręba M, Daczewska M, Kulbacka J, Drąg Zalesińska M. 2023. Betulin and its derivatives reduce inflammation and COX-2 cctivity in macrophages. Inflammation 46: 573-583.

      Tang JJ, Li JG, Qi W, Qiu WW, Li PS, Li BL, Song BL. 2011. Inhibition of SREBP by a small molecule, betulin, improves hyperlipidemia and insulin resistance and reduces atherosclerotic plaques. Cell Metabolism 13: 44-56.

      Tsang SY, Ng SK, Xu Z, Xue H. 2006. The evolution of GABAA receptor–like genes. Molecular Biology and Evolution 24: 599-610.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):

      Summary:

      The authors study the steady-state solutions of ODE models for molecular signaling involving ligand binding coupled to multi-site phosphorylation at saturating ligand concentrations. Although the results are in principle general, the work highlights the receptor tyrosine kinases (RTK) as model systems. After presenting previous ODE model solutions, the authors present their own "kinetic sorting" model, which is distinguished by ligand-induced phosphorylationdependent receptor degradation and the property that every phosphorylation state is signaling competent. The authors show that this model recovers the two types of non-monotonicity experimentally reported for RTKs: maximum activity for intermediate ligand affinity and maximum activity for intermediate kinase activity.

      The main contribution of the work is in demonstrating that their model can capture both types of non-monotonicity, whereas previous models could at most capture non-monotonicity of ligand binding.

      Strengths:

      The question of how energy-dissipating, and thus non-equilibrium, molecular systems can achieve steady-state solutions not accessible to equilibrium systems is of fundamental importance in biomolecular information processing and self-organization. Although the authors do not address the energy requirements of their non-equilibrium model, their comparative analysis of different alternative non-equilibrium models provides insight into the design choices necessary to achieve non-monotonic control, a property that is inaccessible at equilibrium.

      The paper is succinctly written and easy to follow, and the authors achieve their aims by providing convincing numerical solutions demonstrating non-monotonicity over the range of parameter values encompassing the biologically relevant regime.

      Weaknesses:

      (1) A key motivating framework for this work is the argument that the ability to tune to recognize intermediate ligand affinities provides a control knob for signal selection that is available to nonequilibrium systems. As such, this seems like a compelling type of ligand selectivity, which is a question of broad interest. However, as the authors note in the results, the previously published "limited signaling model" already achieves such non-monotonicity in ligand binding affinity. The introduction and abstract do not clearly delineate the new contributions of the model.

      We thank the reviewer for this comment. We apologize for any unclear language on our part. The purpose of our work was not to identify the unique reaction scheme to obtain nonmonotonic dependence of network activity on ligand affinity and kinase activity. Rather, we were interested in exploring how such a dependence could arise from the interplay between two ubiquitous network motifs (multisite phosphorylation and active receptor degradation). Notably, as the reviewer later points out, previous models that incorporate only multisite phosphorylation only capture the non-monotonic dependence of network activity on ligand affinity and not kinase/phosphatase activity. We have now clarified this in the abstract (lines 14-16) and the introduction (lines 55-59). 

      The novel benefit of the model introduced by the authors is that it also achieves a nonmonotonic response to kinase activity. Because such non-monotonicity is observed for RTK, this would make the authors' model a better fit for capturing RTK behavior. However, the broad significance of achieving non-monotonicity to kinase activity is not motivated or supported by empirical evidence in the paper. As such, the conceptual significance of the modified model presented by the authors is not clear.

      We thank the reviewer for this comment. We agree that the ability of our model to reproduce non-monotonic dependence on kinase/phosphatase activity was not sufficiently motivated in the original submission. We have now added a brief mention of the biological motivation for nonmonotonic kinase activity in the discussion (lines 229-247) to describe the potential biological significance of this behavior. In particular, non-monotonic kinase/phosphatase dependence may act as a safeguard, filtering out signaling cells with abnormally elevated kinase activity or suppressed phosphatase activity. In the presence of non-monotonic dependence on network activity, downstream signaling would remain contingent on extracellular cues, and cells with extreme kinase/phosphatase imbalances would fail to signal. This could prevent persistent, cueindependent activation, an especially important protective mechanism in pathways regulating metabolically taxing functions such as growth, proliferation, or mounting immune responses. Although direct experimental evidence for the widespread use of this mechanism is currently scarce, our motivation is supported both by the presence of similar regulatory behaviors of phosphatases which arise through distinct mechanisms (such as CD45 in T-cell receptor signaling, (Weiss, 2019)), but highlight the potential biological use of this strategy and by theoretical work on phosphorylation-dephosphorylation cycles, which demonstrates a similar effect in more general settings (Swain, 2013).

      (2) Whereas previous models used in the literature are schematized in Figure 1, the model proposed by the authors is missing (see line 97 of page 3). Without the schematic, the text description of the model is incomplete.

      We thank the reviewer for identifying this oversight, it has been corrected. See Figure 3 in the new text. 

      (3) The authors use the activity of the first phosphorylation site as the default measure of activity. This choice needs to be justified. Why not use the sum of the activities at all sites?

      We thank the reviewer for this comment. We in fact study all sites (Figure 5A in the resubmitted manuscript). Notably, as suggested by the reviewer, the concentration of the first site is indeed represented by the sum of concentrations of all phosphorylated species. The concentration of the 2<sup>nd</sup> site is represented by the sum of concentrations of all species except for the first one and so on (lines 153-155). 

      Reviewer #2 (Public review):

      Summary:

      In classical models of signaling networks, the signaling activity increases monotonically with the ligand affinity. However, certain receptors prefer ligands of intermediate affinity. In the paper, the authors present a new minimal model to derive generic conditions for ligand specificity. In brief, this requires multi-site phosphorylation and that high-anity complexes be more prone to degrade. This particular type of kinetic discrimination allows for overcoming equilibrium constraints.

      Strengths:

      The model is simple, and it adds only a few parameters to classical generic models. Moreover, the authors vary these additional parameters in ranges based on experimental observations. They explain how the introduction of these new parameters is essential to ligand specificity. Their model quantitatively reproduces the ligand specificity of a certain receptor. Finally, they provide a testable prediction.

      Weaknesses:

      The naming of certain variables may be confusing to readers.

      We apologize for the confusion due to unclear presentation. We have clarified our definitions throughout the manuscript. 

      Reviewer #1 (Recommendations for the authors):

      (1) The abstract and introduction present the problem as if this model is solving the fundamental problem of non-monotonic dependence on ligand affinity. However, as the authors noted in their results, this problem has already been solved by a previous phosphorylation model with N-state degradation. What the authors' new model achieves is the additional experimentally observed non-monotonicity of kinase activity dependence. The abstract and introduction should be changed to reflect the actual novel contributions and also to motivate the biological significance of non-montonic kinase activity dependence.

      We thank the reviewer for this comment. We apologize for any unclear language on our part. The purpose of our work was not to identify the unique reaction scheme to obtain nonmonotonic dependence of network activity on ligand affinity and kinase activity. Rather, we were interested in exploring how such a dependence could arise from two ubiquitous network motifs (multisite phosphorylation and active receptor degradation). Notably, as the reviewer later points out, previous models that incorporate only multisite phosphorylation only capture the nonmonotonic dependence of network activity on ligand affinity and not kinase/phosphatase activity. We have now clarified this in the abstract (lines 14-16) and the introduction (lines 55-59). We have also provided biological motivation behind nonmonotonic kinase activity dependance (lines 229-247). 

      (2) It is important to show (in the supplemental materials if needed) that the closest equilibrium analog to the model (for example, reversible rate constants from each of the activated states to an inactive state) does not achieve non-monotonicity with ligand affinity.

      We have added a model in the supplementary materials that represents a detailed balance Markov chain. In the model, we imagine that ligand bound receptors undergo a series of equilibrium transitions, all characterized by the same activation and inactivation rate. We show that at saturating ligand levels, the signaling output only depends on the ratio of the activation to the inactivation rate (i.e., the thermodynamic stability of the active site) (lines 466-488).

      (3) Schematics for earlier models are described in Figure 1. However, no schematic for the actual model proposed by the authors is shown. This should be added as a subpanel to Figure 1.

      We thank the reviewer for identifying our omission of our model schematic. We have included our model schematic as its own figure (Figure 3).

      (4) Minor: Figure 1 is referred to as Figure?? In line 97 of page 3.

      We thank the reviewer for identifying this error, it has been corrected. 

      Reviewer #2 (Recommendations for the authors):

      (1) There is an inconsistency between Figure 2(a) and Equation (1), it suggests that p_N is \omega^N/(\omega+\delta)^N. This makes more sense with the model defined in the supplementary material.

      We thank the reviewer for identifying this error. Equation (1) has been updated to reflect the correct relationship.

      (2) The figure presenting the model of the authors appears to be missing.

      We thank the reviewer for identifying this error, it has been corrected (Figure 3 in the new manuscript). 

      (3) The authors describe phosphorylation as irreversible in the intro, but then consider reversible phosphorylation in their model, which may be confusing to readers.

      We thank the reviewer for identifying this source of possible confusion. We have clarified that dephosphorylation is taken to be a distinct irreversible reaction, see lines 105 - 112.

      (4) The authors reuse similar names, e.g., network activity, kinase activity, signaling activity, activity. This is confusing.

      We apologize for the confusion. We note that, within the context of our model, there are important distinctions between signaling activity (the amount of signaling competent receptors) and kinase activity (value corresponding to the phosphorylation rate). We have attempted to use these different terms correctly and are happy to make clarifying corrections if there are any places where a term is misused.  

      (5) Several parameters are defined only in the captions of the figures, such as \beta and \rho.

      We thank the reviewer for identifying this omission, we have added the definitions of beta and rho to the main text (see line 129). 

      (6) The sentence at line 137 lacks some words: "Below, we kinetic...".

      We thank the reviewer for identifying this error, we have added the missing words (“Below, we show how kinetic…”).

      (7) The sentence at line 183 lacks some words: "When kinase activity...".

      We thank the reviewer for identifying this error. We have now corrected it. 

      (8) Figure 5 is very small.

      We will work with the production team to increase the size of this figure.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):  

      Summary:

      The manuscript by Cupollilo et al describes the development, characterization, and application of a novel activity labeling system; fast labelling of engram neurons (FLEN). Several such systems already exist but this study adds additional capability by leveraging an activity marker that is destabilized (and thus temporally active) as well as being driven by the full-length promoter of cFos. The authors demonstrate the activity-dependent induction and time course of expression, first in cultured neurons and then in vivo in hippocampal CA3 neurons after one trial of contextual fear conditioning. In a series of ex vivo experiments, the authors perform patch clamp analysis of labeled neurons to determine if these putative engram neurons differ from non-labelled neurons using both the FLEN system as well as the previously characterized RAM system. Interestingly the early labelled neurons at 3 h post CFC (FLEN+) demonstrated no differences in excitability whereas the RAMlabelled neurons at 24h after CFC had increased excitability. Examination of synaptic properties demonstrated an increase in sEPCS and mEPSC frequencies as well as those for sIPSCs and mIPSCs which was not due to a change in the mossy fiber input to these neurons.

      Strengths:

      Overall the data is of high quality and the study introduces a new tool while also reassessing some principles of circuit plasticity in the CA3 that have been the focus of prior studies.

      Weaknesses:

      No major weaknesses were noted.

      Reviewer #2 (Public review): 

      Summary: 

      Cupollilo et al. investigate the properties of hippocampal CA3 neurons that express the immediate early gene cFos in response to a single foot shock. They compare ex-vivo the electrophysiological properties of these "engram neurons" labeled with two different cFos promoter-driven green markers: Their new tool FLEN labels neurons 2-6 h after activity, while RAM contains additional enhancers and peaks considerably later (>24 h). Since the fraction of labeled CA3 cells is comparable with both constructs, it is assumed (but not tested) that they label the same population of activated neurons at different time points. Both FLEN+ and RAM+ neurons in CA3 receive more synaptic inputs compared to non-expressing control neurons, which could be a causal factor for cFos activation, or a very early consequence thereof. Frequency facilitation and E/I ratio of mossy fiber inputs were also tested, but are not different in both cFos+ groups of neurons. One day after foot shock, RAM+ neurons are more excitable than RAM- neurons, suggesting a slow increase in excitability as a major consequence of cFos activation.

      Strengths: 

      The study is conducted to high standards and contributes significantly to our understanding of memory formation and consolidation in the hippocampus. Modifications of intrinsic neuronal properties seem to be more salient than overall changes in the total number of (excitatory and inhibitory) inputs, although a switch in the source of the synaptic inputs would not have been detected by the methods employed in this study

      Weaknesses: 

      With regard to the new viral tool, a direct comparison between the new tool FLEN and existing cFos reporters is missing. 

      Reviewer #1 (Recommendations for the authors):

      I have only minor suggestions for the authors to consider. 

      (1) In the in vitro characterization, the percentage of labelled neurons seems very low after a powerful and prolonged activation. It was somewhat surprising and raised the question of how accurately the FLEN construct reflects endogenous cFOS activity. Could the authors speak to this?

      The reviewer is correct that the level of FLEN positive neurons, as compared to mCherry positive neurons, is low as compared to studies using viral infection with RAM vectors in neuronal cultures (Sorensen et al, 2016, Sun et al, 2020), which is around 70-80% following chemical stimulation. The authors do not provide evidence however for a comparison with endogenous c-Fos activity in cell cultures. The reason for a discrepancy in the effect of chemical stimulation of cultured neurons is not clear, but may depend on culture conditions which may vary between labs. 

      FLEN was constructed using a mouse c-Fos promoter (-355 to +109) (Cen et al, 2003). To answer the reviewer’s question we performed an additional experiment in cultured neurons in which we found that 77.1 % of FLEN positive neurons were also c-fos positive neurons (using immunocytochemistry).

      (2) The authors compare the two labelling strategies and interpret their data with the presumption that both label a similar set of active neurons. This is particularly relevant when they suggest there might be a progressive increase in the excitability of active neurons with time. This is certainly a possibility, but the authors should also consider other possibilities that the two markers might label different populations of neurons. For example, if they require different thresholds for activation, it is possible that one is more sensitive to activity than the other. As these are unknown variables the authors should temper the interpretation accordingly.

      Indeed, the reviewer is correct that this limitation should be discussed. We have added this as a point of discussion in the text (line 355-358). In the article describing the RAM strategy (Sorensen et al, 2016) the authors use RAM to label DG neurons activated during an experience in a context A (Figure 4). Exploiting the fact that engram cells are re-activated when the animal is re-exposed to the same environment of training (memory recall), they performed c-Fos staining 90 minutes following either context A or context B re-exposure. The RAM-c-Fos overlap percentage was higher in A-A rather than A-B (A-A was a bit more than 20%). This means that RAM has captured a group of cells during training that, at least in part, were re-activated during recall. This could in part support the assumption that RAM and c-Fos share a certain overlap. Of course, this was done in DG, while we worked in CA3. In addition, both strategies label in their great majority c-Fos+ neurons (see above answer to point #1). This can not completely rule out the possibility that FLEN and RAM label partly distinct population of activated cells. 

      (3) An increase in the frequency of synaptic events is observed in neurons labelled with both markers. The authors propose that this may be due to an increase in synaptic contacts based on prior studies. However, as this is the first functional assessment why not consider changes in release probability as a mechanism for this finding? 

      We have added this as a possibility in the text (line 362-363).

      (4) It would be useful to include plots of the average frequency of m/sEPSCs and m/sIPSCs in Figures 4 and 5. These figures could also be combined into a single figure.

      We agree with the reviewer that figure 4 and 5 could be merged into a single figure. In the revised version, figure 5A becomes panel C in figure 4. Text and figure descriptions were adjusted accordingly.

      Reviewer #2 (Recommendations for the authors): 

      (1) Abstract, line 24: "In contrast, FLEN+ CA3 neurons show an increased number of excitatory inputs." RAM+ neurons also show an increased number of excitatory inputs, so this is not "in contrast". Also, not just excitatory, but also inhibitory synaptic inputs are more numerous in cFos+ neurons. Please improve the summary of your findings.

      “In contrast” referred to the fact that FLEN+ neurons do not show differences in excitability as compared to FLEN- neurons, as mentioned in the previous sentence. We now provide a more explicit sentence to explain this point: “On the other hand, like RAM+ neurons, FLEN+ CA3 neurons show an increased number of excitatory inputs.”

      (2) Novel tool: Destabilized cFos reporters were introduced 23 years ago and are also part of the TetTag mouse. I am not sure that changing the green fluorescent protein to a different version merits a new acronym (FLEN). To convince the readers that this is more than a branding exercise, the authors should compare the properties (brightness, folding time, stability) of FLEN to e.g. the d2EGFP reporter introduced by Bi et al. 2002 (J Biotechnol. 93(3):231) and show significant improvements.

      We thank the reviewer for this comment which compelled us to evaluate the features of other tools used to label neurons activated following contextual fear conditioing. The key properties of FLEN as compared to other tools used to label engrams is that: (i) it is a viral tool, as opposed to transgenic mice, (ii) a c-fos promoter drives the expression of a brightly fluorescent protein allowing their identification ex vivo for functional analysis, (iii) the fluorescent protein is rapidly destabilized, providing the possibility to label neurons only a few hours after their activation by a behavioural task.

      We did not find any viral tools providing the possibility to label c-fos activated neurons for functional assesment. We have not been able to find references for the use of the d2EGFP reporter introduced by Bi et al. 2002 in a behavioural context. One of the major difference and improvement is certainly the brightness of ZsGreen. In cell cultures, ZsGreen1 showed a 8.6-fold increase in fluorescence intensity as compared with EGFP (Bell et al, 2007).

      Amongst tools with comparable properties, eSARE was developed based on a synthetic Arc promoter driving the expression of a destabilized GFP (dEGFP) (Kawashima et al 2013). We initially used ESARE–dGFP but unfortunately, in our experimental conditions we found that the signal to noise ratio was not satisfactory (number of cells label in the home cage vs. following contextual fear conditining).

      We developed a viral tool to avoid the use of transgenic reporter lines which require laborious breeding and is experimentally less flexible. Nevertheless, many transgenic mice based on the expression of fluorescent proteins under the control of IEG promoters have been developed and used. Some of these mice show a time course of expression of the transgene which is comparable to FLEN. For instance, in organotypic slices from Tet-Tag mice, the time course of expression of EGFP slices follows with a small delay endogenous cFOS expression, and starts decaying after 4 hours (Lamothe-Molina et al, 2022). However, the fluorescence was too weak to visualize neurons in the slice (Christine Gee, personal communication), and imaging is perfomed after immunocytochemistry against GFP. 

      Therefore, we feel that the name given to the FLEN strategy is legitimate. The features of the FLEN strategy were summarized in the discussion (Lines 318-322).

      (3) Line 214: "...FLEN+ CA3 PNs do not show differences in [...] patterns of bursting activity as compared to control neurons." It looks quite different to me (Figure 3E). Just because low n precludes meaningful statistical analysis, I would not conclude there is no difference.

      We agree with the reviewer that the data in Figure 3E are not conclusive due to small sample size, which limits the reliability of statistical comparison. Additionally, the classification of bursting neurons is highly dependent on the specific criteria used, which vary considerably across the literature. To avoid overinterpretation or misleading conclusions, we decided to remove the panel E of Figure 3 showing the fraction of bursting neurons. Nevertheless, we draw the attention to the more robust and interpretable results: RAM⁺ neurons exhibit an increase in firing frequency and a distinct action potential discharge pattern, data which we believe are informative of altered excitability.

      (4) Line 304: Remove the time stamp.

      This was done.

      (5) Line 334: "...results may be explained by an overall increased activity of CA1 neurons..." I don't understand - isn't CA1 downstream of CA3? 

      The reviewer is correct that the sentence was misleading. We removed the reference to CA1, as it was more of a general principle about neuronal activity.

      (6) Line 381: "resolutive", better use "sensitive". 

      This was changed.

      (7) Figure S3: Fear-conditioned animals were 3 days off Dox, controls only 2 days. As RAM expression accumulates over time off Dox, this is not a fair comparison.

      We thank the reviewer for pointing out the incorrect reporting of the experimental design in Figure S3 panel A (bottom), which could lead to misinterpretation of results. In fact, the two groups of mice (CFC vs. HC) underwent all experimental steps in parallel. Specifically, both groups were maintained on and off Doxycycline for the same duration and received viral injection on the same day. 48 hours after Dox withdrawal, the CFC group was trained for contextual conditioning, while the HC group remained in the home cage in the holding room. All animals were thus sacrificed 72 hours after Dox removal. We have corrected the figure to accurately reflect this timeline.

      (8) Please provide sequence information for c-cFos-ZsGreen1-DR. Which regulatory elements of the cFos promoter are included, is the 5' NTR included? This information is very important.

      The information is now provided in the Methods section.

      (9) Please provide the temperature during pharmacological treatments (TTX etc.) before fixation.

      The pharmacological treatment was performed in the incubator at 37°C, this is now indicated in the methods.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1(Public Review):

      Major comments:

      (1) Interpretation of key results and relationship between different parts of the manuscript. The manuscript begins with an information-transmission ansatz which is described as ”independent of the computational goal” (e.g. p. 17). While information theory indeed is not concerned with what quantity is being encoded (e.g. whether it is sensory periphery or hippocampus), the goal of the studied system is to *transmit* the largest amount of bits about the input in the presence of noise. In my view, this does not make the proposed framework ”independent of the computational goal”. Furthermore, the derived theory is then applied to a DDC model which proposes a very specific solution to inference problems. The relationship between information transmission and inference is deep and nuanced. Because the writing is very dense, it is quite hard to understand how the information transmission framework developed in the first part applies to the inference problem. How does the neural coding diagram in Figure 3 map onto the inference diagram in Figure 10? How does the problem of information transmission under constraints from the first part of the manuscript become an inference problem with DDCs? I am certain that authors have good answers to these questions - but they should be explained much better.

      We are very thankful to the reviewer for highlighting the potential confusion surrounding these issues, in particular the relationship between the two halves of the paper – which was previously exacerbated by the length of the paper. We have now added further explanations at different points within the manuscript to better disentangle these issues and clarify our key assumptions. We have also significantly cut the length of the paper by moving more technical discussions to the Methods or Appendices. We will summarise these changes here and also clarify the rationale for our approach and point out potential disagreements with the reviewer.

      Key to our approach is that we indeed do not assume the entire goal of the studied neural system (whether part of the sensory system or not) is to transmit the largest amount of information about the stimulus input (in the presence of noise). In fact, general computations, including the inference of latent causes of inputs, often require filtering out or ignoring some information in the sensory input. It is thus not plausible that tuning curves in general (i.e. in an arbitrary part of the nervous system) are optimised solely with regards to the criterion of information transmission. Accordingly we do not assume they are entirely optimised for that purpose. However, we do make a key assumption or hypothesis (which like any hypothesis might turn out to be partly or entirely wrong): that (1) a minimal feature of the tuning curve (its scale or gain) is entirely free to be optimised for the aim of information transmission (or more precisely the goal of combating the detrimental effect of neural noise on coding fidelity), (2) other aspects of the population tuning curve structure (i.e. the shape of individual tuning curves and their arrangement across the population) are determined by (other) computational goals beyond efficient coding. (Conceptually, this is akin to the modularization between indispensible error correction and general computations in a digital computer, and the need for the former to be performed in a manner that is agnostic as to the computations performed.) We have added two paragraphs in the manuscript which present the above rationale and our key hypothesis or assumption. The first of these was added to the (second paragraph of the) Introduction section, and the second is a new paragraph following Eq. 1 (which is about the gain-shape decomposition of the tuning curves, and the optimisation of the former based on efficient coding) of Results.

      Our paper can be divided into two parts. In the first part, we develop a general, computationally agnostic (in the above sense, just as in the digital computer example), efficient coding theory. In the second part, we apply that theory to a specific form of computation, namely the DDC framework for Bayesian inference. The latter theory now determines the tuning curve shapes. When combined with the results of the first part (which dictate the tuning curve scale or gain according to efficient coding theory), this “homeostatic DDC” model makes full predictions for the tuning curves (i.e., both scale and shape) and how they should adapt to stimulus statistics.

      So to summarise, it is not the case that the problem of information transmission (or rather mitigating the effect noise on coding fidelity under metabolic constraints), dealt with in the first part, has become a problem of Bayesian inference. But rather, the dictates of efficient coding for optimal gains for coding fidelity (under constraints) have been applied to and combined with a computational theory of inference.

      We have added new expository text before and after Eq. 17 in Sec. 2.7 (at the beginning of the second part of the paper on homeostatic DDCs) to again make the connection with the first part and the rationale for its combination with the original DDC framework more clear.

      With the changes outlined above, we believe and hope the connection between the two parts (which we agree with the reviewer, was indeed rather obscure previously) has been adequately clarified.

      (2) Clarity of writing for an interdisciplinary audience. I do not believe that in its current form, the manuscript is accessible to a broader, interdisciplinary audience such as eLife readers. The writing is very dense and technical, which I believe unnecessarily obscures the key results of this study.

      We thank the reviewer for this comment. We have taken several steps to improve the accessibility of this work for an interdisciplinary audience. Firstly, several sections containing dense, mathematical writing have now been moved into appendices or the Methods section, out from the main text; in their place we have made efforts to convey the core of the results, and to providing intuitions, without going into unnecessary technical detail. Secondly, we have added additional figures to help illustrate key concepts or assumptions (see Fig. 1B clarifying the conceptual approach to efficient coding and homeostatic adaptation, and Fig. 8A describing the clustered population). Lastly, we have made sure to refer back to the names of symbols more often, so as to make the analysis easier to follow for a reader with an experimental background.

      (3) Positioning within the context of the field and relationship to prior work. While the proposed theory is interesting and timely, the manuscript omits multiple closely related results which in my view should be discussed in relationship to the current work. In particular, a number of recent studies propose normative criteria for gain modulation in populations: • Duong, L., Simoncelli, E., Chklovskii, D. and Lipshutz, D., 2024. Adaptive whitening with fast gain modulation and slow synaptic plasticity. Advances in Neural Information Processing Systems

      Tring, E., Dipoppa, M. and Ringach, D.L., 2023. A power law describes the magnitude of adaptation in neural populations of primary visual cortex. Nature Communications, 14(1), p.8366.

      Ml ynarski, W. and Tkaˇcik, G., 2022. Efficient coding theory of dynamic attentional modulation. PLoS Biology

      Haimerl, C., Ruff, D.A., Cohen, M.R., Savin, C. and Simoncelli, E.P., 2023. Targeted V1 co-modulation supports task-adaptive sensory decisions. Nature Communications • The Ganguli and Simoncelli framework has been extended to a multivariate case and analyzed for a generalized class of error measures:

      Yerxa, T.E., Kee, E., DeWeese, M.R. and Cooper, E.A., 2020. Efficient sensory coding of multidimensional stimuli. PLoS Computational Biology

      Wang, Z., Stocker, A.A. and Lee, D.D., 2016. Efficient neural codes that minimize LP reconstruction error. Neural Computation, 28(12),

      We thank the reviewer again for bringing these works to our attention. For each, we explain whether we chose to include them in our Discussion section, and why.

      (1) Duong et al. (2024): We decided not to discuss this manuscript, as our assessment is that it is very relevant to our work. That study starts with the assumption that the goal of the sensory system under study is to whiten the signal covariance matrix, which is not the assumption we start with. A mechanistic ingredient (but not the only one) in their approach is gain modulation. However, in their case it is the gains of computationally auxiliary inhibitory neurons that is modulated and not (as in our case) the gain the (excitatory) coding neurons (i.e. those which encode information about the stimulus and whose response covariance is whitened). These key distinction make the connection with our work quite loose and we did not discuss this work.

      (2) Tring et al. (2023): We have added a discussion of the results of this paper and its relationship to the results of our work and that of Benucci et al. This appears in the 7th paragraph of the Discussion. This study is indeed highly relevant to our paper, as it essentially replicates the Benucci et al. experiment, this time in awake mice (rather than anesthetised cats). However, in contrast to the resul‘ts of Benucci et al., Tring et al. do not find firing rate homeostasis in mouse V1. A second, remarkable finding of Tring et al. is that adaptation mainly changes the scale of the population response vector, and only minimally affects its direction. While Tring et al. do not portray it as such, this behaviour amounts to pure stimulus-specific adaptation without the neuron-specific factor found in the Benucci et al. results (see Eq. 24 of our manuscript). As we discuss in our manuscript, when our homeostatic DDC model is based on an ideal-observer generative model, it also displays pure stimulus-specific adaptation with no neuronal factor. Our final model for Benucci’s data did contain a neural factor, because we used a non-ideal observer DDC (in particular, we assumed a smoother prior distribution over orientations compared to the distribution used in the experiment - which has a very sharp peak – as it is more natural given the inductive biases we expect in the brain). The resultant neural factor suppresses the tuning curves tuned to the adaptor stimulus. Interestingly, when gain adaptation is incomplete, and happens to a weaker degree compared to what is necessary for firing rate homeostasis, an additional neural factor emerges that is greater than one for neurons tuned to the adaptor stimulus. These two multiplicative neural factors can approximately cancel each other; such a theory would thus predict both deviation from homeostasis and approximately pure stimulus-specific adaptation. We plan to explore this possibility in future work.

      (3) Ml ynarski and Tkaˇcik (2022): We are now citing and discussing this work in the Discussion (penultimate paragraph), in the context of a possible future direction, namely extending our framework to cover the dynamics of adaptation (via a dynamic efficient gain modulation and dynamic inference). We have noted there that Mlynarski have used such a framework (which while similar has key technical differences with our approach) based on a task-dependent efficient coding objective to model top-down attentional modulation. By contrast, we have studied bottom-up and task-independent adaptation, and it would be interesting to extend our framework and develop a model to make predictions for the temporal dynamics of such adaptation.

      (4) Haimerl et al. (2023): We have elected not to include this work within our discussion either, as we do not believe it is sufficiently relevant to our work to warrant inclusion. Although this paper also considers gain modulation of neural activity, the setting and the aims of the theoretical work and the empirical phenomena it is applied to are very different from our case in various ways. Most importantly, this paper is not offering a normative account of gain modulation; rather, gain modulation is used as a mechanism for enabling fast adaptive readouts of task relevant information.

      (5) Yerxa et al. (2020): We have now included a discussion of this paper in our Discussion section. Note that, even though this study generalises the Ganguli and Simoncelli framework to higher diemsnions, just like that paper it still places strict requirements (which are arguably even more stringent in higher dimensions) on the form of the tuning curves in the population, viz. that there exists a differentiable transform of the stimulus space which renders these unimodal curves completely homogeneous (i.e., of the same shape, and placed regularly and with uniform density).

      (6) Wang et al. (2016): We have included this paper in our discussion as well. As above, this paper does not consider general tuning curves, and places the same constraint on their shape and arrangement as in Ganguli and Simoncelli paper.

      More detailed comments and feedback:

      (1) I believe that this work offers the possibility to address an important question about novelty responses in the cortex (e.g. Homann et al, 2021 PNAS). Are they encoding novelty per-se, or are they inefficient responses of a not-yet-adapted population? Perhaps it’s worth speculating about.

      We are not sure why the relatively large responses to “novel” or odd-ball stimuli should be considered inefficient or unadapted: in the context in which those stimuli are infrequent odd-balls (and thus novel or surprising when occurring), efficient coding theory would indeed typically predict a large response compared to the (relatively suppressed) responses to frequently occurring stimuli. Of course, if the statistics change and the odd-ball stimulus now becomes frequent, adaptation should occur and would be expected to suppress responses to this stimulus. As to the question of whether (large) responses to infrequent stimuli can or should be characterised as novelty responses: this is partly an interpretational or semantic issue – unless it is grounded in knowledge of how downstream populations use this type of coding in V1, which could then provide a basis for solidly linking them to detection of novelty per se. In short, our theory, could be applied to Homann et al.’s data, but we consider that beyond the scope of the current paper.

      (2) Clustering in populations - typically in efficient coding studies, tuning curve distributions are a consequence of input statistics, constraints, and optimality criteria. Here the authors introduce randomly perturbed curves for each cluster - how to interpret that in light of the efficient coding theory? This links to a more general aspect of this work - it does not specify how to find optimal tuning curves, just how to modulate them (already addressed in the discussion).

      We begin by addressing the reviewer’s more general concern regarding the fact that our theory does not address the problem of finding optimal tuning curves, only that of modulating them optimally. As we expound within the updated version of the paper (see the newly expanded 3rd paragraph in Sec. 2.1 and the expanded 2nd paragraph in Introduction), it is not plausible that the sole function of sensory systems, and neural circuits more generally, is the transmission of information. There are many other computational tasks which must be performed by the system, such as the inference of the latent causes of sensory inputs. For many such tasks, it is not even desirable to have complete transmission of information about the external stimulus, since a substantial portion of that information is not important for the task at hand, and must be discarded. For example, such discarding of information is the basis of invariant representations that occur, e.g., in higher visual areas. So we recognise that tuning curve shapes are in general dictated and shaped by computational goals beyond transmission of information or error correction. As such, we have remained agnostic as to the computational goals of neural systems and therefore the shape of the tuning curve. We have made the assumption and adopted the postulate that those computational goals determine the shape of the tuning curves, leaving the gains to be adjuted freely for the purpose of mitigating the effect noise on coding fidelity (this is similar to how error correction is done in computers independendently of the computations performed). by assuming that those computational goals are captured adequately by the shape of tuning curves, this leaves us free to optimise the gains of those curves for purely information theoretic objectives. Finally, we note that the case where the tuning curve shapes are additionally optimised for information transmission is a special case of our more general approach. For further discussion, see the updated version of our introduction.

      We now turn to our choice to model clusters using random perturbations. This is, of course, a toy model for clustering tuning curves within a population. With this toy model we are attempting to capture the important aspects of tuning curve clusters within the population while not over-complicating the simulations. Within any neural population, there will be tuning curves that are similar; however, such curves will inevitably be heterogeneous, as opposed to completely identical. Thus, when we cluster together similar curves there will be an “average” cluster tuning curve (found by, e.g., normalising all individual curves and taking the average), which all other tuning curves within the cluster are deviations from. The random perturbations we apply are our attempt to capture these deviations. However, note that the perturbations are not fully random, but instead have an “effective dimensionality” which we vary over. By giving the perturbations an effective dimensionality, we aim to capture the fact that deviations from the average cluster tuning curve may not be fully random, and may display some structure.

      (3) Figure 8 - where do Hz come from as physical units? As I understand there are no physical units in simulations.

      We have clarified this within the figure caption. The within-cluster optimisation problem requires maximising a quadratic program subject to a constraint on the total mean spike count of the cluster. The objective for the quadratic program is however mathematically homogeneous. So we can scale the variables and parameters in a consistent to be in units of Hz – i.e., turn them into mean firing rates, instead of mean spike counts, with an assumption on the length of the coding time interval. We fix this cluster firing rate to be k × 5 Hz, so that the average single-neuron firing rate is 5 Hz (based on empirical estimates – see our Sec. 2.5). This agrees with our choice of µ in our simulations (i.e., µ = 10) if we assume a coding interval of 0.1 seconds.

      (4) Inference with DDCs in changing environments. To perform efficient inference in a dynamically changing environment (as considered here), an ideal observer needs some form of posterior-prior updating. Where does that enter here?

      A shortcoming of our theory, in its current form, is that it applies only to the system in “steady-state”, without specifying the dynamics of how adaptation temporlly evolves (we assume the enrivonment has periods of relative stability that are of relatively long duration compared to the dynamical timescales of adaptation, and consider the properties of the well-adapted steady state population). Thus our efficient coding theory (which predicts homeostatic adaptation under the outlined conditions) is silent on the time-course over which homeostasis occurs. Likewise, the DDC theory (in its original formulation in Vertes & Sahani) is silent on dynamic updating of posteriors and considers only static inference with a fixed internal model. We have now discuss a new future directoin in the Discussion (where we cite the work of Mlynarski and Tkacik) to point out that our theory can in principle be extended (based on dynamic inference and efficient coding) to account for the dynamics of attention, but this is beyond the scope of the current work.

      (5) Page 6 - ”We did this in such a way that, for all , the correlation matrices, (), were derived from covariance matrices with a 1/n power-law eigenspectrum (i.e., the ranked eigenvalues of the covariance matrix fall off inversely with their rank), in line with the findings of Stringer et al. (2019) in the primary visual cortex.” This is a very specific assumption, taken from a study of a specific brain region - how does it relate to the generality of the approach?

      Our efficient coding framework has been formulated without relying on any specific assumptions about the form of the (signal or noise) correlation matrices in cortex. The homeostatic solution to this efficient coding problem, however, emerges under certain conditions. But, as we demonstrate in our discussion of the analytic solutions to our efficient coding objective and the conditions necessary for the validity of the homeostatic solution, we expect homeostasis to arise whenever the signal geometry is sufficiently high-dimensional (among other conditions). By this we mean that the fall-off of the eigenvalues of the signal correlation matrix must be sufficiently slow. Thus, a fall-off in the eigenvalue spectrum slower than 1/n would favor homeostasis even more than our results. If the fall off was faster, then whether or not (and to what degree) firing rate homeostasis becomes suboptimal depends on factors such as the fastness of the fall-off and also the size of the population. Thus (1) rate homeostasis does not require the specific 1/n spectrum, but that spectrum is consistent with the conditions for optimality of rate homeostasis, (2) in our simulations we had to make a specific choice, and relying on empirical observations in V1 was of course a well-justified choice (moreover, as far as we are aware, there have been no other studies that have characterised the spectrum of the signal covariance matrix in response to natural stimuli, based on large population recordings).

      Reviewer #2 (Public Review):

      Strengths:

      The problem of efficient coding is a long-standing and important one. This manuscript contributes to that field by proposing a theory of efficient coding through gain adjustments, independent of the computational goals of the system. The main result is a normative explanation for firing rate homeostasis at the level of neural clusters (groups of neurons that perform a similar computation) with firing rate heterogeneity within each cluster. Both phenomena are widely observed, and reconciling them under one theory is important.

      The mathematical derivations are thorough as far as I can tell. Although the model of neural activity is artificial, the authors make sure to include many aspects of cortical physiology, while also keeping the models quite general.

      Section 2.5 derives the conditions in which homeostasis would be near-optimal in the cortex, which appear to be consistent with many empirical observations in V1. This indicates that homeostasis in V1 might be indeed close to the optimal solution to code efficiently in the face of noise.

      The application to the data of Benucci et al 2013 is the first to offer a normative explanation of stimulus-specific and neuron-specific adaptation in V1.

      We thank the reviewer for these assessments.

      Weaknesses:

      The novelty and significance of the work are not presented clearly. The relation to other theoretical work, particularly Ganguli and Simoncelli and other efficient coding theories, is explained in the Discussion but perhaps would be better placed in the Introduction, to motivate some of the many choices of the mathematical models used here.

      We thank the reviewer for this comment; we have updated our introduction to make clearer the relationship between this work and previous works within efficient coding theory. Please see the expanded 2nd paragraph of Introduction which gives a short account of previous efficient coding theories and now situates our work and differentiates it more clearly from past work.

      The manuscript is very hard to read as is, it almost feels like this could be two different papers. The first half seems like a standalone document, detailing the general theory with interesting results on homeostasis and optimal coding. The second half, from Section 2.7 on, presents a series of specific applications that appear somewhat disconnected, are not very clearly motivated nor pursued in-depth, and require ad-hoc assumptions.

      We thank the reviewer for this suggestion. The reviewer is right to note that our paper contains both the exposition of a general efficient coding theory framework in addition to applications of that framework. Following your advice we have implemented the following changes. (1) significantly shortened or entirely moved some of the less central results in the second half of Results, to the Methods or appendices (this includes the entire former section 2.7 and significant shortening of the section on implementation of Bayes ratio coding by divisive normalisation). (2) We have added a new figure (Fig 1B) and two long pieces of text to the (2nd paragraph of) Introduction, after Eq. (1), and in Sec. 2.7 (introducing homeostatic DDCs) to more clearly explain and clarify the assumptions underlying our efficient coding theory, and its connection with the second half of the Results (i.e. application to DDC theory of Bayesian inference), and better motivate why we consider the homeostatic DDC.

      For instance, it is unclear if the main significant finding is the role of homeostasis in the general theory or the demonstration that homeostatic DDC with Bayes Ratio coding captures V1 adaptation phenomena. It would be helpful to clarify if this is being proposed as a new/better computational model of V1 compared to other existing models.

      We see the central contribution of our work as not just that homeostasis arises as a result of an efficient coding objective, but also that this homeostasis is sufficient to explain V1 adaptation phenomena - in particular, stimulus specific adaptation (SSA) - when paired with an existing theory of neural representation, the DDC (itself applied to orientation coding in V1). Homeostatic adaptation alone does not explain SSA; nor do DDCs. However, when the two are combined they provide an explanation for SSA. This finding is significant, as it unifies two forms of adaptation (SSA and homeostatic adaptation) whose relationship was not previously appreciated. Our field does not currently have a standard model of V1, and we do not claim to have provided one either; rather, different models have captured different phenomena in V1, and we have done so for homeostatic SSA in V1.

      Early on in the manuscript (Section 2.1), the theory is presented as general in terms of the stimulus dimensionality and brain area, but then it is only demonstrated for orientation coding in V1.

      The efficient coding theory developed in Section 2 is indeed general throughout, we make no assumptions regarding the shape of the tuning curves or the dimensionality of the stimulus. Further, our demonstrations of the efficient coding theory through numerical simulations - make assumptions only about the form of the signal and noise covariance matrices. When we later turn our attention away from the general case, our choice to focus on orientation coding in V1 was motivated by empirical results demonstrating a co-occurrence of neural homeostasis and stimulus specific adaptation in V1.

      The manuscript relies on a specific response noise model, with arbitrary tuning curves. Using a population model with arbitrary tuning curves and noise covariance matrix, as the basis for a study of coding optimality, is problematic because not all combinations of tuning curves and covariances are achievable by neural circuits (e.g. https://pubmed.ncbi.nlm.nih.gov/27145916/ )

      First, to clarify, our theory allows for complete generality of neural tuning curve shapes, and assumes a broad family of noise models (which, while not completely arbitrary, includes cases of biological relevance and/or models commonly used in the theoretical literature). Within this class of noise covariance models, we have shown numerical results for different values for different parameters of the noise covariance model, but more importantly, have analytically outlined the general properties and requirements on noise strength and structure (and its relationship to tuning curves and signal structure) under which homeostatic adaptation would be optimal. Regarding the point that not all combinations of tuning curves and noise covariances occur in biology or are achievable by neural circuits: (1) If we are guessing correctly the specific point of the reviewer’s reference to the review paper by Kohn et al. 2016, we have in fact prominently discussed the case of information limiting noise which corresponds to a specific relationship between signal structure (as determined by tuning curves) and noise structure (as specified by the noise covariance matrix). Our family of noise models include that biologically relevant case and we have indeed paid it particular attention in our simulations and discussions (see discussion of Fig. 7 in Sec. 2.3, and that of aligned noise in Sec. 2.5). (2) As for the more general or abstract point that not all combinations of noise covariance and tuning curve structures are achievable by neural circuits, we can make the following comments. First, in lieu of a full theoretical or empirical understanding of the achievable combinations (which does not exist), we have outlined conditions for homeostatic adaptations under a broad class of noise models and arbitrary tuning curves. If some combinations within this class are not realised in biology, that does not invalidate the theoretical results, as the latter have been derived under more general conditions, which nevertheless include combinations that do occur in biology and are achievable by neural circuits (which, as pointed out, include the important case of aligned noise and signal structure – as reviewed in Kohn et al.– to which we have paid particular attention).

      The paper Benucci et al 2013 shows that homeostasis holds for some stimulus distributions, but not others i.e. when the ’adapter’ is present too often. This manuscript, like the Benucci paper, discards those datasets. But from a theoretical standpoint, it seems important to consider why that would be the case, and if it can be predicted by the theory proposed here.

      The theory we provide predicts that, under certain (specified) conditions, we ought to see deviation from exact homeostatic results; indeed, we provide a first order approximation to the optimal gains in this case which quantifies such deviations when they are small. However, unfortunately the form of this deviation depends on a precise choice of stimulus statistics (e.g. the signal correlation matrix, the noise correlation matrix averaged over all stimulus space, and other stimulus statistics), in contrasts to the universality of the homeostatic solution, when it is a valid approximation. In our model of Benucci et al.’s experiment, we restrict to a simple one-dimensional stimulus space (corresponding to orientated gratings), without specifying neural responses to all stimuli; as such, we are not immediately able to make predictions about whether the homeostatic failure can be predicted using the specific form of deviation from homeostasis. However, we acknowledge that this is a weakness of our analysis, and that a more complete investigation would address this question. For reasons of space, we elected not to pursue this further. We have added a paragraph to our Discussion (8th paragraph) explaining this.

      Reviewer#1 (Recommendations for the authors):

      (1) To make the article more accessible I would suggest the following:

      (a) Include a few more illustrations or diagrams that demonstrate key concepts: adaptationof an entire population, clustering within a population, different sources of noise, inference with homeostatic DDCs, etc.

      We thank the reviewer for this suggestion - we have added an additional figure in (Figure 8, Panel A) to explain the concept of clustering within a population. We also added a new panel to Figure 1 (Figure 1B) which we hope will clarify the conceptual postulate underlying our efficient coding framework and its link to the second half of the paper.

      (b) Within the text refer to names of quantities much more often, rather than relying onlyon mathematical symbols (e.g. w,r,Ω, etc).

      We thank the reviewer for the suggestion; we have updated the text accordingly and believe this has improved the clarity of the exposition.

      (2) It is hard to distill which components of the considered theory are crucial to reproducing the experimental observations in Figure 12. Is it the homeostatic modulation, efficient coding, DDCs, or any combination of those or all of them necessary to reproduce the experiment? I believe this could be explained much better, also with an audience of experimentalists in mind.

      We have updated the text to provide additional clarity on this matter (see the pointers to these changes and additions in the revised manuscript, given above in response to your first comment). In particular, reproducing the experimental results requires combining DDCs with homeostatic modulation – with the latter a consequence of our efficient coding theory, and not an independent ingredient or assumption.

      (3) It would be good to comment on how sensitive the results are to the assumptions made, parameter values, etc. For example: do conclusions depend on statistics of neural responses in simulated environments? Do they generalize for different values of the constraint µ? This could be addressed in the discussion / supplementary material.

      This issue is already discussed extensively within the text - see Sec. 2.4, Analytical insight on the optimality of homeostasis, and Sec. 2.5, Conditions for the validity of the homeostatic solution to hold in cortex. In these sections, we outline that - provided a certain parameter combination is small - we expect the homeostatic result to hold. Accordingly, we anticipate that our numerical results will generalise to any settings in which that parameter combination remains small.

      (4) How many neurons/units were used for simulations?

      We apologies for omitting this detail; we used 10,000 units for our simulations. We have edited both the main text and the methods section to reflect this.

      (5) Typos etc: a) Figure 5 caption - the order of panels B and C is switched. b) Figure 6A - I suggest adding a colorbar.

      Thank you. We have relabelled the panels B and C in the appropriate figures so that the ordering in the figure caption is correct. We feel that a colourbar in figure 6A would be unnecessary, since we are only trying to convey the concept of uniform correlations, rather than any particular value for the correlations; as such we have elected not to add a colourbar. We have, however, added a more explicit explanation of this cartoon matrix in the figure caption, by referring to the colors of diagonal vs off-diagonal elements.

      Reviewer#2 (Recommendations for the authors):

      The text on page 10, with the perturbation analysis, could be moved to a supplement, leaving here only the intuition.

      We thank the reviewer for this suggestion; we have moved much of the argument into the appendix so as to not distract the reader with unnecessary technical details.

      Text before eq. 12 “...in cluster a maximize the objective...” should be ‘minimize’?

      The cluster objective as written is indeed maximised, as stated in the text. Note that, in the revised manuscript, this argument has been moved to an appendix to reduce the density of mathematics in the main text.

      Top of page 25 “S<sub>0</sub> and S<sub>0</sub>” should be “S<sub>0</sub> and S<sub>1</sub>”?

      Thank you, we have corrected the manuscript accordingly.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review): 

      In this manuscript, Chen et al. investigate the role of the membrane estrogen receptor GPR30 in spinal mechanisms of neuropathic pain. Using a wide variety of techniques, they first provide convincing evidence that GPR30 expression is restricted to neurons within the spinal cord, and that GPR30 neurons are well-positioned to receive descending input from the primary sensory cortex (S1). In addition, the authors put their findings in the context of the previous knowledge in the field, presenting evidence demonstrating that GRP30 is expressed in the majority of CCK-expressing spinal neurons. Overall, this manuscript furthers our understanding of neural circuity that underlies neuropathic pain and will be of broad interest to neuroscientists, especially those interested in somatosensation. Nevertheless, the manuscript would be strengthened by additional analyses and clarification of data that is currently presented. 

      Strengths: 

      The authors present convincing evidence for the expression of GPR30 in the spinal cord that is specific to spinal neurons. Similarly, complementary approaches including pharmacological inhibition and knockdown of GPR30 are used to demonstrate the role of the receptor in driving nerve injury-induced pain in rodent models. 

      Weaknesses: 

      Although steps were taken to put their data into the broader context of what is already known about the spinal circuitry of pain, more considerations and analyses would help the authors better achieve their goal. For instance, to determine whether GPR30 is expressed in excitatory or inhibitory neurons, more selective markers for these subtypes should be used over CamK2. Moreover, quantitative analysis of the extent of overlap between GPR30+ and CCK+ spinal neurons is needed to understand the potential heterogeneity of the GPR30 spinal neuron population, and to interpret experiments characterizing descending SI inputs onto GPR30 and CCK spinal neurons. Filling these gaps in knowledge would make their findings more solid. 

      Thank you very much for your constructive feedback.

      In response to your suggestion, we have used more specific markers to distinguish excitatory (VGLUT2) and inhibitory (VGAT) neurons via in situ hybridization. These analyses revealed that GPR30 is predominantly expressed in excitatory neurons of the superficial dorsal horn (SDH), as presented in the Results section (lines 117-120) and in Figure 2A-B.

      Additionally, we performed a quantitative analysis to determine the extent of co-localization between GPR30+ and CCK+ neurons. The data were included in the Results (lines 131–132) and Figure 2G.

      Reviewer #2 (Public review):

      Using a variety of experimental manipulations, the authors show that the membrane estrogen receptor G protein-coupled estrogen receptor (GPER/GPR30) expressed in CCK+ excitatory spinal interneurons plays a major role in the pain symptoms observed in the chronic constriction injury (CCI) model of neuropathic pain. Intrathecal application of selective GPR30 agonist G-1 induced mechanical allodynia and thermal hyperalgesia in male and female mice. Downregulation of GPR30 in CCK+ interneurons prevented the development of mechanical and thermal hypersensitivity during CCI. They also show the up modulation of AMPA receptor expression by GPR30. 

      Generally, the conclusions are supported by the experimental results. I also would like to see significant improvements in the writing and the description of results. 

      Methodological details for some of the techniques are rather sparse. For example, when examining the co-localization of various markers, the authors do not indicate the number of animals/sections examined. Similarly, when examining the effect of shGper1, it is unclear how many cells/sections/animals were counted and analyzed. 

      In other sections, there is no description of the concentration of drugs used (for example, Figure 4H). In Figures 4C-E, there is no indication of the duration of the recordings, the ionic conditions, the effect of glutamate receptor blockers, etc 

      Some results appear anecdotal in the way they are described. For example, in Figure 5, it is unclear how many times this experiment was repeated. 

      We sincerely appreciate your valuable feedback and thoughtful recommendations.

      To address your concerns regarding methodological transparency, we have added the following details to the revised manuscript:

      The number of animals and sections analyzed in co-localization studies.

      The number of cells/sections/animals used in each quantification following shGper1 treatment.

      The concentrations of drugs administered (e.g., in Figure 4H).

      Detailed recording conditions, including duration, ionic composition, and pharmacological conditions (Figures 4C-E).

      In addition, we have thoroughly revised the writing throughout the manuscript to enhance clarity and precision in the description of our findings.

      Reviewer #3 (Public review): 

      Summary: 

      The authors convincingly demonstrate that a population of CCK+ spinal neurons in the deep dorsal horn express the G protein-coupled estrogen receptor GPR30 to modulate pain sensitivity in the chronic constriction injury (CCI) model of neuropathic pain in mice. Using complementary pharmacological and genetic knockdown experiments they convincingly show that GPR30 inhibition or knockdown reverses mechanical, tactile, and thermal hypersensitivity, conditioned place aversion, and c-fos staining in the spinal dorsal horn after CCI. They propose that GPR30 mediates an increase in postsynaptic AMPA receptors after CCI using slice electrophysiology which may underlie the increased behavioral sensitivity. They then use anterograde tracing approaches to show that CCK and GPR30 positive neurons in the deep dorsal horn may receive direct connections from the primary somatosensory cortex. Chemogenetic activation of these dorsal horn neurons proposed to be connected to S1 increased nociceptive sensitivity in a GPR30-dependent manner. Overall, the data are very convincing and the experiments are well conducted and adequately controlled. However, the proposed model of descending corticospinal facilitation of nociceptive sensitivity through GPR30 in a population of CCK+ neurons in the dorsal horn is not fully supported. 

      Strengths: 

      The experiments are very well executed and adequately controlled throughout the manuscript. The data are nicely presented and supportive of a role for GPR30 signaling in the spinal dorsal horn influencing nociceptive sensitivity following CCI. The authors also did an excellent job of using complementary approaches to rigorously test their hypothesis. 

      Weaknesses: 

      The primary weakness in this manuscript involves overextending the interpretations of the data to propose a direct link between corticospinal projections signaling through GPR30 on this CCK+ population of spinal dorsal horn neurons. For example, even in the cropped images presented, GPR30 is present in many other CCK-negative neurons. Only about a quarter of the cells labeled by the anterograde viral tracing experiment from S1 are CCK+. Since no direct evidence is provided for S1 signaling through GPR30, this conclusion should be revised. 

      Thank you for your encouraging comments and critical insights.

      We fully acknowledge the concern regarding the proposed direct involvement of corticospinal projections in modulating nociceptive behavior via GPR30 in CCK+ neurons. While our anterograde tracing experiments suggest anatomical overlap, we agree that definitive evidence of functional connectivity is lacking.

      Accordingly, we have revised the Abstract, Discussion, and Graphical Abstract to present our findings more cautiously. We now describe our observations as indicating that S1 projections potentially interact with GPR30<sup>+</sup> spinal neurons, rather than asserting a definitive functional link.

      To support this revised interpretation, we performed additional quantitative analyses examining the co-localization among S1 projections, CCK+, and GPR30+ neurons. Furthermore, we clarified that the chemogenetic activation studies targeted a mixed neuronal population and did not exclusively manipulate CCK+ neurons.

      These changes aim to better align our conclusions with the presented data and provide a more nuanced framework for future investigations.

      Reviewer #1 (Recommendations for the authors): 

      Major corrections 

      (1) Figure 2: The authors conclude that GPR30 is mainly expressed in excitatory spinal neurons because they are labeled by a virus with a Camk2 promoter. While there is evidence that Camk2 is specific to excitatory neurons in the brain, based on RNAseq datasets (e.g. Linnarsson Lab, http://mousebrain.org/adolescent/genesearch.html ) this is less clear cut within the spinal cord. A more direct way to assess the relative expression of GPR30 in excitatory versus inhibitory neurons would be to perform immunohistochemistry or FISH with GPR30/Vglut2/Vgat. 

      Alternatively, if this observation is not crucial for the overall arch of the story, I recommend the authors eliminate these data, as they do not support the idea that GPR30 is mainly in excitatory neurons. 

      We thank the reviewer for highlighting this important limitation. To strengthen our conclusion regarding the neuronal identity of GPR30-expressing cells, we performed fluorescent in situ hybridization (FISH) using vGluT2 (marker for excitatory neurons) and VGAT (marker for inhibitory neurons). The results confirmed that GPR30 is predominantly expressed in vGluT2-positive excitatory neurons within the spinal cord. These new data are presented in the revised manuscript (lines 117-120) and shown in Figure 2A-B.

      (2) (2a) Figure 2: The authors also report that GPR30 is expressed in most CCK+ spinal neurons. A more rigorous way to present the data would be to perform quantification and report the % of CCK neurons that are GPR30. 

      (2b) More importantly, it is unclear what % of GPR30 neurons are CCK+. These types of quantifications would provide useful insights into the heterogeneity of CCK and GPR30 neuron populations, and help align findings of experiments using the behavioral pharmacology using GRP antagonists to the knockdown of Gper1 in CCK spinal neurons - for instance, does a population of GRP30+/CCK- neurons exist? If so, it would be worth discussing what role (if any) that population might play in nerve injury-induced mechanical allodynia. 

      Understanding the breakdown of GPR30 populations becomes even more relevant when the authors characterize which cell types are targeted by descending projections from S1. It is clear that the vast majority of CCK+ neurons that receive descending input from S1 neurons are GPR30+, but there are many other GPR30+ neurons that do not receive input from SI neurons presented in 5M. Is this simply because only a small fraction of CCK+/GPR30+ neurons are targeted by descending S1 projections, or could they represent a distinct population of GPR30 neurons? 

      (2a) We appreciate the suggestion. Quantification showed that approximately 90% of CCK⁺ neurons express GPR30, and about 50% of GPR30⁺ neurons co-express CCK. These data are now provided in the revised Results (lines 131-132) and in Figure 2F-G.

      (2b) Indeed, our data reveal that a substantial portion of GPR30⁺ neurons do not co-express CCK. While this study focuses on GPR30 function in CCK⁺ neurons, we recognize the potential relevance of GPR30⁺/CCK⁻ populations. We have addressed this point in the Discussion (lines 303-306):

      “However, it should be noted that half of GPR30⁺ neurons are not co-localized with CCK⁺ neurons, and further studies are needed to explore the function of these GPR30⁺/CCK⁻ neurons in neuropathic pain.”

      Regarding descending input, our data in Figure 5 show that S1 projections selectively innervate a subset (~30%) of CCK⁺ neurons, most of which co-express GPR30. This suggests that S1-targeted CCK⁺/GPR30⁺ neurons may represent a functionally distinct population. We have added clarification to the revised manuscript, while acknowledging that further studies are needed to elucidate the roles of non-targeted GPR30⁺ neurons.

      (3) Throughout the manuscript both male and female mice were used in experiments. Rather than referring to male and female mice as different genders, it would be more appropriate to describe them as different sexes. 

      As suggested, we have replaced all instances of “gender” with “sex” throughout the revised manuscript.

      (4) Figure 5: To increase the ease of interpreting the figure, in panels 5J and 5N, it would be helpful to indicate directly on the figure panel which another marker was assessed in double-labeling analyses.

      We have revised Figures 5J and 5N to include clear labels identifying the markers used in double-labeling analyses, to improve interpretability.

      Minor corrections: 

      (1) Line 36, I believe the authors mean to say "GPER/GPR30 in spinal neurons", rather than just "spinal". 

      Corrected as suggested. The sentence now reads (line 34):

      “Here we showed that the membrane estrogen receptor G-protein coupled estrogen receptor (GPER/GPR30) in spinal neurons was significantly upregulated in chronic constriction injury (CCI) mice…”

      (2) There are minor grammatical errors throughout the manuscript that interfere with comprehension. Proofreading/editing of the English language use may be beneficial. 

      We have thoroughly revised the manuscript for clarity and corrected grammatical and syntactic errors to improve readability.

      (3) Line 169-170, reads "Known that EPSCs are mediated by glutamatergic receptors like AMPA receptors and several studies have been reported the relationship between GPR30 and AMPA receptor25,29". Rewriting the sentence such that it better describes what the known relationship is between GPR30 and AMPA would be helpful in setting up the rationale of the experiment in Figure 4. 

      We have rewritten this section to better clarify the rationale behind the electrophysiological experiments (lines 161-164):

      “Given that EPSCs are primarily mediated through glutamatergic receptors such as AMPA receptors, and emerging evidence suggesting that GPR30 enhances excitatory transmission by promoting clustering of glutamatergic receptor subunits, we examined whether GPR30 modulates EPSCs via AMPA receptor-dependent mechanisms.”

      (4) Line 198-199 "Then we explored the possible connections among GPR30, S1-SDH projections and CCK+ neuron." In the context of spinal circuitry, "connections" may raise the expectation that synaptic connectivity will be evaluated. What I think best describes what the authors investigated in Figure 5 is the "relationship" between GPR30, S1-SDH projections, and CCK+ neurons. 

      We have revised the sentence accordingly (lines 184-186):

      “Building on previous findings suggesting a functional interaction between S1-SDH projections and spinal CCK⁺ neurons, our current study aimed to further elucidate the structural relationship among GPR30, S1-SDH projections, and CCK⁺ neurons.”

      (5) Figure 5: To increase the ease of interpreting the figure, in panels 5J and FN, it would be helpful to indicate directly on the figure panel which other marker was assessed in double-labeling analyses. 

      We have added direct labels to figure panels to clarify double-labeled analyses in the revised Figure 5J and 5N.

      Reviewer #2 (Recommendations for the authors): 

      (1) Can the authors provide more detail about the distribution of CCK+ cells in the spinal cord and, in particular, the localization of double-stained (CCK/cfos) neurons? 

      We thank the reviewer for this suggestion. To better characterize the distribution of CCK⁺ neurons within the spinal dorsal horn (SDH), we performed immunostaining in CCK-tdTomato mice using lamina-specific markers: CGRP (lamina I), IB4 (lamina II), and NF200 (lamina III–V). Our results demonstrate that CCK⁺ neurons are primarily localized in the deeper laminae of the SDH. These findings are now described in the revised Results (lines 126–129) and shown in Figure 2E.

      In addition, we conducted c-Fos immunostaining in CCK-Ai14 mice and found increased activation of CCK⁺ neurons following CCI. This supports the involvement of CCK⁺ neurons in neuropathic pain. These data are included in the Results (lines 129–131) and Supplementary Figure S4.

      (2) Figure 2A. There is no formal quantification of the percentage of TdTomato+ neurons that are also CCK+. The description of these results is insufficient. 

      We appreciate this point and have revised the description of Figure 2A accordingly. To strengthen our analysis, we conducted additional FISH experiments with vGluT2 and VGAT probes. Quantification revealed that GPR30 is predominantly expressed in excitatory neurons (approximately 60%). These data are shown in the revised Results (lines 117-119) and Figures 2A-B and S3. This supports our conclusion that GPR30 is largely localized to excitatory spinal interneurons.

      (3) Figure 4H. What is the evidence that these are AMPA-mediated currents? This is not explained in the text. 

      Thank you for raising this point. We now provide detailed experimental procedures to clarify that the recorded EPSCs are AMPA receptor–mediated. Specifically, spinal slices from CCK-Cre mice were used, and excitatory postsynaptic currents were recorded in the presence of APV (100 μM, NMDA receptor blocker), bicuculline (20 μM, GABA_A receptor blocker), and strychnine (0.5 μM, glycine receptor blocker), ensuring that the observed currents were AMPA-dependent. These methodological details are now clearly described in the revised Results (lines 165–173) and supported by prior literature (Zhang et al., J Biol Chem 2012; Hughes et al., J Neurosci 2010).

      (1) Yan Zhang, Xiao Xiao, Xiao-Meng Zhang, Zhi-Qi Zhao, Yu-Qiu Zhang (2012). Estrogen facilitates spinal cord synaptic transmission via membrane-bound estrogen receptors: implications for pain hypersensitivity. J Biol Chem. Sep 28;287(40):33268-81.

      (2) Ethan G Hughes, Xiaoyu Peng, Amy J Gleichman, Meizan Lai, Lei Zhou, Ryan Tsou, Thomas D Parsons, David R Lynch, Josep Dalmau, Rita J Balice-Gordon (2010). Cellular and synaptic mechanisms of anti-NMDA receptor encephalitis. J Neurosci. 2010 Apr 28;30(17):5866-75.

      (4) What is the signaling mechanism leading to a larger amplitude of currents after G-1 infusion? 

      We thank the reviewer for this important question. G-1 is a selective agonist for GPR30. Based on previous studies by Luo et al. (2016), we speculate that activation of GPR30 may increase the clustering of glutamatergic receptor subunits at postsynaptic sites, thereby enhancing AMPA receptor-mediated currents. While our current study did not directly address the intracellular signaling cascade, we have incorporated this mechanistic speculation in the Discussion.

      Jie Luo, X.H., Yali Li, Yang Li, Xueqin Xu, Yan Gao, Ruoshi Shi, Wanjun Yao, Juying Liu, Changbin Ke (2016). GPR30 disrupts the balance of GABAergic and glutamatergic transmission in the spinal cord driving to the development of bone cancer pain. Oncotarget 7, 73462-73472. 10.18632/oncotarget.11867.

      (5) Figure 4I. Please include error bars. 

      We have revised Figure 4I to include error bars, as requested.

      (6) Line 198. What is the evidence that AAV2/1 EF1α FLP is an antegrade trans monosynaptic marker? 

      We thank you for this request. AAV2/1 has been widely used for anterograde monosynaptic tracing based on its properties (Wang et al., Nat Neurosci 2024; Wu et al., Neurosci Bull 2021): (1) it infects neurons at the injection site and undergoes active anterograde transport; (2) newly assembled viral particles are released at synapses and infect postsynaptic partners; (3) in the absence of helper viruses, the spread halts at the first synapse, ensuring monosynaptic restriction. We have elaborated on this in the revised manuscript (line 198), citing Wang et al. (Nat Neurosci 2024) and Wu et al. (Neurosci Bull 2021).

      (1) Hao Wang, Qin Wang, Liuzhe Cui, Xiaoyang Feng, Ping Dong, Liheng Tan, Lin Lin, Hong Lian, Shuxia Cao, Huiqian Huang, Peng Cao, Xiao-Ming Li (2024). A molecularly defined amygdalaindependent tetra-synaptic forebrain-tohindbrain pathway for odor-driven innate fear and anxiety. Nat Neurosci. 2024 Mar;27(3):514-526.

      (2) Zi-Han Wu, Han-Yu Shao, Yuan-Yuan Fu, Xiao-Bo Wu, De-Li Cao, Sheng-Xiang Yan, Wei-Lin Sha, Yong-Jing Gao, Zhi-Jun Zhang (2021). Descending Modulation of Spinal Itch Transmission by Primary Somatosensory Cortex. Neurosci Bull. 2021 Sep;37(9):1345-1350.

      (7) Figure 5G. I do not understand the logic of this experiment. A Cre AAV is injected in the S1 cortex. Why should this lead to the expression of tdTomato on a downstream (postsynaptic?) neuron? The authors should quote the literature that supports this anterograde transsynaptic transport.

      We appreciate this question. As described in previous studies (e.g., Wu et al., Neurosci Bull 2021), AAV2/1-Cre injected into the S1 cortex leads to Cre expression in projection targets due to transsynaptic anterograde transport. Subsequent injection of a Cre-dependent AAV (AAV2/9-DIO-mCherry) into the spinal cord enables specific labeling of postsynaptic neurons that receive input from S1. We have clarified this mechanism in line 206 and provided the appropriate citation.

      Zi-Han Wu, Han-Yu Shao, Yuan-Yuan Fu, Xiao-Bo Wu, De-Li Cao, Sheng-Xiang Yan, Wei-Lin Sha, Yong-Jing Gao, Zhi-Jun Zhang (2021). Descending Modulation of Spinal Itch Transmission by Primary Somatosensory Cortex. Neurosci Bull. 2021 Sep;37(9):1345-1350.

      (8) The same question arises when interpreting the results obtained in Figure 6.

      We thank the reviewer for the question, and we have addressed it in point (7).

      (9) Line 257. How do the authors envision that estrogen would change its modulation of GPR30 under basal and neuropathic conditions? Is there any evidence for this speculation? 

      We thank the reviewer for raising this thoughtful question. In the current study, we focused on pharmacologically manipulating GPR30 activity via its selective agonist and antagonist. We did not directly investigate how endogenous estrogen regulates GPR30 under physiological and neuropathic states. We have recognized this limitation and highlighted the need for future research to investigate this regulatory mechanism.

      (10-20) In my opinion, the entire manuscript needs a careful revision of the English language. While one can follow the text, it contains numerous grammatical and syntactic errors that make the reading far from enjoyable. I am highlighting just a few of the many errors. 

      We appreciate the reviewer’s honest assessment. The manuscript has undergone thorough language editing by a native English speaker to correct grammatical errors, improve clarity, and enhance overall readability. We also restructured several sections, particularly the Discussion, to improve logical flow.

      (21) The discussion of results is a bit disorganized, with disconnected sentences and statements, and somewhat repetitive. For example, lines 303 to 306 lack adequate flow. It is also quite long and includes general statements that add little to the discussion of the new findings (lines 326-333). 

      We agree and have revised the Discussion extensively. Disconnected or repetitive sentences (e.g., lines 303-306, 326-333) have been removed or rewritten. For instance, we added a new transitional paragraph (lines 307-311) to improve flow:

      “Abnormal activation of neurons in the SDH is a key contributor to hyperalgesia, and enhanced excitatory synaptic transmission is a major mechanism driving increased neuronal excitability. Therefore, we evaluated excitatory postsynaptic currents (EPSCs) and observed increased amplitudes in CCK⁺ neurons following CCI, suggesting elevated excitability in these neurons.”

      We also removed redundant generalizations to maintain a focused discussion of our novel findings.

      Reviewer #3 (Recommendations for the authors): 

      (1) What is the distribution of GPR30 throughout the spinal cord and DRG? The authors demonstrate that this can overlap with a CCK+ population, but there are many GPR30+ and CCK negative neurons, even in the cropped images presented. It would be helpful to quantify the colocalization with CCK. 

      We thank the reviewer for this important point. As shown in the revised manuscript, GPR30 is expressed in both the spinal cord and dorsal root ganglia (DRG). However, our updated data (Figure 1B) demonstrate that Gper1 mRNA levels in the DRG are not significantly altered after CCI, suggesting a limited involvement of DRG GPR30 in neuropathic pain. These results are described in the revised Results (line 94).

      Regarding spinal co-expression, we performed a detailed quantification. Approximately 90% of CCK⁺ neurons express GPR30, while about 50% of GPR30⁺ neurons are CCK⁺. These co-localization results are now included in the revised Results and presented in Figure 2G.

      (2) It is clear that CCI and GPR30 influence excitatory synaptic transmission in CCK+ neurons. However, these experiments do not fully support the authors' claims of a postsynaptic upregulation of AMPARs. Comparing amplitudes and frequencies of spontaneous EPSCs cannot necessarily distinguish a pre- vs postsynaptic change since some of these EPSCs can arise from spontaneous action potential firing. I suggest revising this conclusion. 

      We appreciate these insightful comments. We fully agree that our data from spontaneous EPSC recordings (sEPSCs) in CCK⁺ neurons are not sufficient to distinguish between pre- and postsynaptic mechanisms, as sEPSCs may include spontaneous presynaptic activity. Therefore, we have revised the text throughout the manuscript to avoid overstating conclusions related to postsynaptic AMPA receptor upregulation.

      (3) What is the rationale for the evoked EPSC experiments from electrical stimulation in "the deep laminae of SDH?" I do not think that this experiment can rule out a presynaptic contribution of GPR30 to the evoked responses, particularly if these are Gs-coupled at presynaptic terminals. Paired-pulse stimulations could help answer this question, otherwise, alternative interpretations, also related to the point above, should be provided. 

      We thank the reviewer for this thoughtful critique. Indeed, electrical stimulation of the deep SDH laminae does not exclude presynaptic involvement, especially considering that GPR30 is a G protein–coupled receptor (GPCR) and could act presynaptically. We agree that paired-pulse ratio (PPR) analysis would be more informative in distinguishing pre- from postsynaptic effects, but this was not performed due to technical limitations in our current experimental setup.

      Accordingly, we have revised our interpretations in both the Results and Discussion to acknowledge that our data do not rule out presynaptic contributions. We now state that GPR30 activation enhances EPSCs in CCK⁺ neurons, while further studies are needed to dissect the precise site of action.

      (4) I appreciate the challenging nature of the trans-synaptic viral labeling approaches, but the chemogenetic and Gper knockdown experiments do not selectively target this CCK+ population of deep dorsal horn neurons. The data are clear that each of these components (descending corticospinal projections, CCK neurons, and GPR30) can modulate nociceptive hypersensitivity, but I do not agree with the overall conclusion that each of are directly linked as the authors propose. I recommend revising the overall conclusion and title to reflect the convincing data presented. 

      We thank the reviewer for this critical observation. We agree that while our data show functional roles for descending cortical input, CCK⁺ neurons, and GPR30 in modulating pain hypersensitivity, the evidence does not establish a definitive direct circuit integrating all three components.

      In response, we have revised our conclusions to reflect this limitation. Specifically, we avoided claiming a direct functional link among S1 projections, CCK⁺ neurons, and GPR30. Instead, we now propose that GPR30 modulates neuropathic pain primarily through its action in CCK⁺ spinal neurons, with potential involvement of descending facilitation from the somatosensory cortex.

      Additionally, we have revised the manuscript title to better reflect our mechanistic focus:<br /> “GPR30 in spinal CCK-positive neurons modulates neuropathic pain.”

      Minor Corrections

      (1) The authors should refer to mice by sex, not gender. 

      Corrected throughout the manuscript.

      (2) Page 9, line 195: "significantly" is used to refer to co-localization of 28.1%. What is this significant to? 

      We have revised the sentence to accurately describe the observed percentage, without implying statistical significance:

      “Our co-staining results revealed that a high proportion of CCK⁺ S1-SDH postsynaptic neurons expressed GPR30” (line 198-199).

      (3) I recommend modifying some of the transition phrases like "by the way," "what's more," and "besides". 

      All informal expressions have been replaced with academic alternatives including “Furthermore,” “Additionally,” and “Moreover.”

      (4) Additional guides to mark specific laminae in the dorsal horn would be useful. 

      We added immunostaining with laminar markers (CGRP for lamina I and NF200 for lamina III–V), and these data are now shown in Figure 2E and described in the Results (lines 126-129).

      (5) Page 5, line 115: immunochemistry should be immunohistochemistry. 

      Corrected as suggested.

      (6) Page 6, line 136: "Confirming the structural connnections" was not demonstrated here. Perhaps co-localization between GPR30 and CCK+. 

      The text was revised to “To functionally interrogate GPR30 and CCK⁺ neurons in neuropathic pain...” (line 133).

      (7) Page 8, line 166: unsure what "took and important role" means. 

      This phrasing was corrected for clarity and replaced with an accurate scientific description.

      (8) Page 8, line 168: "IPSCs of spinal CCK+ neurons" implies that they are sending inhibitory inputs. 

      We revised the term to “EPSCs” to correctly reflect excitatory synaptic currents in CCK⁺ neurons.

      (9) Page 8, line 169: "Known that EPSCs" is missing an introductory phrase. 

      The sentence was rewritten to include an appropriate introductory clause (lines 161–164):

      “Given that EPSCs are primarily mediated through glutamatergic receptors such as AMPA receptors...”

      (10) Page 10, line 227 and 228: "adequately" and "sufficiently" should be adequate and sufficient. 

      We corrected these terms to the proper adjective forms: “adequate” and “sufficient” (lines 224-225).

    1. Reviewer #2 (Public review):

      Summary:

      This is a very interesting study from Vandendoren and colleagues examining the role of PVN oxytocin neurons during thermoregulatory behaviors, in particular during thermoregulatory huddling. The findings are important and compelling, and have implications for the thermoregulation field as well as the social/naturalistic behavior field.

      Strengths:

      The study is very creative and tackles a challenging task to examine how natural and social behavior influences neural circuits for a homeostatic system such as thermoregulation. The authors use a combination of state-of-the-art tools (photometry, optogenetics, automated behavior tracking, thermal imaging, and core body temperature measurement), often in combination with each other, to produce a rigorous and high-dimensional dataset. Carrying out tightly temperature-controlled experiments and examining natural behavior, neural activity, and body physiology simultaneously is quite a feat. I applaud the authors for taking this on in a rigorous and detailed manner. This paper will be valuable for both the thermoregulation field as well as for researchers interested in naturalistic social behaviors. The conclusions are supported by the data.

      Weaknesses:

      I have a number of questions and suggestions for clarification that would help improve the interpretation of the findings.

      (1) Figure 1D-F: It would be helpful to include representative images of cFos expression in the PVN, LS, and DMH during both quiescent and solo huddling conditions, to better illustrate the reported differences.

      (2) Figure 1C: The data suggest a general suppression of neural activity during sleep-associated quiescent huddling, which somewhat complicates the interpretation of what specifically the active huddling cells are responding to. A more informative control might have been a comparison between huddling and a more generic form of social engagement (e.g., dyadic sniffing) to assess whether huddling-responsive neurons are broadly tuned to social stimuli. While it may not be feasible to add this experimentally at this time, a brief discussion of this limitation in the main text would be valuable.

      (3) Figure 2H-J vs. Figure 1: The fiber photometry data suggest increased PVN activity during quiescent huddling vs active huddling, which appears to contrast with the cFos results from Figure 1. It would be helpful for the authors to comment on possible reasons for this discrepancy-e.g., methodological differences, temporal resolution, or cell-type specificity.

      (4) Figure 2O: A comparable linear regression for active huddling would be informative to assess whether the observed relationships extend across behavioral states.

      (5) Temperature manipulation: The use of floor temperature changes presents a distinct physiological and sensory experience from, for example, manipulation of ambient temperature. A discussion of how this choice may affect neural circuit engagement or interpretation of thermoregulatory responses would be beneficial.

      (6) Correlations with behavior: Across the manuscript, it would be informative to see correlations between huddle duration and neural activity (e.g., cFos expression, calcium signal magnitude). Similarly, do longer huddles produce greater thermogenic effects?

      (7) Lactating vs. virgin mothers: The inclusion of maternal data is intriguing but feels somewhat disconnected from the central huddling-thermoregulation narrative. If these experiments are to remain, additional explanation of their rationale and how they fit into the broader story would help clarify their relevance.

      (8) Optogenetic manipulation: Have the authors tested the effect of PVN OT neuron stimulation or inhibition during huddling? Even a negative result would be of interest to the field. If these data exist (main or supplementary), I apologize for missing them. If not, the authors might consider including them or commenting briefly on any attempts or challenges in carrying out these experiments.

    2. Reviewer #3 (Public review):

      Summary:

      The authors aimed to elucidate the relationship between physiological state (i.e., behavioral status and thermogenic sympathetic activity) and the activity of hypothalamic paraventricular oxytocin (PVNOT) neurons in female mice. They studied this by combining automated classification of mouse behavior via video-based analysis with calcium imaging of PVNOT neuron activity. Sympathetic thermogenesis was inferred from surface temperature changes captured by infrared thermography, and the authors provided their custom analysis scripts in the manuscript. Notably, they found that a strong, pulsatile activation of PVNOT neurons was "occasionally" observed immediately before the animals transitioned from a resting to an active state. This pulsatile activity was observed in both pair-housed and individually housed animals. While PVNOT neurons are often associated with social behaviors, this finding suggests that the oxytocinergic system is also engaged during naturalistic behaviors, even in the absence of social interactions. If experiments were more convincingly performed and presented, the results would point to a broader physiological role of central oxytocin, including in the regulation of fundamental brain states and homeostatic processes, and offer a new perspective on the functional significance of central oxytocin signaling.

      Strengths:

      The oxytocinergic neural system is believed to subserve a wide range of physiological functions, and elucidating these roles requires monitoring PVNOT neuronal activity under various behavioral contexts, as well as manipulating this activity to establish causal links. In the present study, the authors show a technically sound experimental framework that integrates behavioral tracking in both individually and group-housed mice with the observation and manipulation of PVNOT neuron activity. This experimental setup represents a valuable methodological resource for researchers investigating the physiological functions of oxytocin.

      Weaknesses:

      While this study successfully established a new experimental setup for simultaneous analyses of behavior and PVNOT neuronal activity, there are several concerns regarding the interpretation of the results and the robustness of the conclusions, which should be more thoroughly addressed.

      (1) The study relies on the assumption that calcium imaging and optogenetic manipulation were restricted only to PVNOT neurons. However, the specificity of AAV-mediated gene expression was not verified quantitatively. A fair number of cell bodies in the PVN expressed GCaMP8s, but not OT, indicating potential off-target expression (see Figure S2A, B). The lack of quantitative validation weakens confidence in the causal interpretation of the results.

      (2) The study focuses on the transition from rest to active states following pulsatile activity of PVNOT neurons. However, the physiological significance of this pulsatile activity remains unclear. According to the authors, pulsatile activity occurred with an approximately 20% probability within 100 seconds prior to the end of the resting state. This implies that, in the remaining 80% of rest-to-active transitions, pulsatile PVNOT activity did not occur, suggesting that it is not essential for initiating the transition. A comparative analysis of behavioral and thermogenic changes between transitions with and without pulsatile PVNOT activity would help to further clarify the functional relevance of this phenomenon and strengthen the authors' interpretation of the findings.

      (3) The study identifies a correlation between pulsatile activity of PVNOT neurons and rest-to-active transitions, and tests for a causal relationship using optogenetic stimulation. However, since PVNOT neurons are known to co-release other neurotransmitters such as glutamate, it remains unclear whether the observed effects are mediated specifically through oxytocin receptor signaling. To address this question, functional intervention experiments using oxytocin receptor antagonists or receptor knockout mice are necessary.

      (4) The authors attempted to detect BAT thermogenesis and skin vasomotion using infrared thermography. This technique measures only skin hair temperatures (since the skin was not shaved), but does not measure "BAT temperature" or "vasomotor tone". As seen in Figure 5E, the temperatures of the body surface areas ("BAT", "Rump", and "Dorsal surface") mostly changed in parallel, indicating that these temperatures are strongly affected by body core temperature. Therefore, the thermographic measurements in this study did not provide convincing information on BAT thermogenesis or skin vasomotion. To avoid misleading reports, the authors need to use other techniques to directly measure temperatures, such as telemetry.

      (5) Photostimulation of PVNOT neurons increased Tb after 400 sec (6.6 min) (Figure 5). This latency is too long to conclude that the neuronal stimulation elicited BAT thermogenesis. A more reasonable explanation is that the increase in Tb was caused by the induction of physical activity (Figure S4C), which slowly generates heat and contributes to the elevation of Tb. However, this view contradicts the authors' claim. To address this concern, the authors should directly measure BAT thermogenesis and compare it with the rate of Tb elevation. If BAT thermogenesis occurs, the rate at which the BAT temperature increases must exceed the rate at which Tb rises.

    1. Reviewer #2 (Public review):

      Summary:

      This study investigates the role of the enzyme Alcohol Dehydrogenase 5 (ADH5) in brown adipose tissue (BAT) during aging. BAT is crucial for thermogenesis and energy balance, but its function and mass diminish with age, contributing to metabolic dysfunction and age-related diseases. ADH5, also known as S-nitrosoglutathione reductase, regulates nitric oxide (NO) signaling by damaging S-nitrosylation modifications from proteins. The authors show that aging in mice leads to increased protein S-nitrosylation but reduced ADH5 expression in BAT, resulting in impaired metabolic and cognitive functions. Deletion of ADH5 in BAT accelerates tissue senescence and systemic metabolic decline.

      Mechanisticaremoving lly, aging suppresses ADH5 via downregulation of heat shock factor 1 (HSF1), a master regulator of protein homeostasis. Importantly, pharmacologically boosting HSF1 improves BAT function and mitigates both metabolic and cognitive declines in aged mice. The findings highlight a critical HSF1-ADH5 pathway in BAT that protects against aging-related dysfunction, suggesting that targeting this pathway may offer new therapeutic strategies for improving metabolic health and cognition during aging.

      Strengths:

      This research provides insight into the interplay between redox biology, proteostasis, and metabolic decline in aging. By identifying a specific enzyme that controls SNO status in BAT and further developing a therapy to target ADH5 in BAT to prevent age-related decline, the authors have identified a putative mechanism to combat age-related decline in BAT function.

      Weaknesses:

      (1) Sex needs to be considered as a biological variable, at a minimum in the reporting of the phenotypes observed in this manuscript, but also potentially by further experimentation. The only mention of sex I could find is that the authors reported the general protein SNO status in BAT is increased with age in male C57Bl/6J mice. Is this also true in female mice? For all of the ADH5 knockout mouse data, are these also male mice? Do female ADH5 knockout mice have a consistent phenotype, or are the sex differences?

      (2) It would be helpful to know the extent of ADH5 loss in the adipose tissue of knockout mice, either by mRNA or by immunoblotting for ADH5. It could also be helpful to know if ADH5 is deleted from the inguinal adipose tissue of these mice, especially since they seem to accumulate fat mass as they age (Figure 2B).

      (3) For Figure 4D, the ChiP, it would be better to show the IgG control pulldowns. Also, there's an unexpected thing where all the values for the Adh5 flox mice are exactly the same - how is this possible? Finally, it's not clear how these BAT samples were treated with HSF1A - was this done in vivo or ex vivo?

      (4) I didn't understand what was on the y-axis in Figure 5A, nor how it was measured. I assume it's HSF1A, and maybe it's the part in the methods with the Metabolomic Analysis, but this wasn't clear. It would also help if release from the NC-Vehicle formulation could be included as a negative control.

      (5) What happens to BAT protein S-nitrosylation in HSF1A-treated mice?

      (6) Figure 1B: What is the age of the positive (ADH5BKO) and negative (Adh5 fl) mice?

      (7) Figure 1F: Can you clarify what I'm looking at in the P16ink4a panels? The red staining? Is the blue staining DAPI? This is also a problem in Figures 3C, 3D and 5G, and 5I. Figure 4B looks great - maybe this could be used as an example?

      (8) Figure 3B looks a bit odd since 7 of the 12 total mice seem to have an IL-beat level of exactly 5. I was a bit unclear about why arbitrary units were used for IL-1β levels since it says an ELISA was used to quantify IL-1β; however, in the methods the authors describe a Bio-Rad Laboratories Bio-plex Pro Mouse Cytokine 23-Plex approach, which I don't think is an ELISA. Can the approach to measuring IL-1β be clarified, and could the authors explain why they can't show units of mass for IL-1β levels?

      (9) Figure 2C and 2D: I don't really understand why the Heat or VO2 need to be expressed as fold changes. Can't these just be expressed with absolute units? It's also confusing why the heat fold change is 1.0 in the light and the dark for the floxed animal. I bet this is because the knockout is normalized to the floxed animal for light and then normalized again for the dark period, but since both are on the same graph, readers could be confused into thinking there is no difference in the heat production or VO2 between light and dark, which would be surprising. This could all just be solved if absolute units were used.

    2. Author response:

      Reviewer #1 (Public review):

      The topic is appealing given the rise in the aging population and the unclear role of BAT function in this process. Overall, the study uses several techniques, is easy to follow, and addresses several physiological and molecular manifestations of aging.  However, the study lacks an appropriate statistical analysis, which severely affects the conclusions of the work. Therefore, interpretation of the findings is limited and must be done with caution. 

      We greatly appreciate the reviewer’s encouragement. Our team is fully committed to maintaining clarity and rigor in the design, execution, and reporting of this study. We are grateful to the reviewers for bringing these issues to our attention. We also acknowledge and are working on that several statistical analyses could be reperformed to better emphasize our focus on the genetic effect of ADH5 deletion in mice of the same age.

      Reviewer #2 (Public review):

      Strengths: 

      This research provides insight into the interplay between redox biology, proteostasis, and metabolic decline in aging. By identifying a specific enzyme that controls SNO status in BAT and further developing a therapy to target ADH5 in BAT to prevent age-related decline, the authors have identified a putative mechanism to combat age-related decline in BAT function. 

      We greatly appreciate the reviewer’s encouragement. 

      Weaknesses: 

      (1) Sex needs to be considered as a biological variable, at a minimum in the reporting of the phenotypes observed in this manuscript, but also potentially by further experimentation. 

      We thank the reviewer for the insightful remark, and we agree with the reviewer that sex needs to be considered as a biological variable. We will assess ADH5 expression in aged female mice.

      (2)  It would be helpful to know the extent of ADH5 loss in the adipose tissue of knockout mice, either by mRNA or by immunoblotting for ADH5. It could also be helpful to know if ADH5 is deleted from the inguinal adipose tissue of these mice, especially since they seem to accumulate fat mass as they age (Figure 2B). 

      We thank the reviewer for the comment/suggestion. Indeed, we have measured the ADH5 expression in both brown adipose tissue (BAT) and inguinal adipose tissue (iWAT). We regret that we did not include our results in the first submission and will provide these results in the revised manuscript.

      (3)  For Figure 4D, the ChiP, it would be better to show the IgG control pulldowns. Finally, it's not clear how these BAT samples were treated with HSF1A - was this done in vivo or ex vivo? 

      We thank the reviewer for their thoughtful comment and will provide detailed information in the revised manuscript.

      (4) I didn't understand what was on the y-axis in Figure 5A, nor how it was measured.

      We apologize for not making these critical points clearer in the first submission. In the revised manuscript we will include, in detail, the logistics of the experiments in the materials and methods section, figure annotation and figure legends.  

      (5) What happens to BAT protein S-nitrosylation in HSF1A-treated mice? 

      We thank the reviewer for the insightful remark, and we will measure general protein Snitrosylation status in the BAT of HSF1A-treated mice. 

      (6) Figure 1B: What is the age of the positive (ADH5BKO) and negative (Adh5 fl) mice? 

      We regret that we did not describe our results clearly in the first submission and will provide detailed information in the revised manuscript.

      (7) Figure 1F: Can you clarify what I'm looking at in the P16ink4a panels? The red staining? Is the blue staining DAPI? This is also a problem in Figures 3C, 3D and 5G, and 5I. Figure 4B looks great - maybe this could be used as an example?  

      We regret that we did not present results clearly in the first submission and will provide detailed information in the revised manuscript.

      (8) Figure 3B looks a bit odd. Can the approach to measuring IL-1β be clarified, and could the authors explain why they can't show units of mass for IL-1β levels? 

      We will provide detailed information in the revised manuscript.

      (9) Figure 2C and 2D: I don't really understand why the Heat or VO2 need to be expressed as fold changes. Can't these just be expressed with absolute units? 

      We thank the reviewer for the insightful comment. We will present these results as suggested in the revised manuscript.

    1. Author response:

      (1) Stable annual dynamics vs. episodic outbreaks

      We agree that RVF is classically described as producing periodic epidemics interspersed with long inter-epidemic periods, often linked to extreme rainfall events. Our model predicts more regular seasonal dynamics, which reflects the endemic transmission patterns we have observed in The Gambia through serological surveys. In the revision, we will:

      Clarify that while epidemics occur in other parts of sub-Saharan Africa, our results may indicate a different epidemiological narrative in The Gambia, with sustained but low-level circulation (hyperendemicity).

      Discuss how model assumptions (e.g. seasonality, homogenous mixing) may bias results toward stable dynamics.

      Highlight the implications of this for interpretation and for public health decision-making.

      (2) Use of network analysis

      We acknowledge the reviewer’s concern. The network analysis was conducted descriptively to characterize cattle movement patterns and the structure of herd connections, but it was not formally incorporated into the model. In revisions we will:

      Clarify this distinction in the manuscript to avoid overinterpretation.

      Emphasize the need for future modelling work using finer-scale movement data, which could support more realistic herd metapopulation dynamics and better capture heterogeneity in transmission.

      (3) RVFV reproductive impacts

      While RVF outbreaks are known to cause abortions and neonatal deaths, these occur during relatively rare epidemics. In the Gambian context, where we’re not observing such large episodic outbreaks but rather low-level circulation, the annual impact of RVF infection on births is likely modest compared to baseline herd turnover. Moreover, cattle demography is partly managed, with replacement and movement buffering birth rates against short-term losses.

      Our model includes birth as a constant demographic process, it’s reasonable to assume stable population since we are not explicitly modelling outbreak-scale reproductive losses. This is consistent with other RVF transmission models that adopt a similar simplifying assumption. However, we will acknowledge this simplification as a limitation in the revised manuscript.

      (4) Missing ODEs for M herds in the dry season

      We thank the reviewer for identifying this omission. The ODEs for M herds in the dry season were not included in the appendix due to an oversight, though demographic turnover was incorporated in the model code. We will add the missing equations to the appendix.

      (5) Role of immunity loss and model structure (SIR vs. SIRS)

      We acknowledge that the decline of detectable antibodies over time (seropositivity decay/seroreversion) is an important consideration in RVFV serology, but whether this reflects true loss of protective immunity after natural infection remains unknown. Biologically, it is plausible that infected cattle develop long-lasting protection, as suggested by studies in humans, but there is an absence of longitudinal field data. From a modelling perspective, our aim was to predict age-seroprevalence curve dependent on FOI estimates and assess its ability to reproduce observed cross-sectional seroprevalence patterns. We therefore adopted a parsimonious SIR framework, treating loss of seropositivity as a potential explanation for the observed age disparity rather than modelling it as loss of immunity. In revisions we will:

      Clarify this rationale, emphasising that there is no direct evidence for waning immunity following natural RVFV infection in cattle, although evidence of seropositivity decay has been suggested in human.

      Further discuss the seropositivity decay rates predicted in our survey and their possible relation to test sensitivity.

      Highlight that while a SIRS structure could generate different long-term dynamics, evaluating this requires stronger evidence for true immunity loss; we consider this an important future modelling direction.

      (6) RVFV induced mortality in serocatalytic model

      We thank the reviewer for this comment. Disease-induced mortality was included in the serocatalytic model through the mortality parameter (γ), but we recognise that this might not have been sufficiently clear in the text. In revisions we will clarify in the Methods and Appendix.

      (7) Clarifying previous vs. current study components

      We will revise the Methods and Appendix to make clearer distinctions between our previous work (e.g. household survey data collection, seroprevalence estimates) and the analyses undertaken for this manuscript (e.g. model development and fitting).

      (8) Limitations paragraph

      We will expand the limitations section to specifically identify the assumptions contributing most to uncertainty. We will then outline how these may bias transmission dynamics and intervention estimates.

      (9) Movement ban simulations & suitability of model for vaccination interventions

      We appreciate the reviewer’s concerns regarding the movement ban simulation. On reassessment, we agree that our model structure might not be ideally suited to exploring them. In the revised manuscript, we will remove this analysis and emphasize how our modelling framework is more suited to exploring cattle vaccination scenarios, including targeting of specific herd types (e.g. T vs. M vs. L). We note that we are currently developing separate work focused on vaccination strategies in cattle, where this model structure might be more directly applicable, and will reserve a deeper investigation of vaccination interventions for that forthcoming publication.

    1. Reviewer #2 (Public review):

      Summary:

      The intracellular pathogen Toxoplasma gondii scavenges metal ions such as iron and zinc to support its replication; however, mechanistic studies of iron and zinc uptake are limited. This study investigates the function of a putative iron and zinc transporter, ZFT. In this paper, the authors provide evidence that ZFT mediates iron and zinc uptake by examining the regulation of ZFT expression by iron and zinc levels, the impact of altered ZFT expression on iron sensitivity, and the effects of ZFT depletion on intracellular iron and zinc levels in the parasite. The effects of ZFT depletion on parasite growth are also investigated, showing the importance of ZFT function for the parasite.

      Strengths:

      A key strength of the study is the use of multiple complementary approaches to demonstrate that ZFT is involved in iron and zinc uptake. Additionally, the authors build on their finding that loss of ZFT impairs parasite growth by showing that ZFT depletion induces stage conversion and leads to defects in both the apicoplast and mitochondrion.

      Weaknesses:

      (1) Excess zinc was shown not to alter ZFT expression, but a cation chelator (TPEN) did lead to decreased expression. While TPEN is often used to reduce zinc levels, does it have any effect on iron levels? Could the reduction in ZFT after TPEN treatment be due to a reduction in the level of iron or another cation?

      (2) ZFT expression was found to be dynamic depending on the size of the vacuole, based on mean fluorescence intensity measurements. Looking at protein levels by Western blot at different times during infection would strengthen this finding.

      (3) ZFT localization remained at the parasite periphery under low iron conditions. However, in the images shown in Figure S1c, larger vacuoles (containing 4-8 parasites) are shown for the untreated conditions, and single parasite-containing vacuoles are shown for the low iron condition. As ZFT localization is predominantly at the basal end of the parasite in larger PV and at the parasite periphery for smaller vacuoles, it would be better to compare vacuoles of similar size between the untreated and low-iron conditions.

    1. To get list of combos: ``` letters = ['a', 'b', 'c', 'd', 'e', 'f', 'g', 'h', 'i', 'j', 'k', 'l', 'm', 'n', 'o', 'p', 'q', 'r', 's', 't', 'u', 'v', 'w', 'x', 'y', 'z'] numbers = [1, 2, 3, 4, 5,6 ,7, 8, 9, 0] combinations_letters = itertools.combinations(letters, 3) combinations_numbers = itertools.combinations(numbers, 3) combined_results = itertools.product(combinations_letters, combinations_numbers) # 6 character license

      combinations_numbers_2 = itertools.combinations(numbers, 4) combined_results_2 = itertools.product(combinations_letters, combinations_numbers_2) # 7 character license for combo in combined_results: print(combo)

      for combo in combined_results_2: print(combo) ``` The number of estimated license plates with the format LLLNNNN is ((26)^3)((10)^4)), which is 175,760,000. For LLLNNN, it is ((26)^3)((10)^4)), which 17,576,000. The total number of combos is 193,336,000.

    2. not complete: letters = ['A', 'B', 'C', 'D', 'E', 'F', 'G', 'H', 'I', 'J', 'K', 'L', 'M', 'N', 'O', 'P', 'Q', 'R', 'S', 'T', 'U', 'V', 'W', 'X', 'Y', 'Z'] number = ['0', '1', '2', '3', '4', '5', '6', '7', '8', '9']

      final = []

      for word in map("".join, product("K", "8", letters,

    1. When lovely woman stoops to folly and Paces about her room again, alone,

      This line is extracted from Goldsmith's "The Vicar of Wakefield", referring to a young woman, Olivia, who was seduced by the wicked Squire Thornhill. Tricked into a fraudulent marriage by the notorious womanizer, she is left disgraced in the eyes of society, and the stigma of her seduction taints her family's reputation by extension. Olivia thus sings a ballad of her own lament: "When lovely woman stoops to folly, / And finds, too late, that men betray, / ... / The only art her guilt to cover, / To hide her shame from ev'ry eye / To give repentance to her lover, / And wring is bosom, is-- to die" (133-4). This line encapsulates the rigid moral standards to which women are held in society: purity and virtue are held above all, thus determining their absolute value as a person. However, although Olivia is framed as a victim of male seduction, she is simultaneously blamed for her own disgrace by "stooping to folly", with the entirety of punishment ultimately falling upon the woman. Furthermore, another important message is emphasized by Olivia: "the only art" left to woman who have lost their chastity is death. Dying emerges as the only feasible resolution to the issue; it acts not only as an expression of despair, but as a socially endorsed means of restoring dignity, banishing guilt, and even receiving remorse from others.

      On the other hand, Eliot echoes a similar notion of female disgrace "The Fire Sermon". The typist, passive and disengaged during her forced sexual encounter with the clerk, offers no resistance but equally no desire to his intimate advancements. As a result, her moral reputability is irrevocably tarnished; while she was once "lovely", she is now reduced to a hollow emblem of sexual exploitation and impurity. Additionally, she condemns herself to emotional isolation, "pac[ing] about her room again, alone" (line 253). In this moment, Eliot not only depicts the moral disrepute of this singular woman in the modern wasteland, but gestures towards the broader commentary on the tainted condition of womanhood, stripped of its agency and dignity.

    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Public review):

      Shigella flexneri is a bacterial pathogen that is an important globally significant cause of diarrhea. Shigella pathogenesis remains poorly understood. In their manuscript, Saavedra-Sanchez et al report their discovery that a secreted E3 ligase effector of Shigella, called IpaH1.4, mediates the degradation of a host E3 ligase called RNF213. RNF213 was previously described to mediate ubiquitylation of intracellular bacteria, an initial step in their targeting of xenophagosomes. Thus, Shigella IpaH1.4 appears to be an important factor in permitting evasion of RNF213-mediated host defense.

      Strengths:

      The work is focused, convincing, well-performed, and important. The manuscript is well-written.

      We would like to thank the reviewer for their time evaluating our manuscript and the positive assessment of the novelty and importance of our study. We provide a comprehensive response to each of the reviewer’s specific recommendations below and highlight any changes made to the manuscript in response to those recommendations.

      Reviewer #1 (Recommendations for the authors):

      (1) In the abstract (and similarly on p.10), the authors claim to have shown "IpaH1.4 protein as a direct inhibitor of mammalian RNF213". However, they do not show the interaction is direct. This, in my opinion, would require demonstrating an interaction between purified recombinant proteins. I presume that the authors are relying on their UBAIT data to support the direct interaction, but this is a fairly artificial scenario that might be prone to indirect substrates. I would therefore prefer that the 'direct' statement be modified (or better supported with additional data). Similarly, on p.7, the section heading states "S. flexneri virulence factors IpaH1.4 and IpaH2.5 are sufficient to induce RNF213 degradation". The corresponding experiment is to show sufficiency in a 293T cell, but this leaves open the participation of additional 293T-expressed factors. So I would remove "are sufficient to", or alternatively add "...in 293T cells".

      We agree with the reviewer and made the recommended changes to the text in the abstract, in the results section on page 7, and in the Discussion on page 11. During the revision of our manuscript two additional studies were published that provide convincing biochemical evidence for the direct interaction between IpaH1.4 and RNF213 (PMID: 40205224; PMID: 40164614). These studies address the reviewer’s concern extensively and are now briefly discussed and cited in our revised MS.

      (2) In the abstract the authors state "Linear (M1-) and lysine-linked ubiquitin is conjugated to bacteria by RNF213 independent of the linear ubiquitin chain assembly complex (LUBAC)." However, it is not shown that RNF213 is able to directly perform M1-ubiquitylation. It is shown that RNF213 is required for M1-linked ubiquitylation in IpaH1.4 or MxiE mutants, this is different than showing conjugation is done by RNF213 itself. This should be reworded.

      We agree and edited the text accordingly

      (3) Introduction: one of the main points of the paper is that RNF213 conjugates linear ubiquitin to the surface of bacteria in a manner independent of the previously characterized linear ubiquitin conjugation (LUBAC) complex. This is indeed an interesting result, but the introduction does not put this discovery in much context. I would suggest adding some discussion of what was known, if anything, about the type of Ub chain formed by RNF213, and specifically whether linear Ub had previously been observed or not.

      We now provide context in the Introduction on page 3 and briefly discuss previous work that had implicated LUBAC in the ubiquitylation of cytosolic bacteria. We emphasize that LUBAC specifically generates linear (M1-linked) ubiquitin chains, while the types of ubiquitin linkages deposited on bacteria through RNF213-dependent pathways had remained unidentified.

      (4) Figure 3C: is the difference in 7KR-Ub between WT and HOIP KO cells significant? If so, the authors may wish to acknowledge the possibility that HOIP partially contributes to M1-Ub of MxiE mutant Shigella

      The frequencies at which bacteria are decorated with 7KR-Ub is not statistically different between WT and HOIP KO cells. We have included this information in the panel description of Figure 3.

      (5) On page 11, the authors state that "...we observed that LUBAC is dispensable for M1-linked ubiquitylation of cytosolic S. flexneri ∆ipaH1.4. We found that lysine-less internally tagged ubiquitin or an M1-specific antibody bound to S. flexneri ∆ipaH1.4 in cells lacking LUBAC (HOIL-1KO or HOIPKO) but failed to bind bacteria in RNF213-deficient cells". In fact, what is shown is that M1-ubiquitylation in ∆ipaH1.4 infection is RNF213-dependent (5E), but the work with lysine mutants, HOIP or HOIL-1 KOs are all with ∆mxiE, not ∆ipaH1.4 (3B) in this version of the manuscript. Ideally, the data with ∆ipaH1.4 could be added, but alternatively, the conclusion could be re-worded.

      We now include the data demonstrating that staining of ∆ipaH1.4 with an M1-specific antibody is unchanged from WT cells in HOIL-1 KO and HOIP KO cells. These data are shown in supplementary data (Fig. S3E) and referred to on page 9 of the revised manuscript.

      (6) The UBAIT experiment should be explained in a bit more detail in the text. The approach is not necessarily familiar to all readers, and the rationale for using Salmonella-infected ceca/colons is not well explained (and seems odd). Some appropriate caution about interpreting these data might also be welcome. Did HOIP or HOIL show up in the UBAIT? This perhaps also deserves some discussion.

      As expected, HOIP (listed under its official gene name Rnf31 in the table of Fig.S2B) was identified as a candidate IpaH1.4 interaction partner as the third most abundant hit from the UBAIT screen. Remarkably, Rnf213 was the hit with the highest abundance in the IpaH1.4 UBAIT screen. To address the reviewer’s comments, we now explain the UBAIT approach in more detail and provide the rational for using intestinal protein lysates from Salmonella infected mice. The text on page 8 reads as follows: “To investigate potential physical interactions between IpaH1.4 and IpaH2.5, we reanalyzed a previously generated dataset that employed a method known as ubiquitin-activated interaction traps (UBAITs) (32). As shown in Fig. S2A, the human ubiquitin gene was fused to the 3′ end of IpaH2.5, producing a C-terminal IpaH2.5-ubiquitin fusion protein. When incubated with ATP, ubiquitin-activating enzyme E1, and ubiquitin-conjugating enzyme E2, the IpaH2.5-ubiquitin "bait" protein is capable of binding to and ubiquitylating target substrates. This ubiquitylation creates an iso-peptide bond between the IpaH2.5 bait and its substrate, thereby enabling purification via a Strep affinity tag incorporated into the fusion construct (32). IpaH2.5-ubiquitin bait and IpaH3-ubiquitin control proteins were incubated with lysates from murine intestinal tissue. To detect interaction partners in a physiologically relevant setting, we used intestinal lysates derived from mice infected with Salmonella, which in contrast to Shigella causes pronounced inflammation in WT mice and therefore better simulates human Shigellosis in an animal model. Using UBAIT we identified HOIP (Rnf31) as a likely IpaH2.5 binding partner (Fig. S2B), thus confirming previous observations (28) and validating the effectiveness our approach. Strikingly, we identified mouse Rnf213 as the most abundant interaction partner of the IpaH2.5-ubiquitin bait protein (Fig. S2B). Collectively, our data and concurrent reports showing direct interactions between IpaH1.4 and human RNF213 (36, 37) indicate that the virulence factors IpaH1.4 and IpaH2.5 directly bind and degrade mouse as well as human RNF213.”

      (7) It would be helpful if the authors discussed their results in the context of the prior work showing IpaH1.4/2.5 mediate the degradation of HOIP. Do the authors see HOIP degradation? If indeed HOIP and RNF213 are both degraded by IpaH1.4 and IpaH2.5, are there conserved domains between RNF213 and HOIP being targeted? Or is only one the direct target? A HOIP-RNF213 interaction has previously been shown (https://doi.org/10.1038/s41467-024-47289-2). Since they interact, is it possible one is degraded indirectly? To help clarify this, a simple experiment would be to test if RNF213 degraded in HOIP KO cells (or vice-versa)?

      We appreciate the reviewer’s suggestions. We conducted the proposed experiments and found that WT S. flexneri infections result in RNF213 degradation in both WT and HOIP KO cells. Similarly, we found that HOIP degradation was independent of RNF213. We have included these data in Figs. 5A and S3B of our revised submission. A study published during revisions of our paper demonstrates that the LRR of IpaH1.4 binds to the RING domains of both RNF213 and LUBAC (PMID: 40205224). We refer to this work in our revised manuscript.

      Reviewer #2 (Public review):

      Summary:

      The authors find that the bacterial pathogen Shigella flexneri uses the T3SS effector IpaH1.4 to induce degradation of the IFNg-induced protein RNF213. They show that in the absence of IpaH1.4, cytosolic Shigella is bound by RNF213. Furthermore, RNF213 conjugates linear and lysine-linked ubiquitin to Shigella independently of LUBAC. Intriguingly, they find that Shigella lacking ipaH1.4 or mxiE, which regulates the expression of some T3SS effectors, are not killed even when ubiquitylated by RNF213 and that these mutants are still able to replicate within the cytosol, suggesting that Shigella encodes additional effectors to escape from host defenses mediated by RNF213-driven ubiquitylation.

      Strengths:

      The authors take a variety of approaches, including host and bacterial genetics, gain-of-function and loss-of-function assays, cell biology, and biochemistry. Overall, the experiments are elegantly designed, rigorous, and convincing.

      Weaknesses:

      The authors find that ipaH1.4 mutant S. flexneri no longer degrades RNF213 and recruits RNF213 to the bacterial surface. The authors should perform genetic complementation of this mutant with WT ipaH1.4 and the catalytically inactive ipaH1.4 to confirm that ipaH1.4 catalytic activity is indeed responsible for the observed phenotype.

      We would like to thank the reviewer for their time evaluating our manuscript and the positive assessment of our work, especially its scientific rigor. We conducted the experiment suggested by the reviewer and included the new data in the revised manuscript. As expected, complementation of the ∆ipaH1.4 with WT IpaH1.4 but not with the catalytically dead C338S mutant restored the ability of Shigella to efficiently escape from recognition by RNF213 (Figs. 5C-D).

      Reviewer #2 (Recommendations for the authors):

      The authors should perform genetic complementation of the ipaH1.4 mutant with WT ipaH1.4 and the catalytically inactive ipaH1.4 to confirm that ipaH1.4 catalytic activity is indeed responsible for the observed phenotype.

      We performed the suggested experiment and show in Figs. 5C-D that complementation of the ∆ipaH1.4 mutant with WT IpaH1.4 but not with the catalytically dead C338S mutant restored the ability of Shigella to efficiently escape from recognition by RNF213. These data demonstrate that the catalytic activity of IpaH1.4 is required for evasion of RNF213 binding to the bacteria.

      Reviewer #3 (Public review):

      Summary:

      In this study, the authors set out to investigate whether and how Shigella avoids cell-autonomous immunity initiated through M1-linked ubiquitin and the immune sensor and E3 ligase RNF213. The key findings are that the Shigella flexneri T3SS effector, IpaH1.4 induces degradation of RNF213. Without IpaH1.4, the bacteria are marked with RNF213 and ubiquitin following stimulation with IFNg. Interestingly, this is not sufficient to initiate the destruction of the bacteria, leading the authors to conclude that Shigella deploys additional virulence factors to avoid this host immune response. The second key finding of this paper is the suggestion that M1 chains decorate the mxiE/ipaH Shigella mutant independent of LUBAC, which is, by and large, considered the only enzyme capable of generating M1-linked ubiquitin chains.

      Strengths:

      The data is for the most part well controlled and clearly presented with appropriate methodology. The authors convincingly demonstrate that IpaH1.4 is the effector responsible for the degradation of RNF213 via the proteasome, although the site of modification is not identified.

      Weaknesses:

      (1)The work builds on prior work from the same laboratory that suggests that M1 ubiquitin chains can be formed independently of LUBAC (in the prior publication this related to Chlamydia inclusions). In this study, two pieces of evidence support this statement -fluorescence microscopy-based images and accompanying quantification in Hoip and Hoil knockout cells for association of M1-ub, using an antibody, to Shigella mutants and the use of an internally tagged Ub-K7R mutant, which is unable to be incorporated into ubiquitin chains via its lysine residues. Given that clones of the M1-specific antibody are not always specific for M1 chains, and because it remains formally possible that the Int-K7R Ub can be added to the end of the chain as a chain terminator or as mono-ub, the authors should strengthen these findings relating to the claim that another E3 ligase can generate M1 chains de novo.

      (2) The main weakness relating to the infection work is that no bacterial protein loading control is assayed in the western blots of infected cells, leaving the reader unable to determine if changes in RNF213 protein levels are the result of the absent bacterial protein (e.g. IpaH1.4) or altered infection levels.

      (3)The importance of IFNgamma priming for RNF213 association to the mxiE or ipaH1.4 strain could have been investigated further as it is unclear if RNF213 coating is enhanced due to increased protein expression of RNF213 or another factor. This is of interest as IFNgamma priming does not seem to be needed for RNF213 to detect and coat cytosolic Salmonella.<br /> Overall, the findings are important for the host-pathogen field, cell-autonomous/innate immune signaling fields, and microbial pathogenesis fields. If further evidence for LUBAC independent M1 ubiquitylation is achieved this would represent a significant finding.

      We would like to thank the reviewer for their time evaluating our manuscript and the positive assessment of our work and its significance. We provide a comprehensive response to the main three critiques listed under ‘weaknesses’ and also have responded to each of the reviewer’s specific recommendations below. We highlight any changes made to the manuscript in response to those recommendations.

      (1) As the reviewer correctly pointed out, 7KR ubiquitin cannot only be used for linear ubiquitylation but can also function as a donor ubiquitin and can be attached as mono-ubiquitin to a substrate or to an existing ubiquitin chain as a chain terminator. To distinguish between 7KR INT-Ub signals originating from linear versus mono-ubiquitylation, we followed the reviewer’s advice and generated a N-terminally tagged 7KR INT-Ub variant. The N-terminal tag prevents linear ubiquitylation but still allows 7KR INT-Ub to be attached as a mono-ubiquitin. We found that the addition of this N-terminal tag significantly reduced but not completely abolished the number of Δ_mxiE_ bacteria decorated with 7KR INT-Ub. These data are shown in a new Fig. S1 and indicate that 7KR lacking the N-terminal tag is attached to bacteria both in the form of linear (M1-linked) ubiquitin and as donor ubiquitin, possibly as a chain terminator. While we cannot rule out that the anti-M1 antibodies used here cross-react with other ubiquitin linkages, we reason that the 7KR data strongly argues that linear ubiquitin is part of the ubiquitin coat encasing IpaH1.4-deficient cytosolic Shigella. Collectively, our data show that both linear and lysine-linked (especially K27 and K63) ubiquitin chains are part of the RNF213-dependent ubiquitin coat on the surface of IpaH1.4 mutants. And furthermore, our data strongly indicate that this ubiquitylation of IpaH1.4 mutants is independent of LUBAC.

      (2) We used GFP-expressing strains of S. flexneri for our infection studies and were therefore able to use GFP expression as a loading control. We have incorporated these data into our revised figures. These new data (Figs. 4A, 5A, and S3B) show that bacterial infection levels were comparable between WT and mutant infections and that therefore the degradation of RNF213 (or HOIP – see new data in Fig. S3B) is not due to differences in infection efficiency.

      (3) We agree with the reviewer that the mechanism by which RNF213 binds to bacteria is an important unanswered question. Similarly, whether other ISGs have auxiliary functions in this process or whether binding efficiencies vary between different bacterial species are important questions in the field. However, these questions go far beyond the scope of this study and were therefore not addressed in our revisions.

      Reviewer #3 (Recommendations for the authors):

      (1) An N-terminally tagged K7R-ub should be used as a control to test whether the signal found around the mutant shigella is being added via the N terminal Met into chains. As it is known that certain batches of the M1-specific antibodies are in fact not specific and able to detect other chain types, the authors should test the specificity of the antibody used in this study (eg against different di-Ub linkage types) and include this data in the manuscript.

      We agree with the reviewer in principle. The anti-linear ubiquitin (anti-M1) monoclonal antibody, clone 1E3, prominently used in this study was tested by the manufacturer (Sigma) by Western blotting analysis and according to the manufacturer “this antibody detected ubiquitin in linear Ub, but not Ub K11, Ub K48, Ub K63.” However, this analysis did not include all possible Ub linkage types and thus the reviewer is correct that the anti-M1 antibody could theoretically also detect some other linkage types. To address this concern, we added new data during revisions demonstrating that 7KR INT-Ub targeting to S. flexneri is largely dependent on the N-terminus (M1) of ubiquitin. Our combined observations therefore overwhelmingly support the conclusion that linear (M1-linked) as well as K-linked ubiquitin is being attached to the surface of IpH1.4 S. flexneri bacteria in an RNF213-dependent and LUBAC-independent manner.

      (2) The M1 signal detected on bacteria with the antibody is still present in either Hoip or Hoil KO’s but due to the potential non-specificity of the antibody, the authors should test whether K7R ub is detected on bacteria in the Hoil ko (in addition to Hoip KO). This would strengthen the authors’ data on LUBAC-independent M1 and is important because Hoil can catalyse non-canonical ubiquitylation.

      The specific linear ubiquitin-ligating activity of LUBAC is enacted by HOIP. We show that linear ubiquitylation of susceptible S. flexneri mutants as assessed by anti-M1 ubiquitin staining or 7KR INT-Ub recruitment occurs in HOIPKO cells at WT levels (Figs. 3B, 3C, S3E [new data]). In our view , these data unequivocally show that the observed linear ubiquitylation of cytosolic S. flexneri ipaH1.4 and mxiE mutants is independent of LUBAC.

      (3) For Figure 4A, do mxiE bacteria show similar invasion - authors should include a bacterial protein control to show levels of bacteria in WT and mxiE infected conditions. A similar control should be included in Figure 5A.

      We used GFP-expressing strains of S. flexneri for our infection studies and were therefore able to use GFP expression as a loading control. We have incorporated these data into our revised figures. These new data (Figs. 4A, 5A, and S3B) show that bacterial infection levels were comparable between WT and mutant infections and that therefore the degradation of RNF213 (or HOIP – see new data in Fig. S3B) is not due to differences in infection efficiency.

      (4) Can the authors speculate why IFNg priming is needed for the coating of Shigella mxiE mutant but not in the case of Salmonella or Burkholderia? Is this just amounts of RNF213 or something else?

      In our studies we did not directly compare ubiquitylation rates of cytosolic Shigella, Burkholderia, and Salmonella bacteria with each other under the same experimental conditions. However, such a direct comparison is needed to determine whether IFNgamma priming is required for RNF213-dependent bacterial ubiquitylation of some but not other pathogens. Two papers published during the revisions of our manuscript (PMID: 40164614, PMID: 40205224) reports robust RNF213 targeting to IpaH1.4 Shigella mutants in unprimed cells HeLa cells (whereas we used A549 and HT29 cells). Therefore, differences in reagents, cell lines, and/or other experimental conditions may determine whether IFNgamma priming is necessary to observe substantial RNF213 translocation to cytosolic bacteria.

      (5) Typos - there are several, but this is hard to annotate with line numbers so the authors should proofread again carefully.

      We proofread the manuscript and corrected the small number of typos we identified

    1. Author response:

      The following is the authors’ response to the previous reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      Wang and Colleagues present a study aimed at demonstrating the feasibility of repeated ultrasound localization microscopy (ULM) recording sessions on mice chronically implanted with a cranial window transparent to US. They provided quantitative information on their protocol, such as the required number of Contrast enhancing microbubbles (MBs) to get a clear image of the vasculature of a brain coronal section. Also, they quantified the co-registration quality over time-distant sessions and the vasodilator effect of isoflurane.

      Strengths:

      The study showed a remarkable performance in recording precisely the same brain coronal section over repeated imaging sessions. In addition, it sheds light on the vasodilator effect of isoflurane (an anesthetic whose effects are not fully understood) on the different brain vasculature compartments, although, as the Authors stated, some insights in this aspect have already been published with other imaging techniques. The experimental setting and protocol are very well described.

      Wang and co-authors submitted a revised version of their study, which shows improvements in the clarity of the data description.

      However, the flaws and limitations of this study are substantially unchanged.

      The main issues are:

      Statistics are still inadequate. The TOST test proposed in this revised version is not equivalent to an ANOVA. Indeed, multivariate analyses should be the most appropriate, given that some quantifications were probably made on multiple vessels from different mice. The 3 reviewers mentioned the flaws in statistics as the primary concern.

      Response 01: We thank the reviewer for raising this important point. We fully acknowledge the limitations of our current statistical analysis. We would like to clarify that the TOST procedure was applied exclusively to the measurements taken from the same vessel segment in the same animal across different time points, with the purpose of evaluating the consistency of vessel diameter measurements. We recognize that the statistical analysis in this study remains limited, which we have acknowledged as a key limitation in the manuscript. This constraint arises primarily from the limited number of animals, and our analysis should be interpreted as a representative case study rather than a generalized statistical conclusion. We have revised the manuscript to clarify these points and to more explicitly acknowledge the statistical limitations.

      (Line 329) “Our current study primarily focused on demonstrating the feasibility of longitudinal ULM imaging in awake animals, instead of conducting a systematic investigation of how isoflurane anesthesia alters cerebral blood flow. Due to the limited number of animals used, the analyses presented in this work should be interpreted as example case studies. While the trends observed across animals were consistent, the small sample size restricts the scope of statistical inference. For future work, it would be valuable to design more rigorous control experiments with larger sample sizes to systematically compare the effects of isoflurane anesthesia, awake states, and other anesthetics that do not induce vasodilation on cerebral blood flow.”

      No new data has been added, such as testing other anesthetics.

      Response 02: We acknowledge that the current study does not include data involving other anesthetics, and we have also discussed this point in our initial response. In fact, we did attempt to use other anesthetics such as ketamine. However, we found it difficult to draw reliable conclusions due to experimental limitations such as variable anesthesia recovery profiles and injection timing, as elaborated in the following paragraphs. Therefore, we decided not to include these data in the current study to avoid potential misinterpretation.

      One major limitation of our experimental setup is that imaging in the awake state is necessarily conducted after a brief period of isoflurane-anesthesia. This brief anesthesia allows for the intravenous injection of microbubbles via the tail vein. Isoflurane is particularly suited for this purpose due to its rapid onset and offset. Mice can recover quickly once the gas is withdrawn, which enables relatively consistent post-anesthesia imaging in the awake state.

      In contrast, other anesthetic agents present challenges. Their recovery profiles are slower, more variable, and less controllable. Reversal drugs can be administered to awaken the animals, but they add another variability. These may lead to greater fluctuations in cerebral hemodynamics and factors introduce uncertainty in the timing of bolus microbubble injection. As such, our current setup is not ideal for systematically comparing different anesthetics and could yield misleading results.

      A more appropriate strategy for comparing awake ULM imaging with different anesthetics would be performing awake imaging first, followed by imaging under anesthesia. This would ensure that the awake condition is free from residual anesthetic effects. However, this method raises higher requirement in bubble delivery, as no anesthesia can be used for the intravenous injection.

      To address this, we are actively exploring another solution using indwelling jugular vein catheterization. By surgically implanting a catheter into the jugular vein prior to imaging, we can establish a stable and reproducible route for microbubble delivery in fully awake animals without any anesthesia induction. This method has the potential to enable direct and reliable comparisons across different physiological states. However, the implementation of this technique and the associated experimental findings go beyond the scope of the current study and will be presented in a future manuscript.

      In the present work, we have emphasized the methodological limitations of our approach and clarified that our primary goal is to highlight the necessity and feasibility of awake-state ULM imaging. The focus is not to comprehensively characterize the effects of different anesthetic agents on microvascular brain flow. We appreciate your understanding and interest in this important future direction. 

      Based the responses and previous revision, we have further refined the discussion of the relevant limitations:

      (Line 324) “Although isoflurane is widely used in ultrasound imaging because it provides long-lasting and stable anesthetic effects, it is important to note that the vasodilation observed with isoflurane is not representative of all anesthetics. Some anesthesia protocols, such as ketamine combined with medetomidine, do not produce significant vasodilation and are therefore preferred in experiments where vascular stability is essential, such as functional ultrasound imaging. Our current study primarily focused on demonstrating the feasibility of longitudinal ULM imaging in awake animals, instead of conducting a systematic investigation of how isoflurane anesthesia alters cerebral blood flow. Due to the limited number of animals used, the analyses presented in this work should be interpreted as example case studies. While the trends observed across animals were consistent, the small sample size restricts the scope of statistical inference. For future work, it would be valuable to design more rigorous control experiments with larger sample sizes to systematically compare the effects of isoflurane anesthesia, awake states, and other anesthetics that do not induce vasodilation on cerebral blood flow.”

      (Line 347) “Another limitation of this study is the potential residual vasodilatory effect of isoflurane anesthesia on awake imaging sessions and the short imaging window available after bolus injection. The awake imaging sessions were conducted shortly after the mice had emerged from isoflurane anesthesia, required for the MB bolus injections. The lasting vasodilatory effects of isoflurane may have influenced vascular responses, potentially contributing to an underestimation of differences in vascular dynamics between anesthetized and awake state. In addition, since microbubbles are rapidly cleared from circulation, the duration of effective imaging is limited to only a few minutes, which also overlaps with the anesthesia recovery period, constraining the usable awake-state imaging window. Future improvement on microbubble infusion using an indwelling jugular vein catheter presents a promising alternative to address these limitations. This method allows for stable microbubble infusion without the need for anesthesia induction, ensuring that the awake imaging condition is free from residual anesthetic effects. Moreover, it has the potential to extend the duration of imaging sessions, offering a longer and more stable time window for data acquisition. Furthermore, by performing ULM imaging in the awake state first, instead of starting with anesthetized imaging, researchers can achieve a more rigorous comparison of how various anesthetics influence cerebral microvascular dynamics relative to the awake baseline.”

      The Authors still insist on using the term Vascularity which they define as: 'proportion of the pixel count occupied by blood vessels within each ROI, obtained by binarizing the ULM vessel density maps and calculating the percentage of the pixels with MB signal.'. Why not use apparent cerebral blood volume or just CBV? Introducing an unnecessary and redundant term is not scientifically acceptable. In this revised version, vascularity is also used to indicate a higher vascular density (Line 275), which does not make sense: blood vessels do not generate from the isoflurane to the awake condition in a few minutes. Rev2 also raised this point.

      Response 03: Thank you for revisiting this important point. We acknowledge that the term vascularity is difficult to interpret for readers, and we also recognize that we did not sufficiently justify its use in the earlier version.

      Based on your suggestion, we have now replaced all instances of “vascularity” with “fractional vessel area”. While the underlying definition remains the same, fractional vessel area offers a more intuitive description. The term “fractional” denotes that the vessel area is normalized to the total area of the selected ROI. This normalization is essential for fair comparisons across ROIs of different sizes, such as Figures 4i–k to evaluate various brain regions. We would also like to clarify that this was not introduced as an unnecessary or redundant term, but rather as a more suitable metric for longitudinal ULM analysis. We did consider using apparent cerebral blood volume (CBV), estimated from microbubble counts. However, we found that it was less robust and meaningful in the context of longitudinal ULM comparisons. Below we provide further justification for using the vessel area instead:

      (1) Using the vessel area is more robust:

      In longitudinal ULM comparisons, normalization across time points is essential to enable fair and meaningful comparisons. In our study, we normalized the data based on a cumulative 5 million microbubbles (e.g., Fig. 2). Other normalization strategies could also be adopted, as long as the resulting vascular maps reach a sufficiently saturated state. However, even with normalization, it remains important to use a quantitative metric that is minimally biased and invariant to experimental fluctuations across time points. Vessel area, derived from binarized vessel maps, is less sensitive to variations in acquisition time and microbubble concentration. This is because repeated microbubble trajectories through the same location are not counted multiple times. In contrast, apparent CBV, calculated from the microbubble counts, is more susceptible to different concentration conditions. Since repeated detections in the same location accumulate, the metric can be dependent on injection efficiency and imaging duration. While CBV may still be valid under well-controlled, steady-state conditions, we found the vessel area to be a more robust and reliable metric for longitudinal analysis under our current bolus-injection protocol.

      (2) Using the vessel area is more meaningful:

      Compared to CBV, the vessel area provides a more direct representation of structural characteristics such as vessel diameter. Anesthesia-induced vasodilation leads to an increase in vessel diameter. Although local diameter changes can be assessed by manually selecting vessel segments, this approach is labor-intensive and prone to selection bias. To enable a more comprehensive and objective assessment of such morphological changes, fractional vessel area provides a more informative alternative to CBV, as it captures diameter-related variations at a global or regional scale, and avoids potential biases associated with manually selecting specific vessels or regions.

      In response to: vascularity is also used to indicate a higher vascular density (Line 275), which does not make sense: blood vessels do not generate from the isoflurane to the awake condition in a few minutes.

      We agree that blood vessels cannot be generated in a few minutes. Vascularity (now fractional vessel area) should be interpreted as apparent vessel density, which reflects a probabilistic estimate of vessel density based on the detectable microbubble. 

      Both apparent vessel density and apparent CBV are indirect, sampling-based approximations of vascular features, and both are fundamentally limited by microbubble detection sensitivity. Low microbubble concentrations lead to underestimation of both CBV and vessel area. A change from zero to non-zero in these metrics does not imply the physical appearance or disappearance of vessels, but rather reflects a change in the likelihood of detecting flow in each region.

      In summary, while neither fractional vessel area (vascularity in previous versions) nor apparent CBV is a perfect metric due to the inherent limitations of ULM, we believe the vessel area provides a more robust and meaningful parameter for our longitudinal comparisons. We have revised the main text to include this explanation and acknowledge the limitations and interpretation of fractional vessel area more explicitly.

      Revision in Results:

      (Line 181) “To validate the broader applicability of our findings, we conducted ROI-based analyses using fractional vessel area and mean velocity as primary metrics. These metrics extended the analysis of vessel diameter and flow velocity to entire brain regions or selected ROIs, which provides a more objective assessment of cerebral blood flow changes at a global scale and reduces the bias associated with manually selecting vessel segments. For vessel area measurements, the term fractional denotes that the vessel area is normalized to the total area of the selected ROI. This normalization is essential for fair comparisons across ROIs of different sizes.”

      Revision in Methods: definition of vascularity

      (Line 571) “In ROI-based analysis, we focused on two primary parameters: fractional vessel area and mean velocity. Fractional vessel area was defined as the proportion of the pixel count occupied by blood vessels within each ROI, obtained by binarizing the ULM vessel density maps and calculating the percentage of the pixels with MB signal. Mean velocity was calculated by averaging all non-zero pixel of velocity estimates within the ROI. The velocity distribution within each ROI was also visualized using violin plots, as shown in Fig. 2, 4 and 6, to illustrate the range and density of flow velocity estimates across different acquisition. In this study, we focused on these two metrics because they represent the most straightforward extension of single-vessel analysis to brain-wide vascular changes.”

      We put our ROI analysis code on GitHub and added a “Code availability” section. We hope it can serve as a foundation for users to explore different quantitative metrics in their own longitudinal ULM studies. We hope to provide an example to inspire further exploration.

      (Line 578) “Code availability

      To support quantitative longitudinal analysis of ULM data, we developed an open-source MATLAB application (https://github.com/ekerwang/ULMQuantitativeAnalysis). This tool is designed to facilitate ROI-based analysis of ULM images for longitudinal comparisons. It supports multiple quantification metrics, including but not limited to vessel area and mean velocity used in this study. Users can select and adapt different metrics based on their specific applications, as a wide range of ULM-based quantification metrics have been developed for different pathological and pharmacological studies.”

      The long-term recordings mentioned by the Authors refer to the 3-week time frame analyzed in this study. However, within each acquisition, the time available from imaging is only a few minutes (< 10', referring to most of the plots showing time courses) after the animals' arousal from isoflurane and before bubbles disappear. This limitation should be acknowledged.

      Response 04: Thank you for this comment. We agree that the current imaging sessions are constrained by the short time window available after the animal’s arousal from isoflurane and before bubbles disappear. This limitation indeed restricts the duration of usable awake-state imaging in our current bolus injection protocol. As discussed earlier, we are actively exploring the use of a jugular vein catheterization approach to address this limitation. This approach has the potential to extend the imaging session duration and provide a longer, more stable time window. We have now acknowledged this limitation more explicitly in the revised Discussion section.

      (Line 347) “Another limitation of this study is the potential residual vasodilatory effect of isoflurane anesthesia on awake imaging sessions and the short imaging window available after bolus injection. The awake imaging sessions were conducted shortly after the mice had emerged from isoflurane anesthesia, required for the MB bolus injections. The lasting vasodilatory effects of isoflurane may have influenced vascular responses, potentially contributing to an underestimation of differences in vascular dynamics between anesthetized and awake state. In addition, since microbubbles are rapidly cleared from circulation, the duration of effective imaging is limited to only a few minutes, which also overlaps with the anesthesia recovery period, constraining the usable awake-state imaging window. Future improvement on microbubble infusion using an indwelling jugular vein catheter presents a promising alternative to address these limitations. This method allows for stable microbubble infusion without the need for anesthesia induction, ensuring that the awake imaging condition is free from residual anesthetic effects. Moreover, it has the potential to extend the duration of imaging sessions, offering a longer and more stable time window for data acquisition. Furthermore, by performing ULM imaging in the awake state first, instead of starting with anesthetized imaging, researchers can achieve a more rigorous comparison of how various anesthetics influence cerebral microvascular dynamics relative to the awake baseline.”

      The more precise description of the number of mice and blood vessels analyzed in Figure 6 makes it apparent the limited number of independent samples used to support the findings of this work. A limitation that should be acknowledged. The newly provided information added as Supplementary Figure 1 should be moved to the main text, eventually in the figure legends. The limited data in support of the findings was also highlighted by Rev2 and, indirectly, by Rev3.

      Response 05: We acknowledge the limited number of independent samples used in this study. In the revised manuscript, we have explicitly emphasized this limitation in the Discussion section. Specifically, we added the following statement:

      (Line 329) “Our current study primarily focused on demonstrating the feasibility of longitudinal ULM imaging in awake animals, instead of conducting a systematic investigation of how isoflurane anesthesia alters cerebral blood flow. Due to the limited number of animals used, the analyses presented in this work should be interpreted as example case studies. While the trends observed across animals were consistent, the small sample size restricts the scope of statistical inference. For future work, it would be valuable to design more rigorous control experiments with larger sample sizes to systematically compare the effects of isoflurane anesthesia, awake states, and other anesthetics that do not induce vasodilation on cerebral blood flow.”

      Following your suggestion, we have also moved the newly provided information (the table in Supplementary Figure 1) into figure captions. In addition, we have modified in the Methods section to ensure that this information is clear.

      (Line 406) “Eight healthy female C57 mice (8-12 weeks) were used for this study, numbered as Mouse 1 to Mouse 8. Three mice (Mouse 1–3) were used to compare imaging results between awake and anesthetized states (Fig. 3 and 4). Three additional mice (Mouse 4–6) underwent longitudinal imaging over a three-week period (Fig. 5 and 6). Among them, Mouse 4 was also used as an example to demonstrate the overall system schematic and saturation conditions (Fig. 1 and 2). Several mice (Mouse 2, 6, 7, and 8) exhibited suboptimal cranial window quality or image artifacts and were included to illustrate common surgical or imaging issues (Supplementary Fig. 1). The specific usage of each animal is also annotated in the corresponding figure captions.”

      Reviewer #2 (Public Review):

      The authors present a very interesting collection of methods and results using brain ultrasound localization microscopy (ULM) in awake mice. They emphasize the effect of the level of anesthesia on the quantifiable elements assessable with this technique (i.e. vessel diameter, flow speed, in veins and arteries, area perfused, in capillaries) and demonstrate the possibility of achieving longitudinal cerebrovascular assessment in one animal during several weeks with their protocol.

      The authors made a good rewriting of the article based on the reviewers' comments. One of the message of the first version of the manuscript was that variability in measurements (vessel diameter, flow velocity, vascularity) were much more pronounced under changes of anesthesia than when considering longitudinal imaging across several weeks. This message is now not quite mitigated, as longitudinal imaging seems to show a certain variability close to the order of magnitude observed under anesthesia. In that sense, the review process was useful in avoiding hasty conclusion and calls for further caution in ULM awake longitudinal imaging, in particular regarding precision of positioning and cancellation of tissue motion.

      Strengths:

      Even if the methods elements considered separately are not new (brain ULM in rodents, setup for longitudinal awake imaging similar to those used in fUS imaging, quantification of vessel diameters/bubble flow/vessel area), when masterfully combined as it is done in this paper, they answer two questions that have been longrunning in the community: what is the impact of anesthesia on the parameters measured by ULM (and indirectly in fUS and other techniques)? Is it possible to achieve ULM in awake rodents for longitudinal imaging? The manuscript is well constructed, well written, and graphics are appealing.

      The manuscript has been much strengthened by the round of review, with more animals for the longitudinal imaging study.

      Weaknesses:

      Some weaknesses remain, not hindering the quality of the work, that the authors might want to answer or explain.

      When considering fig 4e and fig 4j together: it seems that in fig 4e the vascularity reduction in the cortical ROI is around 30% for downward flow, and around 55% for upward flow; but when grouping both cortical flows in fig 4j, the reduction is much smaller (~5%), even at the individual level (only mouse 1 is used in fig 4e). Can you comment on that?

      Response 06: Thank you for carefully pointing this out. This discrepancy arises primarily from differences in ROI selections.

      The vascularity metric (now we changed the term into fractional vessel area, based on Reviewer 1’s comments) is calculated as the proportion of vessel-occupied pixels relative to the total ROI area. As such, it is best suited for longitudinal comparisons within the same ROI rather than across-ROI comparisons, particularly when the size and vessel composition of the ROIs differ.

      In Fig. 4e, the cortical ROI includes mostly the penetrating vessels, which are selected due to their clear distinction between upward (venous) and downward (arterial) flow directions. Pial vessels were intentionally excluded because flow direction alone does not reliably distinguish arteries from veins in these surface vessels. Thus, the goal of this analysis was to indicate arteriovenous differences, rather than to represent the full cortical vascular changes.

      In contrast, the ROIs used in Fig. 4j aim to provide a more comprehensive view of cortical vascular responses without distinguishing flow direction. That’s why both penetrating and pial vessels are included. Since pial vessels showed relatively smaller vascularity changes within the coronal cross-sections analyzed in our study, their inclusion in the cortical ROI likely contributed to the smaller overall reduction in vascularity observed in Figure 4j.

      To address this potential confusion, we have added further clarification in the Results section of the revised manuscript.

      (Line 209) “It is worth noting that prior analyses (Fig. 4d–h) aimed to illustrate arteriovenous differences. Since pial vessels are difficult to distinguish as arteries or veins based on flow direction in coronal plane imaging, they were excluded from the ROI selection in those analyses. In the current whole-brain comparisons (Fig. 4i-k), the cortical ROIs no longer exclude pial vessels, since distinguishing between arteries and veins is not required. This aims to provide a more comprehensive representation of cortical vasculature.”

      When considering fig 4e, fig 4j, fig 6e and fig 6i altogether, it seems that vascularity can be highly variable, whether it be under anesthesia or vascular imaging, with changes between 5 to 40%. Is this vascularity quantification worth it (namely, reliable for example to quantify changes in a pathological model requiring longitudinal imaging)?

      Response 07: Thank you for raising this important point. We found that imaging in the awake state is inherently more variable than under anesthesia. In contrast, anesthetized imaging offers a more controlled and stable physiological condition, as anesthesia suppresses many sources of variation. For pathological studies, if the vascular or hemodynamic changes induced by anesthesia do not interfere with the scientific question being addressed, imaging under anesthesia can still be a practical and effective approach, due to its experimental simplicity and better physiological consistency.

      The higher variability observed in awake imaging arises from both physiological fluctuations in animals and unavoidable experimental inconsistencies, such as small misalignment on the imaging plane across sessions. If the research question aims to avoid the confounding effects of anesthesia, then instead of suppressing variation through anesthesia, it is important to acknowledge the natural baseline variation in the awake state. However, efforts should be made to minimize technical sources of variation. We have added a brief discussion of this issue at the end of the manuscript to reflect this consideration.

      (Line 396) “However, it is also important to note that although longitudinal awake imaging presents promise to avoid the confounding effects of anesthetics, imaging under anesthesia remains more convenient and controllable in many cases. For applications where the physiological question of interest is not sensitive to anesthesia-induced vascular effects, anesthetized imaging still offers a simpler and more stable approach. Awake imaging inherently exhibits greater physiological variability. However, care must be taken at the experimental level to minimize confounding sources of variation, such as stress level of the animal or handling inconsistencies, to ensure that the measurements are physiologically meaningful.”

      Regarding whether fractional vessel area (formerly referred to as vascularity) is a worthwhile metric for longitudinal quantification: based on our experience and comparisons, we found vessel area to be relatively robust and informative (see also Response 02 to Reviewer 1 for details). However, we acknowledge that other quantitative metrics—such as microbubble count, tortuosity, or flow directionality—may be more suitable depending on the specific pathological model or research question. How these metrics perform in awake imaging and longitudinal disease models is indeed an open and important question. We hope our work can serve as a foundation to inspire further investigation in this direction. To facilitate such exploration, we have developed and open-sourced a MATLAB-based analysis tool that supports multiple quantitative ULM metrics for longitudinal comparison. We encourage users to adapt and extend this framework to evaluate different quantitative metrics.

      (Line 578) “Code availability

      To support quantitative longitudinal analysis of ULM data, we developed an open-source MATLAB application (https://github.com/ekerwang/ULMQuantitativeAnalysis). This tool is designed to facilitate ROI-based analysis of ULM images for longitudinal comparisons. It supports multiple quantification metrics, including but not limited to vessel area and mean velocity used in this study. Users can select and adapt different metrics based on their specific applications, as a wide range of ULM-based quantification metrics have been developed for different pathological and pharmacological studies.”

      Reviewer #2 (Recommendations For The Authors):

      Images in figure 4 lack color bars.

      Response 08: Thank you for pointing this out. The color bars for the images in Figure 4 are the same as those used in the corresponding images in Figure 3. We have now added the explanation of color bars to the revised version of Figure 4 caption.

      Fig 4d: upward and downward are probably swapped.

      Response 09: Thank you for pointing this out, and we apologize for the oversight. They were mistakenly swapped. We have corrected this error in the revised figure.

      No quantitative conclusions are drawn regarding the changes in vessel diameter under anesthesia? Is it not significant? If it is not then why bring changes in diameter to our attention in fig 3 (white arrows) and figure 4b?

      Response 10: Our intention in highlighting diameter changes in Figure 3 (white arrows) and Figure 4b was to provide an illustrative example of isoflurane-induced diameter changes at the single-vessel level. These examples are meant to serve as case studies, not as the basis for broad statistical conclusions.

      In the initial version of the manuscript, we attempted to draw quantitative conclusions by measuring vessel diameters from ten manually selected vessel segments at each location. However, based on feedback from other reviewers, we decided to remove this analysis in the revised version. Manual selection of vessel segments is highly subjective and prone to bias, limiting its reliability for quantitative interpretation.

      Instead, we focused on ROI-based analysis using fractional vessel area (formerly referred to as vascularity), which reflects widespread changes in vessel diameter across regions. It is a more generalizable and less biased metric for quantifying vascular diameter changes.

      We further explained this in the Results section:

      (Line 181) “To validate the broader applicability of our findings, we conducted ROI-based analyses using fractional vessel area and mean velocity as primary metrics. These metrics extended the analysis of vessel diameter and flow velocity to entire brain regions or selected ROIs, which provides a more objective assessment of cerebral blood flow changes at a global scale and reduces the bias associated with manually selecting vessel segments. For vessel area measurements, the term fractional denotes that the vessel area is normalized to the total area of the selected ROI. This normalization is essential for fair comparisons across ROIs of different sizes.”

      Line 210 "In summary, statistical analysis revealed a decrease in individual vessel diameter" this does not seem to be supported by this version of the manuscript as no analysis is done on a representative group of vessels for the diameter.

      Response 11: Thank you for pointing out this important issue. In line with our previous response (Response 10), we would like to clarify that the analysis of individual vessel diameter was intended to serve as an example study, rather than a statistically supported conclusion based on a group of vessels. To avoid confusion, we have removed the phrase “statistical analysis revealed a decrease in individual vessel diameter” from the manuscript. 

      The meaning of the *** in fig 6b and 6c should be clarified as: -it is not explicitly stated - the equivalence test interpretation is less usual than other tests.

      Response 12: We thank the reviewer for pointing out this important issue. We agree that the use of asterisks (***) in Fig. 6b and 6c may have led to confusion, as such markers are typically associated with statistical significance in difference testing. In our case, the analysis was based on the two one-sided test (TOST) procedure to assess statistical equivalence, which is indeed less commonly used and could be misinterpreted.

      To address this, we have replaced the asterisks *** in the figure with the label “equiv.”, which more clearly reflects the intended interpretation. Additionally, we have revised the figure caption and the main text to explicitly state that these markers denote statistical equivalence (not difference) as determined by TOST, with the equivalence margin defined as three times the standard deviation of one week.

      (Figure 6 Caption) “Statistical analysis was performed using the two one-sided test (TOST) to evaluate consistency of measurement. The label “equiv.” indicates statistically equivalent measurements (p < 0.001), defined as interweek differences smaller than three times the standard deviation of one week.”

      (Line 240) “Statistical testing of equivalence was conducted using the two one-sided test (TOST) procedure, which evaluates whether the difference between two time points falls within a predefined equivalence margin. Specifically, equivalence is defined as the inter-week difference being smaller than three times the standard deviation of one week. A statistically significant result in TOST (p < 0.001) supports the interpretation that the measurements are statistically equivalent, which is denoted as “equiv.” in the figures.”

      Line 237 and following: please consider rephrasing into "To further generalize these findings and examine longitudinal variation in ROI-based analysis, we used Mouse 4 as an example to show the consistency of blood flow density across different flow directions in the cortex (Fig. 6d) and extended the quantitative analysis to all three mice (Fig. 6e) (individual ULM upward and downward flow images for all three mice over the threeweek longitudinal study period can be found in Supplementary Fig. 4)." The paragraph will make much more sense.

      Response 13: We appreciate your helpful rephrasing. We have fully adopted your proposed revision to enhance the clarity and coherence of the text. The sentence now reads exactly as you recommended:

      (Line 250): “To further generalize these findings and examine longitudinal variation in ROI-based analysis, we used Mouse 4 as an example to show the consistency of blood flow density across different flow directions in the cortex (Fig. 6d) and extended the quantitative analysis to all three mice (Fig. 6e) (individual ULM upward and downward flow images for all three mice over the three-week longitudinal study period can be found in Supplementary Fig. 4).”

      Line 248: "While arterial and venous flow velocity distributions exhibit clear distinctions, their variations over the three weeks remained acceptable" the meaning of acceptable remains elusive.

      Response 14: Thank you for pointing out the ambiguity in the phrase “remained acceptable”. To improve clarity and precision, we have revised the sentence to provide a more informative description. The updated sentence now reads:

      (Line 261) “While arterial and venous flow velocity distributions exhibit clear distinctions, the distribution shapes remained relatively consistent across the three weeks. Specifically, variation in median velocity were within 1 mm/s. In contrast, anesthesia-induced changes can lead to velocity shifts exceeding 1 mm/s.”

      Line 253: consider rephrasing in "Despite subcortical regions showing the largest vascularity variability consecutive to anesthesia-induced changes, vascularity in those regions was relatively stable values in the longitudinal study" as otherwise the link between the 2 parts of the sentence feels odd.

      Response 15: Thank you for your constructive suggestion regarding the logical flow of the sentence. We fully agree with your point and have revised the sentence exactly as you proposed.

      (Line 268) “Despite subcortical regions showing the largest vascularity variability consecutive to anesthesia-induced changes, vascularity in those regions was relatively stable values in the longitudinal study.”

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Weir et al. investigate why the 13-lined ground squirrel (13LGS) retina is unusually rich in cone photoreceptors, the cells responsible for color and daylight vision. Most mammals, including humans, have rod-dominant retinas, making the 13LGS retina both an intriguing evolutionary divergence and a valuable model for uncovering novel mechanisms of cone generation. The developmental programs underlying this adaptation were previously unknown.

      Using an integrated approach that combines single-cell RNA sequencing (scRNAseq), scATACseq, and histology, the authors generate a comprehensive atlas of retinal neurogenesis in 13LGS. Notably, comparative analyses with mouse datasets reveal that in 13LGS, cones can arise from late-stage neurogenic progenitors, a striking contrast to mouse and primate retinas, where late progenitors typically generate rods and other late-born cell types but not cones. They further identify a shift in the timing (heterochrony) of expression of several transcription factors. Further, the authors show that these factors act through species-specific regulatory elements. And overall, functional experiments support a role for several of these candidates in cone production.

      Strengths:

      This study stands out for its rigorous and multi-layered methodology. The combination of transcriptomic, epigenomic, and histological data yields a detailed and coherent view of cone development in 13LGS. Cross-species comparisons are thoughtfully executed, lending strong evolutionary context to the findings. The conclusions are, in general, well supported by the evidence, and the datasets generated represent a substantial resource for the field. The work will be of high value to both evolutionary neurobiology and regenerative medicine, particularly in the design of strategies to replace lost cone photoreceptors in human disease.

      Weaknesses:

      (1) Overall, the conclusions are strongly supported by the data, but the paper would benefit from additional clarifications. In particular, some of the conclusions could be toned down slightly to reflect that the observed changes in candidate gene function, such as those for Zic3 by itself, are modest and may represent part of a more complex regulatory network.

      (2) Additional explanations about the cell composition of the 13LGS retina are needed. The ratios between cone and rod are clearly detailed, but do those lead to changes in other cell types?

      (3) Could the lack of a clear trajectory for rod differentiation be just an effect of low cell numbers for this population?

      (4) The immunohistochemistry and RNA hybridization experiments shown in Figure S2 would benefit from supporting controls to strengthen their interpretability. While it has to be recognized that performing immunostainings on non-conventional species is not a simple task, negative controls are necessary to establish the baseline background levels, especially in cases where there seems to be labeling around the cells. The text indicates that these experiments are both immunostainings and ISH, but the figure legend only says "immunohistochemistry". Clarifying these points would improve readers' confidence in the data.

      (5) Figure S3: The text claims that overexpression of Zic3 alone is sufficient to induce the cone-like photoreceptor precursor cells as well as horizontal cell-like precursors, but this is not clear in Figure S3A nor in any other figure. Similarly, the effects of Pou2f1 overexpression are different in Figure S3A and Figure S3B. In Figure S3B, the effects described (increased presence of cone-like and horizontal-like precursors) are very clear, whereas it is not in Figure S3A. How are these experiments different?

      (6) The analyses of Zic3 conditional mutants (Figure S4) reveal an increase in many cone, rod, and pan-photoreceptor genes with only a reduction in some cone genes. Thus, the overall conclusion that Zic3 is essential for cones while repressing rod genes doesn't seem to match this particular dataset.

      (7) Throughout the text, the authors used the term "evolved". To substantiate this claim, it would be important to include sequence analyses or to rephrase to a more neutral term that does not imply evolutionary inference.

    2. Reviewer #3 (Public review):

      Summary:

      The authors perform deep transcriptomic and epigenetic comparisons between mouse and 13-lined ground squirrel (13LGS) to identify mechanisms that drive rod vs cone-rich retina development. Through cross-species analysis, the authors find extended cone generation in 13LGS, gene expression within progenitor/photoreceptor precursor cells consistent with a lengthened cone window, and differential regulatory element usage. Two of the transcription factors, Mef2c and Zic3, were subsequently validated using OE and KO mouse lines to verify the role of these genes in regulating competence to generate cone photoreceptors.

      Strengths:

      Overall, this is an impactful manuscript with broad implications toward our understanding of retinal development, cell fate specification, and TF network dynamics across evolution and with the potential to influence our future ability to treat vision loss in human patients. The generation of this rich new dataset profiling the transcriptome and epigenome of the 13LGS is a tremendous addition to the field that assuredly will be useful for numerous other investigations and questions of a variety of interests. In this manuscript, the authors use this dataset and compare it to data they previously generated for mouse retinal development to identify 2 new regulators of cone generation and shed insights into their regulation and their integration into the network of regulatory elements within the 13LGS compared to mouse.

      Weaknesses:

      (1) The authors chose to omit several cell classes from analyses and visualizations that would have added to their interpretations. In particular, I worry that the omission of 13LGS rods, early RPCs, and early NG from Figures 2C, D, and F is notable and would have added to the understanding of gene expression dynamics. In other words, (a) are these genes of interest unique to late RPCs or maintained from early RPCs, and (b) are rod networks suppressed compared to the mouse?

      (2) The authors claim that the majority of cones are generated by late RPCs and that this is driven primarily by the enriched enhancer network around cone-promoting genes. With the temporal scRNA/ATACseq data at their disposal, the authors should compare early vs late born cones and RPCs to determine whether the same enhancers and genes are hyperactivated in early RPCs as well as in the 13LGS. This analysis will answer the important question of whether the enhancers activated/evolved to promote all cones, or are only and specifically activated within late RPCs to drive cone genesis at the expense of rods.

      (3) The authors repeatedly use the term 'evolved' to describe the increased number of local enhancer elements of genes that increase in expression in 13LGS late RPCs and cones. Evolution can act at multiple levels on the genome and its regulation. The authors should consider analysis of sequence level changes between mouse, 13LGS, and other species to test whether the enhancer sequences claimed to be novel in the 13LGS are, in fact, newly evolved sequence/binding sites or if the binding sites are present in mouse but only used in late RPCs of the 13LGS.

      (4) The authors state that 'Enhancer elements in 13LGS are predicted to be directly targeted by a considerably greater number of transcription factors than in mice'. This statement can easily be misread to suggest that all enhancers display this, when in fact, this is only the cone-promoting enhancers of late 13LGS RPCs. In a way, this is not surprising since these genes are largely less expressed in mouse vs 13LGS late RPCs, as shown in Figure 2. The manuscript is written to suggest this mechanism of enhancer number is specific to cone production in the 13LGS- it would help prove this point if the authors asked the opposite question and showed that mouse late RPCs do not have similar increased predicted binding of TFs near rod-promoting genes in C7-8.

    1. Reviewer #1 (Public review):

      In this important study, the authors develop a suite of machine vision tools to identify and align fluorescent neuronal recording images in space and time according to neuron identity and position. The authors provide compelling evidence for the speed and utility of these tools. While such tools have been developed in the past (including by the authors), the key advancement here is the speed and broad utility of these new tools. While prior approaches based on steepest descent worked, they required hundreds of hours of computational time, while the new approaches outlined here are >600-fold faster. The machine vision tools here should be immediately useful to readers specifically interested in whole-brain C. elegans data, but also for more general readers who may be interested in using BrainAlignNet for tracking fluorescent neuronal recordings from other systems.

      I really enjoyed reading this paper. The authors had several ground truth examples to quantify the accuracy of their algorithms and identified several small caveats users should consider when using these tools. These tools were primarily developed for C. elegans, an animal with stereotyped development, but whose neurons can be variably located due to internal motion of the body. The authors provide several examples of how BrainAlignNet reliably tracked these neurons over space and time. Neuron identity is also important to track, and the authors showed how AutoCellLoader can reliably identify neurons based on their fluorescence in the NeuroPAL background. A challenge with NeuroPAL though, is the high expression of several fluorophores, which compromises behavioral fidelity. The authors provide some possible avenues where this problem can be addressed by expressing fewer fluorophores. While using all four channels provided the best performance, only using the tagRFP and CyOFP channels was sufficient for performance that was close to full performance using all 4 NeuroPAL channels. This result indicates that the development of future lines with less fluorophore expression could be sufficient for reliable neuronal identification, which would decrease the genetic load on the animal, but also open other fluorescent channels that could be used for tracking other fluorescent tools/markers. Even though these tools were developed for C. elegans specifically, they showed BrainAlignNet can be applied to other organisms as well (in their case, the cnidarian C. hemisphaerica), which broadens the utility of their tools.

      Strengths:

      (1) The authors have a wealth of ground-truth training data to compare their algorithms against, and provide a variety of metrics to assess how well their new tools perform against hand annotation and/or prior algorithms.

      (2) For BrainAlignNet, the authors show how this tool can be applied to other organisms besides C. elegans.

      (3) The tools are publicly available on GitHub, which includes useful README files and installation guidance.

      Weaknesses:

      (1) Most of the utility of these algorithms is for C. elegans specifically. Testing their algorithms (specifically BrainAlignNet) on more challenging problems, such as whole-brain zebrafish, would have been interesting. This is a very, very minor weakness, though.

      (2) The tools are benchmarked against their own prior pipeline, but not against other algorithms written for the same purpose.

      (3) Considerable pre-processing was done before implementation. Expanding upon this would improve accessibility of these tools to a wider audience.

    2. Reviewer #3 (Public review):

      Context:

      Tracking cell trajectories in deformable organs, such as the head neurons of freely moving C. elegans, is a challenging task due to rapid, non-rigid cellular motion. Similarly, identifying neuron types in the worm brain is difficult because of high inter-individual variability in cell positions.

      Summary:

      In this study, the authors developed a deep learning-based approach for cell tracking and identification in deformable neuronal images. Several different CNN models were trained to: (1) register image pairs without severe deformation, and then track cells across continuous image sequences using multiple registration results combined with clustering strategies; (2) predict neuron IDs from multicolor-labeled images; and (3) perform clustering across multiple multicolor images to automatically generate neuron IDs.

      Strengths:

      Directly using raw images for registration and identification simplifies the analysis pipeline, but it is also a challenging task since CNN architectures often struggle to capture spatial relationships between distant cells. Surprisingly, the authors report very high accuracy across all tasks. For example, the tracking of head neurons in freely moving worms reportedly reached 99.6% accuracy, neuron identification achieved 98%, and automatic classification achieved 93% compared to human annotations.

      Weaknesses:

      (1) The deep networks proposed in this study for registration and neuron identification require dataset-specific training, due to variations in imaging conditions across different laboratories. This, in turn, demands a large amount of manually or semi-manually annotated training data, including cell centroid correspondences and cell identity labels, which reduces the overall practicality and scalability of the method.

      (2) The cell tracking accuracy was not rigorously validated, but rather estimated using a biased and coarse approach. Specifically, the accuracy was assessed based on the stability of GFP signals in the eat-4-labeled channel. A tracking error was assumed to occur when the GFP signal switched between eat-4-negative and eat-4-positive at a given time point. However, this estimation is imprecise and only captures a small subset of all potential errors. Although the authors introduced a correction factor to approximate the true error rate, the validity of this correction relies on the assumption that eat-4 neurons are uniformly distributed across the brain - a condition that is unlikely to hold.

      (3) Figure S1F demonstrates that the registration network, BrainAlignNet, alone is insufficient to accurately align arbitrary pairs of C. elegans head images. The high tracking accuracy reported is largely due to the use of a carefully designed registration sequence, matching only images with similar postures, and an effective clustering algorithm. Although the authors address this point in the Discussion section, the abstract may give the misleading impression that the network itself is solely responsible for the observed accuracy.

      (4) The reported accuracy for neuron identification and automatic classification may be misleading, as it was assessed only on a subset of neurons labeled as "high-confidence" by human annotators. Although the authors did not disclose the exact proportion, various descriptions (such as Figure 4f) imply that this subset comprises approximately 60% of all neurons. While excluding uncertain labels is justifiable, the authors highlight the high accuracy achieved on this subset without clearly clarifying that the reported performance pertains only to neurons that are relatively easy to identify. Furthermore, they do not report what fraction of the total neuron population can be accurately identified using their methods-an omission of critical importance for prospective users.

    3. Author response:

      Reviewer #1 (Public review):

      In this important study, the authors develop a suite of machine vision tools to identify and align fluorescent neuronal recording images in space and time according to neuron identity and position. The authors provide compelling evidence for the speed and utility of these tools. While such tools have been developed in the past (including by the authors), the key advancement here is the speed and broad utility of these new tools. While prior approaches based on steepest descent worked, they required hundreds of hours of computational time, while the new approaches outlined here are >600-fold faster. The machine vision tools here should be immediately useful to readers specifically interested in whole-brain C. elegans data, but also for more general readers who may be interested in using BrainAlignNet for tracking fluorescent neuronal recordings from other systems.

      I really enjoyed reading this paper. The authors had several ground truth examples to quantify the accuracy of their algorithms and identified several small caveats users should consider when using these tools. These tools were primarily developed for C. elegans, an animal with stereotyped development, but whose neurons can be variably located due to internal motion of the body. The authors provide several examples of how BrainAlignNet reliably tracked these neurons over space and time. Neuron identity is also important to track, and the authors showed how AutoCellLoader can reliably identify neurons based on their fluorescence in the NeuroPAL background. A challenge with NeuroPAL though, is the high expression of several fluorophores, which compromises behavioral fidelity. The authors provide some possible avenues where this problem can be addressed by expressing fewer fluorophores. While using all four channels provided the best performance, only using the tagRFP and CyOFP channels was sufficient for performance that was close to full performance using all 4 NeuroPAL channels. This result indicates that the development of future lines with less fluorophore expression could be sufficient for reliable neuronal identification, which would decrease the genetic load on the animal, but also open other fluorescent channels that could be used for tracking other fluorescent tools/markers. Even though these tools were developed for C. elegans specifically, they showed BrainAlignNet can be applied to other organisms as well (in their case, the cnidarian C. hemisphaerica), which broadens the utility of their tools.

      Strengths:

      (1) The authors have a wealth of ground-truth training data to compare their algorithms against, and provide a variety of metrics to assess how well their new tools perform against hand annotation and/or prior algorithms.

      (2) For BrainAlignNet, the authors show how this tool can be applied to other organisms besides C. elegans.

      (3) The tools are publicly available on GitHub, which includes useful README files and installation guidance.

      We thank the reviewer for noting these strengths of our study.

      Weaknesses:

      (1) Most of the utility of these algorithms is for C. elegans specifically. Testing their algorithms (specifically BrainAlignNet) on more challenging problems, such as whole-brain zebrafish, would have been interesting. This is a very, very minor weakness, though.

      We appreciate the reviewer’s point that expanding to additional animal models would be valuable. In the study, we have so far tested our approaches on C. elegans and Jellyfish. Given that this is considered a ‘very, very minor weakness’ and that it does not directly affect the results or analyses in the paper, we think this might be better to address in future work.

      (2) The tools are benchmarked against their own prior pipeline, but not against other algorithms written for the same purpose.

      We agree that it would be valuable to benchmark other labs’ software pipelines on our datasets. We note that most papers in this area, which describe those pipelines, provide the same performance metrics that we do (accuracy of neuron identification, tracking accuracy, etc), so a crude, first-order comparison can be obtained by comparing the numbers in the papers. But, we agree that a rigorous head-to-head comparison would require applying these different pipelines to a common dataset. We considered performing these analyses, but we were concerned that using other labs’ software ‘off the shelf’ on our data might not represent those pipelines in their best light when compared to our pipeline that was developed with our data in mind. Data from different microscopy platforms can be surprisingly different and we wouldn’t want to perform an analysis that had this bias. Therefore, we feel that this comparison would be best pursued by all of these labs collaboratively (so that they can each provide input on how to run their software optimally). Indeed, this is an important area for future study. In this spirit, we have been sharing our eat-4::GFP datasets (that permit quantification of tracking accuracy) with other labs looking for additional ways to benchmark their tracking software.

      We also note that there are not really any pipelines to directly compare against CellDiscoveryNet, as we are not aware of any other fully unsupervised approach for neuron identification in C. elegans.

      (3) Considerable pre-processing was done before implementation. Expanding upon this would improve accessibility of these tools to a wider audience.

      Indeed, some pre-processing was performed on images before registration and neuron identification -- understanding these nuances can be important. The pre-processing steps are described in the Results section and detailed in the Methods. They are also all available in our open-source software. For BrainAlignNet, the key steps were: (1) selecting image registration problems, (2) cropping, and (3) Euler alignment. Steps (1) and (3) were critically important and are extensively discussed in the Results and Discussion sections of our study (lines 142-144, 218-234, 318-323, 704-712). Step (2) is standard in image processing. For AutoCellLabeler and CellDiscoveryNet, the pre-processing was primarily to align the 4 NeuroPAL color channels to each other (i.e. make sure the blue/red/orange/etc channels for an animal are perfectly aligned). This is also just a standard image processing step to ensure channel alignment. Thus, the more “custom” pre-processing steps were extensively discussed in the study and the more “common” steps are still described in the Methods. The implementation of all steps is available in our open-source software.

      Reviewer #2 (Public review):

      Summary:

      The paper introduced the pipeline to analyze brain imaging of freely moving animals: registering deforming tissues and maintaining consistent cell identities over time. The pipeline consists of three neural networks that are built upon existing models: BrainAlignNet for non-rigid registration, AutoCellLabeler for supervised annotation of over 100 neuronal types, and CellDiscoveryNet for unsupervised discovery of cell identities. The ambition of the work is to enable high-throughput and largely automated pipelines for neuron tracking and labeling in deforming nervous systems.

      Strengths:

      (1) The paper tackles a timely and difficult problem, offering an end-to-end system rather than isolated modules.

      (2) The authors report high performance within their dataset, including single-pixel registration accuracy, nearly complete neuron linking over time, and annotation accuracy that exceeds individual human labelers.

      (3) Demonstrations across two organisms suggest the methods could be transferable, and the integration of supervised and unsupervised modules is of practical utility.

      We thank the reviewer for noting these strengths of our study.

      Weaknesses:

      (1) Lack of solid evaluation. Despite strong results on their own data, the work is not benchmarked against existing methods on community datasets, making it hard to evaluate relative performance or generality.

      We agree that it would be valuable to benchmark many labs’ software pipelines on some common datasets, ideally from several different research labs. We note that most papers in this area, which describe the other pipelines that have been developed, provide the same performance metrics that we do (accuracy of neuron identification, tracking accuracy, etc), so a crude, first-order comparison can be obtained by comparing the numbers in the papers. But, we agree that a rigorous head-to-head comparison would require applying these different pipelines to a common dataset. We considered performing these analyses, but we were concerned that using other labs’ software ‘off the shelf’ and comparing the results to our pipeline (where we have extensive expertise) might bias the performance metrics in favor of our software. Therefore, we feel that this comparison would be best pursued by all of these labs collaboratively (so that they can each provide input on how to run their software optimally). Indeed, this is an important area for future study. In this spirit, we have been sharing our eat-4::GFP datasets (that permit quantification of tracking accuracy) with other labs looking for additional ways to benchmark their tracking software.

      We also note that there are not really any pipelines to directly compare against CellDiscoveryNet, as we are not aware of any other fully unsupervised approach for neuron identification in C. elegans.

      (2) Lack of novelty. All three models do not incorporate state-of-the-art advances from the respective fields. BrainAlignNet does not learn from the latest optical flow literature, relying instead on relatively conventional architectures. AutoCellLabeler does not utilize the advanced medNeXt3D architectures for supervised semantic segmentation. CellDiscoveryNet is presented as unsupervised discovery but relies on standard clustering approaches, with limited evaluation on only a small test set.

      We appreciate that the machine learning field moves fast. Our goal was not to invent entirely novel machine learning tools, but rather to apply and optimize tools for a set of challenging, unsolved biological problems. We began with the somewhat simpler architectures described in our study and were largely satisfied with their performance. It is conceivable that newer approaches would perhaps lead to even greater accuracy, flexibility, and/or speed. But, oftentimes, simple or classical solutions can adequately resolve specific challenges in biological image processing.

      Regarding CellDiscoveryNet, our claim of unsupervised training is precise: CellDiscoveryNet is trained end-to-end only on raw images, with no human annotations, pseudo-labels, external classifiers, or metadata used for training, model selection, or early stopping. The loss is defined entirely from the input data (no label signal). By standard usage in machine learning, this constitutes unsupervised (often termed “self-supervised”) representation learning. Downstream clustering is likewise unsupervised, consuming only image pairs registered by CellDiscoveryNet and neuron segmentations produced by our previously-trained SegmentationNet (which provides no label information).

      (3) Lack of robustness. BrainAlignNet requires dataset-specific training and pre-alignment strategies, limiting its plug-and-play use. AutoCellLabeler depends heavily on raw intensity patterns of neurons, making it brittle to pose changes. By contrast, current state-of-the-art methods incorporate spatial deformation atlases or relative spatial relationships, which provide robustness across poses and imaging conditions. More broadly, the ANTSUN 2.0 system depends on numerous manually tuned weights and thresholds, which reduces reproducibility and generalizability beyond curated conditions.

      Regarding BrainAlignNet: we agree that we trained on each species’ own data (worm, jellyfish) and we would suggest other labs working on new organisms to do the same based on our current state of knowledge. It would be fantastic if there was an alignment approach that generalized to all possible cases of non-rigid-registration in all animals – an important area for future study. We also agree that pre-alignment was critical in worms and jellyfish, which we discuss extensively in our study (lines 142-144, 318-321, 704-712).

      Regarding AutoCellLabeler: the animals were not recorded in any standardized pose and were not aligned to each other beforehand – they were basically in a haphazard mix of poses and we used image augmentation to allow the network to generalize to other poses, as described in our study. It is still possible that AutoCellLabeler is somehow brittle to pose changes (e.g. perhaps extremely curved worms) – while we did not detect this in our analyses, we did not systematically evaluate performance across all possible poses. However, we do note that this network was able to label images taken from freely-moving worms, which by definition exhibit many poses (Figure 5D, lines 500-525); aggregating the network’s performance across freely-moving data points allowed it to nearly match its performance on high-SNR immobilized data. This suggests a degree of robustness of the AutoCellLabeler network to pose changes.

      Regarding ANTSUN 2.0: we agree that there are some hyperparameters (described in our study) that affect ANTSUN performance. We agree that it would be worthwhile to fully automate setting these in future iterations of the software.

      Evaluation:

      To make the evaluation more solid, it would be great for the authors to (1) apply the new method on existing datasets and (2) apply baseline methods on their own datasets. Otherwise, without comparison, it is unclear if the proposed method is better or not. The following papers have public challenging tracking data: https://elifesciences.org/articles/66410, https://elifesciences.org/articles/59187, https://www.nature.com/articles/s41592-023-02096-3.

      Please see our response to your point (1) under Weaknesses above.

      Methodology:

      (1) The model innovations appear incrementally novel relative to existing work. The authors should articulate what is fundamentally different (architectural choices, training objectives, inductive biases) and why those differences matter empirically. Ablations isolating each design choice would help.

      There are other efforts in the literature to solve the neuron tracking and neuron identification problems in C. elegans (please see paragraphs 4 and 5 of our Introduction, which are devoted to describing these). However, they are quite different in the approaches that they use, compared to our study. For example, for neuron tracking they use t->t+1 methods, or model neurons as point clouds, etc (a variety of approaches have been tried). For neuron identification, they work on extracted features from images, or use statistical approaches rather than deep neural networks, etc (a variety of approaches have been tried). Our assessment is that each of these diverse approaches has strengths and drawbacks; we agree that a meta-analysis of the design choices used across studies could be valuable.

      We also note that there are not really any pipelines to directly compare against CellDiscoveryNet, as we are not aware of any other fully unsupervised approach for neuron identification in C. elegans.

      (2) The pipeline currently depends on numerous manually set hyperparameters and dataset-specific preprocessing. Please provide principled guidelines (e.g., ranges, default settings, heuristics) and a robustness analysis (sweeps, sensitivity curves) to show how performance varies with these choices across datasets; wherever possible, learn weights from data or replace fixed thresholds with data-driven criteria.

      We agree that there are some ANTSUN 2.0 hyperparameters (described in our Methods section) that could affect the quality of neuron tracking. It would be worthwhile to fully automate setting these in future iterations of the software, ensuring that the hyperparameter settings are robust to variation in data/experiments.

      Appraisal:

      The authors partially achieve their aims. Within the scope of their dataset, the pipeline demonstrates impressive performance and clear practical value. However, the absence of comparisons with state-of-the-art algorithms such as ZephIR, fDNC, or WormID, combined with small-scale evaluation (e.g., ten test volumes), makes the strength of evidence incomplete. The results support the conclusion that the approach is useful for their lab's workflow, but they do not establish broader robustness or superiority over existing methods.

      We wish to remind the reviewer that we developed BrainAlignNet for use in worms and jellyfish. These two animals have different distributions of neurons and radically different anatomy and movement patterns. Data from the two organisms was collected in different labs (Flavell lab, Weissbourd lab) on different types of microscopes (spinning disk, epifluorescence). We believe that this is a good initial demonstration that the approach has robustness across different settings.

      Regarding comparisons to other labs’ C. elegans data processing pipelines, we agree that it will be extremely valuable to compare performance on common datasets, ideally collected in multiple different research labs. But we believe this should be performed collaboratively so that all software can be utilized in their best light with input from each lab, as described above. We agree that such a comparison would be very valuable.

      Impact:

      Even though the authors have released code, the pipeline requires heavy pre- and post-processing with numerous manually tuned hyperparameters, which limits its practical applicability to new datasets. Indeed, even within the paper, BrainAlignNet had to be adapted with additional preprocessing to handle the jellyfish data. The broader impact of the work will depend on systematic benchmarking against community datasets and comparison with established methods. As such, readers should view the results as a promising proof of concept rather than a definitive standard for imaging in deformable nervous systems.

      Regarding worms vs jellyfish pre-processing: we actually had the exact opposite reaction to that of the reviewer. We were surprised at how similar the pre-processing was for these two very different organisms. In both cases, it was essential to (1) select appropriate registration problems to be solved; and (2) perform initialization with Euler alignment. Provided that these two challenges were solved, BrainAlignNet mostly took care of the rest. This suggests a clear path for researchers who wish to use this approach in another animal. Nevertheless, we also agree with the reviewer’s caution that a totally different use case could require some re-thinking or re-strategizing. For example, the strategy of how to select good registration problems could depend on the form of the animal’s movement.

      Reviewer #3 (Public review):

      Context:

      Tracking cell trajectories in deformable organs, such as the head neurons of freely moving C. elegans, is a challenging task due to rapid, non-rigid cellular motion. Similarly, identifying neuron types in the worm brain is difficult because of high inter-individual variability in cell positions.

      Summary:

      In this study, the authors developed a deep learning-based approach for cell tracking and identification in deformable neuronal images. Several different CNN models were trained to: (1) register image pairs without severe deformation, and then track cells across continuous image sequences using multiple registration results combined with clustering strategies; (2) predict neuron IDs from multicolor-labeled images; and (3) perform clustering across multiple multicolor images to automatically generate neuron IDs.

      Strengths:

      Directly using raw images for registration and identification simplifies the analysis pipeline, but it is also a challenging task since CNN architectures often struggle to capture spatial relationships between distant cells. Surprisingly, the authors report very high accuracy across all tasks. For example, the tracking of head neurons in freely moving worms reportedly reached 99.6% accuracy, neuron identification achieved 98%, and automatic classification achieved 93% compared to human annotations.

      We thank the reviewer for noting these strengths of our study.

      Weaknesses:

      (1) The deep networks proposed in this study for registration and neuron identification require dataset-specific training, due to variations in imaging conditions across different laboratories. This, in turn, demands a large amount of manually or semi-manually annotated training data, including cell centroid correspondences and cell identity labels, which reduces the overall practicality and scalability of the method.

      We performed dataset-specific training for image registration and neuron identification, and we would encourage new users to do the same based on our current state of knowledge. This highlights how standardization of whole-brain imaging data across labs is an important issue for our field to address and that, without it, variations in imaging conditions could impact software utility. We refer the reviewer to an excellent study by Sprague et al. (2025) on this topic, which is cited in our study.

      However, at the same time, we wish to note that it was actually reasonably straightforward to take the BrainAlignNet approach that we initially developed in C. elegans and apply it to jellyfish. Some of the key lessons that we learned in C. elegans generalized: in both cases, it was critical to select the right registration problems to solve and to preprocess with Euler registration for good initialization. Provided that those problems were solved, BrainAlignNet could be applied to obtain high-quality registration and trace extraction. Thus, our study provides clear suggestions on how to use these tools across multiple contexts.

      (2) The cell tracking accuracy was not rigorously validated, but rather estimated using a biased and coarse approach. Specifically, the accuracy was assessed based on the stability of GFP signals in the eat-4-labeled channel. A tracking error was assumed to occur when the GFP signal switched between eat-4-negative and eat-4-positive at a given time point. However, this estimation is imprecise and only captures a small subset of all potential errors. Although the authors introduced a correction factor to approximate the true error rate, the validity of this correction relies on the assumption that eat-4 neurons are uniformly distributed across the brain - a condition that is unlikely to hold.

      We respectfully disagree with this critique. We considered the alternative suggested by the reviewer (in their private comments to the authors) of comparing against a manually annotated dataset. But this annotation would require manually linking ~150 neurons across ~1600 timepoints, which would require humans to manually link neurons across timepoints >200,000 times for a single dataset. These datasets consist of densely packed neurons rapidly deforming over time in all 3 dimensions. Moreover, a single error in linking would propagate across timepoints, so the error tolerance of such annotation would be extremely low. Any such manually labeled dataset would be fraught with errors and should not be trusted. Instead, our approach relies on a simple, accurate assumption: GFP expression in a neuron should be roughly constant over a 16min recording (after bleach correction) and the levels will be different in different neurons when it is sparsely expressed. Because all image alignment is done in the red channel, the pipeline never “peeks” at the GFP until it is finished with neuron alignment and tracking. The eat-4 promoter was chosen for GFP expression because (a) the nuclei labeled by it are scattered across the neuropil in a roughly salt-and-pepper fashion – a mixture of eat-4-positive and eat-4-negative neurons are found throughout the head; and (b) it is in roughly 40% of the neurons, giving very good overall coverage. Our view is that this approach of labeling subsets of neurons with GFP should become the standard in the field for assessing tracking accuracy – it has a simple, accurate premise; is not susceptible to human labeling error; is straightforward to implement; and, since it does not require manual labeling, is easy to scale to multiple datasets. We do note that it could be further strengthened by using multiple strains each with different ‘salt-and-pepper’ GFP expression patterns.

      (3) Figure S1F demonstrates that the registration network, BrainAlignNet, alone is insufficient to accurately align arbitrary pairs of C. elegans head images. The high tracking accuracy reported is largely due to the use of a carefully designed registration sequence, matching only images with similar postures, and an effective clustering algorithm. Although the authors address this point in the Discussion section, the abstract may give the misleading impression that the network itself is solely responsible for the observed accuracy.

      Our tracking accuracy requires (a) a careful selection of registration problems, (b) highly accurate registration of the selected registration problems, and (c) effective clustering. We extensively discussed the importance of the choosing of the registration problems in the Results section (lines 218-234 and 318-321), Discussion section (lines 704-708), and Methods section (955-970 and 1246-1250) of our paper. We also discussed the clustering aspect in the Results section (lines 247-259), Discussion section (lines 708-712), and Methods section (lines 1162-1206). In addition, our abstract states that the BrainAlignNet needs to be “incorporated into an image analysis pipeline,” to inform readers that other aspects of image analysis need to occur (beyond BrainAlignNet) to perform tracking.

      (4) The reported accuracy for neuron identification and automatic classification may be misleading, as it was assessed only on a subset of neurons labeled as "high-confidence" by human annotators. Although the authors did not disclose the exact proportion, various descriptions (such as Figure 4f) imply that this subset comprises approximately 60% of all neurons. While excluding uncertain labels is justifiable, the authors highlight the high accuracy achieved on this subset without clearly clarifying that the reported performance pertains only to neurons that are relatively easy to identify. Furthermore, they do not report what fraction of the total neuron population can be accurately identified using their methods-an omission of critical importance for prospective users.

      The reviewer raises two points here: (1) whether AutoCellLabeler accuracy is impacted by ease of human labeling; and (2) what fraction of total neurons are identified. We address them one at a time.

      Regarding (1), we believe that the reviewer overlooked an important analysis in our study. Indeed, to assess its performance, one can only compare AutoCellLabeler’s output against accurate human labels – there is simply no way around it. However, we noted that AutoCellLabeler was identifying some neurons with high confidence even when humans had low confidence or had not even tried to label the neurons (Fig. 4F). To test whether these were in fact accurate labels, we asked additional human labelers to spend extra time trying to label a random subset of these neurons (they were of course blinded to the AutoCellLabeler label). We then assessed the accuracy of AutoCellLabeler against these new human labels and found that they were highly accurate (Fig. 4H). This suggests that AutoCellLabeler has strong performance even when some human labelers find it challenging to label a neuron. However, we agree that we have not yet been able to quantify AutoCellLabeler performance on the small set of neuron classes that humans are unable to identify across datasets.

      Regarding (2), we agree that knowing how many neurons are labeled by AutoCellLabeler is critical. For example, labeling only 3 neurons per animal with 100% accuracy isn’t very helpful. We wish to emphasize that we did not omit this information: we reported the number of neurons labeled for every network that we characterized in the study, alongside the accuracy of those labels (please see Figures 4I, 5A, and 6G; Figure 4I also shows the number of human labels per dataset, which the reviewer requested). We also showed curves depicting the tradeoff between accuracy and number of neurons labeled, which fully captures how we balanced accuracy and number of neurons labeled (Figures 5D and S4A). It sounds like the reviewer also wanted to know the total number of recorded neurons. The typical number of recorded neurons per dataset can also be found in the paper in Fig. 2E.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review): 

      Summary: 

      The study investigated how individuals living in urban slums in Salvador, Brazil, interact with environmental risk factors, particularly focusing on domestic rubbish piles, open sewers, and a central stream. The study makes use of the step selection functions using telemetry data, which is a method to estimate how likely individuals move towards these environmental features, differentiating among groups by gender, age, and leptospirosis serostatus. The results indicated that women tended to stay closer to the central stream while avoiding open sewers more than men. Furthermore, individuals who tested positive for leptospirosis tended to avoid open sewers, suggesting that behavioral patterns might influence exposure to risk factors for leptospirosis, hence ensuring more targeted interventions. 

      Strengths: 

      (1) The use of step selection functions to analyze human movement represents an innovative adaptation of a method typically used in animal ecology. This provides a robust quantitative framework for evaluating how people interact with environmental risk factors linked to infectious diseases (in this case, leptospirosis). 

      (2) Detailed differentiation by gender and serological status allows for nuanced insights, which can help tailor targeted interventions and potentially improve public health measures in urban slum settings. 

      (3) The integration of real-world telemetry data with epidemiological risk factors supports the development of predictive models that can be applied in future infectious disease research, helping to bridge the gap between environmental exposure and health outcomes. 

      Weaknesses: 

      (1) The sample size for the study was not calculated, although it was a nested cohort study. 

      We thank Reviewer #1 for highlighting this weakness. We will make sure that this is explained in the next version of the manuscript. At the time of recruiting participants, we found no literature on how to perform a sample size calculation for movement studies involving GPS loggers and associated methods of analysis. Therefore, we aimed to recruit as many individuals as possible within the resource constraints of the study.  

      “Participants who were already enrolled in the cohort study were recruited to take part in the movement analysis study. At the time of recruitment, we found no published scientific studies detailing how to perform sample size calculations for research using GPS data in humans. Therefore, we opted to use convenience sampling instead. A target of 30 people per study area, balanced by gender and blind to their serological status, was chosen for this study.” [Lines 163 - 169]

      (2) The step‐selection functions, though a novel method, may face challenges in fully capturing the complexity of human decision-making influenced by socio-cultural and economic factors that were not captured in the study. 

      We agree with Reviewer #1 that this model may fail to capture the full breadth of human decisionmaking when it comes to moving through local environments. We included a section discussing the aspect of violence and how this influences residents’ choices, along with some possibilities on how to record and account for this. Although it is outside of the scope of this study, we believe that coupling these quantitative methods with qualitative studies would provide a comprehensive understanding of movement in these areas.  

      (3) The study's context is limited to a specific urban slum in Salvador, Brazil, which may reduce the generalizability of its findings to other geographical areas or populations that experience different environmental or socio-economic conditions. 

      We thank the reviewer for highlighting this limitation. We have made this more clear in the discussion section: 

      “As a result, the findings are biased towards the more represented individuals, limiting their generalisability. Additionally, all participants are from specific areas in Salvador, which may further limit the generalisability to similar contexts.” [Lines 561 - 564]

      (4) The reliance on self-reported or telemetry-based movement data might include some inaccuracies or biases that could affect the precision of the selection coefficients obtained, potentially limiting the study's predictive power. 

      We agree that telemetry data has inherent inaccuracies, which we have tried to account for by using only those data points within the study areas. We would like to clarify that there is no self-reported movement data used in this study. All movement data was collected using GPS loggers.  

      (5) Some participants with less than 50 relocations within the study area were excluded without clear justification, see line 149. 

      We found that the SSF models would not run properly if there weren’t enough relocations. Therefore, we decided to remove these individuals from the analysis. They are also removed from any descriptive statistics presented. We have now clarified this in the manuscript.  

      “Individuals with less than 50 relocations within the study area were excluded from the analysis to ensure good model convergence. Details of these excluded individuals can be found in Supplementary Material I.” [Lines 183 – 186]

      (6) Some figures are not clear (see Figure 4 A & B). 

      We have improved the resolution of the image and believe it is more clear now. Please let us know if the resolution still is not clear enough.  

      (7) No statement on conflict of interest was included, considering sponsorship of the study. 

      The conflict of interest forms for each author were sent to eLife separately. I believe these should be made available upon publication, but please reach out if these need to be re-sent.  

      Reviewer #2 (Public review): 

      Summary: 

      Pablo Ruiz Cuenca et al. conducted a GPS logger study with 124 adult participants across four different slum areas in Salvador, Brazil, recording GPS locations every 35 seconds for 48 hours. The aim of their study was to investigate step-selection models, a technique widely used in movement ecology to quantify contact with environmental risk factors for exposure to leptospires (open sewers, community streams, and rubbish piles). The authors built two different types of models based on distance and based on buffer areas to model human environmental exposure to risk factors. They show differences in movement/contact with these risk factors based on gender and seropositivity status. This study shows the existence of modest differences in contact with environmental risk factors for leptospirosis at small spatial scales based on socio-demographics and infection status. 

      Strengths: 

      The authors assembled a rich dataset by collecting human GPS logger data, combined with fieldrecorded locations of open sewers, community streams, and rubbish piles, and testing individuals for leptospirosis via serology. This study was able to capture fine-scale exposure dynamics within an urban environment and shows differences by gender and seropositive status, using a method novel to epidemiology (step selection). 

      Weaknesses: 

      Due to environmental data being limited to the study area, exposure elsewhere could not be captured, despite previous research by Owers et al. showing that the extent of movement was associated with infection risk. Limitations of step selection for use in studying human participants in an urban environment would need to be explicitly discussed. 

      The environmental factors used in the study required research teams to visit the sites and map the locations. Given that individuals travelled throughout the city of Salvador, performing this task at a large scale would be unachievable. Therefore, we limited the data to only those points within the study area boundaries to avoid any biases from interactions with unrecorded environmental factors.  

      Reviewing Editor Comments: 

      The manuscript would benefit from clearer articulation of SSF assumptions, data exclusions, and buffer choices, as well as improvements in figure clarity, to strengthen its generalizability and impact. 

      Please see replies to Reviewer #2 below regarding the assumptions (2.3), data exclusions (2.1) and buffer choices (2.2). We have improved Figure 4 clarity, please let us know if this is not sufficient.  

      Reviewer #1 (Recommendations for the authors): 

      (1) Provide comprehensive details on telemetry data collection for improved data quality and reproducibility. 

      Details for this are included under the “Methods/GPS Data” section. We have included a sentence to explain that we used to GPS device manufacturer’s software to programme them. We believe this provides enough information on how to collect the data for reproducibility, but please let us know if there is further information that we could provide.  

      “Individuals who consented to take part in this study were asked to wear GPS loggers for continuous periods of up to 48 hours, which could be repeated. The GPS loggers used were i-got U GT-600, set to record their location every 35 seconds. We used the manufacturer’s software to programme the devices. Data were collected between March and November 2022.” [Lines 172 - 176]

      (2) Check all figures and improve on clarity (see Figure 4). 

      We have updated Figure 4 and believe the resolution is better now. Please let us know if this it not the case from the readers perspective.  

      (3) Revisit sentence structures to improve readability and reduce overly complex phrasing. 

      We have reviewed the manuscript and made some changes to improve readability. 

      Reviewer #2 (Recommendations for the authors): 

      I thank Ruiz Cuenca et al. for putting together this interesting manuscript on the use of step selection functions for understanding exposure to leptospires in urban Brazil. I thoroughly enjoyed reading it and have a few suggestions that may improve the manuscript. 

      I also apologise, but I was not able to find some of the supplementary materials, for instance, Supplementary Material I. That may have been my oversight. 

      To eLife: These should have been included with the submitted manuscript file. Please let me know if it has to be resubmitted to eLife.

      (1) Descriptive statistics 

      Some more descriptive statistics would be helpful. For instance, what was the leptospirosis infection status of the six individuals who were removed due to having <50 points inside the area? As part of the analysis relies on exposure, defined as GPS locations within a 20m buffer of open sewers, community streams, and rubbish piles, it would be good to have some descriptive statistics around this. How many visits to these different sites did people make, and how did these statistics vary by study area, age, gender, and leptospirosis infection status? 

      We thank Reviewer #2 for highlighting this. Thanks to their comment, we noticed a mistake in the code which excluded more individuals from the summary statistics table than were actually excluded from the full analysis. There were only 2 individuals that had less than 50 relocations across the whole day (5 am to 9 pm) which were excluded from further analysis. The mistake has been rectified and the summary statistics updated. (see table 1)

      We have included the demographic details of excluded participants as a table in the supplementary material, which we have referenced to in the manuscript. We have also explained that the exclusion is to aid model convergence, as we found that too few relocations would result in SSF models not working properly.  

      “Individuals with less than 50 relocations within the study area were excluded from the analysis to ensure good model convergence. Details of these excluded individuals can be found in Supplementary Material I.” [Lines 183 – 186]

      We have also now included a table (Table 2),  to show more descriptive statistics of how much time individuals spent within each of the environmental buffers. 

      (2) Definitions of buffers 

      I was surprised that the authors chose a 20m buffer for each factor but 10m around the household.Could this be more clearly justified, especially given that there will be location errors in both the GPS location point and the GPS logger points? These buffers do appear quite small, particularly in an urban environment where obstruction from buildings can be expected to yield substantial GPS errors. 

      The 20 meter buffer represents an intense interaction with the point of interest. This distance was decided after visiting the sites and seeing the points of interest in person. The 10 meter buffer accounts for the size of dwellings in these areas. We have included these explanations in the new manuscript:  

      “The buffer rasters, one for each factor, were created using a 20 meter buffer around each reference point. The size of this buffer was decided after visiting the study areas and represented an area within which it could be considered a strong interaction with the point of interest.” [Lines 198 – 202]

      “Buffer rasters were also created for each individual’s household location, with a 10 meter buffer around each location.This represented space within and immediately outside each house.  This buffer size accounted for the size of dwellings in these study areas.” [Lines 205 - 208]

      (3) Assumptions of the step selection function 

      Step selection functions (SSFs) rely on a number of assumptions. Whether these assumptions are met needs to be critically discussed within the article. (For a discussion of the assumptions, I am relying on points raised in this article: Integrated step selection analysis: bridging the gap between resource selection and animal movement (2015): Tal Avgar, Jonathan R. Potts, Mark A. Lewis, Mark S. Boyce, DOI: https://doi.org/10.1111/2041-210X.12528). 

      First, SSFs typically assume each step is independent, conditional only on the previous step (Markovian process). This is violated in circular movements, for instance. Circular movements are highly likely in human movement as people will leave and return to their homes during the day. While this is partially addressed by conducting separate analyses by time of day, circular journeys can still exist within these segments. 

      Second, SSFs do not account for goal-oriented behaviour like intentional destination-seeking. So, for instance, when someone executes a plan to visit a specific stream to fetch drinking water, such behaviour is poorly approximated using SSFs because SSFs compare observed steps to random alternatives drawn from a movement kernel, assuming movement is opportunistic rather than intentional. 

      This is true of SSF that do not include movement attributes. However, in our SSF we have included both step lengths and turning angles, which, according to Avgar et al, should be enough to account for this goal-oriented behaviour. It may be clearer to call the model an integrated step selection function (iSSF), as they do in Avgar et al., which we can change in the next version of the manuscript.  

      Third, turning angles in human movement are often sharp due to regular street layout, which can violate the assumptions of SSFs, which usually assume smooth, correlated movement. 

      As this paper proposes SSFs as a novel method to measure exposure to environmentally transmitted pathogens, a discussion on the extent to which assumptions of SSFs are valid for this purpose should be included in the paper. 

      We thank Reviewer #2 for highlighting these points. We have included a section discussing these assumptions in detail: 

      “Additionally, these models have some underlying assumptions that may be violated in this study. Step-selection functions assume each step is independent, conditioned on the previous step. This can be violated by circular journeys. Although we attempted to account for these by analysing specific periods of the day, a higher temporal resolution of analysis may be needed if circular journeys are still present within each period. Another assumption is that movement is smooth through the environment. In urban environments this may not hold true, as street layouts may force sharp corners in movements. The effect of violating this assumption is not immediately clear and requires further methodological research to understand its significance. Finally, we assumed that by including movement characteristics (step lengths and turning angles) into our models, we were accounting for goal-oriented behaviour. These assumptions need to be considered in future studies that attempt to use step-selection functions to analyse human mobility.” [Lines 593 - 607]

      (4) Abstract 

      While it is highlighted in the abstract that this "study introduces a novel method for analysing human telemetry data in infectious disease research, providing critical insights for targeted interventions", I did not see any discussion about how the findings can inform interventions. 

      We thank Reviewer #2 for highlighting this. We have now removed this wording from the abstract to avoid misunderstanding.  

      (5) Effect sizes 

      It would have helped me if there had been some discussion around the size of these effects. Especially for the distance-based models, the effects seem very small. Maybe this is a misinterpretation on my part, but it would help to contextualise if the observed effect were small or large. 

      We agree with Reviewer #2 on this point and have now included a paragraph explaining that these effect sizes are indeed very small. We believe that this may be linked to the spatial scale of the rasters used (1 meter), as the selection coefficients represent changes with regards to increasing distances of 1 meter. This may not be that significant for human mobility. However, given the focus on analysing fine scale movement, we decided to keep the spatial scale of the rasters as small as possible. 

      “It is important to highlight that the effect sizes of the selection coefficients for the distance based rasters are very small and could be considered negligible. This may be linked to the spatial scale used, as these values represent increases of 1 meter. A coarser scale may have produced larger effect sizes that may have been easier to conceptualise. However, given the focus on fine-scale movement, we decided to keep this spatial scale for the analysis.” [Lines 421 - 427]

    1. Reviewer #2 (Public review):

      This work presents theoretical results concerning the effect of punctuated mutation on multistep adaptation and empirical evidence for that effect in cancer. The empirical results seem to agree with the theoretical predictions. However, it is not clear how strong the effect should be on theoretical grounds, and there are other plausible explanations for the empirical observations.

      For various reasons, the effect of punctuated mutation may be weaker than suggested by the theoretical and empirical analyses:

      (1) The effect of punctuated mutation is much stronger when the first mutation of a two-step adaptation is deleterious (Figure 2). For double inactivation of a TSG, the first mutation--inactivation of one copy--would be expected to be neutral or slightly advantageous. The simulations depicted in Figure 4, which are supposed to demonstrate the expected effect for TSGs, assume that the first mutation is quite deleterious. This assumption seems inappropriate for TSGs, and perhaps the other synergistic pairs considered, and exaggerates the expected effects.

      (2) More generally, parameter values affect the magnitude of the effect. The authors note, for example, that the relative effect decreases with mutation rate. They suggest that the absolute effect, which increases, is more important, but the relative effect seems more relevant and is what is assessed empirically.

      (3) Routes to inactivation of both copies of a TSG that are not accelerated by punctuation will dilute any effects of punctuation. An example is a single somatic mutation followed by loss of heterozygosity. Such mechanisms are not included in the theoretical analysis nor assessed empirically. If, for example, 90% of double inactivations were the result of such mechanisms with a constant mutation rate, a factor of two effect of punctuated mutagenesis would increase the overall rate by only 10%. Consideration of the rate of apparent inactivation of just one TSG copy and of deletion of both copies would shed some light on the importance of this consideration.

      Several factors besides the effects of punctuated mutation might explain or contribute to the empirical observations:

      (1) High APOBEC3 activity can select for inactivation of TSGs (references in Butler and Banday 2023, PMID 36978147). This selective force is another plausible explanation for the empirical observations.

      (2) Without punctuation, the rate of multistep adaptation is expected to rise more than linearly with mutation rate. Thus, if APOBEC signatures are correlated with a high mutation rate due to the action of APOBEC, this alone could explain the correlation with TSG inactivation.

      (3) The nature of mutations caused by APOBEC might explain the results. Notably, one of the two APOBEC mutation signatures, SBS13, is particularly likely to produce nonsense mutations. The authors count both nonsense and missense mutations, but nonsense mutations are more likely to inactivate the gene, and hence to be selected.

    1. Reviewer #1 (Public review):

      Summary:

      This study focuses on characterizing the EEG correlates of item-specific proportion congruency effects. In particular, two types of learned associations are characterized. One being associations between stimulus features and control states (SC), and the other being stimulus features and responses (SR). Decoding methods are used to identify SC and SR correlates and to determine whether they have similar topographies and dynamics.

      The results suggest SC and SR associations are simultaneously coactivated and have shared topographies, with the inference being that these associations may share a common generator.

      Strengths:

      Fearless, creative use of EEG decoding to test tricky hypotheses regarding latent associations.

      Nice idea to orthogonalize the ISPC condition (MC/MI) from stimulus features.

      Weaknesses:

      (1) I'm relatively concerned that these results may be spurious. I hope to be proven wrong, but I would suggest taking another look at a few things.

      While a nice idea in principle, the ISPC manipulation seems to be quite confounded with the trial number. E.g., color-red is MI only during phase 2, and is MC primarily only during Phase 3 (since phase 1 is so sparsely represented). In my experience, EEG noise is highly structured across a session and easily exploited by decoders. Plus, behavior seems quite different between Phase 2 and Phase 3. So, it seems likely that the classes you are asking the decoder to separate are highly confounded with temporally structured noise.

      I suggest thinking of how to handle this concern in a rigorous way. A compelling way to address this would be to perform "cross-phase" decoding, however I am not sure if that is possible given the design.

      The time courses also seem concerning. What are we to make of the SR and SC timecourses, which have aggregate decoding dynamics that look to be <1Hz?

      Some sanity checks would be one place to start. Time courses were baselined, but this is often not necessary with decoding; it can cause bias (10.1016/j.jneumeth.2021.109080), and can mask deeper issues. What do things look like when not baselined? Can variables be decoded when they should not be decoded? What does cross-temporal decoding look like - everything stable across all times, etc.?

      (2) The nature of the shared features between SR and SC subspaces is unclear.

      The simulation is framed in terms of the amount of overlap, revealing the number of shared dimensions between subspaces. In reality, it seems like it's closer to 'proportion of volume shared', i.e., a small number of dominant dimensions could drive a large degree of alignment between subspaces.

      What features drive the similarity? What features drive the distinctions between SR and SC? Aside from the temporal confounds I mentioned above, is it possible that some low-dimensional feature, like EEG congruency effect (e.g., low-D ERPs associated with conflict), or RT dynamics, drives discriminability among these classes? It seems plausible to me - all one would need is non-homogeneity in the size of the congruency effect across different items (subject-level idiosyncracies could contribute: 10.1016/j.neuroimage.2013.03.039).

      (3) The time-resolved within-trial correlation of RSA betas is a cool idea, but I am concerned it is biased. Estimating correlations among different coefficients from the same GLM design matrix is, in general, biased, i.e., when the regressors are non-orthogonal. This bias comes from the expected covariance of the betas and is discussed in detail here (10.1371/journal.pcbi.1006299). In short, correlations could be inflated due to a combination of the design matrix and the structure of the noise. The most established solution, to cross-validate across different GLM estimations, is unfortunately not available here. I would suggest that the authors think of ways to handle this issue.

      (4) Are results robust to running response-locked analyses? Especially the EEG-behavior correlation. Could this be driven by different RTs across trials & trial-types? I.e., at 400 ms post-stim onset, some trials would be near or at RT/action execution, while others may not be nearly as close, and so EEG features would differ & "predict" RT.

      (5) I suggest providing more explanation about the logic of the subspace decoding method - what trialtypes exactly constitute the different classes, why we would expect this method to capture something useful regarding ISPC, & what this something might be. I felt that the first paragraph of the results breezes by a lot of important logic.

      In general, this paper does not seem to be written for readers who are unfamiliar with this particular topic area. If authors think this is undesirable, I would suggest altering the text.

    1. Reviewer #2 (Public review):

      Summary:

      In this paper, the authors used a multi-level modelling approach to reanalyse trial data from Takeda's Phase III randomised control trial investigating the efficacy of the TAK-003 vaccine against dengue. The aim of the paper is to refine uncertainty by incorporating all the available data into the model and pooling across stratifications that are correlated. A major challenge in constructing a likelihood that allows for data available at differing levels of aggregation by group and outcome, and at different time intervals. This is done by first supposing that the data is available without aggregation for all groups, outcomes and time points, and then marginalising over the aggregated levels. The model is validated using simulations and then applied to trial data from Takeda. Results appear to corroborate those of Takeda with reduced uncertainty in the estimates.

      Strengths:

      The main strength of the paper is the multi-level modelling approach. It is a particularly natural one for this setting. One reason for this, as discussed in the paper, is that correlations across stratifications can arise when there are similarities in their underlying causal structure. It is more realistic to model this nested data structure hierarchically. Another reason, also well discussed in the paper, is the reduction in uncertainty you get when you pool estimates across related groups. Multi-level modelling is also beneficial when group sizes are different. For example, there were too few cases of DENV-4 from seronegatives, which resulted in hospitalised disease for the original analysis to produce estimates, but by using multi-level modelling, this paper can produce estimates. The modelling framework developed in this paper will be simple to extend to further trial data collected in the future.

      Another strength is that it is reanalysing existing trial data, which is both cost-effective and beneficial for scientific reproducibility. This approach also helps to assess the robustness of conclusions about the efficacy of the TAK-003 vaccine to use of different analytical methods.

      The paper is well-written. The tables and figures presented in this paper are particularly informative. Protection conferred by the vaccine varies depending upon which variant a person is exposed to, their serostatus, and time since vaccination. The analysis presented supports the discussed conclusions. Comparisons between the results of this paper and the results of the original trial analysis are also shown and demonstrate a reduction in the uncertainty of parameter estimates, as desired.

      Weaknesses:

      The weakness of the paper is that it reports per-exposure protection instead of vaccine efficacy. This is methodologically sound, but it does limit the comparability of this study with the original trial analyses, which reported vaccine efficacy. It is therefore unclear whether the reduction in uncertainty observed is due solely to the multi-level modelling approach or whether it may be due in part to the parameters of interest being slightly different.

    1. Reviewer #2 (Public review):

      Summary:

      Across two experiments, this work presents a novel spatial predictive inference paradigm that facilitates the investigation of meta-learning across multiple environments with distinct statistics, as well as more local learning from sequences of observations within an environment. The authors present behavioral data indicating that people can indeed learn to distinguish between noise levels and calibrate their learning rates accordingly across environments, even on initial trials when revisiting an environment. They complement their behavioral results with computational modeling, further bolstering claims of both local and global adaptation. Additional fMRI results support the role of OFC in this meta-learning process, with central OFC activity reflecting similarity between environments. This similarity emerges over time with task experience. Holistically, this paradigm and these data add to our understanding of how humans dynamically adapt their behavior on different timescales.

      Strengths:

      The novel paradigm represents a clever and creative expansion of spatial predictive inference tasks. The cover story was well chosen to facilitate an intuitive understanding of both the differences between environments and the estimation of the mean within environments.

      Additionally, the authors present complementary results from two experiments, which strengthen the behavioral findings. This is especially effective as the initial experiment's results were a bit noisy, and the modifications within the second experiment increased both power and the specificity/accuracy of participant predictions. Taken together, the behavioral results provide convincing evidence that participants did distinguish environments based on their underlying statistics and adapted their initial behavior accordingly.

      Beyond this, the combination of behavioral results, computational modeling, and neuroimaging enhances the impact of the work. It paints a fuller picture of whether and how humans meta-learn the global statistics of environments, and this is an important direction for the field of adaptive learning.

      Weaknesses:

      (1) The authors make the distinction between meta-learned "global" learning rates and within-environment learning rate adaptation in response to "local" fluctuations/observations. Though the experimental paradigm is novel, there are certainly links to prior work - for instance, though change point structures don't entail revisiting unique environments, they do require meta-learning from environmental statistics that is distinct from transient local adaptation to prediction errors. This tendency to increase one's learning rate after large prediction errors is appropriate in change point environments, though, as is true in this study, the amount of increase should be dependent on. This represents a similar kind of slower-timescale learning or reuse of more "global" parameters, and can be seen to different extents in prior work. It might benefit readers if the authors were to link the current work to previous research more explicitly to draw clearer connections between the approaches and findings.

      (2) Throughout much of the paper, the authors refer to the distinctions between environments primarily as differences in "initial learning rates" or "environment-specific learning rates." This is particularly prominent when discussing fMRI results. Though the optimal initial learning rate did differ across environments, this was the result of differences in underlying task statistics. It will be important to clarify this throughout the text, because of the confounds between task statistics and initial learning rate (and to some extent, the position on the screen), it is not possible to separate the impact of these specific variables. This is also relevant to understanding the justification for using methods like RSA to test whether brain regions represent task states similarly. If the main hypothesis is that neural activity reflects the (initial) learning rate itself, then a univariate analysis approach would seem more natural.

      (3) For the neuroimaging results in particular, the specificity of some of the results (e.g. ventral striatum showing an effect of prediction error only in the low noise condition in the second half of task experience, only on the first trial) is a bit surprising. Additional justification of or context for these results would be useful to help readers gauge how expected or surprising these findings are.

      (4) There are some methodological details that are unclear (e.g., how were the positions of the crabs selected relative to the location they emerged from? Looking at Figure 1C, it looks like the crabs spread out unevenly, and that the single position they emerge from is not necessarily at the center of the crab locations.) Additional detail and clarity would help address some unanswered questions (more details below).

    1. The working definition of prior restraint used in this Article willtherefore incorporate the following factors: (1) the form of a restric-tion and its timing, not the content of an expression or the anticipateddanger arising from it; (2) the difference between prior restraints andsubsequent sanctions; (3) a sufficiently broad perspective to enableconsideration of a proposed "constitutionalism of means"; (4) prox-imity to the accepted legal conception of the term; and (5) the nexusto the common-sense meaning of the words

      This quote lays out a detailed way to understand prior restraint, focusing on the timing and method of stopping speech rather than its content. It also separates it from consequences that happen after the fact and connects the idea to both legal reasoning and everyday understanding, making it easier to see how the concept really works in practice.

    1. Reviewer #1 (Public review):

      In this well-written and timely manuscript, Rieger et al. introduce Squidly, a new deep learning framework for catalytic residue prediction. The novelty of the work lies in the aspect of integrating per-residue embeddings from large protein language models (ESM2) with a biology-informed contrastive learning scheme that leverages enzyme class information to rationally mine hard positive/negative pairs. Importantly, the method avoids reliance on the use of predicted 3D structures, enabling scalability, speed, and broad applicability. The authors show that Squidly outperforms existing ML-based tools and even BLAST in certain settings, while an ensemble with BLAST achieves state-of-the-art performance across multiple benchmarks. Additionally, the introduction of the CataloDB benchmark, designed to test generalization at low sequence and structural identity, represents another important contribution of this work.

      I have only some minor comments:

      (1) The manuscript acknowledges biases in EC class representation, particularly the enrichment for hydrolases. While CataloDB addresses some of these issues, the strong imbalance across enzyme classes may still limit conclusions about generalization. Could the authors provide per-class performance metrics, especially for underrepresented EC classes?

      (2) An ablation analysis would be valuable to demonstrate how specific design choices in the algorithm contribute to capturing catalytic residue patterns in enzymes.

      (3) The statement that users can optionally use uncertainty to filter predictions is promising but underdeveloped. How should predictive entropy values be interpreted in practice? Is there an empirical threshold that separates high- from low-confidence predictions? A demonstration of how uncertainty filtering shifts the trade-off between false positives and false negatives would clarify the practical utility of this feature.

      (4) The excerpt highlights computational efficiency, reporting substantial runtime improvements (e.g., 108 s vs. 5757 s). However, the comparison lacks details on dataset size, hardware/software environment, and reproducibility conditions. Without these details, the speedup claim is difficult to evaluate. Furthermore, it remains unclear whether the reported efficiency gains come at the expense of predictive performance.

      (5) Given the well-known biases in public enzyme databases, the dataset is likely enriched for model organisms (e.g., E. coli, yeast, human enzymes) and underrepresents enzymes from archaea, extremophiles, and diverse microbial taxa. Would this limit conclusions about Squidly's generalisability to less-studied lineages?

    1. List all folders/files created/modified over the last two month by the owner/curator of the Peergos Name: hyperpost

      Gyuri Lajos Lead Envisioneer for the Indy Learning Commons on the IndyWeb

      Annotate this listing for providing publicly shared information related to non public documents/folders

      add secret links with passwords and expiry dates to provide controlled access to information or anchoring threaded conversations

      Yes Indywiki is the Way

      the alternative that completes it to o bbecome what it always dreamed to be but failed to become

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewers for providing thoughtful and constructive feedback, which will help us improve the clarity and rigor of the paper. On balance, the reviews were positive. Reviewer 1 mentioned that “This is a strong manuscript with few problems and all important findings well justified, indeed this is a nicely polished…..high-quality manuscript,” and that “this paper makes a major breakthrough, showing that cell autonomous defects in hTSCs are very likely at the heart of the pathology observed in GIN-prone murine mutants.” Reviewer 3 stated that “The study is well designed, and the manuscript is very well written. The conclusions are supported by the evidence presented.” Reviewer 2 was less enthusiastic, with main concerns being that “The paper is mostly descriptive and often quite confusing leaving one not much closer to understanding the mechanistic basis for the interesting sex-biased semi-lethal phenotype.” and felt that figure titles/section headers overstated the results, and finally recommended to improve some technical aspects and tempering conclusions. The proposed edits we think address most issues raised by the reviewers either with re-writing or adding data as described below.

      In response to reviewer #1 comments:

      Major comments:

      • I am confused as to the basis of the sex-skewing phenomenon? Is the problem that lack of maternally loaded WT Mcm4 worsens the phenotype, or is the issue that Mcm4C3/C3 dams are less able to retain pregnancies, perhaps being a more inflammatory environment? Also, while there quite consistent evidence for reduced viability of Mcm4C3/C3McmGt/+ progeny, especially for female progeny, how confident can we be that the genotype of the dam vs. sire is important? Notably on a Ddx58 background, the progeny of the Mcm4C3/C3 sire included seven live male Mcm4C3/C3McmGt/+ but no female.

      Regarding the first point (sex skewing only when female is C3/C3), we also suspected either: 1) the maternal uterine environment, or 2) reduced oocyte quality. Although not reported in this manuscript, we tested #1 by performing embryo transfer experiments. Transferring 2-cell stage embryos from sex-skewing mating to WT females did not rescue the sex-bias. We then examined oocytes from C3/C3 females. We found evidence for compromised mitochondria and transcriptome disruption. However, we are not sure why this happens (poor follicle support? Oocyte intrinsic phenomenon?). We are reserving these results and additional experiments for another paper, especially since this one mainly deals with GIN and placenta development. If the reviewers feel strongly that the embryo transfer data is crucial, we can include it.

      Regarding how confident we are that the genotype of the dam vs. sire is important, this stems from our previous paper by McNairn et al 2019 (the percentage of female C3/C3 M2/+ from sex-skewing mating is 20% compared to 60% from the reciprocal mating), which was quite dramatic. Consistent with this, MCM levels were significantly reduced in the placentae only when the dam was C3/C3 and the sire C3/+ M2/+, but not in the reciprocal cross. The reviewer makes a good observation about the Ddx58 cross; we can only hypothesize that the mutation somehow sensitizes females in this scenario and will make mention of it in the revision. We also realize that we neglected to write in Methods that the Ddx58 allele was coisogenic in the C3H background.

      • I'm not sure what Supplementary Figure 6 is showing (faster differentiation of C3 but less TGC?). Regardless, it's hard to draw too much conclusion from one not-very-pretty Western blot. This figure requires both additional replicates and a better explanation of how it fits with the other conclusions of the paper..

      We hypothesized that the JZ defect observed in the semi-lethal genotype placentas could arise either from impaired maintenance of the progenitor pool or from reduced capacity of mutant trophoblast progenitors to differentiate into the JZ lineage. The blot in Supplementary Figure 6 was intended as a qualitative demonstration that mutant trophoblast stem cells can differentiate into JZ lineages. We recognize that the figure is not definitive and will revise the text to clarify its purpose. A replicate(s) of the Western will be performed as suggested.

      • Supplementary Figure 7F-G is puzzling. Half of the mESCs have gamma-H2AX at all times, including most in S or G2 phase? In Figure S7E, do the quadrants correspond to being negative or positive for gamma-H2AX? At very least, IF images showing clear gamma-H2AX foci would be much more convincing.

      The gates for γH2AX FACS analysis were established using negative controls lacking primary antibody. As reported previously, embryonic stem cells display high basal levels of γH2AX staining (Chuykin et al., Cell Cycle 2008; Turinetto et al., Stem Cells 2012; Ahuja et al., Nat Comm 2016), which likely explains the broad signal observed across cell cycle phases. Regardless, we will provide immunofluorescence staining of γH2Ax and foci count in our revision.

      • The methods section is well detailed, but it would be ideal to clarify how many replicates each Western Blot or flow cytometry experiment is representative of.

      Thanks for the suggestion. We will update this for Fig4 and Fig5.

      Minor comments:

      • Is it possible that cGAS-STING and RIG pathways act redundantly to cause inflammation and lethality, or that other innate immune components are involved? I don't expect the authors to make compound mutants to test this but at least this possibility should be discussed textually.

      We appreciate the reviewer’s point, and had the same suspicion. Supporting this, we will add new RNA-seq analysis of Tmem173 KO placentas revealed elevated inflammatory gene expression compared to C3/C3 M2/+ controls, consistent with potential redundancy or feedback regulation. We will update in supplementary figures to reflect this.

      In response to reviewer #2 comments:

      Major comments:

      A major concern throughout the paper is that conclusions are often overstating their data. The title of figure 2 is "placentae with replication stress have smaller junctional and labyrinth zones". However, there is no measure of replication stress in this figure, just a histological evaluation of the placentae from the different mutants. The title of figure 3 is "Impact of GIN on LZ is less than JZ," but there is no measure of GIN, but instead measurement of number of cells in cell cycle and some bulk RNA-seq analysis. Title of figure 4 is "TSCs with increased genomic instability exhibit abnormal phenotypes." Again there is no measure of GIN, but instead staining of derived TSCs for proliferation, cell death, and a TSC marker. Title of figure 5 is "DNA damage responses and G2/M checkpoint activation drive premature TSC differentiation." However, there does not appear to be a difference in gH2AX between the two mutant genotypes. Checkpoint proteins might be up, but need quantification and reproduction. > 4C is the only marker of differentiation. Importantly, all the analyses here are associations, not connections, so cannot use the word "drive". Similar issues can be raised with a number of the supplementary figures.

      The Chaos3 (chromosome aberrations occurring spontaneously 3) model is a well-established system of intrinsic chronic replication stress and GIN. It is characterized by ~20 fold elevation of blood micronuclei (Shima et al., Nature 2007), a hallmark of GIN (Soxena et al., Mol Cell 2022); a destabilized MCM2-7 helicase prone to replication fork collapse (Bai et al., PLoS Genet 2016); and increased mitotic chromosome abnormalities and decreased dormant origins (Kawabata et al., Mol Cell 2011; Chuang et al., Nucleic Acid Res 2012) that are known to cause GIN and replication stress (Ibarra et al., PNAS 2008 ). Also, in our previous work (McNairn et al Nature 2019), we showed that placentae from C3/C3 dams exhibit significantly elevated γH2Ax as well as reduced MCM2 and MCM4 protein levels. In our current study, we also observe elevated γH2Ax in mutant TSCs (C3/C3 and C3/C3 M2/+), consistent with genomic instability. Nevertheless, we acknowledge that in TSCs, we did not formally demonstrate replications stress(RS), so where appropriate, we will advise figure titles, for example to say that “cells/placentae with a GIN or RS genotype.”

      We acknowledge the reviewers concern regarding western blots. We will provide quantification and statistics in our revision.

      1) A deeper analysis of the cell lines is likely to be the most fruitful path to reveal interesting mechanisms. It is very surprising that there is no phenotype in ESCs. Authors should check for increased apoptosis. Maybe the phenotypic cells are lost. Or do ESCs use different MCMs/mechanisms of DNA replication or are they better able to handle replication stress and GIN? How many passages were the TSCs and ESCs cultured for? Does GIN (i.e. aneuploidy, CNVs) develop in TSCs and ESCs with passaging? How do the MCM mutations impact the molecular identity of the ESC and TSC cells including their heterogeneity in the population.

      We assessed apoptosis using cleaved caspase 3 flow cytometry in mutant ESCs and observed no difference compared to controls (we will add this data as Supplementary Fig. 7).

      We believe there are intrinsic differences in TSCs and ESCs in their ability to respond to and counteract replication stress and DNA damage. ESCs are known to license more replication origins than somatic cells at a higher rate, which protects them from short G1-induced replication stress (Ahuja et al., Nat Comm 2016; Ge et al., Stem Cell Rep 2015; Matson et al., eLife 2017). Human placental cells physiologically exhibit high levels of mutation rate and chromosomal instability in vivo (Coorens et al., Nature 2021). Supporting this, Wang, D., et al (Nat Comm 2025) reported that several cell cycle and DDR regulators are differentially expressed in human TSCs vs human pluripotent stem cells. Whether such transcriptional differences directly contribute to functional outcomes remains to be determined.

      All experiments in this study were conducted using early-passage ESCs and TSCs (i.e. Finally, we showed that close to 90% mutant ESCs are KLF4+ (a naive pluripotency marker) whereas EOMES+ cells were significantly reduced in TSCs carrying the GIN genotype (Fig. 4E–F and Supplementary Fig. 7), highlighting lineage-specific differences.

      Minor Comments:

      1) There is a lack of quantification and repeats for all Westerns. At minimum there should be three repeats for each experiment, quantification including normalization to a reference protein, and stats confirming any proposed differences between conditions.

      We will update our revision with quantification and statistics for western blots.

      2) I would recommend moving the results in supp table 1 to figure 1. While negative, they are the newer results. The results shown in current figure 1 are essentially a reproduction of their previous work.

      The placental observations presented in Fig.1 are new. In particular, the placental and embryonic weight measurements graphed in Fig1B and C have not been published by our group. Fig1A reproduces our previous observation on embryo viability in GIN mutants (McNairn et al., Nature 2019), while the schematic was provided for better flow and readability given the complex mating schemes. We are agnostic on the Suppl Table 1. It could be changed to a new Table 1 in the main section depending on the journal.

      In response to reviewer #3 comments:

      Major Comments

      While the inclusion of bulk RNAseq data of whole placental tissue is appreciated, the interpretation of the results is somewhat problematic, as it is acknowledged that the cell type composition of the placentas is drastically different between groups. Making conclusions based upon GSEA analysis of two different groups with drastically different cell type composition is somewhat misleading, as based on the results, it is a direct reflection of the cell types present. It would be more helpful to perform cell type deconvolution of the RNAseq data to estimate the proportion of each cell type within the bulk samples and compare that to what is seen histologically and not dive too deeply into the pathways since the results could just be a reflection of the cell types e.g. angiogenesis pathways from more endothelial cells. Additionally, the RNAseq data can be leveraged to look at expression of inflammatory genes by sex, which may show interesting patterns based on the other results.

      We agree that the representation of cell types in the placenta is problematic especially for underrepresented genes. We propose to use the BayesPrism tool (Chu et al., Nat Cancer 2022) to deconvolute bulk RNA-seq for better representation of transcriptional changes in the placenta.

      Section: GIN impairs trophoblast stem cell establishment and maintenance. To support the assertion in the first paragraph, beyond measuring apoptosis, it would be helpful at this stage to look at RNA expression levels indicative of the activation of DNA damage checkpoint genes

      We have performed RNA-seq on mutant ESC and TSCs and are in the process of data analysis. We will update these results in the revision.

      Please include additional methodological details in the methods section on the statistical analysis done for differential expression analysis. Specifically, what type of normalization was used, if lowly expressed genes were filtered out and at what cutoff, what statistical model was used (did you include covariates?), what comparisons were made? Did you stratify by sex? What cutoff was used for statistical significance? Did you perform multiple testing correction?

      We will update RNA-Seq data analysis methods in our full revision.

      2. Description of the revisions that have already been incorporated in the transferred manuscript

      Reviewer #1 comments:

      • Supplementary Table 1. would be enhanced greatly showing comparable tables for Mcm4C3/C3 x Mcm4C3/+McmGt/+ in mice without the Tmem173 or Ddx58 mutations. It is fine to recycle data from McNairn 2019 here, as long as the source is indicated, but a comparison is needed.

      Thanks for pointing this out. We have updated this suggestion in Supp table 1.

      • In Figure S3E-F, is the box above each graph supposed to show the genotype of the dam?

      Yes. Thanks for pointing this out. We have added a description in the figure legend to make it clear.

      • "Indeed, the placenta and embryo weights of E13.5 Mcm4C3/C3 Mcm2Gt/+ Mcm3Gt/+ animals were significantly improved vs. Mcm4C3/C3 Mcm2Gt/+ animals, rendering them similar to Mcm4C3/C3 littermates (Fig. 6A-C). The JZ (but not LZ) area in Mcm4C3/C3 Mcm2Gt/+ Mcm3Gt/+ placentae also increased to the level of Mcm4C3/C3 littermates (Fig. 6D-H)." There are two problems here. First, the figure calls are wrong. Second, the description of the data is not quite right, it looks like the C3/C3 and C3/C3 M2/+ M3/+ LZs are a similar size to each and are statistically indistinguishable.

      Thanks for catching this. We have updated these in the main text.

      *Reviewer #2 comments: *

      Minor comment

      • Need to review citations to figures. For example, no citations are made to figure 4a and 4c.

      Thanks for catching this. We have updated the text.

      Reviewer #3 comments:

      Define the first use of >4C DNA content to help readers understand this potentially unfamiliar term.

      We have edited this part to indicate cells with more than 4C DNA content for better clarity.

      iDEP tool - please include citation to manuscript instead of link

      We have updated this citation.

      Check citations. Some citations to BioRxiv that are now published e.g. 13.

      We have updated this citation.

      3. Description of analyses that authors prefer not to carry out

      Reviewer 2

      2) Along similar lines, most of the in vivo phenotypic analyses are performed at E13.5, long after defects are likely beginning to express themselves especially given that they see phenotypes in the TSCs, which represent the polar TE of a E4.5. To understand the primary defects of the in vivo phenotype, they should be looking much earlier. Supplemental figure 5 is a start but represents a rather superficial analysis.

      The peri-implantation period, namely E4.5, represents a “black box” of embryonic development given that this is a critical stage for implantation. Aside from being an extremely difficult stage to analyze technically, we don’t think it is essential to the conclusions (or doable in a timely manner), especially given the use of TSCs. If we complete EdU studies on E6.5 embryos, we will include them.

      3) Fig. 6 would benefit from evidence that MCM3 mutant is rescuing MCM4 levels in the chromatin fraction of cells and the DNA damage phenotype.

      The genetic evidence presented is strong, and although we didn’t do the suggested experiment, we feel that our previous studies (McNairn et al., Nature 2019 and Chuang et al., PLoS Genet 2010) on the effects of MCM3 as a nuclear export factor (as it is in yeast (Liku et al., Mol Biol Cell 2005)) are a reasonable basis for not repeating such experiments. Furthermore, we are no longer maintaining the Mcm3 line and it would take over a year to reconstitute and rebreed triple mutants.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The manuscript, "Chronic replication stress-mediated genomic instability disrupts placenta development in mice" by Munisha et al follows up a 2019 paper in Nature by the same group where they show that mutations to the MCM genes lead to a sex-skewed semi-lethal phenotype starting after embryonic day 9.5 and extending to birth. In the paper, they hypothesized that the semi-lethality is secondary to genomic instability (GIN) driven inflammation due to activation of the innate immune pathways sensing cytoplasmic DNA. In this paper, they start by disproving that hypothesis and then go on to present data arguing lethality is due to a placental development defect rather than inflammation. The paper is mostly descriptive and often quite confusing leaving one not much closer to understanding the mechanistic basis for the interesting sex-biased semi-lethal phenotype that was described in their original paper. The most interesting aspect of the paper is the derivation of TSC and ESCs and initial analysis suggesting that the TSCs are more sensitive to the MCM mutations, but the analysis is rather shallow. Importantly it is unclear how the phenotype explains the sex-skewing of the phenotype. Are the TSC phenotypes sex-skewed and if so why? Also, why is the JZ and especially GlyTCs most effected?

      A major concern throughout the paper is that conclusions are often overstating their data. The title of figure 2 is "placentae with replication stress have smaller junctional and labyrinth zones". However, there is no measure of replication stress in this figure, just a histological evaluation of the placentae from the different mutants. The title of figure 3 is "Impact of GIN on LZ is less than JZ," but there is no measure of GIN, but instead measurement of number of cells in cell cycle and some bulk RNA-seq analysis. Title of figure 4 is "TSCs with increased genomic instability exhibit abnormal phenotypes." Again there is no measure of GIN, but instead staining of derived TSCs for proliferation, cell death, and a TSC marker. Title of figure 5 is "DNA damage responses and G2/M checkpoint activation drive premature TSC differentiation." However, there does not appear to be a difference in gH2AX between the two mutant genotypes. Checkpoint proteins might be up, but need quantification and reproduction. > 4C is the only marker of differentiation. Importantly, all the analyses here are associations, not connections, so cannot use the word "drive". Similar issues can be raised with a number of the supplementary figures.

      Major Comments:

      1) A deeper analysis of the cell lines is likely to be the most fruitful path to reveal interesting mechanisms. It is very surprising that there is no phenotype in ESCs. Authors should check for increased apoptosis. Maybe the phenotypic cells are lost. Or do ESCs use different MCMs/mechanisms of DNA replication or are they better able to handle replication stress and GIN? How many passages were the TSCs and ESCs cultured for? Does GIN (i.e. aneuploidy, CNVs) develop in TSCs and ESCs with passaging? How do the MCM mutations impact the molecular identity of the ESC and TSC cells including their heterogeneity in the population.

      2) Along similar lines, most of the in vivo phenotypic analyses are performed at E13.5, long after defects are likely beginning to express themselves especially given that they see phenotypes in the TSCs, which represent the polar TE of a E4.5. To understand the primary defects of the in vivo phenotype, they should be looking much earlier. Supplemental figure 5 is a start but represents a rather superficial analysis.

      3) Fig. 6 would benefit from evidence that MCM3 mutant is rescuing MCM4 levels in the chromatin fraction of cells and the DNA damage phenotype.

      Minor Comments:

      1) There is a lack of quantification and repeats for all Westerns. At minimum there should be three repeats for each experiment, quantification including normalization to a reference protein, and stats confirming any proposed differences between conditions.

      2) I would recommend moving the results in supp table 1 to figure 1. While negative, they are the newer results. The results shown in current figure 1 are essentially a reproduction of their previous work.

      3) Need to review citations to figures. For example, no citations are made to figure 4a and 4c.

      Significance

      As is, the study does not provide much new insight or understanding of how the MCM mutants are driving the sex-skewed semi-lethal phenotype. It would likely take much effort (months) to reach such a goal. However, without such effort, it is unclear what the significance of the story is. It does make the observation that the placenta appears to be impacted more severely and earlier than then the embryo, and that within the placenta, certain zones and cell types are more vulnerable. The reasons for these differential impacts are unclear though.

      If the authors choose not to dig deeper as suggested in the major comments, then at a minimum it would be important to soften their conclusions as raised in the summary and at least perform experiments/edits proposed in minor comments.

    1. Reviewer #2 (Public review):

      Summary:

      Wethington et al. investigated the mechanistic principles underlying antigen-specific proliferation and memory formation in mouse natural killer (NK) cells following exposure to mouse cytomegalovirus (MCMV), a phenomenon predominantly associated with CD8+ T cells. Using a stochastic modeling approach, the authors aimed to develop a quantitative model of NK cell clonal dynamics during MCMV infection. Starting from a single immature Ly49+CD27+ NK cell, a two-state linear model (with a death variant) explained the negative correlation between clone size at 8 dpi and the CD27+ fraction, but failed to reproduce the first and second moments of CD27+ and CD27− NK cell populations at 8 dpi. To address this limitation, the authors added an intermediate maturation state, yielding a three-stage model (CD27+Ly6C− → CD27−Ly6C− → CD27−Ly6C+) that fits the first and second moments under two constraints: CD27+ NK cells proliferate faster than CD27− NK cells, and clone size is negatively correlated with the CD27+ fraction (upper bound of −0.2). The model predicts high proliferation in the intermediate state and high death in mature CD27−Ly6C+ cells, and it was validated using Adams et al. (2021) NK reporter mice tracking CD27+/− populations after tamoxifen, allowing discrimination between bone marrow-derived and pre-existing peripheral NK cells. To test the prediction that mature CD27− NK cells have a higher death rate, the authors measured Ly49H+ NK cell viability in the mouse spleen at different time points post-MCMV infection. Data confirmed lower viability of mature (CD27−) than immature (CD27+) cells during days 4-8 post-infection, and a model variant supported that higher CD27− death increases their proportion in the dead cell compartment. Altogether, the authors propose a three-stage quantitative model of antigen-specific expansion and maturation of naïve Ly49H+ NK cells with the trajectory CD27+Ly6C− (immature) → CD27−Ly6C− (mature I) → CD27−Ly6C+ (mature II), highlighting high proliferation in the mature I state and increased death in the mature II state.

      Strengths:

      Models explaining correlations and first and second moments, supported by analytical investigations, stochastic simulations, and model selection, identify key processes in antigen-specific NK expansion and maturation. The work distinguishes expansion, contraction, and memory in NK cells from CD8+ T cells and informs NK therapy development.

      Weaknesses (relating to initial submission):

      The conclusions of this paper are largely supported by the available data. However, a comparative analysis with more recent works in the field would be desirable. Clarifications:

      (1) Initial Conditions and Grassmann Data: The Grassmann data is used solely as a constraint, while the simulated values of CD27+/CD27− cells could have been directly fitted to the Grassmann data, which assumes a 1:1 ratio of CD27+/CD27− at t = 0. This would allow an alternative initial condition rather than starting from a single CD27+ cell.

      (2) Correlation Coefficients in the Three-State Model: Although the parameter scan of the three-stage model (Figure 2) demonstrates the potential for negative correlations between colony size and the fraction of CD27+ cells, the calculated correlation coefficients using the fitted parameter values are not shown. Including these would validate that the fitted parameters lie in the negative-correlation regime.

      (3) Viability Dynamics and Adaptive Response: The authors measured the time evolution of CD27+/− dynamics and viability over 30 days post-infection (Figure 4). It would be valuable to test whether the three-state model can reproduce the adaptive response of CD27− cells to MCMV infection, particularly the observed drop in CD27− viability at 5 dpi and its rebound at 8 dpi. Demonstrating this would test whether the model can simultaneously explain viability dynamics and moment dynamics, and would enable sensitivity analysis of CD27− viability with respect to model parameters.

    1. Reviewer #2 (Public review):

      Summary:

      Meiotic recombination initiates with the formation of DNA double-strand break (DSB) formation, catalyzed by the conserved topoisomerase-like enzyme Spo11. Spo11 requires accessory factors that are poorly conserved across eukaryotes. Previous genetic studies have identified several proteins required for DSB formation in C. elegans to varying degrees; however, how these proteins interact with each other to recruit the DSB-forming machinery to chromosome axes remains unclear.

      In this study, Raices et al. characterized the biochemical and genetic interactions among proteins that are known to promote DSB formation during C. elegans meiosis. The authors examined pairwise interactions using yeast two-hybrid (Y2H) and co-immunoprecipitation and revealed an interaction between a chromatin-associated protein HIM-17 and a transcription factor XND-1. They further confirmed the previously known interaction between DSB-1 and SPO-11 and showed that DSB-1 also interacts with a nematode-specific HIM-5, which is essential for DSB formation on the X chromosome. They also assessed genetic interactions among these proteins, categorizing them into four epistasis groups by comparing phenotypes in double vs. single mutants. Combining these results, the authors proposed a model of how these proteins interact with chromatin loops and are recruited to chromosome axes, offering insights into the process in C. elegans compared to other organisms.

      Weaknesses:

      This work relies heavily on Y2H, which is notorious for having high rates of false positives and false negatives. Although the interactions between HIM-17 and XND-1 and between DSB-1 and HIM-5 were validated by co-IP, the significance of these interactions was not tested in vivo. Cataloging Y2H and genetic interactions does not yield much more insight. The model proposed in Figure 4 is also highly speculative.

    2. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      The manuscript by Raices et al., provides novel insights into the role and interactions between SPO-11 accessory proteins in C. elegans. The authors propose a model of meiotic DSBs regulation, critical to our understanding of DSB formation and ultimately crossover regulation and accurate chromosome segregation. The work also emphasizes the commonalities and species-specific aspects of DSB regulation.

      Strengths:

      This study capitalizes on the strengths of the C. elegans system to uncover genetic interactions between a large number of SPO-11 accessory proteins. In combination with physical interactions, the authors synthesize their findings into a model, which will serve as the basis for future work, to determine mechanisms of DSB regulation.

      Weaknesses:

      The methodology, although standard, lacks quantification. This includes the mass spectrometry data , along with the cytology. The work would also benefit from clarifying the role of the DSB machinery on the X chromosome versus the autosomes.

      • We have uploaded the MS data and added a summary table with the number of peptides and coverage.

      • We have added statistics to the comparisons of DAPI body counts.

      • We have provided additional images of the change in HIM-5 localization

      • We have quantified the overlap (or lack thereof) between XND-1 and HIM-17 and the DNA axis

      Reviewer #2 (Public Review):

      Summary:

      Meiotic recombination initiates with the formation of DNA double-strand break (DSB) formation, catalyzed by the conserved topoisomerase-like enzyme Spo11. Spo11 requires accessory factors that are poorly conserved across eukaryotes. Previous genetic studies have identified several proteins required for DSB formation in C. elegans to varying degrees; however, how these proteins interact with each other to recruit the DSB-forming machinery to chromosome axes remains unclear.

      In this study, Raices et al. characterized the biochemical and genetic interactions among proteins that are known to promote DSB formation during C. elegans meiosis. The authors examined pairwise interactions using yeast two-hybrid (Y2H) and co-immunoprecipitation and revealed an interaction between a chromatin-associated protein HIM-17 and a transcription factor XND-1. They further confirmed the previously known interaction between DSB-1 and SPO-11 and showed that DSB-1 also interacts with a nematodespecific HIM-5, which is essential for DSB formation on the X chromosome. They also assessed genetic interactions among these proteins, categorizing them into four epistasis groups by comparing phenotypes in double vs. single mutants. Combining these results, the authors proposed a model of how these proteins interact with chromatin loops and are recruited to chromosome axes, offering insights into the process in C. elegans compared to other organisms.

      Weaknesses:

      This work relies heavily on Y2H, which is notorious for having high rates of false positives and false negatives. Although the interactions between HIM-17 and XND-1 and between DSB-1 and HIM-5 were validated by co-IP, the significance of these interactions was not tested, and cataloging Y2H interactions does not yield much more insight.

      We appreciate that the reviewer recognized the value of our IP data, but we beg to differ that we rely too heavily on the Y2H. We also provide genetic analysis on bivalent formation to support the physical interaction data. We do acknowledge that there are caveats with Y2H, however, including that a subset of the interactions can only be examined with proteins in one orientation due to auto-activation. While we acknowledge that it would be nice to have IP data for all of the proteins using CRISPR-tagged, functional alleles, these strains are not all feasible (e.g. no functional rec-1 tag has been made) and are beyond the scope of the current work.

      Moreover, most experiments lack rigor, which raises serious concerns about whether the data convincingly supports the conclusions of this paper. For instance, the XND-1 antibody appears to detect a band in the control IP; however, there was no mention of the specificity of this antibody.

      We previously showed the specificity of this antibody in its original publication showing lack of staining in the xnd-1 mutant by IF (Wagner et al., 2010). To further address this, however, we have now included a new supplementary figure (Figure S1) demonstrating the specificity of the XND-1 antibody by Western blot. The antibody detects a distinct band in extracts from wild-type (N2) worms, but this band is absent in two independent xnd-1 mutant strains. This confirms that the antibody specifically recognizes XND-1, supporting the validity of the IP results shown in the main figures.

      Additionally, epistasis analysis of various genetic mutants is based on the quantification of DAPI bodies in diakinesis oocytes, but the comparisons were made without statistical analyses.

      We have added statistical analysis to all datasets where quantification was possible, strengthening the rigor and interpretation of our findings.

      For cytological data, a single representative nucleus was shown without quantification and rigorous analysis. The rationale for some experiments is also questionable (e.g. the rescue by dsb-2 mutants by him-5 transgenes in Figure 2), making the interpretation of the data unclear. Overall, while this paper claims to present "the first comprehensive model of DSB regulation in a metazoan", cataloging Y2H and genetic interactions did not yield any new insights into DSB formation without rigorous testing of their significance in vivo. The model proposed in Figure 4 is also highly speculative.

      Regarding the cytology, we provide new images and quantification of HIM-17 and XND-1 overlap with the DNA axes. We also added full germ line images showing HIM-5 localization in wild type and dsb-1 mutants, to provide a more complete and representative view of the observed phenotype. To further support our findings, we’ve also included images demonstrating that this phenotype is consistently observed with both in live worm with the the him-5::GFP transgene and in fixed worms with an endogenously tagged version of HIM-5.

      Reviewer #3 (Public Review):

      During meiosis in sexually reproducing organisms, double-strand breaks are induced by a topoisomerase-related enzyme, Spo11, which is essential for homologous recombination, which in turn is required for accurate chromosome segregation. Additional factors control the number and genome-wide distribution of breaks, but the mechanisms that determine both the frequency and preferred location of meiotic DSBs remain only partially understood in any organism.

      The manuscript presents a variety of different analyses that include variable subsets of putative DSB factors. It would be much easier to follow if the analyses had been more systematically applied. It is perplexing that several factors known to be essential for DSB formation (e.g., cohesins, HORMA proteins) are excluded from this analysis, while it includes several others that probably do not directly contribute to DSB formation (XND-1, HIM-17, CEP-1, and PARG-1).

      We respectfully disagree with the reviewer’s statement regarding the selection of factors included in our analysis. In this work, our focus was specifically on SPO-11 accessory factors — proteins that directly interact with or regulate SPO-11 activity during doublestrand break formation. Cohesins and chromosome axis proteins (such as the HORMA domain proteins) are essential for establishing the correct chromosome architecture that supports DSB formation, but there is no evidence that they are direct accessory factors of SPO-11. Therefore, they were intentionally excluded from this study to maintain a clear and focused scope on proteins that more directly modulate SPO-11 function.

      Conversely, XND-1, HIM-17, CEP-1, and PARG-1 have all been implicated in regulating aspects of SPO-11-mediated DSB formation or its immediate environment. Although their contributions mayinvolve broader chromatin or DNA damage response regulation, prior literature supports their inclusion as relevant modulators of SPO-11 activity, justifying their analysis within the context of this work.

      The strongest claims seem to be that "HIM-5 is the determinant of X-chromosome-specific crossovers" and "HIM-5 coordinates the actions of the different accessory factors subgroups." Prior work had already shown that mutations in him-5 preferentially reduce meiotic DSBs on the X chromosome. While it is possible that HIM-5 plays a direct role in DSB induction on the X chromosome, the evidence presented here does not strongly support this conclusion. It is also difficult to reconcile this idea with evidence from prior studies that him-5 mutations predominantly prevent DSB formation on the sex chromosomes, while the protein localizes to autosomes.

      HIM-5 is not the only protein that is autosomally enriched but preferentially affects the X chromosome: MES-4 and MRG-1 are both autosomally-enriched but influence silencing of the X chromosome. While HIM-5 appears autosomally-enriched, it does not appear to be autosomal-exclusive. While we would ideally perform ChIP to determine its localization on chromatin, this method for assaying DSB sites is likely insufficient to identify DSB sites which differ in each nucleus and for which there are no known hotspots in the worm.

      him-5 mutants confer an ~50% reduction in total number of breaks and a very profound change in break dynamics (seen by RAD-51 foci (Meneely et al., 2012)). Since the autosomes receives sufficient breaks in this context to attain a crossover in >98% of nuclei, this indicates that the autosomes are much less profoundly impacted by loss of DSB functions than is the X chromosome. Indeed, prior data from co-author, Monica Colaiacovo, showed that fewer breaks occur on the X (Gao, 2015) likely resulting from differences in the chromatin composition of the X and autosome resulting from X chromosome silencing.

      The conclusion that HIM-5 must be required for breaks on the X comes from the examination of DSB levels and their localization in different mutants that impair but do not completely abrogate breaks. In any situation where HIM-5 protein expression is affected (xnd-1, him-17, and him-5 null alleles), breaks on the X are reduced/ eliminated. By contrast, in dsb-2 mutants, where HIM-5 expression is unaffected, both X and autosomal breaks are impacted equally. As discussed above, in the absence of HIM-5 function, there are ~15 breaks/ nucleus. The Ppie1::him-5 transgene is expressed to lower levels than Phim-5::him-5, but in the best case, the ectopic expression of this protein should give a maximum of ~15 breaks (the total # of breaks is thought to be ~30/nucleus). By these estimates, Ppie-1::him-5; him-17 and him-5 null mutants have the same number of breaks. Yet, in the former case, breaks occur on the X; whereas in the latter they do not. The best explanation for this discrepancy is that HIM-5 is sufficient to recruits the DSB machinery to the X chromosome.

      The one experiment that seems to elicit the conclusion that HIM-5 expression is sufficient for breaks on the X chromosome is flawed (see below). The conclusion that HIM-5 "coordinates the activities of the different accessory sub-groups" is not supported by data presented here or elsewhere.

      We have reorganized the discussion to more directly address the reviewers’ concerns. We raise the possibility that HIM-5 has an important role in bringing together the SPO-11 and its interacting components (DSB-1/2/3) with the other DSB inducing factors, including those factors that regulating DSB timing (XND-1), coordination with the cell cycle (REC-1), association with the chromosome axis (PARG-1, MRE-11), and coupling to downstream resection and repair (MRE-11, CEP-1).  

      This raises a natural question: if HIM-5 has such a central role, why are the phenotypes of HIM-5 so mild? We propose that while the loss of DSBs on the X appears mild, more profound effects are seen in the total number, timing, and placement of the DSBs across the genome- all of which are diminished or altered in the absence of HIM-5. The phenotypes of him-5 loss reminiscent of those observed in Prdm9-/- in mice where breaks are relocated to transcriptional start sites and show significant delay in formation. As with PRDM9, the comparatively subtle phenotypes of HIM-5 loss do not diminish its critical role in promoting proper DSB formation in most mammals.

      Like most other studies that have examined DSB formation in C. elegans, this work relies on indirect assays, here limited to the cytological appearance of RAD-51 foci and bivalent chromosomes, as evidence of break formation or lack thereof. Unfortunately, neither of these assays has the power to reveal the genome-wide distribution or number of breaks. These assays have additional caveats, due to the fact that RAD-51 association with recombination intermediates and successful crossover formation both require multiple steps downstream of DSB induction, some of which are likely impaired in some of the mutants analyzed here. This severely limits the conclusions that can be drawn. Given that the goal of the work is to understand the effects of individual factors on DSB induction, direct physical assays for DSBs should be applied; many such assays have been developed and used successfully in other organisms.

      We appreciate the reviewer’s thoughtful comments. We agree that RAD-51 foci are an indirect readout of DSB formation and that their dynamics can be influenced by defects in downstream repair processes. However, in C. elegans, the available methods for directly detecting DSBs are limited. Unlike other organisms, C. elegans lacks γH2AX, eliminating the possibility of using γH2AX as a DSB marker. TUNEL assays, while conceptually appealing, have proven unreliable and poorly reproducible in the germline context. Similarly, RPA foci do not consistently correlate with the number of DSBs and are influenced by additional processing steps.

      Given these limitations, RAD-51 foci remain the most widely accepted surrogate for monitoring DSB formation in C. elegans. While we fully acknowledge the caveats associated with this approach — particularly the potential effects of downstream repair defects — RAD-51 analysis continues to provide valuable insight into DSB dynamics and regulation, especially when interpreted in combination with other phenotypic assessments.

      Throughout the manuscript, the writing conflates the roles played by different factors that affect DSB formation in very different ways. XND-1 and HIM-17 have previously been shown to be transcription factors that promote the expression of many germline genes, including genes encoding proteins that directly promote DSBs. Mutations in either xnd-1 or him-17 result in dysregulation of germline gene expression and pleiotropic defects in meiosis and fertility, including changes in chromatin structure, dysregulation of meiotic progression, and (for xnd-1) progressive loss of germline immortality. It is thus misleading to refer to HIM-17 and XND-1 as DSB "accessory factors" or to lump their activities with those of other proteins that are likely to play more direct roles in DSB induction.

      It is clear that we will not reach agreement about the direct vs indirect roles here of chromatin remodelers/transcription factors in break formation. In yeast, there is a precedent for SPP1 and in mouse for Prdm9, both of which could be described as transcription factors as well, as having roles in break formation by creating an open chromatin environment for the break machinery. We envision that these proteins function in the same fashion. The changes in histone acetylation in the xnd-1 mutants supports such a claim.

      We do not know what the reviewer is referring to in statement that “XND-1 and HIM-17 have previously been shown to be transcription factors that promote the expression of many germline genes.” While the Carelli et al paper indeed shows a role for HIM-17 in expression of many germline genes, there is only one reference to XND-1 in this manuscript (Figure S3A) which shows that half of XND-1 binding sites overlap with the co-opted germline promoters. There is no transcriptional data at all on xnd-1 mutants, save our studies (referenced herein) that XND-1 regulates him-5 expression.

      For example, statements such as the following sentence in the Introduction should be omitted or explained more clearly: "xnd-1 is also unique among the accessory factors in influencing the timing of DSBs; in the absence of xnd-1, there is precocious and rapid accumulation of DSBs as monitored by the accumulation of the HR strand-exchange protein RAD-51.

      We are not sure what is confusing here. The distribution of RAD-51 foci is significantly altered in xnd-1 mutants and peak levels of breaks are achieved as nuclei leave the transition zone (Wagner et al., 2010; McClendon et al., 2016). There is no other mutation that causes this type of change in RAD-51 distribution.

      "The evidence that HIM-17 promotes the expression of him-5 presented here corroborates data from other publications, notably the recent work of Carelli et al. (2022), but this conclusion should not be presented as novel here.

      We have clarified this in the text. We note that this paper showed alterations in him-5 levels by RNA-Seq but they did not validate these results with quantitative RT-PCR. Thus, our studies do provide an important validation of their prior results.

      The other factors also fall into several different functional classes, some of which are relatively well understood, based largely on studies in other organisms. The roles of RAD50 and MRE-11 in DSB induction have been investigated in yeast and other organisms as well as in several prior studies in C. elegans. DSB-1, DSB-2, and DSB-3 are homologs of relatively well-studied meiotic proteins in other organisms (Rec114 and Mei4) that directly promote the activity of Spo11, although the mechanism by which they do so is still unclear.

      Whilst we agree that we understand some of the functions of the homologs, there are clearly examples in other processes of conserved proteins adopting unique regulatory function. We should not presume evolutionary conservation until proven. Indeed the comparison between the Mer2 proteins becomes particularly relevant here. For example, the RMM complex in plants does not contain PRD3, although this protein is thought to have function in DSB formation and repair (Lambing et al, 2022; Vrielynck et al., 2021; Thangavel et al., 2023). In Sordaria, as well, the Mer2 homolog has distinct functions (Tesse et al., 2017).  

      Mutations in PARG-1 (a Poly-ADP ribose glycohydrolase) likely affect the regulation of polyADP-ribose addition and removal at sites of DSBs, which in turn are thought to regulate chromatin structure and recruitment of repair factors; however, there is no convincing evidence that PARG-1 directly affects break formation.

      Our prior collaborative studies on PARG-1 showed that is has a non-catalytic function that promote DSBs that is independent of accumulation of PAR (Janisiw et al., 2020; Trivedi et al., 2022)

      CEP-1 is a homolog of p53 and is involved in the DNA damage response in the germline, but again is unlikely to directly contribute to DSB induction.

      We respectfully disagree with the reviewer’s statement. While CEP-1 is indeed a homolog of p53 and plays a major role in the DNA damage response, prior work from Brent Derry’s lab and from our group (Mateo et al., 2016) demonstrated that specific cep-1 separationof-function alleles affect DSB induction and/or repair pathway choice independently of canonical DNA damage checkpoint activation. In particular, defects in DSB formation observed in certain cep-1 mutants can be rescued by exogenous irradiation, supporting a direct or closely linked role in promoting DSB formation rather than merely responding to damage. Thus, based on these functional data, we considered CEP-1 a relevant factor to include in our analysis. We have now clarified this rationale in the revised manuscript.

      HIM-5 and REC-1 do not have apparent homologs in other organisms and play poorly understood roles in promoting DSB induction. A mechanistic understanding of their functions would be of value to the field, but the current work does not shed light on this. A previous paper (Chung et al. G&D 2015) concluded that HIM-5 and REC-1 are paralogs arising from a recent gene duplication, based on genetic evidence for a partially overlapping role in DSB induction, as well as an argument based on the genomic location of these genes in different species; however, these proteins lack any detectable sequence homology and their predicted structures are also dissimilar (both are largely unstructured but REC-1 contains a predicted helical bundle lacking in HIM-5). Moreover, the data presented here do not reveal overlapping sets of genetic or physical interactions for the two genes/proteins. Thus, this earlier conclusion was likely incorrect, and this idea should not be restated uncritically here or used as a basis to interpret phenotypes.

      Actually, there is quite good bioinformatic analysis that the rec-1 and him-5 loci evolved from a gene duplication and that each share features of the ancestral protein (Chung et al., 2015). We are sorry if the reviewer casts aspersions on the prior literature and analyses. The homology between these genes with the ancestral protein is near the same degree as dsb-1, dsb-2, or dsb-3 to their ancestral homologs (<17%).

      DSB-1 was previously reported to be strictly required for all DSB and CO formation in C. elegans. Here the authors test whether the expression of HIM-5 from the pie-1 promoter can rescue DSB formation in dsb-1 mutants, and claim to see some rescue, based on an increase in the number of nuclei with one apparent bivalent (Figure 2C). This result seems to be the basis for the claim that HIM-5 coordinates the activities of other DSB proteins. However, this assay is not informative, and the conclusion is almost certainly incorrect. Notably, a substantial number of nuclei in the dsb-1 mutant (without Ppie-1::him-5) are reported as displaying a single bivalent (11 DAPI staining bodies) despite prior evidence that DSBs are absent in dsb-1 mutants; this suggests that the way the assay was performed resulted in false positives (bivalents that are not actually bivalents), likely due to inclusion of nuclei in which univalents could not be unambiguously resolved in the microscope. A slightly higher level of nuclei with a single unresolved pair of chromosomes in the dsb-1; Ppie-1::him-5 strain is thus not convincing evidence for rescue of DSBs/CO formation, and no evidence is presented that these putative COs are X-specific. The authors should provide additional experimental evidence - e.g., detection of RAD-51 and/or COSA-1 foci or genetic evidence of recombination - or remove this claim. The evidence that expression of Ppie-1::him-5 may partially rescue DSB abundance in dsb-2 mutants is hard to interpret since it is currently unknown why C. elegans expresses 2 paralogs of Rec114 (DSB-1 and DSB-2), and the age-dependent reduction of DSBs in dsb-2 mutants is not understood.

      We have removed this claim in part because we have been unable to create the triple mutants strains to analyze COSA-1 foci.

      To the point about 11 vs 12 DAPI bodies: the literature is actually replete with examples of 11 DAPI bodies vs 12 in mutants with no breaks:

      Hinman al., 2021: null allele of dsb-3 has an average of 11.6 +/- 0.6 breaks;

      Stamper et al, 2013, show just over 60% of dsb-1 nuclei with 12 DAPI bodies and 5-10% with 10 DAPI bodies. (Figure 1);

      In addition, we also previously showed (Machovina et al., 2016) that a subset of meiotic nuclei have a single RAD-51 focus and can achieve a crossover. RAD-51 foci in spo-11 were also reported in Colaiacovo et al., 2003.

      Several of the factors analyzed here, including XND-1, HIM-17, HIM-5, DSB-1, DSB-2, and DSB-3, have been shown to localize broadly to chromatin in meiotic cells. Coimmunoprecipitation of pairs of these factors, even following benzonase digestion, is not strong evidence to support a direct physical interaction between proteins.

      Similarly, the super-resolution analysis of XND-1 and HIM-17 (Figure 1EF) does not reveal whether these proteins physically interact with each other, and does not add to our understanding of these proteins functions, since they are already known to bind to many of the same promoters. Promoters are also likely to be located in chromatin loops away from the chromosome axis, so in this respect, the localization data are also confirmatory rather than novel.

      While the binding to promoters would be expected to be on DNA loops, that has not been definitively shown in the worm germ line. The supplemental data of the Carelli paper suggests that there are ~250 binding sites for each protein at these coopted promoters. This could not account for crossover map seen in C. elegans.

      The reviewer states correct that we do not reveal that these proteins interact, but we have shown that the two proteins co-IP and have a Y2H interaction. This interaction is supporedt by a recent publication (Blazickova et al., 2025) corroborating this conclusion and identifies XND-1 in HIM-17 co-IPs also in the presence of benzonase. We do now show, however, by immuno-localization that the two proteins appear to be adjacent, but nonoverlapping. As now described in the text, AlphaFold 3 modeling and structural analysis suggests that the two proteins do interact directly and that the tagged 5’ end of HIM-17 used in our studies is likely to be at least 200nm from the putative XND-1 binding interface, a distance that is consistent with our confocal images showing frequent juxtaposition of the two proteins.

      The phenotypic analysis of double mutant combinations does not seem informative. A major problem is that these different strains were only assayed for bivalent formation, which (as mentioned above) requires several steps downstream of DSB induction. Additionally, the basis for many of the single mutant phenotypes is not well understood, making it particularly challenging to interpret the effects of double mutants. Further, some of the interactions described as "synergistic" appear to be additive, not synergistic. While additive effects can be used as evidence that two genes work in different pathways, this can also be very misleading, especially when the function of individual proteins is unknown. I find that the classification of genes into "epistastasis groups" based on this analysis does not shed light on their functions and indeed seems in some cases to contradict what is known about their functions. ‘

      As described above, each of the proteins analyzed is thought to have a direct role in regulating meiotic DSB formation and single mutant phenotypes are consistent with this interpretation. In almost all-if not all- of these cases, IR induced breaks suppress univalent phenotypes (or uncover a downstream repair defect (e.g. in mre-11)) supporting this conclusion. We have changed the terminology from “epistasis groups” since this is not strict epistasis, but rather, “functional groups”.  

      The yeast two-hybrid (Y2H) data are only presented as a single colony. While it is understandable to use a 'representative' colony, it is ideal to include a dilution series for the various interactions, which is how Y2H data are typically shown.

      The Y2H data are presented as spots on a plate and are from three to four individual transformants per interaction tested, and are not individual colonies. The experiment was repeated in triplicate from different transformations. We have now made this clearer in the materials and methods section. This approach has been successfully used to examine protein interactions in our prior manuscripts of yeast and human proteins [Gaines et al (2015) Nat. Comms 6:7834; Kondrashova et al (2017) Cancer Discovery 7:984; Garcin et al (2019) PLoS Genetics 15:e1008355; Bonilla et al (2021) eLife 1: e68080) Prakash et al (2022) PNAS 119: e2202727119, etc]

      Additional (relatively minor) concerns about these data:

      (1) Several interactions reported here seem to be detected in only one direction - e.g., MRE-11-AD/HIM-5-BD, REC-1-AD/XND-1-BD, and XND-1-AD/HIM-17-BD - while no interactions are seen with the reciprocal pairs of fusion proteins. I'm not sure if some of this is due to pasting "positive" colony images into the wrong position in the grid, but this should be addressed.

      The asymmetry in the interactions observed is due to the well-known phenomenon in yeast two-hybrid (Y2H) assays where certain plasmids exhibit self-activation when fused in one orientation, making interpretation of reciprocal interactions challenging. In our experiment, some of the plasmids indeed showed self-activation in one direction, which likely accounts for the lack of interaction seen with the reciprocal pairs of fusion proteins. We have clarified this point in the Methods.

      (2) DSB-3 was only assayed in pairwise combinations with a subset of other proteins; this should be explained; it is also unclear why the interaction grids are not symmetrical about the diagonal.

      We have now completed the analysis by adding the interactions of DSB-3 with the remaining proteins that were missing from the initial set.

      (3) I don't understand why the graphic summaries of Y2H data are split among 3 different figures (1, 2, and 3).

      We chose to split the graphic summaries of the Y2H data across Figures 1, 2, and 3 because we felt this organization better aligns with the flow of the results presented in each figure. Each set of interactions is shown in the context of the specific experiments and findings discussed in those sections, which we believe helps provide a clearer and more logical presentation of the data.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      Figure 1: B) The IP is difficult to interpret - there is a band of the corresponding size to XND-1 in the control lane calling into question the specificity of the IP/Western.

      We added a supplemental figure with the specificity of the antibody showing that there is a background non-specific band.

      C) More information about the mass spectrometry should be included. No indication of the number of times a peptide was identified, or the overall coverage of the identified proteins.

      Done

      This is important as in the results section (line 114) the authors indicate that there was "strong" interaction yet there is no way to assess this.

      D) Why wasn't hatching measured in the him-5p::him-5; him-17(ok424) strain?

      Great question. I guess we need to do this while back out for review. If anyone has suggestions of what to say here. Clearly we overlooked this point but do have the strain.

      E) Quantification of the cytology should be included.

      We have now quantified overlap between XND-1 and HIM-17

      Figure 2: C) Statistics should be included.

      Done

      E) Quantification should be included for the cytology. I recommend changing the eals15 to HIM-5.

      We included better images showing whole gonads instead of one or two nuclei. We were not sure what the reviewers want us to quantify here since the relocalization of the protein to the cytoplasm is very clear.

      I have a general issue with the use of the term epistasis - this is used to order gene function based on different mutant phenotypes, usually with null alleles. While I think the authors have valid points with how they group the different SPO-11 accessory proteins, I do not think they should use the word epistasis, but rather genetic interactions.

      We appreciate the reviewers thoughts on this matter and have removed the term epistasis and use functional groups or genetic interactions throughout the text.

      Figure 4 and the nature of the X chromosome: First, I think it would help the non-C. elegans reader to include a little more information on the X chromosome with respect to its differences compared to the autosomes. I also think that, if possible, it would be beneficial to include a model of the X in Figure 4.

      We have added more about X/autosome differences in the intro and during the discussion of HIM-5 function and have added a figure showing difference in the behavior of the X/autosomes during DSB/crossover formation.

      Minor points:

      Abstract: Given the findings of Silva and Smolikove on SPO-11 breaks, I recommend removing "early" from line 28 in the Abstract.

      Done

      Introduction (line 93): I think "biochemical studies" is a stretch here - I recommend "interaction studies".

      Done

      Results: (lines 160-161): mutations are not required for breaks. Line 172, there is a problem with the sentence.

      Corrected

      Reviewer #2 (Recommendations For The Authors):

      Major comments:

      (1) Figure 1B- The signal for XND-1 seems to appear both in the control and him-17::HA IP. Do the authors have tested the specificity of the XND-1 antibody?

      We included a supplementary figure demonstrating the specificity of the XND-1 antibody by Western blot. This was also previously published (Wagner et al., 2010)

      (2) Figure 1D - can the authors provide an explanation why the him-5p::him-5 transgene that drives a higher expression than pie-1p::him-5 fails to suppress the Him phenotype seen in him-17? What are the HIM-5 levels like in these two strains compared to N2 and him-17 null mutants? Can this information provide explanation for the differential effect of the him-5 transgenes?

      We previously reported that him-5p::him-5 drives higher expression than pie-1p::him-5 (McClendon et al, 2016).

      The reason that him-5p::him-5 does not rescue, despite higher wild type expression is that HIM-17 directly regulates expression of him-5. Since HIM-17 does not regulate the pie-1 promoter, the pie-1p::him-5 construct can at least partially suppress the him-17 mutation.

      We have (hopefully) explained this better in the text.  

      (3) Line 102- the subheading "HIM-5 is the essential factor for meiotic breaks in the Xchromosome" may not be appropriate for this section. This is what has previously been known. However, the results in Figure 1 demonstrate that a him-5 transgene can partially rescue the him-17 and ¬xnd-1 phenotype, but not that it is essential for meiotic DSB formation on X chromosomes.

      We think some of the concern here is sematic and have changed the phraseology to say that HIM-5 is SUFFICIENT for DSBs on the X… which had not previously been shown.

      Vis-à-vis the X chromosome, in all genetic backgrounds examined, the absence of HIM-5 consistently results in a complete lack of DSBs on the X. For instance, in dsb-2 mutants— where HIM-5 is still expressed—DSBs are reduced genome-wide, but the X chromosome occasionally retains breaks. In contrast, even a weak allele of him-17 results specifically in the loss of X chromosome breaks, underscoring a unique requirement for HIM-5 in promoting DSBs on the X. While Figure 1 shows that a him-5 transgene can partially rescue him-17 and xnd-1 phenotypes, the consistent observation that X breaks are absent without HIM-5 supports its classification as sufficient for DSB formation on the X chromosome.

      (4) Figure 1E - please consider enlarging the images and showing multiple examples.

      Done.

      I also suggest that the authors perform a more rigorous analysis to support the conclusion that XND-1 and HIM-17 localize away from the axis by quantifying multiple images and doing line-scan analysis.

      Provided. New images are provided in both, the main and supplemental figures, and quantification is included. There is no detectable overlap of the two protein with one another or the DNA axes (see quantification of overlap in Fig. 1).

      (5) Line 162 - This is the first mention of DSB-1, DSB-2, and DSB-3 in the paper. DSB-1 and DSB-2 are Rec114 homologs in C. elegans (Tesse et al., 2017), while DSB-3 is a homolog of Mei4 (Hinman et al., 2021). These proteins should be properly introduced in the introduction with appropriate citations.

      Done. We appreciate the reviewer pointing out that this was the first reference to these genes.

      (6) Line 169 - the rationale for this experiment is unclear. Why did the Y2H interaction between HIM-5 and DSB-1 prompt the authors to test the rescue of dsb-1 or dsb-2 phenotypes by the ectopic expression of him-5? Do the authors have evidence that HIM-5 level is reduced in dsb-1 or dsb-2 mutants?

      We have reorganized this section to better explain the motivation for looking at these interactions. We did see a difference in the localization in HIM-5 in the dsb-1 mutant animals and we did have a sense that HIM-5 was critical for breaks on the X. We reasoned that it could have independent functions in promoting breaks that were not yet appreciated so wanted to do this experiment.

      (7) Line 172 - "very slightly reduced". This claim requires statistical analysis.

      We added statistical analysis, but we also removed this claim.

      (8) Figures 2C and 2D - Can the authors provide an explanation why the pie-1p::him-5 transgene fails to suppress the phenotypes in dsb-1, while the him-5p::him-5 trasgene can? Again, the rationale for these experiments is unclear. Because of this, the interpretation is also unclear.

      The difference in rescue between the pie-1p::him-5 and him-5p::him-5 transgenes likely reflects differences in expression levels. As previously shown (McClendon et al., 2016), the him-5p::him-5 construct results in significantly higher expression of HIM-5 protein compared to pie-1p::him-5. This elevated expression likely explains its ability to partially rescue the dsb-1 phenotype. In contrast, the lower expression driven by the pie-1 promoter is insufficient to compensate for the absence of dsb-1 function. We have clarified the rationale and interpretation of these experiments in the revised manuscript to better reflect this point.

      (9) Lines 184-185 - the data for endogenously tagged HIM-5::3xHA are not shown anywhere in the paper. This must be shown.

      We have added this in the supplemental figures.

      (10) Figure 2D and 2E - what does the localization of pie-1p::him-5::GFP (eaIs15) and him5p::him-5::GFP (eaIs4) look like in wild-type and dsb-1 mutants? Are the cytoplasmic aggregates caused by increased levels of HIM-5 expression? Can the differential behavior of him-5 transgenes provide explanation for differential rescues?

      We now show both live and fixed images of Phim-5::him-5::gfp transgenes, as well as the localization of the endogenously HA-tagged HIM-5 locus (Figure 2 and S3). In all cases, the protein is initially nuclear and then absent from meiotic nuclei with similar timing. The Ppie1::him-5 transgene was very difficult to image due to low expression (even in wild type) so it not shown here. We presume it is the slightly elevated level of expression of the Phim5::him-5::gfp that can explain the differential rescue.

      (11) Lines 221-222, where are the results shown? Please refer to Figure S3.

      Done

      (12) Figure S3 - these need statistical analyses.

      Done

      (13) Lines 230-231 - what about the rec-1; parg-1; cep-1 triple mutant?

      This is an excellent suggestion and not one we have not yet pursued. Given the lack of strong phenotypes in all combination of double mutants, we prioritized other experiments . However, we agree that examining the rec-1; parg-1; cep-1 triple mutant would provide a valuable test of whether these factors act in the same pathway, and we appreciate the reviewer highlighting this potential future direction.

      (14) Line 298 - I suggest the authors take a look at the Alphafold prediction of DSB-1/DSB-2/DSB-3 and the comparison to human and budding yeast Rec114/Mei4 complex in Guo et al., 2022 eLife, which could provide insights into the Y2H results.

      We thank the reviewer for these comments and have indeed used these interactions and predicted homologies to zero in a region of interaction between these proteins that resembles what is seen in humans and yeast with a dimer of REC114 like proteins wraps stabilizing a central Mei4 helix . This is now shown in Figure 3H, I. Satisfyingly, this modeling predicts that a trimer comprised of 2 DSB-1 proteins with DSB-3 is more stable than a DSB1-DSB-2-DSB-3 trimer. This might explain why DSB-2 is not required in young adults and only becomes essential as DSB-1 levels drop in older animals (Rosu et al., 2013)

      (15) Can the authors introduce mutations within the DSB-1 interfaces that disrupt the interaction to either SPO-11 or DSB-2?

      We have begun to address this question by introducing targeted mutations within DSB-1. As shown in Figure 3E and 3F, mutations in the C-terminal region of DSB-1—which includes a core of four α-helices—disrupt its interaction with DSB-2 and DSB-3, but not with SPO-11. These findings suggest that the C-terminus mediates interactions specifically with DSB2 and DSB-3

      (16) Line 323 - The him-5 phenotypes are too weak to support the idea that it serves as the linchpin for the whole DSB complex. Do the authors have an explanation for why him-5 mutants exhibit X-chromosome-specific DSB defects?

      In response to the reviewer, above, and in the text, we have included a more detailed explanation of why we think HIM-5 has a key role in coordinating meiotic break formation. Although, identified for its role on the X, the phenotypes associated with DSB formation in the mutant are really quite pleiotropic and severe.

      (17) Line 436 - C. elegans lacks DSB hotspots.

      Removed

      Minor comments:

      (1) Figure 1A - please show the raw data for the yeast two-hybrid.

      We show representative yeast colonies in Figure S3.

      (2) It looks like the labeling for Figure 1B and 1C are switched.

      Fixed.

      (3) Figure 1B - what does the red box indicate? Please explain it in the legend.

      It indicates the XND-1 band. We added that information in the legend.

      (4) Figure 1C - in the legend, it was noted that the results are from GFP pulldowns of HIM17::GFP. However, the method for Figure 1B and the method section noted that HIM-17 was tagged with 3xHA, and the pull-down was performed using anti-HA affinity matrix. Please reconcile this discrepancy.

      That’s because they were done in two different sets of experiments. For the IPs we used a HIM-17::HA strain and for the MS, a HIM-17::GFP strain.

      (5) Also in Figure 1C - please call Table S2 in the main text when discussing the mass spec results. Also, it is not clear what HIM-17 and GFP indicate in the table. What makes CKU80 different from the other proteins listed under GFP? Please explain more clearly in the legend.

      We have move the table to supplemental data where we have included all of the peptide counts and gene coverage. We have included in the revised method rationale for inclusion in this table which explains why CKU-80 differs.

      (6) Line 527 - it is unclear what experiment was done for HIM-17. Please revise it to indicate that this is for "HIM-17 immunoprecipitation". Also please indicate the strain used for HIM17 pull-down (AV280?).

      (7) Line 113- please be specific about how the HIM-17 IP was performed. Which epitope and strains are used for pull-downs?

      This indeed was AV280. This has been added to the text and methods.

      (8) Figure 1D- What does ND mean? In the text, it was stated that there was only a minor suppression of hatching rates. The hatching rate for him-5p::him-5; him-17 must have been measured, and the data must be presented.

      ND does mean not determined. We have removed the statement about “minor suppression”. We only tested the overall population dynamics in the Phim-5::him-5;him17(ok424) and the DAPI body counts. The failure to suppress the latter suggests there would be no enect on hatching rates, although we did not test this directly. Since we had done this for the Ppie-1::him-5;him-17 strain, we provided this information to further support the claims of genetic rescue by ectopic expression.

      (9) Line 151 - please specify that STED was used.

      We have removed the STED images, and just show the confocal images with Lightning Processing.

      (10) Figure 1E- the authors suggested that HIM-17 and XND-1 mainly localize to autosomes but not the X chromosome. However, there is not enough evidence that the chromosome excluded from HIM-17 staining is indeed an X chromosome.

      (11) Figure 1E (Line 154) - what are the active chromatin markers examined? Where are the data?

      We have previously shown that the chromosome lacking XND-1 staining is the X (Wagner et al., 2010). The X is heterochromatic and chromatin marks associated with active transcription, including H3K4me3 and HTZ-1 (a variant H2A), preferentially localize to autosomes, effectively anti-marking the X chromosome. As shown in the new Figure 1E, a single chromosome has very little XND-1 and HIM-17 associated proteins. This is the X chromosome.

      (12) Line 172 - It should be a comma instead of the period after "In dsb-1 mutants".

      Fixed

      (13) Figure S3H-K - I suggest the authors indicate the alleles of mre-11 (null vs. iow1) on the graph, similarly to him-5(e1490) to avoid confusion.

      Done

      (14) Lines 294 and 600 - Guo et al. 2022 is now published in eLife. The authors must cite the published paper, not the preprint.

      Fixed

      (15) Line 407 - the reference Carelli et al., 2022 is missing.

      Added

      (16) Line 766 - please remove "is" before nuclear.

      Done

      Reviewer #3 (Recommendations For The Authors):

      Major issues:

      In my view, the most interesting mechanistic finding in the paper is the evidence that HIM-5 may not bind to chromatin in the absence of DSB-1. If validated, this would suggest that HIM-5 is likely to be directly involved in a process that promotes break formation, in contrast to factors such as HIM-17 and XND-1. It does not, however, support the idea that HIM-5 is at the top of a hierarchy of DSB factors, as it is interpreted here. More importantly, the data supporting this claim are unconvincing; only a single image of an unfixed gonad from an animal expressing HIM-5::GFP is shown. Immunofluorescence should be performed and the results must be quantified.

      We have provided additional images of the HIM-5 relocalization to show that we observed this in both fixed and live worms with two different tagged strains. The exclusion from the nucleus is seen in all scenarios. Whether the protein now accumulates exclusively in the cytoplasm/ is destabilized is challenging to address with the fixed images due to the arbitrariness of defining “background” staining.

      More generally, this type of analysis, looking at the interdependence of different factors for their association with chromosomes, is much more informative than the genetic interaction data presented in the paper, which does not seem to provide any mechanistic insights into the functions of the factors analyzed. The paper could potentially be greatly improved through a more extensive, systematic analysis of the interdependence of DSBpromoting factors for their localization to chromosomes.

      We have at least added this for XND-1 and HIM-17 and show they are not interdependent for chromosome association. We also provide for the first time data on the localization of HIM-5 in the dsb-1 mutant. Many of the other interactions have already been shown in the literature and/or were not warranted base on the lack of genetic interaction we present here.

      Minor issues:

      The title is vague and inconclusive. A more concrete title summarizing the major findings would help readers to assess whether the work is of interest.

      We have discussed the title extensively with all authors and all would like to keep the current title.

      The authors claim that the expression of HIM-5 from a different promoter (Ppie-1::him-5) but not its endogenous promoter (Phim-5::him-5) can partially rescue the DSB defect in him-17 mutants. To support this claim, they should really quantify the germline expression of HIM-5 in wild-type, him-17, him-17; Ppie-1::him-5, and Phim-5::him-5; him-17.

      We had previously reported the expression in the N2 background of both transgenes (McClendon et al., 2016)

      Panel O appears to be missing from Figure S3.

      Fixed

      The evidence for chromosome fusions in cep-1; mre-11 mutants shown in S4D is not convincing and the claim should be removed unless stronger evidence can be obtained.

      A clearer image has been added

      The basis of the following statement is unclear: "Furthermore, rec-1;him-5 double mutants give an age-dependent severe loss of DSBs (like dsb-2 mutants) suggesting that the ancestral function of the protein may have a more profound effect on break formation." The manuscript does not seem to include data regarding age-dependent loss of DSBs and no other publication is cited to support this claim. The interpretation is also perplexing; I think that it may be predicated on the idea that REC-1 and HIM-5 are paralogs, but as stated above, this claim is not well supported and is likely specious.

      We have added the reference. This was shown in Chung et al., 2013 – the paper that presented the cloning of the rec-1 locus.

    1. RISC-V

      RISC-V چیه؟

      RISC-V (ریسک فایو) یک معماری مجموعه دستورالعمل (Instruction Set Architecture – ISA) برای پردازنده‌هاست. یعنی همون چیزی که مشخص می‌کنه یک CPU چه دستوراتی رو می‌تونه اجرا کنه (مثل جمع، ضرب، پرش، load/store و …).


      نکات مهم:

      1. RISC = Reduced Instruction Set Computer

      2. یعنی مجموعه دستورالعمل‌هاش ساده و بهینه طراحی شده تا اجرای دستورات سریع‌تر باشه.

      3. V = Version 5

      4. عدد ۵ به نسل طراحی RISC برمی‌گرده، نه اینکه فقط ۵ دستور داشته باشه 🙂.

      5. Open Source بودن

      6. بزرگ‌ترین فرقش با معماری‌های معروف مثل x86 (اینتل/AMD) یا ARM (پردازنده‌های موبایل) اینه که RISC-V متن‌بازه.

      7. یعنی هر کسی می‌تونه بدون پرداخت لایسنس، پردازنده بر اساس RISC-V طراحی کنه.

      8. کاربردها

      9. از میکروکنترلرهای خیلی کوچک (مثل Arduino)

      10. تا سرورها و حتی سوپرکامپیوترها
      11. چون انعطاف‌پذیره و می‌شه دستورالعمل‌های خاص رو بهش اضافه کرد.

      مقایسه خیلی ساده:

      • x86 (Intel/AMD): بسته، پیچیده، ولی خیلی قدرتمند.
      • ARM: کم‌مصرف و محبوب در موبایل و تبلت، ولی نیاز به لایسنس داره.
      • RISC-V: ساده، متن‌باز، قابل توسعه، و در حال رشد سریع در دنیا.
    1. Dossier d'Information : La Méthode Réconciliations

      Résumé

      La méthode "Réconciliations" est une approche pédagogique innovante, conçue par Jérémie Fontanieu, professeur de sciences économiques et sociales (SES), et son ancien collègue David Benoit, professeur de mathématiques au lycée de Drancy.

      Née en 2012 du constat de la démotivation des élèves, de l'épuisement des enseignants et de la rupture de communication entre l'école et les familles, cette méthode repose sur un principe fondamental : la création d'une alliance solide et proactive entre les professeurs et les parents.

      Le protocole s'articule autour de deux piliers : un appel téléphonique à chaque famille avant même la rentrée scolaire pour établir un contact de confiance, suivi de l'envoi d'un SMS individualisé et hebdomadaire pour maintenir un dialogue constant tout au long de l'année.

      En transformant les parents en "alliés indéfectibles", la méthode change la dynamique de la classe.

      Les élèves, conscients de cette communication permanente, deviennent plus engagés et responsables, ce qui enclenche un cercle vertueux de progrès, d'encouragements et de réussite.

      Les résultats sont probants : la classe de Jérémie Fontanieu affiche 100% de réussite au baccalauréat depuis l'année scolaire 2017-2018.

      Au-delà des performances académiques, la méthode réduit considérablement le temps consacré à la discipline, diminue le sentiment d'isolement des professeurs et réconcilie les enseignants avec leur métier.

      Développée de manière indépendante, sans soutien institutionnel, la méthode se diffuse via un collectif d'enseignants qui comptait 350 membres pour l'année 2023/2024, avec un objectif de 1000 participants.

      Flexible, elle est appliquée avec succès de l'école primaire au lycée, dans des contextes socio-économiques variés, des zones prioritaires aux centres-villes et zones rurales.

      1. Genèse et Contexte de la Méthode

      La méthode Réconciliations est née d'une série de constats alarmants sur l'état du système éducatif, particulièrement exacerbés dans des contextes socio-économiques difficiles comme la Seine-Saint-Denis.

      A. Les constats initiaux

      Jérémie Fontanieu identifie plusieurs sources de frustration et d'échec qui ont motivé le développement de son approche :

      L'épuisement et le désespoir des enseignants : Particulièrement chez les plus jeunes, un sentiment d'abandon par l'institution face à la "violence de ce métier" et une impuissance face à des adolescents qui "gâchent leur potentiel".

      Les professeurs se sentent seuls à porter toutes les responsabilités.

      La démotivation des élèves : Souvent "accros aux écrans et aux réseaux sociaux", les élèves manquent d'implication dans leur scolarité, ce qui accroît la frustration de leurs professeurs.

      Ils ont "la flemme" ou manquent de confiance en eux.

      La rupture entre parents et enseignants : Une distance, voire une confrontation, entre ces deux pôles éducatifs, marquée par des "quiproquos et des failles" que les élèves exploitent.

      Les parents, souvent tenus à l'écart, reçoivent des informations partielles de leurs enfants.

      Les appels de l'école sont quasi systématiquement perçus comme des annonces de mauvaises nouvelles.

      Le sentiment d'abandon généralisé : En Seine-Saint-Denis, élèves et parents se sentent délaissés par l'Éducation nationale et la République, percevant l'école comme une "machine à broyer" incapable de les intégrer.

      Ce sentiment est également partagé par les enseignants face à la pénurie structurelle, les faibles salaires et la dégradation des conditions de travail.

      B. La déconstruction du "Mythe du prof héros"

      Jérémie Fontanieu, dans son livre Le Mythe du prof héros, analyse une construction culturelle qu'il juge toxique.

      Ce mythe, hérité des "hussards noirs de la République" du XIXe siècle, place l'enseignant sur un piédestal et lui attribue des capacités extraordinaires.

      Un mythe à double tranchant : Si l'idée de valoriser les professeurs est belle en apparence, elle conduit les parents à se décharger entièrement sur l'enseignant ("les laisser gérer").

      Une source de culpabilité : Pour les professeurs, cette attente irréaliste engendre un "sentiment de culpabilité de ne pas réussir à être à la hauteur du mythe".

      Un frein à l'implication parentale : Ce mythe dissuade les parents de s'impliquer, alors même qu'ils ont un rôle crucial à jouer.

      Fontanieu insiste sur le fait que l'implication parentale n'est pas forcément intellectuelle mais relève de "l'attention morale", du soutien aux valeurs d'honnêteté et de courage, communes à l'éducation parentale et scolaire.

      2. Principes Fondamentaux et Mécanisme Opérationnel

      La méthode Réconciliations repose sur une stratégie de co-éducation proactive, simple et structurée, visant à faire des parents des partenaires centraux du processus éducatif.

      A. Le socle : l'alliance parents-professeurs

      L'idée centrale est de "nouer le dialogue" entre professeurs et parents pour créer un binôme solide.

      En informant systématiquement les parents, l'enseignant change de statut aux yeux des élèves.

      Ces derniers, réalisant que leurs parents et professeurs "sont devenus des amis", ne peuvent plus exploiter le manque de communication.

      B. Le mécanisme en deux temps

      La méthode s'appuie sur deux actions clés, mises en œuvre dès le début de l'année scolaire.

      Étape

      Description

      Objectifs

      1. L'appel téléphonique initial

      L'enseignant contacte chaque famille personnellement "le 31 août ou le 1er septembre", avant même qu'un problème ne survienne.

      • - Établir la confiance : Les parents, "positivement surpris", apprécient cette attention et deviennent rapidement des alliés.<br>\
        • Présenter la démarche : L'enseignant explique sa méthode et son intention de collaborer.<br>\
        • Dédramatiser la communication : L'appel n'est pas lié à une sanction, ce qui change la perception de l'école.

      2. Les SMS hebdomadaires

      Chaque semaine, un SMS individualisé est envoyé à chaque famille pour l'informer du comportement et du travail de l'enfant, "y compris lorsqu’il n’y a pas de problème".

      • - Maintenir un dialogue constant : Assurer un suivi régulier et éviter les ruptures de communication.<br>
      • - Valoriser et encourager : Les SMS permettent de féliciter les efforts et les progrès, renforçant la motivation.<br>\
        • Assurer la cohésion : Le fait que tous les élèves soient concernés, qu'ils soient en difficulté ou non, évite la stigmatisation et favorise la solidarité.

      Jérémie Fontanieu souligne que cette approche est "contre-intuitive" car elle demande un investissement de travail supplémentaire en début d'année, mais que ce temps est "largement rentabilisé" par la suite.

      3. Impacts et Résultats Observés

      La mise en place de la méthode Réconciliations génère des effets positifs et mesurables sur l'ensemble des acteurs du système éducatif : élèves, enseignants et parents.

      A. Sur les élèves

      Engagement accru : Constatant l'alliance entre parents et professeurs, les élèves deviennent "moins passifs et plus engagés" et prennent leurs responsabilités.

      Cercle vertueux de la réussite : L'investissement croissant des élèves entraîne des progrès, qui sont salués par les adultes (parents et professeurs).

      Ces encouragements renforcent à leur tour l'implication, créant "une dynamique de réussite, de confiance et d'espoir pour tous".

      Amélioration des résultats scolaires : Depuis l'année scolaire 2017-2018, la classe de terminale de Jérémie Fontanieu affiche un taux de 100% de réussite au baccalauréat.

      Les élèves terminent le programme en avance, ce qui leur laisse "beaucoup de temps pour réviser".

      Gain de confiance : La méthode permet aux élèves de réussir "là où ils pensaient ne pas en être capable", ce qui les encourage à poursuivre des études supérieures.

      B. Sur les enseignants

      Réduction de la charge disciplinaire : Le partenariat avec les parents diminue les comportements perturbateurs. C'est "de l'énergie dépenser en moins à faire sa discipline".

      Fin de l'isolement : Les professeurs ne se sentent plus seuls à tout porter sur leurs épaules.

      Ils trouvent un soutien précieux chez les parents, qui deviennent des "alliés indéfectibles" les protégeant des "violences du métier".

      Réconciliation avec le métier : La méthode permet aux enseignants de se consacrer à leur cœur de métier : l'enseignement. Jérémie Fontanieu évoque "une réconciliation entre nous, enseignants, et notre métier".

      Gestion du temps optimisée : Bien que l'approche puisse sembler chronophage, elle fait en réalité gagner "beaucoup de temps" en réduisant les conflits et en augmentant l'implication des élèves.

      C. Sur les parents

      Partenaires actifs : Les parents deviennent des "acteurs clés" et des partenaires fiables, intégrés au processus éducatif.

      Influence positive : Ils réalisent l'influence qu'ils peuvent avoir, même sans compétences académiques spécifiques. Leur rôle est un levier d'attention morale et de soutien.

      Relation apaisée avec l'école : La communication régulière et positive transforme la relation, qui n'est plus basée sur la crainte des mauvaises nouvelles.

      4. Le Collectif "Réconciliations" : Diffusion et Organisation

      La méthode, initialement expérimentale, est aujourd'hui portée par un collectif d'enseignants en pleine croissance, qui fonctionne sur un modèle horizontal et indépendant.

      A. Historique et croissance

      Développement : La méthode a été développée à partir de 2012 au lycée Eugène Delacroix de Drancy.

      Création du collectif : Après avoir constaté des résultats "suffisamment forts", le collectif est créé en 2021 pour partager la méthode.

      Expansion rapide : Le collectif est passé de 200 enseignants à 350 pour l'année 2023-2024, avec des projections autour de 500 pour 2024-2025.

      Objectif : Atteindre une masse critique de 1000 enseignants pour passer à une phase de diffusion à plus grande échelle via un site internet et un manuel.

      B. Philosophie et indépendance

      Le collectif revendique une indépendance totale et refuse "aucun soutien institutionnel". Jérémie Fontanieu explique ce choix : "Nous utilisons notre liberté pédagogique.

      Nous sommes indépendants, et nous sommes très attachés à la diffusion horizontale de cette méthode, de professeur à professeur.

      Nous grandissons plus lentement sans l’aide de l’État, mais de manière plus saine". La diffusion se fait principalement par le bouche-à-oreille.

      C. Portée et applicabilité

      La méthode Réconciliations démontre une grande flexibilité et s'adapte à divers contextes :

      Niveaux scolaires : Elle est appliquée de l'école primaire (CM2) jusqu'au lycée.

      Contextes géographiques et sociaux : Si elle est née en "quartier populaire", environ la moitié des enseignants qui l'utilisent travaillent en zones rurales ou dans des établissements de centre-ville, prouvant sa pertinence au-delà des zones d'éducation prioritaire.

      5. Accès à la Méthode et Ressources Disponibles

      Pour préserver la qualité de l'accompagnement, l'accès à la méthode est actuellement contrôlé par son fondateur en attendant que le collectif atteigne sa taille cible.

      Comment participer : Les enseignants intéressés doivent contacter directement Jérémie Fontanieu par email à l'adresse projet.reconciliations@gmail.com.

      Les détails de la méthode restent "assez secrets" et ne sont pas communiqués publiquement pour le moment.

      Outils de soutien pour les membres : Les professeurs qui rejoignent le collectif ont accès à un ensemble d'outils pratiques :

      • ◦ Vidéos et tutoriels (notamment pour la rédaction des SMS).   
      • ◦ Groupes de discussion en ligne.   
      • ◦ Visioconférences hebdomadaires.   
      • ◦ Une rencontre annuelle pour un suivi et une formation continue.

      Ressources publiques : Pour en savoir plus sur la philosophie de la méthode, le public peut consulter :

      • ◦ Le livre de Jérémie Fontanieu, "Le Mythe du prof héros".   
      • ◦ Le film documentaire "Le Monde est à eux", sorti au cinéma en mars 2024.   
      • ◦ Un deuxième documentaire disponible en ligne, montrant l'application de la méthode en écoles primaires et au collège.
  2. Sep 2025
    1. Yet the same users who integrate these tools into personalworkflows describe them as unreliable when encountered within enterprise systems. Thisparadox illustrates the GenAI Divide at the user level.This preference reveals a fundamental tension. The same professionals using ChatGPT dailyfor personal tasks demand learning and memory capabilities for enterprise work. Asignificant number of workers already use AI tools privately, reporting productivity gains,while their companies' formal AI initiatives stall. This shadow usage creates a feedback loop:employees know what good AI feels like, making them less tolerant of static enterprisetools.Unwillingness to adopt new toolsModel output quality concernsPoor user experienceLack of executive sponsorshipChallenging change management0 1 2 3 4 5 6 7 8 9 10

      This is huge.

    1. Santé Mentale et Handicap : Synthèse de l'Audition de la Défenseure des droits

      Résumé Exécutif

      Ce document de synthèse présente les constats et analyses clés issus de l'audition de Claire Hédon, Défenseure des droits, devant la commission d'enquête sur les défaillances des politiques publiques en matière de santé mentale et de handicap. L'audition révèle une divergence critique entre les droits proclamés par la loi et leur application effective sur le terrain. Le handicap demeure le premier motif de saisine pour discrimination, signalant des failles systémiques dans des domaines essentiels tels que l'éducation, l'emploi et l'accessibilité. La situation de la santé mentale est jugée particulièrement alarmante, avec un système de soins insuffisant et cloisonné pour les adultes, et des conditions critiques pour les mineurs marqués par des délais d'attente insoutenables et des pratiques d'hospitalisation inadaptées. La Défenseure des droits soutient que le non-respect des droits fondamentaux, loin d'être une économie, engendre un coût social et financier élevé à long terme. La collecte de données fiables, l'application rigoureuse des textes existants et une approche décloisonnée sont identifiées comme des leviers indispensables pour remédier à ces défaillances.

      1. Le Défenseur des droits : Un Observatoire des Défaillances Systémiques

      L'institution du Défenseur des droits, par ses cinq domaines de compétence (droits des usagers des services publics, lutte contre les discriminations, droits de l'enfant, déontologie de la sécurité, protection des lanceurs d'alerte), constitue un observateur privilégié des carences des politiques publiques relatives au handicap et à la santé mentale.

      Le Handicap comme Premier Motif de Discrimination

      Claire Hédon souligne que le handicap est, depuis plusieurs années, le premier motif de saisine en matière de discrimination, ce qui témoigne de difficultés persistantes et généralisées.

      Statistiques Clés (2024) :

      Total des saisines pour discrimination : 5 679    ◦ Part concernant le handicap : 22 % (soit 1 249 réclamations)

      Ces discriminations s'exercent dans de multiples domaines : emploi, scolarisation, accès à la santé, à la justice, aux loisirs, au sport et à la culture.

      Le Coût du Non-Respect des Droits

      La Défenseure des droits conteste l'idée selon laquelle l'application des droits fondamentaux représenterait un coût financier excessif. Elle affirme sa conviction que "c'est le non-respect des droits fondamentaux qui entraînera à terme un coût élevé pour la société". L'approche de l'institution se concentre sur "l'écart entre le droit annoncé et son effectivité", soulignant que les défaillances actuelles génèrent un coût social majeur.

      2. La Santé Mentale : Une Crise des Droits Fondamentaux

      L'audition met en lumière une crise profonde dans la prise en charge de la santé mentale en France, exacerbée par la crise sanitaire du Covid-19. Les politiques publiques sont jugées insuffisantes tant en quantité qu'en organisation.

      2.1. Prise en Charge des Adultes : Un Système Insuffisant et Cloisonné

      Le système de soins pour adultes souffre de faiblesses structurelles majeures :

      Offre de soins : Des offres trop faibles, des capacités d'hospitalisation limitées et des déserts médicaux.

      Organisation : Un système mal organisé et cloisonné entre les secteurs sanitaire et médico-social.

      Ressources : Des moyens qui stagnent alors que les besoins augmentent, rendant les conditions d'exercice indignes pour les soignants et les patients.

      Conséquences pour les patients :

      • Délais d'attente excessifs.

      • Ruptures de soins fréquentes et errance sanitaire.

      • Inégalités territoriales criantes, pénalisant particulièrement les personnes précaires.

      Focus : La Situation Critique des Personnes Détenues

      La santé mentale des personnes détenues est une préoccupation majeure, avec une surreprésentation des pathologies et troubles mentaux en milieu carcéral.

      Appels à la plateforme (3141) de juillet 2024 à juillet 2025 :

      7,6 % des appels concernaient des difficultés d'accès aux soins (1 065 appels).    ◦ 106 appels portaient spécifiquement sur un risque suicidaire.

      Causes identifiées :

      1. La politique de désinstitutionnalisation : Menée sans un développement suffisant des services de proximité pour prendre le relais des services hospitaliers psychiatriques.

      2. La diminution des déclarations d'irresponsabilité pénale : Conduisant au maintien en détention de personnes dont l'état de santé nécessiterait une prise en charge dans une structure de soins.

      La Cour européenne des droits de l'homme (arrêt GC c. France, 2012) a qualifié cette situation de "traitement inhumain". Le manque de continuité des soins à la sortie de prison augmente par ailleurs le risque de récidive.

      2.2. Santé Mentale des Mineurs : Une Situation Alarmante

      Les données concernant la santé mentale des jeunes sont particulièrement inquiétantes. Une étude de 2025 (Institut Montaigne, Mutualité française, Institut Terram) révèle que 25 % des jeunes de 15 à 29 ans souffrent de dépression, un chiffre atteignant 39 % en outre-mer.

      Carences Structurelles de la Pédopsychiatrie

      Absence de données fiables : Le manque de données agrégées sur le nombre d'enfants en attente de prise en charge "fragilise le pilotage de nos politiques publiques". Le rapport 2023 de la Cour des comptes estime que sur 1,6 million d'enfants souffrant d'un trouble psychique, seuls 50 à 53 % bénéficient de soins.

      Pénurie de médecins et inégalités territoriales : Malgré des infrastructures dans la moyenne européenne, le secteur est saturé. Les Centres Médico-Psychologiques (CMP) sont inégalement répartis, et les délais pour obtenir un premier rendez-vous dépassent souvent un an.

      Pratiques Inadaptées et Atteintes aux Libertés

      Des pratiques préoccupantes sont régulièrement signalées :

      Hospitalisation en services pour adultes : Des enfants et adolescents sont hospitalisés dans des services de psychiatrie adulte, une situation qui "ne fait que s'aggraver".

      Maintien par défaut : Des jeunes en situation de handicap sont maintenus en structure psychiatrique faute de places dans le secteur médico-social ou en protection de l'enfance.

      Recours à l'isolement et à la contention : Ces mesures de dernier recours sont utilisées de manière trop fréquente, souvent motivées par un manque de personnel. Pour les mineurs hospitalisés en "soins libres" (à la demande des parents mais sans leur propre consentement), il n'existe aucun contrôle systématique par un juge des libertés et de la détention (JLD), contrairement aux adultes.

      Exemple emblématique : Une adolescente de 15 ans, atteinte d'autisme sévère, a été hospitalisée pendant plus de deux ans dans un service psychiatrique pour adultes, sans justification médicale. Elle était confinée dans une chambre d'isolement "plus de 20 heures par jour", déscolarisée et privée de soins somatiques essentiels.

      3. Politiques du Handicap : Des Droits Proclamés mais Non Effectifs

      Vingt ans après la loi de 2005, son application reste partielle et les obstacles à l'inclusion demeurent nombreux.

      3.1. Éducation : Une Inclusion Inachevée

      Bien que la loi de 2005 ait permis une augmentation du nombre d'enfants handicapés scolarisés, l'accès à une éducation de qualité reste difficile.

      Catégorie de réclamation (2024)

      Chiffres et pourcentages

      Discrimination liée à l'éducation/formation

      7 % des 5 679 saisines

      Droits de l'enfant (majorité liée à la scolarisation)

      30 % des 3 073 saisines

      Obstacles persistants :

      Manque d'AESH : Malgré les créations de postes, les besoins ne sont pas couverts.

      Pause méridienne : La loi du 27 mai 2024 prévoyant la prise en charge par l'État de l'accompagnement sur le temps de la pause méridienne est "très loin d'être effective" en raison de blocages administratifs.

      Aménagement des examens : Une augmentation inquiétante des refus d'aménagement pour des élèves ou étudiants handicapés, parfois sous le prétexte paradoxal que "leurs résultats étaient bons".

      3.2. Emploi : Premier Domaine de Discrimination

      L'emploi est le principal secteur où s'exercent les discriminations liées au handicap. Sur les réclamations pour handicap en 2024, 21 % concernent l'emploi privé et 24 % l'emploi public. L'obligation d'emploi de 6 % ne suffit pas à garantir l'égalité de traitement.

      Difficultés récurrentes :

      • Aménagement tardif du poste de travail.

      • Non-respect des préconisations du médecin du travail.

      • Difficultés accrues dans le maintien dans l'emploi pour les personnes dont le handicap ou la maladie survient en cours de carrière.

      3.3. Accessibilité : Un Retard Persistant

      L'accessibilité, condition essentielle à la participation sociale, reste un point faible majeur.

      Transports : L'objectif de mise en accessibilité n'est pas atteint, la loi s'étant limitée aux "points d'arrêt prioritaires".

      Logement : Inquiétude face à l'assouplissement des règles d'accessibilité (loi ELAN).

      Numérique : La dématérialisation produit des effets ambivalents. Selon l'ARCOM, peu de sites publics atteignent 50 % d'accessibilité et seulement 5 % sont totalement conformes.

      3.4. Aides à l'autonomie : Une Compensation Insuffisante et Inégale

      Le droit à la compensation instauré par la loi de 2005 présente des limites flagrantes.

      La barrière des 60 ans : Une différence de traitement persiste selon que le handicap survient avant ou après 60 ans, la fusion des régimes prévue pour 2010 n'ayant jamais eu lieu.

      Limites de la PCH (Prestation de Compensation du Handicap) :

      ◦ L'aide humaine est limitée aux besoins essentiels.    ◦ Les aides techniques sont sous-financées.    ◦ La PCH parentalité est critiquée pour ses critères restrictifs.

      4. Recommandations et Perspectives

      L'audition se conclut sur plusieurs axes d'action prioritaires pour remédier aux défaillances constatées.

      Application des textes : L'urgence est "l'application pure et simple des textes votés par le Parlement", dont beaucoup sont en attente de décrets d'application.

      Données statistiques : Il est impératif de disposer de données fiables et agrégées (ex : nombre d'heures de scolarisation effectives, nombre de places manquantes en IME) pour piloter les politiques publiques.

      Décloisonnement : Renforcer la coordination entre les secteurs sanitaire, médico-social, éducatif et judiciaire est crucial pour assurer la fluidité des parcours.

      Priorité à la jeunesse : La santé mentale des jeunes, érigée en grande cause nationale 2025, nécessite une "véritable prise de conscience collective" et des moyens financiers adéquats, notamment pour les CMP.

      Formation : L'amélioration de la formation des professionnels (enseignants, AESH, employeurs, soignants) est essentielle pour faire évoluer les pratiques et les cultures professionnelles.

      Lutte contre la complexité administrative : Le "mille-feuille" des dispositifs doit être simplifié pour améliorer la lisibilité et l'accès aux droits pour les familles.

    1. Four qualitative factors that maketexts easier or more complex are (1) levels of meaning (e.g., single or multiple) or purpose (e.g., explicitor implicit), (2) structure (e.g., simple or complex),(3) language conventionality and clarity (e.g., literalor figurative), and (4) knowledge demands (e.g., fewor many assumptions about readers' knowledge

      need a human's eyes on these things

    Annotators

    1. Reviewer #2 (Public review):

      Summary:

      The authors quantify human fMRI BOLD responses in pulvinar and mediodorsal thalamic nuclei during a fear conditioning and extinction task across two days, in a large sample size (hundreds of participants). They show that the BOLD responses in these areas differentiate the conditioned (CS+) and safety (CS-) stimuli. Additionally, this changes with repeated trials, which could be a neural correlate of fear learning. They show that the anterior pulvinar is most correlated with the MD, and that this is not due to anatomical proximity. They perform graph analysis on the pulvinar subnuclei, which suggests that the medial pulvinar is a hub between the sensory (lateral/inferior) and associative (anterior) pulvinar. They show different patterns of thalamic activity across conditioning, extinction, recall, and renewal.

      Strengths:

      The data has a large sample size (n=293 in some measures, n=412 in others). This is a validated human fear conditioning/extinction task that Dr Milad's group has been working with for several years. Few labs have investigated the thalamus activity during fear conditioning and extinction, particularly with a large sample size. There is an independent replication of the pulvinar network structure (Figure 3), which suggests that the processing in the more sensory-related inferior and lateral pulvinar is relayed to the anterior pulvinar (and possibly thereby to more action-related prefrontal areas) via an intermediate step in the medial pulvinar - potentially a novel discovery, but that needs more validation.

      Weaknesses:

      (1) The authors cannot make causal claims about their results based on correlational neuroimaging evidence. Causal claims should be pared back. E.g., sentence 1 in the Results section: "The anterior pulvinar and MD contribute to early associative threat learning, as evidenced by increased functional activation in response to CS+ compared to CS- at the block level (Fig. 1b-c)." needs to be reworded to something like "The anterior pulvinar and MD have increased functional activation... This suggests that these areas may contribute to early associate threat learning."

      (2) Figure 1: The fact that the difference in BOLD activity between CS+ and CS- goes away on the third trial is not addressed. This is a very large effect in the data.

      (3) Figure 3: Could the observed network structure be due to anatomical proximity? Perhaps the authors should do an analogous analysis to what they did in Figure 2 for this intra-pulvinar analysis. This analysis doesn't take into account the indirect connections through corticothalamic and thalamocortical connections with the visual cortex and the pulvinar. There is an implicit assumption that there are interconnections between the pulvinar subnuclei, but there are few strong excitatory projections between these subnuclei to my knowledge. If visual areas are included in the graph, it would make things more complex, but would probably dramatically change the story. In this way, the message is somewhat constructed or arbitrary.

      (3) In the results section describing Figures 4-7, there are no statistics supporting the claims made. There needs to be a set of graphs comparing the results across the study sessions and days, with statistical comparisons between the different experiments to confirm differences.

      (4) Figure 7 does not include the major corticothalamic and thalamocortical projections from early, mid-level, and higher visual cortex to the different pulvinar nuclei. I doubt that there are strong direct projections between the pulvinar nuclei; rather, the functional connections are probably mediated through interconnections with cortical visual areas.

      (5) Stylistic: There are a lot of hypotheses and interpretations presented in this primary literature paper, which may be better suited for a review or perspective piece.

      (6) In the discussion, there is an assumption that the fMRI BOLD responses to CS+ and CS- need to be different to indicate that an area is processing these distinctly, but the BOLD signal can only detect large-scale changes in overall activity. It's easy to imagine that an area could be involved in processing these two stimuli distinctly without showing an overall difference in the gross amount of activity.

      (7) There is strong evidence that the BOLD responses to the threat-related and safety-related stimuli are different, modest evidence for their claims of learning/plasticity in these pathways, and circumstantial evidence supporting their hypothesized graph network models. Overall, most of the claims made in the discussion are better considered possible interpretations rather than proven findings - this is not a criticism, as these experiments and subject matter are extremely complex.

      This study continues to validate the power and utility of this in human fear conditioning/extinction paradigm, and extends this paradigm to investigating fear learning beyond the traditional limbic system pathways. It's possible that their models for the pulvinar nuclei interconnections could guide future neuromodulation or DBS studies that could provide more causal evidence for their hypotheses.

    2. Reviewer #3 (Public review):

      Summary:

      The present work was aimed at investigating the specific contributions of thalamic nuclei to associative threat learning and extinction. Using fMRI, they examined activation patterns across pulvinar divisions, the lateral geniculate nucleus (LGN), and the mediodorsal thalamus (MD) during threat acquisition, extinction, and recall. Their goal was to uncover whether distinct thalamic systems support different modes of learning-automatic survival mechanisms versus more deliberate processes - and to propose a hierarchical pulvinar model of fear conditioning. They also try to refine current neuroanatomical models of threat learning and memory, highlighting the role of thalamic nuclei in it.

      Strengths:

      (1) Valuable theoretical elaboration and modeling regarding the differential role of pulvinar subdivisions on feedforward (inferior, lateral) and higher-order integration (anterior), and their functional interplay with other relevant subcortical and cortical structures in associative threat and extinction learning.

      (2) Large sample sizes and multipronged analytical approaches were used for hypothesis testing.

      (3) Exhaustive literature review in the field of associative threat, as well as regarding the role of thalamic nuclei and other brain structures in it.

      Weaknesses:

      (1) Several weaknesses should be pointed out regarding how fMRI data were collected, as well as decisions regarding how the fMRI data were preprocessed and analyzed:

      a) fMRI data have low resolution (3 cubic mm), which certainly limits the examination of small nuclei such as the ones investigated here, and especially the examination of the LGN and inferior pulvinar.

      b) fMRI was normalized to standard space. Analyzing the data in individual-subject space would have given you the options of avoiding altering every participant's brain and of using a probabilistic thalamic atlas that better adapts to each subject's brain and thalamic nuclei (see, for instance, Iglesias et al., 2018). This would have been ideal and would have given the authors more precision, especially considering the low resolution of the fMRI data and the size of the thalamic nuclei of interest.

      c) On top of the two previous points, the authors decided to smooth the data to 6mm, which means that every single voxel within these small nuclei was blurred/mixed with the 2 immediately contiguous voxels (if they followed the standard SPM12 normalization resampling default which resamples, or upsamples the data in this case, to 2 x 2 x 2mm). Given the strong changes in structural connectivity and function that can occur, especially in the thalamus, on voxels of this size, this and the previous 2 decisions do not favor anatomical precision.

      d) Motion during scanning was poorly controlled in the preprocessing. Including the motion parameters as covariates of no interest in the GLM does not fully guarantee that motion is not influencing the results, and that motion is not differentially influencing some experimental conditions more than others.

      (2) It is not clearly indicated in the manuscript how many subjects and how many trials went into each of the analyses. It would be important to indicate this in the text and/or the figures.

      (3) It is not clear either, why, given the large sample size, some of the results were not conducted using reproducibility strategies such as dividing the sample into 2 or 3 groups or using further cross-validation strategies.

      (4) Limited testing of alternative hypotheses. The results clearly seem to be a selection of the findings supporting the hypotheses that the authors sought to confirm. (just one example: in the analysis reported in Figures 1-2; are there other correlations between the activation of the anterior pulvinar and MD with other pulvinar nuclei? only the MD-anterior Puv is reported).

      (5) The manuscript does not contain a limitations subsection. Practically every study has limitations, and this one is not an exception. Better to tell the limitations to the readers upfront so they can factor them into their evaluation of the relevance of the manuscript and reported evidence.

      (6) Data should be made available to the scientific community. Code too. Even if you just used standard fMRI toolboxes, any code used to run analyses will be helpful to the community, or if someone decides to try to replicate your findings.

      Despite these weaknesses and what can be derived from them, this manuscript constitutes a valuable contribution to the field to start characterizing and conceptualizing the involvement of thalamic nuclei and their interactions with other brain regions in the associative threat learning circuitries. It also paves the road for further testing of the functional dynamics among these regions and circuitries, and modeling testing.

    1. Document d'information : Enjeux et Défis de la Santé Mentale en France

      Synthèse

      L'audition devant la commission d'enquête de l'Assemblée nationale met en lumière une crise profonde et multidimensionnelle du système de santé mentale en France.

      Les analyses des experts révèlent un fardeau économique et social colossal, estimé à 163 milliards d'euros, plaçant les maladies mentales au premier rang des dépenses de l'Assurance Maladie.

      Ce coût est principalement tiré par les hospitalisations, qui représentent jusqu'à 85 % des dépenses directes pour des pathologies comme la schizophrénie, tandis que les coûts des médicaments, majoritairement génériqués, restent faibles.

      Le système de soins psychiatriques est caractérisé par des défaillances systémiques majeures. La prévention est quasi inexistante, entraînant des retards de diagnostic dramatiques (plus de 10 ans pour les troubles bipolaires).

      L'organisation des soins, jugée obsolète, reste hospitalo-centrée, inégalitaire sur le territoire et cloisonnée, notamment entre les soins somatiques et psychiatriques.

      Ce cloisonnement a des conséquences mortelles, réduisant l'espérance de vie des patients de 15 à 20 ans, principalement à cause de maladies cardiovasculaires et de cancers non ou mal soignés.

      Malgré ce tableau sombre, des innovations organisationnelles et technologiques ont prouvé leur efficacité.

      Les "Centres Experts", par des bilans complets, réduisent significativement les hospitalisations et améliorent le pronostic des patients.

      De même, des projets pilotes utilisant des outils numériques (expérimentation "Article 51") ont divisé par deux les tentatives de suicide et généré des économies substantielles.

      Cependant, ces innovations peinent à être déployées à grande échelle en raison de freins structurels, d'un manque de vision stratégique et d'un sous-investissement chronique dans la recherche et le développement.

      La psychiatrie souffre en parallèle d'une grave crise d'attractivité, exacerbant les pénuries de personnel et la saturation du système.

      Enfin, la recherche et le pilotage des politiques publiques sont handicapés par un manque criant de données structurées, notamment sur le volet du handicap.

      1. Le Fardeau Économique et Social des Maladies Mentales

      L'analyse économique présentée par Isabelle Duranzaleski, professeur de médecine et docteur en économie, révèle l'empreinte considérable des pathologies psychiatriques sur la société française.

      L'étude, qui reproduit des méthodologies internationales pour permettre les comparaisons, agrège plusieurs types de coûts pour obtenir un chiffre global.

      1.1. Une Estimation Globale de 163 Milliards d'Euros

      En combinant l'ensemble des coûts, l'étude arrive à un total de 163 milliards d'euros.

      Ce chiffre, bien que sujet à des risques de double compte, a pour objectif principal d'alerter les décideurs publics sur l'ampleur du fardeau.

      Il se décompose en quatre catégories principales :

      1. Dépenses de l'Assurance Maladie : Les coûts directs des soins médicaux, qui placent les maladies mentales comme le premier ou deuxième poste de dépense avec le cancer.

      2. Dépenses du secteur médico-social : Les coûts réels engagés pour l'accompagnement.

      3. Pertes de production : Le "manque à gagner" pour la société lié à la morbidité et à la mortalité prématurée.

      4. Perte de santé valorisée : Une estimation monétaire de la perte d'années de vie en bonne santé, calculée selon des standards internationaux.

      1.2. La Prépondérance des Coûts d'Hospitalisation

      Une analyse détaillée des dépenses directes pour les patients atteints de schizophrénie et de troubles bipolaires, menée via les Centres Experts, démontre que l'hospitalisation constitue la source principale des coûts.

      Pour la schizophrénie, 85 % des coûts directs sont liés aux hospitalisations.

      Pour les troubles bipolaires, ce chiffre s'élève à 78 %.

      En comparaison, la part des médicaments est très faible, en raison de l'accès quasi-exclusif à des traitements génériqués et du manque d'accès aux innovations thérapeutiques.

      Pathologie

      Part des Hospitalisations

      Part des Médicaments

      Part des Consultations

      Schizophrénie

      85 %

      9 %

      6-7 %

      Troubles Bipolaires

      78 %

      18 %

      6-7 %

      1.3. Le Coût Spécifique du Suicide

      Une étude distincte, utilisant la même méthodologie, a évalué le coût du suicide et des tentatives de suicide, en incluant les coûts directs des soins, la perte de production et la perte d'années de vie valorisée.

      Coût du suicide : 18 milliards d'euros.

      Coût des tentatives de suicide : 5 milliards d'euros.

      2. Défaillances Systémiques et Organisation des Soins

      Les experts s'accordent sur un constat sévère : l'organisation actuelle des soins psychiatriques en France est en échec. Elle est marquée par un retard structurel, une inégalité d'accès et un cloisonnement préjudiciable.

      2.1. L'Échec de la Prévention

      Selon Marion Leboyer, professeur de psychiatrie, et Coralie Gandré, maîtresse de recherche à l'IRDES, le système de soins français présente des défaillances majeures aux trois niveaux de la prévention.

      Prévention primaire : Elle est quasi-inexistante, marquée par une méconnaissance profonde des maladies mentales dans la société et une méfiance envers la psychiatrie, ce qui entraîne une perte de chance considérable.

      La France accuse un retard de 20 ans par rapport aux pays anglo-saxons dans la mise en place de mesures de prévention et de lutte contre la stigmatisation.

      Prévention secondaire : Elle se traduit par un retard au diagnostic catastrophique.

      Troubles bipolaires : Plus de 10 ans en moyenne entre les premiers symptômes et le diagnostic.    ◦ Schizophrénie : Une "durée de psychose non traitée" de 2 à 5 ans, alors que les cinq premières années sont cruciales pour la réponse au traitement.

      Ce retard est attribué à un déficit de formation des médecins de première ligne et à un manque d'information du grand public.

      Prévention tertiaire : L'arrivée tardive dans le soin se fait majoritairement par les urgences, qui sont engorgées par des patients de plus en plus sévères, chroniques et polypathologiques.

      Un patient sur quatre hospitalisé en psychiatrie y entre via les urgences, un indicateur international de mauvaise qualité de l'accès aux soins.

      2.2. Un Modèle de Soins Obsolète, Inégal et Cloisonné

      Le système français est décrit comme :

      Hospitalo-centré : L'approche reste centrée sur l'hôpital, malgré un virage ambulatoire précoce (80 % des prises en charge).

      Un quart des lits en psychiatrie est occupé au long cours sans indication thérapeutique, souvent par défaut d'offre d'hébergement non médicalisé.

      Inégalitaire sur le territoire : L'accès aux soins varie considérablement d'une région à l'autre.

      La prévalence de la schizophrénie et des troubles bipolaires est deux fois plus élevée dans les villes, suggérant la nécessité de politiques de soins adaptées aux facteurs de risque environnementaux (urbanicité, pollution, stress).

      Peu lisible et insuffisamment spécialisé : L'offre de soins est confuse pour les usagers et manque de spécialisation par pathologie, contrairement aux pratiques internationales.

      Manquant de vision stratégique : Les experts déplorent une absence de stratégie claire et de politique d'évaluation de l'organisation des soins.

      2.3. La Fracture entre Soins Somatiques et Psychiatriques

      Un des points les plus critiques soulevés est le cloisonnement total entre la psychiatrie et les autres spécialités médicales.

      Les conséquences pour les patients sont dramatiques :

      Surmortalité massive : La première cause de mortalité n'est pas le suicide, mais les maladies cardiovasculaires et les cancers.

      L'espérance de vie est réduite de 20 ans pour les femmes et 15 ans pour les hommes (données de M. Leboyer) ou de 13 ans pour les femmes et 16 ans pour les hommes (étude IRDES).

      Le taux de mortalité est 2 à 3 fois supérieur à celui de la population générale.

      Sous-dépistage des comorbidités : Des pathologies comme le syndrome métabolique (hypertension, obésité, etc.) sont très fréquentes (24 % pour la schizophrénie, 38 % pour la dépression résistante, contre 10 % en population générale), mais 70 % à 90 % des patients ne sont ni dépistés, ni soignés.

      Moindre recours aux soins préventifs : Les patients ont moins accès au dépistage des cancers, à la vaccination, aux dentistes, ophtalmologues ou gynécologues.

      Retard au diagnostic pour le cancer : Les patients atteints de troubles psychiques sont diagnostiqués plus tardivement pour les cancers, reçoivent des prises en charge plus invasives mais moins intensives, ce qui augmente les pertes de chance.

      3. Innovations et Levier d'Amélioration : Un Potentiel Sous-exploité

      Face à ces défaillances, des innovations organisationnelles, numériques et thérapeutiques ont démontré leur efficacité, mais leur déploiement reste limité par de nombreux freins.

      3.1. Les Centres Experts : Un Modèle d'Évaluation Efficace

      Inspirés des modèles existants pour le cancer ou les maladies rares, les Centres Experts proposent un bilan complet (psychiatrique, cognitif, social, somatique) sur une journée en hôpital de jour.

      Impact démontré : Les études menées avant et après passage dans ces centres montrent une diminution significative du nombre de journées d'hospitalisation, une amélioration du pronostic, une meilleure adhérence au traitement et un meilleur dépistage des comorbidités somatiques.

      Objectif : L'objectif est de déployer ce dispositif sur tout le territoire national et de l'inscrire dans la liste des activités spécifiques nationales pour garantir un accès équitable à tous les patients.

      3.2. Le Numérique au Service du Suivi (Expérimentation Article 51)

      Un projet pilote a testé une innovation organisationnelle combinant un suivi par une infirmière de pratique avancée ("case manager") et des outils digitaux pour des patients bipolaires.

      Résultats positifs : L'évaluation par le ministère de la Santé a montré :

      • Tentatives de suicide divisées par deux (de 7,6 % à 3,9 %).  
      • Journées d'hospitalisation réduites (de 42 % à 24 %).  
      • Impact économique démontré : un gain estimé à 3 500 € par patient et par an, grâce à la baisse des hospitalisations.

      Déploiement en attente : Malgré ces résultats, la décision de généraliser le dispositif est toujours en attente, suscitant l'inquiétude des patients et des soignants.

      3.3. Freins à l'Innovation et à la Recherche

      Plusieurs obstacles entravent la modernisation de la psychiatrie :

      Sous-investissement dans la recherche : La France ne consacre que 2 à 4 % de son budget de recherche biomédicale à la psychiatrie, l'un des taux les plus faibles des pays développés.

      Faible translation des découvertes : De nombreuses innovations (biomarqueurs sanguins, imagerie cérébrale, outils numériques) issues de la recherche peinent à être appliquées dans les soins courants.

      Accès limité aux nouvelles thérapies : Les patients français ont un accès moindre aux innovations thérapeutiques, qu'elles soient médicamenteuses (ex: antipsychotiques de 2e génération disponibles ailleurs en Europe mais pas en France) ou psychosociales.

      4. Enjeux Transversaux et Difficultés Spécifiques

      4.1. Crise d'Attractivité et Pénurie de Personnel

      La psychiatrie fait face à une crise d'attractivité majeure, constituant un frein structurel à toute amélioration.

      Désaffection des jeunes médecins : La psychiatrie est choisie parmi les dernières disciplines par les internes, contrairement à des pays comme le Canada où elle est devenue un premier choix suite à des investissements massifs.

      Pénurie de personnel : La FHF (Fédération Hospitalière de France) estime qu'un quart des postes de psychiatres sont vacants dans plus de la moitié des établissements publics.

      Conséquences directes : Cette pénurie entraîne une fermeture massive de lits d'hospitalisation et une saturation de l'ensemble du système de soins (CMP surchargés, urgences engorgées).

      4.2. Les Pratiques Coercitives et les Droits des Patients

      La France se distingue par un recours élevé à des pratiques restrictives de liberté, avec des variations très importantes entre établissements qui interrogent l'équité des prises en charge.

      Soins sans consentement : Près de 100 000 personnes par an (soit 5% de la file active).

      Isolement : Près de 30 000 personnes par an.

      Contention mécanique : Près de 10 000 personnes par an. Ces chiffres placent la France dans une situation défavorable par rapport aux autres pays développés, avec une tendance à l'augmentation.

      4.3. Le Manque Crucial de Données sur le Handicap

      Maude Espagiac, spécialiste du handicap, souligne une difficulté majeure pour la recherche et le pilotage des politiques : l'accès aux données.

      Cloisonnement des données : Les systèmes d'information des secteurs médical, social et médico-social ne communiquent pas, ce qui empêche d'avoir une vision complète des parcours et des coûts.

      Identification des personnes : Il est très difficile d'identifier les personnes en situation de handicap dans les bases de données de santé existantes, et de mesurer les coûts non publics (reste à charge pour les familles, les aidants).

      Perspective d'amélioration : L'intégration annoncée des données des MDPH (Maisons Départementales des Personnes Handicapées) dans le Système National des Données de Santé (SNDS) d'ici 2026 est une avancée attendue depuis une décennie.

    1. Joint Public Review:

      This manuscript puts forward the provocative idea that a posttranslational feedback loop regulates daily and ultradian rhythms in neuronal excitability. The authors used in vivo long-term tip recordings of the long trichoid sensilla of male hawkmoths to analyze spontaneous spiking activity indicative of the ORNs' endogenous membrane potential oscillations. This firing pattern was disrupted by pharmacological blockade of the Orco receptor. They then use these recordings together with computational modeling to predict that Orco receptor neuron (ORN) activity is required for circadian, not ultradian, firing patterns. Orco did not show a circadian expression pattern in a qPCR experiment, and its conductance was proposed to be regulated by cyclic nucleotide levels. This evidence led the authors to conclude that a post-translational feedback loop (PTFL) clockwork, associated with the ORN plasma membrane, allows for temporal control of pheromone detection via the generation of multi-scale endogenous membrane potential oscillations. The findings will interest researchers in neurophysiology, circadian rhythms, and sensory biology. However, the manuscript has limited experimental evidence to support its central hypothesis and is undermined by several questionable assumptions that underlie their data analysis and model builds, as well as insufficient biological data, including critical controls to validate and/or fully justify the model the authors are proposing.

      Strengths:

      The study is notable for its combination of long-term in vivo tip recordings with computational modeling, which is technically challenging and adds weight to the authors' claims. The link between Orco, cyclic nucleotides, and circadian regulation is potentially important for sensory neuroscience, and the modeling framework itself - a stochastic Hodgkin-Huxley formulation that explicitly incorporates channel noise - is a solid and forward-looking contribution. Together, these elements make the study conceptually bold and of clear interest to circadian and olfactory biologists.

      Major weaknesses:

      At the same time, several limitations temper the conclusions. The pharmacological evidence relies on a single antagonist and concentration, without key controls. The circadian analysis is based on relatively small numbers of neurons, with rhythms detected only in subsets, and the alignment procedure used in constant darkness raises concerns of bias. The molecular evidence is sparse, with only three qPCR timepoints, and the model, while creative, rests on assumptions that are not yet fully supported by in vivo data.

      Detailed comments are provided below:

      (1) The role for Orco proposed in the authors' model largely stems from the effects seen following the administration of (a single dose) of the Orco antagonist, OLC15. However, this hypothesis is undercut by the lack of adequate pharmacological controls, including a basic multipoint OLC15 dose-response series in addition to the administration of blockers for the other channels that are embedded in their model, but which were ruled out as being involved in the modulation of biological rhythms. In addition, these studies would (ideally) also benefit from the inclusion of the same concentration (series) of an inactive OLC15 analog to better control for off-target effects.

      (2) The expression pattern of Orco was assessed using qPCR at only three timepoints. Rhythmic transcripts can easily be missed with such sparse sampling (Hughes et al., 2017). A minimum of six evenly spaced timepoints across a 24-hour cycle would be required to confidently rule out circadian transcriptional regulation. In addition, the use of the timeless mRNA control from another study is not acceptable. Furthermore, qPCR analysis measures transcript abundance, not transcription, as the authors repeatedly state. Transcriptional studies would require nuclear run-off or, more recently, can be done with snRNAseq analysis. Taken together, these concerns undermine the authors' desire to rule out TTFL-based control that directly led them to implicate a PTTF-based model.

      (3) The modelling presented is based on Orco as a ZT-dependent conductance tied to the cAMP oscillations that were reported by this group in the cockroach and from the presence and functionality in Manduca of homomeric Orco complexes that are devoid of tuning ORs. While these complexes have been generated in cell culture and other heterologous expression systems, as well as presumably exist in vivo in the Drosophila empty neuron and other tuning OR mutants, there is no evidence that these complexes exist in wild-type Manduca ORNs. While this doesn't necessarily undermine every aspect of their models, the authors should note the presence of Orco/OR complexes rather than Orco homomeric complexes.

      (4) Some aspects of the authors' models, most notably the decision to phase align/optimize their DD and OLC15 recordings, are likely to bias their interpretations.

      (5) The tip recordings from long trichoid sensilla are critical aspects of this study. These recordings were carried out on upper sensillar tips located on the distal-most second annulus. Since there are approximately 80 annuli on the Manduca antennae, it is unclear whether the recordings are representative of the antennal response.

      (6) The authors do not provide any data in support of their cAMP/cGMP-based Orco gating, and the PTTF model proposed is somewhat disappointing. The model seems to be influenced by their long-held proposal that insect olfactory signaling has a critical metabotropic component involving cyclic nucleotides, PKC, etc, a view that may be influenced by the use of Orco homomeric complexes generated in HEK cells. Nevertheless, structural studies on Orco do not support a cyclic nucleotide binding site, although PKC-based phosphorylation has been implicated in the fine-tuning/adaptation of olfactory signaling.

      (7) Because only 5/11 LD and 7/10 DD animals showed daily rhythms, with averages lacking clear daily modulation, the methods are not sufficiently reliable enough to reveal novel underlying mechanisms of circadian rhythm generation. The reported results are therefore not yet reliable or quantifiable. To quantify their results, the authors should apply tests for circadian rhythmicity using methods such as RAIN, JTK CYCLE, MetaCycle, or Echo. The use of FFT and Wavelet is applauded, but these methods do not have tests of significance for rhythms and can be biased when analyzing data in which there could only be 1-3 circadian cycles. Because the conclusions appear to be based on 11-12 neurons that were recorded for 2-4 days, the reader is concerned that the methods are not yet perfected to provide strong evidence for circadian regulation of spontaneous firing of ORNs. The average data (e.g., Figure 3Bii and 3Cii) highlight the apparent lack of daily rhythms. In summary, the results would be more compelling if more than 50% of the recordings had significant circadian amplitudes and with similar periods and phases.

      (8) The statement that circadian patterns of ORN firing are lost with the Orco antagonist (OLC15) is not strongly supported. The manuscript should be revised to quantify how Orco changed circadian amplitude in the 12 recorded neurons. Measures of circadian amplitude can avoid confusing/vague statements like Line 394 "low and high frequency bands appeared to merge during the activity phase around ZT 0 in the animals that showed clear circadian rhythms (N = 5 of 11 in LD)". The conclusion that Orco blocks circadian firing appears to be contradicted by Figure 6, which indicates that ~6 of these neurons had circadian periods detected by wavelet. The manuscript would be strengthened with details about the specificity and reproducibility of the Orco antagonist. The authors quantify the gradual decrease in firing with the slope of a linear fit to estimate how the "effectiveness [of OLC15] increased over time." They conclude that the drug "obliterated circadian rhythms and attenuated the spontaneous activity in several, but not all experiments (N = 8 of 12)." The report would be greatly strengthened with corroborating data from additional Orco antagonists and additional doses of OLC15 (the authors use only 50 uM OLC15).

      (9) The manuscript includes several statements that are more speculation than conclusion. For example, there is no evidence for tuning or plasticity in this report. Statements like the following should be removed or addressed with experiments that show changes in odor response specificity or sensitivity: "ORN signalosomes are highly plastic endogenous PTFL clocks comprising receptors for circadian and ultradian Zeitgebers that allow to tune into internal physiological and external environmental rhythms as basis for active sensing." (Discussion Line 622). The paper concludes that (line 380) "mean frequency of spontaneous spiking and the frequency of bursting expressed daily modulation, and are both most likely controlled via a circadian clock that targets the leak channel Orco." This is too bold given the available results.

      (10) Because Orco conductance is modulated by cyclic nucleotides, it remains highly plausible that circadian regulation occurs upstream at the level of signaling pathways (e.g., calcium, calcium-binding proteins, GPCRs, cyclases, phosphodiesterases). The possibility that circadian oscillations of cyclic nucleotides are generated by the canonical TTFL mechanism has not been excluded. In fact, extensive work in Drosophila has demonstrated that the TTFL-based molecular clock proteins are required for circadian rhythms in olfaction.

      (11) A defining feature of circadian oscillators is the feedback mechanism that generates a time delay (e.g., PERIOD/TIMELESS repressing their own transcription). While the authors describe how cyclic nucleotides can regulate Orco conductance, they do not provide a convincing explanation of how Orco activity could, in turn, feed back into the proposed PTFL to sustain oscillations. For these reasons, the authors should consider:

      (a) Providing a broader discussion of non-TTFL models of circadian rhythms (e.g., redox cycles, post-translational modifications).

      (b) Reassessing Orco expression using a higher-resolution temporal sampling ({greater than or equal to}6 timepoints per 24 h).

      (c) Clarifying or revising the PTFL model to explicitly address how feedback would be achieved. Alternatively, the data may be more consistent with Orco conductance rhythms being regulated by post-translational mechanisms downstream of the canonical TTFL oscillator, as suggested by the Drosophila olfactory system literature.

      Minor weaknesses:

      (1) The authors should compare the firing patterns of ORN neurons to the bursts, clusters, and packets of retinal efferent spikes reported in Liu JS and Passaglia CL (2011; JBR). By comparing measures in moths to measures in Limulus, the authors might be able to address the question: Is the daily firing pattern of ORN neurons likely a conserved feature of circadian control of sensory sensitivity?

      (2) The methods need further details. For example, it is unclear if or how single neuron activity was discriminated and whether the results were compromised by the relatively large environmental fluctuations in temperature (21-27oC), humidity (35-60%), or other cues known to modulate spontaneous firing.

    2. Author response:

      Joint Public Review

      This manuscript puts forward the provocative idea that a posttranslational feedback loop regulates daily and ultradian rhythms in neuronal excitability. The authors used in vivo long-term tip recordings of the long trichoid sensilla of male hawkmoths to analyze spontaneous spiking activity indicative of the ORNs' endogenous membrane potential oscillations. This firing pattern was disrupted by pharmacological blockade of the Orco receptor. They then use these recordings together with computational modeling to predict that Orco receptor neuron (ORN) activity is required for circadian, not ultradian, firing patterns. Orco did not show a circadian expression pattern in a qPCR experiment, and its conductance was proposed to be regulated by cyclic nucleotide levels. This evidence led the authors to conclude that a post-translational feedback loop (PTFL) clockwork, associated with the ORN plasma membrane, allows for temporal control of pheromone detection via the generation of multi-scale endogenous membrane potential oscillations. The findings will interest researchers in neurophysiology, circadian rhythms, and sensory biology. However, the manuscript has limited experimental evidence to support its central hypothesis and is undermined by several questionable assumptions that underlie their data analysis and model builds, as well as insufficient biological data, including critical controls to validate and/or fully justify the model the authors are proposing.

      We thank the reviewers for their thorough and thoughtful comments and believe that the manuscript will be much stronger once we incorporate the requested changes.

      Please note that we used ORN as acronym for “olfactory receptor neuron” throughout the manuscript. ORNs contain odorant receptors (ORs), and in insects these ORs have to associate with the olfactory receptor co-receptor (Orco) in the cilium of the neuron to form functional OR-Orco complexes for odorant detection. Besides this chaperone function, Orco can form homomers with the potential to act as ionic pacemaker channels; a role which we explore in this study.

      Strengths:

      The study is notable for its combination of long-term in vivo tip recordings with computational modeling, which is technically challenging and adds weight to the authors' claims. The link between Orco, cyclic nucleotides, and circadian regulation is potentially important for sensory neuroscience, and the modeling framework itself - a stochastic Hodgkin-Huxley formulation that explicitly incorporates channel noise - is a solid and forward-looking contribution. Together, these elements make the study conceptually bold and of clear interest to circadian and olfactory biologists.

      Major weaknesses:

      At the same time, several limitations temper the conclusions. The pharmacological evidence relies on a single antagonist and concentration, without key controls. The circadian analysis is based on relatively small numbers of neurons, with rhythms detected only in subsets, and the alignment procedure used in constant darkness raises concerns of bias. The molecular evidence is sparse, with only three qPCR timepoints, and the model, while creative, rests on assumptions that are not yet fully supported by in vivo data.

      Please see our responses to the detailed comments.

      Detailed comments are provided below:

      (1) The role for Orco proposed in the authors' model largely stems from the effects seen following the administration of (a single dose) of the Orco antagonist, OLC15. However, this hypothesis is undercut by the lack of adequate pharmacological controls, including a basic multipoint OLC15 dose-response series in addition to the administration of blockers for the other channels that are embedded in their model, but which were ruled out as being involved in the modulation of biological rhythms. In addition, these studies would (ideally) also benefit from the inclusion of the same concentration (series) of an inactive OLC15 analog to better control for off-target effects.

      The Orco agonist VUAA1 (Jones et al., 2011) binds directly to Orco and increases the channel open time probability. In M. sexta hawkmoths, we have already published that VUAA 1 increases the low spontaneous activity of ORNs in a dose-dependent fashion (Nolte et al., 2016). Chen and Luetje (2012) systematically varied the chemical structure of VUAA1 to identify new Orco ligands and discovered 22 Orco Ligand Candidates (OLC) that either activated or inhibited Orco. In their heterologous expression system, Orco was most sensitive to inhibition by OLC15. Based on these results, we published a dose-response curve of OLC15 inhibition (1-100 µM) using in vivo tip recordings of pheromone-sensitive long trichoid sensilla of M. sexta (Nolte et al., 2016). In that study, we could also demonstrate that OLC15 antagonizes the VUAA1 activation of Orco.

      Furthermore, we tested other published Orco antagonists in in vivo assays in intact hawkmoths, focusing on amiloride-derived antagonists, because we previously identified an amiloride-sensitive cation channel in hawkmoth ORNs. We found that, in contrast to OLC15, the amilorides HMA and MIA were not Orco-specific but instead affected different targets depending on time-of-day (Nolte et al., 2016). Based on those experiments and the dose-response curves we determined that the Orco agonist VUAA1 (Jones et al., 2011) and the Orco antagonist OLC15 (Chen and Luetje, 2012) worked best in hawkmoth ORNs to target Orco pharmacologically. Based on comparative tests with other published Orco antagonists we settled since then in all further experiments on a dose of 50 µM OLC15.

      We will clarify the Methods section accordingly.

      (2) The expression pattern of Orco was assessed using qPCR at only three timepoints. Rhythmic transcripts can easily be missed with such sparse sampling (Hughes et al., 2017). A minimum of six evenly spaced timepoints across a 24-hour cycle would be required to confidently rule out circadian transcriptional regulation. In addition, the use of the timeless mRNA control from another study is not acceptable. Furthermore, qPCR analysis measures transcript abundance, not transcription, as the authors repeatedly state. Transcriptional studies would require nuclear run-off or, more recently, can be done with snRNAseq analysis. Taken together, these concerns undermine the authors' desire to rule out TTFL-based control that directly led them to implicate a PTTF-based model.

      We agree with the referees that more time points and a direct comparison between timeless and Orco mRNA levels should be included in this manuscript. We will include these additional qPCR experiments and edit the manuscript to make clear that we measure transcript abundance, but we will not perform snRNAseq analysis due to time- and financial constraints. We are currently working on the transcriptional control of Orco, both during ontogeny and throughout the day but this work in progress is beyond the scope of this manuscript.

      (3) The modelling presented is based on Orco as a ZT-dependent conductance tied to the cAMP oscillations that were reported by this group in the cockroach and from the presence and functionality in Manduca of homomeric Orco complexes that are devoid of tuning ORs. While these complexes have been generated in cell culture and other heterologous expression systems, as well as presumably exist in vivo in the Drosophila empty neuron and other tuning OR mutants, there is no evidence that these complexes exist in wild-type Manduca ORNs. While this doesn't necessarily undermine every aspect of their models, the authors should note the presence of Orco/OR complexes rather than Orco homomeric complexes.

      Our ELISAs found circadian oscillations in cAMP levels not only in antennae of the Madeira cockroach (Schendzielorz et al., 2014, 2012), but also in hawkmoth antennae (Schendzielorz et al., 2015). We will add the 2015 citation to the Modeling chapter in the Methods section to clarify this.

      We agree with the referees that we cannot distinguish between Orco homo- and heteromers in the different compartments of our hawkmoth ORNs. Thus, as the referee suggests, we will add text regarding the presence and localization of OR-Orco heteromers. However, we have indications that Orco homomers could indeed be present in the hawkmoth ORNs. In a heterologous expression system, MsexOrco expression alone was sufficient to increase intracellular Ca<sup>2+</sup> levels in response to VUAA1 application (Nolte et al., 2013). In differentiating primary cell cultures of hawkmoth antennae, Orco expression started during a developmental time window where ORNs did not yet express pheromone receptors, and Orco affected spontaneous activity (Nolte et al., 2016). Thus, Orco homomers are present in developing hawkmoth ORNs during a time window where ORNs already express spontaneous activity but cannot heteromerize with pheromone receptors. However, we do not know whether and in what ratio homo- and heteromers of Orco and ORs are present in the respective sensillum compartments of adult hawkmoths (Nolte et al., 2013; Stengl, 1994; Stengl and Hildebrand, 1990).

      We will clarify our manuscript accordingly.

      (4) Some aspects of the authors' models, most notably the decision to phase align/optimize their DD and OLC15 recordings, are likely to bias their interpretations.

      It is consensus that insects display daily and circadian rhythms in pheromone-dependent mating, odor-gated feeding, and egg-laying behavior that phase-locks to environmental rhythms, corresponding with daily/circadian rhythms of sensory neuron physiology (e.g., Merlin et al., 2007; Rymer et al., 2007; Schendzielorz et al., 2015, 2012). However, circadian rhythms can be easily masked by stress, like the disturbances during a very challenging long-term recording experiment over several days. In addition, we observed in our animal raising facility that in LD 17:7 light-dark cycles the originally nocturnal hawkmoths M. sexta distribute their activity patterns over the course of the day, finding nocturnal as well as diurnal hawkmoths. Thus, light-dark cycles were not enough to ensure phase-synchronized behavioral rhythms, and it is very likely that the nocturnal hawkmoths rely heavily on pheromone/odor dependent synchronization as also found in other moth species (Ghosh et al., 2024). Here, we used isolated males that were never exposed to the female pheromones so that their circadian activity patterns readily disperse. Therefore, it became necessary in free-running conditions to first determine the respective behavioral rhythm for each animal, and then to phase-align their activity patterns to allow for statistical analysis. Otherwise, circadian differences would average out in a free-running population. As requested by the referees in point (7), we will use additional tests for rhythmicity in each of our recordings and revise the manuscript accordingly.

      Assuming that hawkmoths need pheromone presence as additional Zeitgeber, we are currently working on a new set of experiments where we attempt to improve synchronization by exposure to LD cycles and pheromone before DD and OLC15 recordings. We will add these experiments to the manuscript.

      (5) The tip recordings from long trichoid sensilla are critical aspects of this study. These recordings were carried out on upper sensillar tips located on the distal-most second annulus. Since there are approximately 80 annuli on the Manduca antennae, it is unclear whether the recordings are representative of the antennal response.

      We think the reviewers might have misinterpreted our description of the recording site. In the Methods, we state that we clip off the 20 most distal annuli (leaving a stump of about 60 annuli) and insert the reference electrode into the flagellum up to the second annulus from the cut end, i.e., the recording site is located at 2/3 – 3/4 of the antenna length as seen from the head of the animal. We will make this more clear in the Methods section.

      In addition, our lab did show with antibody stainings against Orco that apparently all ORNs that innervate long and short trichoid sensilla along the whole flagellum express the same staining pattern (Nolte et al., 2016). Furthermore, our patch clamp recordings of primary cell cultures of whole male antennae found largely overlapping ion channel populations across ORNs. This would indicate that all ORNs, whether they express pheromone- or general odorant receptors, could potentially share the same Orco-dependent spontaneous activity rhythms. In our lab, different experimenters from different years that recorded from long trichoid sensilla on different annuli did not detect obvious differences in neither the spontaneous activity nor the pheromone responses (c.f., Dolzer et al., 2003; Gawalek and Stengl, 2018; Schneider et al., 2025). Thus, it is very likely that we are reporting a general encoding mechanism that is not locally restricted along the antennal flagellum.

      (5.1) The authors do not provide any data in support of their cAMP/cGMP-based Orco gating…

      There are publications supporting cyclic nucleotide gating of Orco in Drosophila, but only after previous phosphorylation via protein kinase C (PKC; review: (Wicher and Miazzi, 2021)). Since Orco is very conserved among insect species, it is likely that these PKC and cGMP/cAMP-dependent regulations are present in other insect species. We are currently running thorough tip-recording experiments on the regulation of Orco gating, which are beyond the scope of this manuscript. However, we will add a set of experiments to this manuscript that demonstrates cAMP gating of Orco.

      (5.2)… and the PTTF model proposed is somewhat disappointing.

      For a detailed introduction of our PTFL membrane clock hypothesis please see our opinion paper (Stengl and Schneider, 2024).

      (5.3) The model seems to be influenced by their long-held proposal that insect olfactory signaling has a critical metabotropic component involving cyclic nucleotides, PKC, etc, a view that may be influenced by the use of Orco homomeric complexes generated in HEK cells.

      Indeed, we propose a metabotropic pheromone-transduction cascade, which in moths and cockroaches is based on G-protein-mediated activation of phospholipase C but not on adenylyl cyclase activation. Our hypothesis is not influenced by HEK cell heterologous expression studies of Orco but is supported by our own work comparing in vivo tip recordings of intact hawkmoths with patch clamp experiments on hawkmoth primary cell cultures of olfactory receptor neurons, which are able to respond to their species-specific pheromones in vitro ((Schneider et al., 2025; Stengl, 2010; Stengl and Funk, 2013; Wicher and Miazzi, 2021). In addition, a multitude of publications by other laboratories with in vivo and in vitro studies using physiological, genetic, and immunocytochemical assays all support a metabotropic signal transduction cascade in insect olfaction (reviews: Stengl, 2010; Stengl and Funk, 2013; Wicher and Miazzi, 2021). In contrast, the hypothesis suggesting a solely ionotropic pheromone- and general odor-dependent transduction cascade for all insect species is based on very sparse experimental evidence, based primarily on heterologous expression studies such as HEK cells that lack the insect’s WT molecular surroundings, and thus, cannot predict OR-Orco function in vivo. Furthermore, the ionotropic hypothesis is heavily based upon the argument that an inverse 7TM receptor cannot couple to G-proteins, which lacks careful backup via biochemical and structural studies. In addition, the ionotropic hypothesis lacks support via carefully performed physiological in vivo studies in different insect species that paid attention to analysis of the distinct kinetic components of ORN´s odor/pheromone responses and that employ physiological concentrations and durations of odor/pheromone stimuli (please see our most recent publication by Schneider et al. (2025)).

      (5.4) Nevertheless, structural studies on Orco do not support a cyclic nucleotide binding site, although PKC-based phosphorylation has been implicated in the fine-tuning/adaptation of olfactory signaling.

      While structural studies did not find evidence for conserved known cyclic nucleotide binding sites on Orco, this does not exclude the presence of so far unknown binding sites, or via sites that fold out only after a specific sequence of previous phosphorylations of the many phosphorylation sites on Orco. Indeed, physiological studies in Drosophila presented evidence for cyclic nucleotide dependence of Orco after previous PKC-dependent phosphorylation (Getahun et al., 2013). Our ongoing in vivo experiments in hawkmoths further corroborate a PKC- and cAMP-dependent modulation of Orco. These studies will be published in a follow-up publication.

      (6) Because only 5/11 LD and 7/10 DD animals showed daily rhythms, with averages lacking clear daily modulation, the methods are not sufficiently reliable enough to reveal novel underlying mechanisms of circadian rhythm generation. The reported results are therefore not yet reliable or quantifiable. To quantify their results, the authors should apply tests for circadian rhythmicity using methods such as RAIN, JTK CYCLE, MetaCycle, or Echo. The use of FFT and Wavelet is applauded, but these methods do not have tests of significance for rhythms and can be biased when analyzing data in which there could only be 1-3 circadian cycles. Because the conclusions appear to be based on 11-12 neurons that were recorded for 2-4 days, the reader is concerned that the methods are not yet perfected to provide strong evidence for circadian regulation of spontaneous firing of ORNs. The average data (e.g., Figure 3Bii and 3Cii) highlight the apparent lack of daily rhythms. In summary, the results would be more compelling if more than 50% of the recordings had significant circadian amplitudes and with similar periods and phases.

      The long-term tip-recordings of intact hawkmoths are very challenging and take a very long time to accomplish, thus, we are very happy that we succeeded in obtaining so many of them (N=34). Since 5/11 LD recordings and 7/10 DD recordings revealed daily/circadian rhythmicity and since many other physiological recordings at different ZTs of different members of our laboratory all revealed ZT-dependent pheromone-transduction we can be certain that the physiology of hawkmoth antennae is under strict circadian control. Please see also our response to (4) above commenting the phase-dispersal of activity rhythms observed in our experiments, as well as in the behavior of hawkmoth males in the mating cage.

      Nevertheless, we will follow the advice of the referees to apply additional tests for significance of rhythms in spontaneous activity, and we are thankful for the tests suggested that we were not aware of.

      (7) The statement that circadian patterns of ORN firing are lost with the Orco antagonist (OLC15) is not strongly supported. The manuscript should be revised to quantify how Orco changed circadian amplitude in the 12 recorded neurons. Measures of circadian amplitude can avoid confusing/vague statements like Line 394 “low and high frequency bands appeared to merge during the activity phase around ZT 0 in the animals that showed clear circadian rhythms (N = 5 of 11 in LD)”. The conclusion that Orco blocks circadian firing appears to be contradicted by Figure 6, which indicates that ~6 of these neurons had circadian periods detected by wavelet. The manuscript would be strengthened with details about the specificity and reproducibility of the Orco antagonist. The authors quantify the gradual decrease in firing with the slope of a linear fit to estimate how the “effectiveness [of OLC15] increased over time.” They conclude that the drug “obliterated circadian rhythms and attenuated the spontaneous activity in several, but not all experiments (N = 8 of 12).” The report would be greatly strengthened with corroborating data from additional Orco antagonists and additional doses of OLC15 (the authors use only 50 uM OLC15).

      We will revise our data analysis, according to the valuable suggestions of the referees.

      However, based upon our previous studies with other Orco antagonists and different doses of OLC15 (Nolte et al., 2016) we found that 50 µM OLC15 is the best Orco antagonist dose in M. sexta to target Orco-dependent modulation of spontaneous action potential activity of hawkmoth olfactory receptor neurons. Please see also our response to (1).

      (8) The manuscript includes several statements that are more speculation than conclusion. For example, there is no evidence for tuning or plasticity in this report. Statements like the following should be removed or addressed with experiments that show changes in odor response specificity or sensitivity: "ORN signalosomes are highly plastic endogenous PTFL clocks comprising receptors for circadian and ultradian Zeitgebers that allow to tune into internal physiological and external environmental rhythms as basis for active sensing." (Discussion Line 622). The paper concludes that (line 380) "mean frequency of spontaneous spiking and the frequency of bursting expressed daily modulation, and are both most likely controlled via a circadian clock that targets the leak channel Orco." This is too bold given the available results.

      We will revise the discussion accordingly and clarify which statements are supported via published evidence and which are predictions based upon our novel hypothesis published in our opinion paper (Stengl and Schneider, 2024).

      (9.1) Because Orco conductance is modulated by cyclic nucleotides, it remains highly plausible that circadian regulation occurs upstream at the level of signaling pathways (e.g., calcium, calcium-binding proteins, GPCRs, cyclases, phosphodiesterases).

      We agree with the referees that it is very likely that there are multiple layers of interconnected feedback cycles that control Orco localization and activity. Our novel hypothesis suggests interlocked TTFL and PTFL control of physiological circadian rhythms, not strictly hierarchical TTFL control, which would require a daily turnover of membrane proteins and transcriptional control via the established TTFL clock in insect ORNs. We currently search for TTFL control at all levels of odor/pheromone transduction using ZT-dependent transcriptomics in combination with qPCR and single nuclear transcriptomics, involving also all the molecules suggested by the referees. These studies are ongoing, are very time- and money-consuming, and are beyond the scope of this manuscript.

      (9.2) The possibility that circadian oscillations of cyclic nucleotides are generated by the canonical TTFL mechanism has not been excluded. In fact, extensive work in Drosophila has demonstrated that the TTFL-based molecular clock proteins are required for circadian rhythms in olfaction.

      Our experiments that test circadian TTFL control at different levels of the cAMP transduction cascade in hawkmoth antennae are on the way and are part of another publication. We will revise our discussion accordingly.

      The experiments published for TTFL dependent control of Drosophila olfaction that we are aware of (Krishnan et al., 1999; Tanoue et al., 2004) do not exclude interlinked PTFL and TTFL clocks. Krishnan et al. (1999) demonstrate that the TTFL clock in antennal olfactory receptor neurons correlates with circadian rhythms in odor responses measured in electroantennogram (EAG) recordings, not in single sensillum recordings as in our experiments. EAG recordings comprise not only voltage responses of the olfactory sensory neurons but also voltage changes generated in non-neuronal antennal cells such as trichogen and tormogen cells that built the transepithelial potential gradient via vATPases that generates the high K<sup>+</sup> concentration in the sensillum lymph (Jain et al., 2024; Klein, 1992; Thurm and Küppers, 1980). In addition, EAG recordings most likely contain responses of afferent neurons originating from somata in the brain that maintain central control of the antennae. Thus, EAG recordings are difficult to interpret.

      (11) A defining feature of circadian oscillators is the feedback mechanism that generates a time delay (e.g., PERIOD/TIMELESS repressing their own transcription). While the authors describe how cyclic nucleotides can regulate Orco conductance, they do not provide a convincing explanation of how Orco activity could, in turn, feed back into the proposed PTFL to sustain oscillations. For these reasons, the authors should consider:

      a) Providing a broader discussion of non-TTFL models of circadian rhythms (e.g., redox cycles, post-translational modifications).

      We will revise the discussion accordingly.

      b) Reassessing Orco expression using a higher-resolution temporal sampling ({greater than or equal to}6 timepoints per 24 h).

      We will add those experiments to the revised version of the manuscript (see our response to (2)).

      c) Clarifying or revising the PTFL model to explicitly address how feedback would be achieved. Alternatively, the data may be more consistent with Orco conductance rhythms being regulated by post-translational mechanisms downstream of the canonical TTFL oscillator, as suggested by the Drosophila olfactory system literature.

      We will revise the manuscript accordingly.

      Minor weaknesses:

      (1) The authors should compare the firing patterns of ORN neurons to the bursts, clusters, and packets of retinal efferent spikes reported in Liu JS and Passaglia CL (2011; JBR). By comparing measures in moths to measures in Limulus, the authors might be able to address the question: Is the daily firing pattern of ORN neurons likely a conserved feature of circadian control of sensory sensitivity?

      We will revise the discussion accordingly.

      (2) The methods need further details. For example, it is unclear if or how single neuron activity was discriminated and whether the results were compromised by the relatively large environmental fluctuations in temperature (21-27oC), humidity (35-60%), or other cues known to modulate spontaneous firing.

      We will clarify the Methods section.

      References

      Chen S, Luetje CW. 2012. Identification of New Agonists and Antagonists of the Insect Odorant Receptor Co-Receptor Subunit. PLOS ONE 7:e36784. doi:10.1371/journal.pone.0036784

      Dolzer J, Fischer K, Stengl M. 2003. Adaptation in pheromone-sensitive trichoid sensilla of the hawkmoth Manduca sexta. J Exp Biol 206:1575–1588. doi:10.1242/jeb.00302

      Gawalek P, Stengl M. 2018. The Diacylglycerol Analogs OAG and DOG Differentially Affect Primary Events of Pheromone Transduction in the Hawkmoth Manduca sexta in a Zeitgebertime-Dependent Manner Apparently Targeting TRP Channels. Front Cell Neurosci 12:218. doi:10.3389/fncel.2018.00218

      Getahun MN, Olsson SB, Lavista-Llanos S, Hansson BS, Wicher D. 2013. Insect Odorant Response Sensitivity Is Tuned by Metabotropically Autoregulated Olfactory Receptors. PLOS ONE 8:e58889. doi:10.1371/journal.pone.0058889

      Ghosh S, Suray C, Bozzolan F, Palazzo A, Monsempès C, Lecouvreur F, Chatterjee A. 2024. Pheromone-mediated command from the female to male clock induces and synchronizes circadian rhythms of the moth Spodoptera littoralis. Curr Biol 34:1414-1425.e5. doi:10.1016/j.cub.2024.02.042

      Jain K, Prelic S, Hansson BS, Wicher D. 2024. Expression of Drosophila melanogaster V-ATPases in Olfactory Sensillum Support Cells. Insects 15:1016. doi:10.3390/insects15121016

      Jones PL, Pask GM, Rinker DC, Zwiebel LJ. 2011. Functional agonism of insect odorant receptor ion channels. Proc Natl Acad Sci 108:8821–8825. doi:10.1073/pnas.1102425108

      Klein U. 1992. The insect V-ATPase, a plasma membrane proton pump energizing secondary active transport: immunological evidence for the occurrence of a V-ATPase in insect ion-transporting epithelia. J Exp Biol 172:345–354. doi:10.1242/jeb.172.1.345

      Krishnan B, Dryer SE, Hardin PE. 1999. Circadian rhythms in olfactory responses of Drosophila melanogaster. Nature 400:375–378. doi:10.1038/22566

      Merlin C, Lucas P, Rochat D, François M-C, Maïbèche-Coisne M, Jacquin-Joly E. 2007. An Antennal Circadian Clock and Circadian Rhythms in Peripheral Pheromone Reception in the Moth Spodoptera littoralis. J Biol Rhythms 22:502–514. doi:10.1177/0748730407307737

      Nolte A, Funk NW, Mukunda L, Gawalek P, Werckenthin A, Hansson BS, Wicher D, Stengl M. 2013. In situ Tip-Recordings Found No Evidence for an Orco-Based Ionotropic Mechanism of Pheromone-Transduction in Manduca sexta. PLOS ONE 8:e62648. doi:10.1371/journal.pone.0062648

      Nolte A, Gawalek P, Koerte S, Wei H, Schumann R, Werckenthin A, Krieger J, Stengl M. 2016. No Evidence for Ionotropic Pheromone Transduction in the Hawkmoth Manduca sexta. PLOS ONE 11:e0166060. doi:10.1371/journal.pone.0166060

      Rymer J, Bauernfeind AL, Brown S, Page TL. 2007. Circadian rhythms in the mating behavior of the cockroach, Leucophaea maderae. J Biol Rhythms 22:43–57. doi:10.1177/0748730406295462

      Schendzielorz J, Schendzielorz T, Arendt A, Stengl M. 2014. Bimodal Oscillations of Cyclic Nucleotide Concentrations in the Circadian System of the Madeira Cockroach Rhyparobia maderae. J Biol Rhythms 29:318–331. doi:10.1177/0748730414546133

      Schendzielorz T, Peters W, Boekhoff I, Stengl M. 2012. Time of Day Changes in Cyclic Nucleotides Are Modified via Octopamine and Pheromone in Antennae of the Madeira Cockroach. J Biol Rhythms 27:388–397. doi:10.1177/0748730412456265

      Schendzielorz T, Schirmer K, Stolte P, Stengl M. 2015. Octopamine Regulates Antennal Sensory Neurons via Daytime-Dependent Changes in cAMP and IP3 Levels in the Hawkmoth Manduca sexta. PLOS ONE 10:e0121230. doi:10.1371/journal.pone.0121230

      Schneider AC, Schröder K, Chang Y, Nolte A, Gawalek P, Stengl M. 2025. Hawkmoth Pheromone Transduction Involves G-Protein–Dependent Phospholipase Cβ Signaling. eNeuro 12:ENEURO.0376-24.2024. doi:10.1523/ENEURO.0376-24.2024

      Stengl M. 2010. Pheromone Transduction in Moths. Front Cell Neurosci 4:133. doi:10.3389/fncel.2010.00133

      Stengl M. 1994. Inositol-trisphosphate-dependent calcium currents precede cation currents in insect olfactory receptor neurons in vitro. J Comp Physiol A 174:187–194. doi:10.1007/BF00193785

      Stengl M, Funk NW. 2013. The role of the coreceptor Orco in insect olfactory transduction. J Comp Physiol A 199:897–909. doi:10.1007/s00359-013-0837-3

      Stengl M, Hildebrand JG. 1990. Insect olfactory neurons in vitro: morphological and immunocytochemical characterization of male-specific antennal receptor cells from developing antennae of male Manduca sexta. J Neurosci 10:837–847. doi:10.1523/JNEUROSCI.10-03-00837.1990

      Stengl M, Schneider AC. 2024. Contribution of membrane-associated oscillators to biological timing at different timescales. Front Physiol 14:1243455. doi:10.3389/fphys.2023.1243455

      Tanoue S, Krishnan P, Krishnan B, Dryer SE, Hardin PE. 2004. Circadian Clocks in Antennal Neurons Are Necessary and Sufficient for Olfaction Rhythms in Drosophila. Curr Biol 14:638–649. doi:10.1016/j.cub.2004.04.009

      Thurm U, Küppers J. 1980. Epithelial physiology of insect sensilla In: Locke M, Smith DS, editors. Insect Biology in the Future. Academic Press. pp. 735–763. doi:10.1016/B978-0-12-454340-9.50039-2

      Wicher D, Miazzi F. 2021. Functional properties of insect olfactory receptors: ionotropic receptors and odorant receptors. Cell Tissue Res 383:7–19. doi:10.1007/s00441-020-03363-x

    1. Synthèse de l'Audition sur l'État de la Santé Mentale en France

      Résuméf

      L'audition à la commission d'enquête de l'Assemblée nationale a mis en lumière un consensus alarmant sur l'état désastreux de la psychiatrie en France, particulièrement en pédopsychiatrie, qualifiée de "désastre absolu".

      Les experts, un pédopsychiatre-épidémiologiste et un psychanalyste-chercheur, s'accordent sur plusieurs points critiques : une dégradation continue du bien-être psychique de la population depuis 20 ans, une inversion préoccupante de la tendance à la baisse des suicides chez les jeunes depuis 2017, et une offre de soins totalement inadaptée face à une demande croissante, créant un "effet ciseau" dévastateur.

      Les défaillances du système sont jugées systémiques et profondes, impliquant une responsabilité partagée entre les psychiatres (pour leurs certitudes passées), les administrations ("il nous flingue"), les directeurs d'hôpitaux (gestion par "tableau Excel") et des politiques publiques inadaptées.

      La critique vise particulièrement le "New Public Management", qui applique des logiques de rentabilité au soin, et l'hégémonie d'une approche "scientiste" de l'Evidence-Based Medicine, jugée inefficace et inconsistante dans le champ de la santé mentale de l'enfant.

      Des dispositifs comme "Mon Soutien Psy" sont cités comme des exemples de "leviers qu'il ne fallait pas activer".

      Face à ce constat, les experts appellent à une réorientation radicale. Les leviers d'action prioritaires incluent la prévention, notamment la lutte contre la maltraitance infantile qui explique 50% des troubles futurs, et le renforcement des dispositifs institutionnels existants (CMP, CMPP, hôpitaux) plutôt que la création de nouvelles structures complexes.

      La formation de thérapeutes qualifiés et la revalorisation des pratiques cliniques pluridisciplinaires sont essentielles.

      Enfin, bien que le coût de l'inaction se chiffre en milliards d'euros, les experts soulignent que la solution n'est pas uniquement budgétaire mais réside dans une meilleure organisation des ressources existantes et dans des choix politiques courageux qui replacent la relation humaine et la complexité clinique au cœur du système de soin.

      1. État des Lieux : Un Constat Alarmant

      Les deux intervenants dressent un tableau extrêmement sombre de la situation psychiatrique en France, soulignant la nécessité de distinguer le "vécu anxio-dépressif" des pathologies cliniques et de se méfier des chiffres de prévalence bruts qui, pour des troubles dimensionnels comme la dépression, "ne veulent rien dire".

      1.1. Tendances Épidémiologiques Inquiétantes

      Le professeur Falissard, épidémiologiste, identifie une "quadruple interaction entre le temps, l'âge, le genre et la pathologie".

      Suicide et tentatives de suicide : Si la tendance globale sur 30 ans est à l'amélioration, une rupture nette est observée depuis 2017 chez les jeunes.

      • ◦ L'amélioration s'est stoppée pour les suicides chez les jeunes.  
      • ◦ Les suicides augmentent légèrement chez les jeunes filles.  
      • ◦ Les tentatives de suicide montent de "façon spectaculaire" chez les jeunes filles depuis 2017.

      Vécu anxio-dépressif : Le ressenti subclinique mesuré par Santé publique France se dégrade "de façon homogène" depuis 20 ans, touchant toutes les tranches d'âge et tous les genres.

      Pédopsychiatrie : La situation est décrite comme un "désastre absolu". Certains départements n'ont plus de pédopsychiatres, rendant la permanence des soins impossible.

      1.2. L'Effet Ciseau : Une Offre de Soins Débordée

      M. Tonou met en évidence l'écart croissant entre l'augmentation de la demande et le déficit de l'offre de soins, créant un "effet ciseau" aux conséquences désastreuses.

      Délais d'attente : Les délais pour une consultation spécialisée sont "insupportables", allant de 6 à 18 mois.

      À l'échelle d'un enfant de 3 à 6 ans, cela équivaut à un délai de 5 à 10 ans pour un adulte.

      Conséquences cliniques :

      ◦ Augmentation des hospitalisations en urgence.    ◦ Augmentation des passages à l'acte suicidaire.     ◦ Consommation accrue de psychotropes en pédiatrie, souvent hors autorisation de mise sur le marché (AMM).     ◦ Substitution des psychothérapies recommandées par le seul médicament, faute de moyens.

      Selon les estimations, 13% de la population française serait concernée par un trouble mental, soit 1,5 million d'enfants et 9 millions de personnes au total.

      2. Défaillances Systémiques et Critiques de la Gouvernance

      Les experts s'accordent sur le fait que la crise actuelle est le résultat d'une série de défaillances à tous les niveaux du système. La responsabilité est "partagée" et les erreurs stratégiques des pouvoirs publics sont pointées du doigt.

      2.1. Une Responsabilité Partagée

      Le professeur Falissard insiste : "tout le monde est responsable du fait qu'aujourd'hui c'est une catastrophe".

      Acteur

      Critique

      Les psychiatres

      Ont eu le tort au 20e siècle de croire détenir une vérité unique (la psychanalyse), ce qui a nui aux patients, notamment dans l'autisme. Ce n'est plus le cas pour 90% des praticiens aujourd'hui.

      L'Administration et la Tutelle

      "Il ne nous aide pas, il nous flingue". Des réglementations absurdes (ex: conventions pour les orthophonistes en CMP) bloquent concrètement la prise en charge des enfants.

      Le Délégué Interministériel (TND)

      Est qualifié d'"antipsychiatre", accusé d'empêcher la construction de soins avec une "lubie scientiste" et une recherche de solutions simplistes à des problèmes complexes.

      Les Directeurs d'Hôpitaux

      Privilégient une gestion comptable ("Le tableau Excel il est bien rempli") au détriment de la qualité des soins, menant à des catastrophes (ex: aide-soignant d'orthopédie remplaçant une infirmière psy, conduisant à des abus sexuels).

      Les Parents

      Sont souvent "partie du problème", plaçant les soignants dans une situation complexe entre le devoir de protection de l'enfant et l'impératif d'inclure les parents dans le soin.

      2.2. Erreurs Stratégiques de Gouvernance

      M. Tonou identifie deux phénomènes majeurs qui ont sapé les fondements du soin psychique :

      1. Le New Public Management : La gestion des structures publiques comme des entreprises, substituant "la rentabilité et le profit au principe fondateur des missions de services publics".

      2. L'Evidence-Based Medicine (EBM) : Son déploiement pour évaluer la psychiatrie s'est révélé "tout à fait inconsistant dans le domaine de la santé mentale de l'enfant", avec "aucune avancée en terme de diagnostic, aucune avancée en terme de traitement".

      Plusieurs initiatives sont citées comme des "archétypes de ce qu'il ne faudrait pas faire" :

      • Le dispositif Mon Soutien Psy.

      • La stratégie TND 2023-2027.

      • Le cas des centres experts et de la Fondation FondaMental, critiqués pour leur approche qui dissocie le diagnostic du soin, leur absence de résultats concrets (zéro marqueur biologique trouvé) et les biais scientifiques de leurs études d'efficacité.

      3. Prévention et Pistes d'Amélioration

      Plutôt que de chercher des solutions simplistes, les experts appellent à revenir aux fondamentaux du soin, de la prévention et à mieux organiser les ressources existantes.

      3.1. La Prévention comme Levier Principal

      Lutte contre la maltraitance : La prévention primaire, c'est "éviter les abus sur les enfants".

      Cela "explique 50 % de la variance des problèmes psychiatriques des adolescents et des adultes après".

      L'Aide Sociale à l'Enfance (ASE), qui devrait être le bras armé de cette prévention, est elle-même "un désastre absolu".

      Facteurs de risque sociaux : La lutte contre la pauvreté, la précarité et l'exclusion est une politique de prévention en santé mentale.

      "Lorsque je propose une aide sociale à une famille en difficulté [...] je préviens aussi un risque de santé mentale".

      Milieu scolaire : Réduire le nombre d'élèves par classe (le seuil de 17 élèves est cité comme optimal pour l'apprentissage de la lecture) est un levier puissant pour la santé mentale des enfants.

      3.2. Renforcer l'Existant et Former les Acteurs

      La priorité n'est pas de créer de nouveaux dispositifs, mais de soutenir les institutions et les pratiques qui ont fait leurs preuves.

      Soutenir les institutions : "S'il y avait qu'une chose à retenir, c'est celle-là".

      Il faut "rouvrir des places, des lits à l'hôpital, dans les services spécialisés, les CMP, les CMPP". Il faut cesser de "déshabiller le CMP" pour "abonder les centres experts".

      Repenser la formation :

      ◦ La question centrale n'est pas "quelle est la bonne thérapie ?" mais "qui est un bon thérapeute ?".

      L'effet du clinicien est largement supérieur à l'effet spécifique de la thérapie.    *   ◦ Il n'existe pas de formation universitaire pour les psychothérapies en France.    *  ◦ Il faut recréer des formations intermédiaires pour les infirmiers, sur le modèle d'une spécialisation locale d'un an ("infirmiers plus un"), car le modèle des Infirmières en Pratique Avancée (IPA) est trop lourd et coûteux.

      Valoriser la diversité des approches : La spécificité française réside dans une grande diversité de pratiques (psychanalyse, thérapies familiales, psychothérapie institutionnelle) qui "font leur preuve dans la clinique". Toute tentative de réduire cette diversité "va se payer par du moins de soins".

      4. Enjeux Économiques et Budgétaires

      Le débat sur les moyens financiers révèle une tension entre la nécessité de prouver l'efficacité économique des investissements et la conviction que le soin aux plus vulnérables relève d'un "pacte démocratique" non quantifiable.

      4.1. Le Coût de l'Inaction

      M. Tonou propose une méthode pour chiffrer le "coût de l'inaction", qui se calculerait en milliards d'euros.

      Exemple de calcul : Pour un coût moyen estimé de 10 000 € par an et par patient, une meilleure prise en charge permettant de réduire ces coûts de 20% générerait une économie de 3,9 milliards d'euros en population pédiatrique et de 18 milliards d'euros en population générale.

      Finalité : Démontrer que "le coût de l'inaction est beaucoup plus élevé que des politiques volontaires et cohérentes".

      4.2. Une Question d'Organisation plus que de Moyens ?

      Le professeur Falissard adopte une posture provocatrice : "Ça n'est pas une question de moyen".

      Gains d'efficience : "Nous pourrions faire beaucoup mieux avec la même quantité d'argent". Il pointe l'argent alloué à des dispositifs coûteux et peu efficaces (ex: hospitalisations de semaine pour bilans TDAH) alors que les urgences ne sont pas financées.

      Inefficacité des études médico-économiques : Il affirme que les études prouvant le sous-financement de la psychiatrie "n'ont servi à rien".

      Injustice de l'allocation des ressources : Il dénonce une inégalité fondamentale : "On n'a pas le même argent selon les maladies qu'on a en France".

      Des traitements à 2 millions d'euros sont remboursés pour certaines maladies rares, tandis qu'il n'y a "pas 3000 € pour une tentative de suicide chez une adolescente".

      La rationalité économique ne s'applique pas à la psychiatrie en France.

      La conclusion partagée est que la priorité absolue est de mieux organiser le système, en se basant sur les acteurs de terrain ("pas des gens en costard-cravate") pour redéfinir les priorités et optimiser les dépenses.

    1. Reviewer #1 (Public review):

      Summary:

      This work asks the question of how different organelles and structures in the apicomplexan parasite Toxoplasma gondii are recycled and/or segregated to the daughter cells during cell replication. In particular, they consider an unusual cell structure called the residual body that links replicating cells during the intracellular infection stage of this parasite. The residual body has historically been considered a 'dumping ground' for unnecessary relics of the mother cell during division, but this notion is increasingly being revised. Indeed, cell replication in Toxoplasma is often misinterpreted as cell division (cytokinesis), but in fact, the cell replicates its organelles and structures to multiple 10s of copies in seemingly distinctly formed daughter cells, but cytokinesis is delayed for many such cycles and typically only occurs simultaneously with parasite egress from its host cell. The residual body is, in fact, the connection between these pre-cytokinetic replicated daughters, and effectively, this is still a single cell at this stage. The authors have previously shown that an actin network extends through the residual body between these daughter cells, and ER and mitochondria common to all cells are also linked through this structure. This study examining the fates of organelles during cell replication is timely for continuing our understanding of how this fascinating component of the cell participates in these processes. The authors use Halo-tags as their principal tool to track discrete populations of proteins, labelling their organelle locations, and this provides beautiful insight into these processes.

      Strengths:

      Using dyes conjugated to Halo tags, this work elegantly tracks the fates of proteins synthesised by an original 'mother' cell over several replication cycles of pre-cytokinetic 'daughters'. Using this tool, they show that some organelles are made intact just once and that some of these can be subsequently sorted to the daughters (micronemes and rhoptries) while others are dismantled (IMC) and the daughters must make their own. A third set of organelles (largely synthesis, sorting, and metabolic compartments) is divided and inherited, and new daughter-synthesised proteins are added to the preexisting maternal proteins in these structures. A role for actin and myosin is clearly demonstrated for micronemes and rhoptries, and this correlates with their relatively late inheritance into the developing daughters. Overall, this work gives clarity to the behaviours of several cell structures during replication and paves the way to a better understanding of the mechanisms that drive the differences between structures and the universality of these processes in other apicomplexan parasites.

      Weaknesses:

      In addressing the question of residual body participation in sorting of organelles, it would be useful to clearly define this structure and when and where it is delineated from the posterior of a mother cell during the formation of daughter structures. This might seem like a moot point, but it would give clarity to notions of recycling and 'reservoirs'. Mother cells retain their active invasion apparatus until very late in daughter formation, and the need for micronemes and rhoptries to be released from this service late in the process might explain why they are only then trafficked to the cell posterior and then into the daughters. So, is this a distinct 'residual body' body function/reservoir or just a spatial constraint of this sequence of daughter formation? In subsequent cell replications (4, 8, 16... stages), is there a separation between the residual body that links them all and the posterior of each new 'mother cell', and if so, when is this distinction lost? This is important because without a definition, we might be confusing different processes. Are rhoptries/micronemes that originate in one 'mother' able to be sorted to the 'daughters' from a distinct mother in this syncytium? If so, this would make it a sorting centre, but otherwise we could be just capturing the activities at the posterior of any given cell during replication. The authors' further thoughts on this would be very interesting.

      The Group 2 structures are described as those that are divided between daughters and receive newly synthesised proteins that add to the maternal protein of these compartments. While this is a logical conclusion for several that are mentioned, where the maternal protein signal is seen to be depleted with replication (including for the apicoplast, ER, glideosome, and Golgi). Data for the addition of new proteins to these existing structures is actually only presented in direct support of this for the Golgi.

    2. Reviewer #2 (Public review):

      Summary:

      Toxoplasma gondii is an obligate intracellular parasite and the causative agent of Toxoplasmosis. Parasite invasion into host cells, intracellular replication, and then egress, which results in the destruction of the infected cell, is central to pathogenicity. This manuscript focuses on understanding how maternal resources (in this case, cellular organelles) are shared between daughter parasites during cell division. Many organelles are single copy, meaning that division and inheritance by the daughters is crucial for successful replication. The major strength of this study was the use of a Halo-based pulse chase assay to characterize patterns of organelle inheritance. The results show that both microneme and rhoptries (secretory vesicles) previously thought to be synthesized de novo are inherited by daughter parasites. Thus, this paper adds new insight to our understanding of cell division in this important parasite.

      Strengths:

      This study demonstrated that pulse labeling of proteins can be used to monitor protein synthesis, turnover, and movement. This approach will be of great interest to the field. Using this method, the authors demonstrate three main modes of organelle inheritance.

      (1) Organelles, where there are multiple copies (such as secretory vesicles, micronemes, and rhoptries), are divided between the daughter parasites, with additional contribution of newly formed vesicles. New and old material remain as separate entities in the cell.

      (2) Single-copy organelles, which are expanded to include newly synthesized material prior to division, such as the Golgi and apicoplast.

      (3) Cytoskeletal structures that are synthesized anew during each round of division. These studies provide more refined insight into patterns or organelle inheritance and demonstrate that secretory organelles are not made de novo during each round of division as previously thought. The paper has a logical flow, and overall, the data is presented in a clear and organized fashion.

      Weaknesses:

      (1) Descriptions of methodology and statistical analysis were incomplete.

      (2) There are inconsistencies between the data in Figures 1 and 5. In Figure 1, a small amount of maternal IMC is visible in stage 2 parasites. Although this is a ~90% reduction, these parasites should be quantified as parasites with material IMC. However, the graph in Figure 5C indicates that no material parasites have GAPM1a, given that graph 5C is a binary measure (present vs. absent), one would expect a non-zero percent of parasites to have maternal material.

      (3) The conclusion from Figure 6 was not justified based on the data. I agree with the author's conclusion that the accumulation of micronemes and rhoptries in the residual body was time-dependent. In Figure 6A, the signal observed in the residual body at times 6:30, 13, and 14 hours is not observed in subsequent time points. However, the fate of these micronemes and rhoptries is unclear. It cannot be concluded that these vesicles are recycled back to the mother. They could also have been degraded. In fact, the graphs of microneme inheritance in Figure 2B show a decrease in maternal signal from 100% to 80% between stages 1 and 2, indicating that some microneme degradation is taking place.

      (4) To convincingly demonstrate that the redistribution of micronemes and rhoptries was due to recovery of MyoF protein levels after auxin washout, a Western blot should be performed to show MyoF protein levels over time. In addition, the decrease in mMIC2 protein levels in the residual body in Figure 8F should be measured and normalized for photobleaching. Both apical and basal signals appear to be reduced over the time course of imaging.

    1. Analyse de la Politique de Santé Mentale et de Psychiatrie en France

      Synthèse

      Ce document de synthèse analyse l'état actuel et les perspectives d'évolution des politiques publiques de santé mentale et de psychiatrie en France, en se basant sur les auditions de la délégation ministérielle dédiée.

      Il en ressort un constat central : après plus de trente ans de négligence, où la santé mentale a été le "parent pauvre des politiques publiques", un tournant majeur a été amorcé en 2018 avec la "Feuille de route santé mentale et psychiatrie".

      Cette initiative marque une rupture, symbolisant un engagement politique et financier inédit pour rattraper des décennies de défaillances structurelles.

      La crise Covid-19 a agi comme un révélateur, exacerbant les vulnérabilités préexistantes du système de soins (pénurie de soignants, hospitalo-centrisme, manque de prévention) et de la population (dégradation de la santé mentale des jeunes, des femmes et des précaires).

      Le système actuel est décrit comme largement "illisible", constitué d'une "multitude de particularismes" et freiné par un modèle de financement "anesthésiant" qui décourage l'innovation.

      Face à ces défis, une stratégie de transformation profonde est proposée, articulée autour de plusieurs axes fondamentaux :

      1. Une gouvernance refondée et interministérielle : Inspirée du modèle du handicap, elle vise à coordonner l'ensemble des politiques publiques ayant un impact sur la santé mentale, en impliquant tous les ministères concernés.

      2. Une programmation pluriannuelle : Pour sortir de la logique budgétaire annuelle (PLFSS) et donner de la visibilité financière et stratégique aux réformes sur le moyen et long terme.

      3. Une organisation graduée des soins : Renforcer les premières lignes (médecine générale, psychologues, Maisons des adolescents) pour mieux orienter les patients et réserver les services de psychiatrie hautement spécialisés aux cas les plus complexes.

      4. Des réformes structurelles du financement et des autorisations : Introduire des mécanismes incitatifs pour encourager l'innovation, les soins ambulatoires et la coopération entre tous les acteurs d'un territoire (public, privé, associatif).

      L'enjeu humain reste crucial, avec une crise d'attractivité des métiers liée non seulement à la pénurie mais aussi à une "blessure morale" des soignants due à la perte de sens.

      La dynamique actuelle, portée par une mobilisation politique et sociétale sans précédent, représente un "momentum" unique pour amplifier et accélérer ces réformes.

      Analyse Détaillée

      Un Héritage de Négligence Structurelle

      Pendant plus de 30 ans, la santé mentale a été marginalisée dans les politiques publiques, une situation illustrée par plusieurs facteurs structurels :

      Le "parent pauvre" des politiques de santé : Ce statut s'est traduit par une considération "à part" des patients, des familles et des professionnels, tant dans le système de santé que dans la société.

      Une sous-valorisation financière : L'Objectif national de dépenses d'assurance maladie (Ondam) pour la psychiatrie a systématiquement progressé moins vite que l'Ondam pour la médecine, la chirurgie et l'obstétrique (MCO).

      Un pilotage financier "anesthésiant" : Le modèle de la dotation annuelle de fonctionnement (DAF) a largement contribué à freiner l'innovation et l'adaptation de l'offre de soins face à des besoins populationnels croissants.

      Une offre de soins illisible : L'absence de pilotage stratégique fort a conduit à une "multitude de particularismes" territoriaux, rendant le système complexe et difficile à naviguer pour les usagers, les familles et même les professionnels.

      Des familles en sont réduites à déménager pour accéder à un secteur de psychiatrie jugé plus performant.

      Une faible culture des données probantes : Malgré l'existence de nombreuses études validées sur les prises en charge efficaces, le secteur a peu intégré ces "données probantes" dans ses pratiques.

      La Feuille de Route de 2018 : Une Rupture et un Nouvel Élan

      La "Feuille de route santé mentale et psychiatrie", lancée en 2018, est présentée comme une "rupture par rapport à une aboulie de plus de 30 ans".

      Elle constitue le point de départ d'une politique de rattrapage, marquée par un engagement politique et financier inédit.

      Qualité et alignement international : La feuille de route est jugée de qualité et conforme aux standards internationaux.

      Enrichissements successifs : Elle a été constamment renforcée par des jalons importants comme le Ségur de la santé, les Assises de la santé mentale et de la psychiatrie, les Assises de la pédiatrie, et les annonces dans le cadre de la grande cause nationale.

      Continuité politique : Malgré une forte instabilité ministérielle (le délégué ministériel a "survécu à 10 ministres de la santé"), la feuille de route a été systématiquement reconduite et enrichie, assurant une forme de continuité dans l'action publique.

      Premiers effets visibles : Les réformes structurelles engagées commencent à porter leurs fruits, bien que leur plein impact ne soit attendu qu'à moyen terme.

      Les Vulnérabilités Révélées par la Crise Sanitaire

      La crise du Covid-19 a amplifié des fragilités structurelles anciennes, sans pour autant en être la cause première.

      Vulnérabilités du système de soins :

      ◦ Manque de soignants et départs de l'hôpital public.    ◦ Débits de formation insuffisants.    ◦ Un "hospitalo-centrisme" persistant.    ◦ Pauvreté de la santé primaire et des politiques de prévention.    ◦ Fortes hétérogénéités territoriales.

      Vulnérabilités de la population :

      ◦ Une dégradation de la santé mentale des jeunes, des femmes et des personnes en situation de précarité, un phénomène observé à l'échelle européenne et mondiale.

      Vers un Changement de Paradigme : Stratégie et Réformes Proposées

      Pour répondre à ces défis systémiques, un changement de paradigme est préconisé, s'appuyant sur des réformes profondes de la gouvernance, de la planification et de l'organisation des soins.

      Refonder la Gouvernance sur un Modèle Interministériel

      L'action sur les déterminants de la santé mentale (logement, emploi, lutte contre les violences, addictions) ne relève pas du seul ministère de la Santé. Une gouvernance interministérielle est donc jugée indispensable.

      Structure Proposée

      Fréquence

      Objectif

      Comité Interministériel

      Annuelle

      Définir, coordonner et évaluer les politiques publiques en faveur de la santé mentale, sur le modèle du Comité Interministériel du Handicap (CIH).

      Conférence Nationale

      Triennale

      Débattre des orientations et des moyens des politiques de santé mentale, en réunissant l'ensemble des parties prenantes.

      Comité des Parties Prenantes

      À définir

      Formaliser un organe consultatif pour assurer la participation active des usagers, familles, professionnels et autres acteurs.

      Délégation Ministérielle

      Renforcée

      Renforcer ses moyens (objectif de 8 agents) pour lui confier la coordination et le pilotage de la feuille de route devenue interministérielle.

      Instaurer une Programmation Pluriannuelle

      Le cycle budgétaire annuel du Projet de Loi de Financement de la Sécurité Sociale (PLFSS) est un obstacle à la mise en œuvre de réformes structurelles.

      La nécessité d'une vision à long terme est soulignée, avec un plaidoyer pour une loi de programmation qui garantirait une visibilité financière et stratégique sur plusieurs années.

      Cette approche permettrait d'investir dans des actions de prévention dont les bénéfices, notamment en termes de "coût évité", ne sont mesurables que sur la durée.

      Organiser l'Offre de Soins : Gradation et Coordination

      Le système actuel est marqué par un héritage où "tout allait à la psychiatrie". La stratégie proposée vise à structurer une offre de soins graduée :

      1. Renforcer les premières lignes : La médecine générale, les dispositifs comme "MonSoutienPsy" et les Maisons des adolescents doivent jouer un rôle de filtre pour les troubles légers à modérés et "refroidir" un certain nombre de situations.

      2. Réserver la psychiatrie spécialisée : Le secteur de psychiatrie, avec ses équipes hautement spécialisées (psychiatres, psychologues, IPA, psychomotriciens), doit se concentrer sur les cas les plus complexes et graves.

      3. Coordonner tous les acteurs : La réforme des autorisations vise à obliger les différents offreurs de soins d'un territoire (public sectorisé, public non sectorisé, privé) à sortir de leurs "couloirs de nage" pour s'articuler fonctionnellement et se répartir la charge des besoins.

      4. Inciter à l'innovation : La réforme du mode de financement introduit des compartiments financiers incitatifs pour encourager les pratiques orientées vers le rétablissement, l'ambulatoire et les alternatives à l'hospitalisation, notamment sans consentement.

      La Nécessité d'un "Grand Texte" Clarificateur

      Un texte de loi est jugé crucial pour clarifier la politique de santé mentale, remettre de l'ordre dans les "particularismes" et définir des standards de prise en charge basés sur les données probantes.

      Enjeux Cruciaux : Ressources Humaines et Évaluation

      L'Attractivité des Métiers : Au-delà de la Pénurie

      La crise de l'attractivité en psychiatrie est multifactorielle :

      Pénurie et burnout : La pénurie de personnel génère une surcharge de travail pour les équipes en place, créant un cercle vicieux.

      "Blessure morale" : Plus profondément, les soignants expriment une perte de sens. Ils sont confrontés à des situations qui violent leur éthique professionnelle (ex: maintenir un patient attaché pendant plusieurs jours sur un brancard faute de lit), ce qui génère une "blessure morale".

      Redonner du sens : Les dispositifs innovants (ex: équipes mobiles de crise financées par le Fonds d'Innovation Organisationnelle en Psychiatrie - FIOP) rencontrent moins de difficultés de recrutement car ils s'inscrivent dans un projet clair et porteur de sens.

      Formation : La réforme du DES de psychiatrie, en instaurant un passage obligatoire en pédopsychiatrie plus tôt dans le cursus, vise à améliorer l'attractivité de cette spécialité. Pour les infirmières, un renforcement des modules de santé mentale dans la formation initiale et un meilleur accompagnement à la prise de poste ("onboarding") sont des pistes explorées.

      Développer une Culture de l'Évaluation

      La France est jugée en retard sur l'évaluation de ses politiques publiques.

      Le coût de l'inaction : Des études, notamment anglo-saxonnes, démontrent les coûts socio-économiques phénoménaux de la non-prise en charge des troubles mentaux (estimés à 163 milliards d'euros par an en France par la Fondation FondaMental).

      L'argument du "coût évité" : Investir dans la prévention est économiquement vertueux. Par exemple, il est démontré qu'1€ investi dans des soins psychologiques de première ligne permet d'économiser entre 1,4€ et 1,6€. Cet argument peine cependant à être pris en compte dans les arbitrages budgétaires annuels.

      Évaluation des politiques publiques : Un ensemble d'indicateurs a été mis au point pour suivre les effets de la feuille de route, ce qui constitue une exception. Il reste à développer cette culture au niveau local pour évaluer l'efficacité des dispositifs innovants financés par appels à projets.

    1. Synthèse de l'audition de Santé publique France sur la santé mentale et le handicap

      Résumé

      L'audition de Santé publique France devant la commission d'enquête met en lumière une dégradation persistante de la santé mentale de la population française depuis la pandémie de Covid-19, touchant particulièrement les jeunes de 18 à 24 ans et les femmes.

      Les données de surveillance révèlent une prévalence élevée des troubles dépressifs et anxieux, avec un décalage majeur entre les besoins et le recours effectif aux soins.

      Près de la moitié des adultes ayant connu un épisode dépressif caractérisé n'ont eu aucun recours thérapeutique.

      Les principaux freins identifiés sont le coût des consultations, la difficulté à se confier et le manque d'information.

      Concernant les personnes en situation de handicap, l'agence souligne une lacune importante dans les données de surveillance, rendant difficile la caractérisation fiable de leur état de santé et de leur prise en charge.

      L'accès aux données des Maisons Départementales des Personnes Handicapées (MDPH) est identifié comme un levier majeur d'amélioration.

      Face à ces constats, Santé publique France insiste sur l'importance cruciale de la prévention.

      Les stratégies préconisées incluent le renforcement des compétences psychosociales dès l'enfance, la lutte contre la stigmatisation via des campagnes d'information et la promotion de la santé mentale positive. Des dispositifs comme le programme "Vigilance", qui a démontré un retour sur investissement positif (€1 investi pour €2 économisés en coûts de santé), sont mis en avant comme des modèles à généraliser pour une approche économiquement vertueuse de la santé publique.

      1. Rôle et Méthodes de Surveillance de Santé publique France

      Santé publique France, agence de santé publique créée en 2016, fonde ses missions sur un triple objectif :

      1. Anticiper et répondre aux crises sanitaires, notamment par la gestion des stocks stratégiques de produits de santé et la mobilisation de la réserve sanitaire.

      2. Surveiller l'état de santé de la population sur l'ensemble du territoire, y compris ultramarin, en couvrant les maladies infectieuses, chroniques et les expositions environnementales.

      3. Développer la prévention et promouvoir la santé.

      Pour la surveillance de la santé mentale, l'agence s'appuie sur plusieurs sources de données complémentaires :

      Les enquêtes en population générale :

      • ◦ Réalisées sur des échantillons aléatoires, elles utilisent des questionnaires et des échelles de santé mentale pour évaluer l'état de la population sans poser de diagnostic individuel.  

      • ◦ Exemples notables : le Baromètre de Santé publique France (adultes), l'enquête ENABI (enfants de 3 à 11 ans, réalisée en milieu scolaire en 2022), et l'enquête EnCLASS (collégiens).   

      • ◦ Pendant la crise sanitaire, l'enquête Coviprêve a permis un suivi plus rapide, bien que moins détaillé.

      Les bases de données médico-administratives :

      ◦ Le Système National des Données de Santé (SNDS) est une source majeure d'informations.  

      ◦ Les données des services d'urgence (motifs de passage) et de SOS Médecins permettent un suivi en temps réel de certains indicateurs comme les troubles anxieux ou les tentatives de suicide.

      L'ensemble de ces données permet d'obtenir une "photographie en vie réelle" de la santé mentale des Français, contribuant à l'élaboration de stratégies de prévention et de campagnes d'information.

      2. État des Lieux de la Santé Mentale en France

      2.1. Une Dégradation Post-Covid Durable

      La surveillance épidémiologique confirme une dégradation nette de la santé mentale de la population française par rapport à la période pré-Covid.

      Populations les plus touchées : Les jeunes adultes de 18 à 24 ans, les jeunes filles et les femmes en général présentent les indicateurs les plus dégradés.

      Persistance : Les différents indicateurs de santé mentale se maintiennent à un niveau élevé, sans retour aux niveaux d'avant la crise sanitaire. Les causes sont multifactorielles (éco-anxiété, système économique, mais aussi une plus grande déclaration due à une libération de la parole).

      2.2. Données Clés sur la Prévalence des Troubles

      Les enquêtes récentes fournissent des chiffres préoccupants :

      Population Cible

      Indicateur

      Donnée Chiffrée

      Source (Année)

      Adultes (18-79 ans)

      Épisode dépressif caractérisé (12 derniers mois)

      1 adulte sur 6

      Baromètre SPF (2024)

      Adultes (18-79 ans)

      Trouble anxieux (12 derniers mois)

      6 % de la population

      Baromètre SPF (2024)

      Enfants (6-11 ans)

      Trouble probable de la santé mentale

      Plus d'1 enfant sur 10

      ENABI (2022)

      Enfants (tous âges)

      Consultation d'un professionnel pour des raisons psychologiques/d'apprentissage

      1 enfant sur 5

      ENABI (2022)

      Collégiens

      Consultation d'un psychiatre au cours de leur vie

      1 tiers des collégiens

      EnCLASS

      2.3. Le Non-Recours aux Soins : Un Enjeu Majeur

      Un décalage important est observé entre les besoins exprimés ou mesurés et le recours effectif à une prise en charge.

      Épisodes dépressifs : Près de la moitié (50 %) des personnes déclarant un épisode dépressif n'ont eu "aucun recours thérapeutique" (ni professionnel, ni traitement).

      Troubles anxieux : Cette proportion est de 1 personne sur 3.

      Profils concernés : Le non-recours aux soins est plus élevé chez les hommes que chez les femmes.

      Situations critiques : Près de 40 % des personnes déclarant une tentative de suicide ne se sont pas présentées à l'hôpital et n'ont pas consulté de professionnel de santé par la suite.

      2.4. Les Freins à la Consultation

      L'enquête Coviprêve a permis d'identifier les principaux obstacles au recours aux soins en santé mentale :

      • 1. Le prix de la consultation (cité par près de la moitié des répondants).

      • 2. La difficulté à se confier ou la peur de ce qu'ils pourraient découvrir sur eux-mêmes.

      • 3. Le manque d'information sur les professionnels et leur rôle.

      • 4. La difficulté à obtenir un rendez-vous.

      • 5. La peur que l'entourage l'apprenne (stigmatisation).

      • 3. La Situation Spécifique des Personnes en Situation de Handicap

      Santé publique France reconnaît un manque de données structurées concernant l'état de santé des personnes en situation de handicap.

      Limites de la surveillance : La surveillance de cette population n'entre pas "strictement" dans les missions de l'agence, bien qu'elle soit incluse dans les enquêtes générales.

      Difficultés de caractérisation : Il est difficile d'identifier et de caractériser de manière fiable ces personnes dans les bases de données. L'Allocation Adulte Handicapé (AAH) est le principal repère, mais elle ne couvre que les adultes en âge de travailler avec des handicaps reconnus comme sévères.

      Besoin crucial de données : L'agence attend avec impatience la remontée des données des Maisons Départementales des Personnes Handicapées (MDPH) dans le SNDS, ce qui constituerait un "saut qualitatif et quantitatif" pour mieux orienter les politiques publiques.

      Vulnérabilités observées : Les données existantes montrent que les bénéficiaires de l'AAH sont "proportionnellement plus concernés par des événements cardiovasculaires graves".

      4. Stratégies de Prévention et Pistes d'Amélioration

      Face à ces constats, Santé publique France place la prévention au cœur de sa stratégie.

      4.1. Axes de Prévention

      Prévention primaire :

      Compétences psychosociales (CPS) : Développer dès le plus jeune âge (école, associations sportives) des capacités à gérer le stress, communiquer, résoudre des problèmes.

      Cette approche, inspirée des modèles anglo-saxons, est de plus en plus acceptée et intégrée, notamment par l'Éducation Nationale.  

      Promotion de la santé mentale positive : Informer sur les comportements protecteurs (activité physique, sommeil, altruisme, pensée positive) au même titre que la santé physique.

      Lutte contre la stigmatisation :

      • ◦ Mener des campagnes d'information pour dédramatiser les troubles psychiques.  

      • ◦ Mettre à disposition des ressources grand public comme le site santémentaleinfoservice.fr.

      Prévention tertiaire (prévention de la récidive) :

      ◦ Le dispositif Vigilance, qui consiste à rappeler les personnes ayant fait une tentative de suicide six mois après leur passage aux urgences, a fait l'objet d'une évaluation médico-économique très positive. Il est en cours de déploiement dans toutes les régions.

      4.2. Pistes d'Amélioration

      Santé publique France identifie plusieurs axes pour améliorer la connaissance et l'action :

      • Mieux caractériser les personnes en situation de handicap dans les bases médico-administratives et médico-sociales.

      • Mieux documenter la santé et le rôle des aidants.

      • Poursuivre et développer les enquêtes en milieu scolaire pour un dépistage précoce.

      • Renforcer l'information sur les signes de souffrance psychique et les parcours de soins gradués.

      5. Enjeux Économiques et Décisionnels

      L'audition a souligné la dimension économique de la santé mentale et l'importance de convaincre les décideurs publics d'investir dans la prévention.

      Coût des troubles psychiques : Estimé à 109 milliards d'euros pour la société française, dont près de la moitié en perte de productivité.

      Retour sur investissement de la prévention :

      • ◦ L'évaluation du dispositif Vigilance montre que 1 € investi permet d'économiser 2 € de coûts de santé, avec un coût moyen évité de 248 € par patient.  
      • ◦ Santé publique France s'engage à évaluer de plus en plus le retour sur investissement de ses actions.

      Nécessité d'un plaidoyer : L'agence travaille au développement d'indicateurs sur le "fardeau de la maladie" (Global Burden of Disease) pour objectiver le poids des troubles sur la société et justifier les investissements en prévention.

      Le manque de données fiables, notamment à un niveau territorial fin, reste un obstacle pour convaincre les acteurs locaux.

    1. Briefing : Prise en Charge de la Santé Mentale et du Handicap en France

      Résumé

      Ce document de synthèse analyse les enjeux majeurs de la prise en charge de la santé mentale en France, en se basant sur les échanges tenus à l'Assemblée nationale.

      Il en ressort un paradoxe central : malgré des efforts budgétaires significatifs et le déploiement de dispositifs structurants, le secteur de la psychiatrie est en proie à une crise profonde, principalement due à une pénurie critique de ressources humaines.

      Les points à retenir sont les suivants :

      1. Crise d'Attractivité Sévère : La psychiatrie souffre d'un déficit majeur d'attractivité, avec plus de 23 % de postes de praticiens hospitaliers vacants dans le secteur public et 30 % des postes d'internes non pourvus.

      Cette pénurie, qualifiée de "cercle vicieux", entrave la capacité du système à répondre à la demande croissante.

      2. Dissonance entre Investissements et Réalité de Terrain : Des financements conséquents ont été alloués via des programmes comme le Fonds d'Innovation Organisationnelle en Psychiatrie (FIOP) et des appels à projets pour la pédopsychiatrie.

      Le dispositif "Mon Soutien Psy" a également permis de réaliser plus de 2,5 millions de séances.

      Cependant, ces efforts se heurtent à une réalité marquée par des délais d'attente, des défauts de prise en charge et un manque de diagnostics.

      3. Impératif du Repérage Précoce : Un consensus se dégage sur la nécessité de basculer d'une approche majoritairement curative vers une stratégie axée sur la prévention et le repérage précoce des troubles.

      Les médecins généralistes, la santé scolaire et les maisons des adolescents sont identifiés comme des acteurs clés de cette stratégie, qui est perçue comme un levier de "coûts évités" majeur.

      4. Angle Mort sur les Données et l'Évaluation :

      Il existe un manque critique de données médico-économiques sur l'impact du non-dépistage précoce et des hospitalisations évitées.

      Ce déficit de modélisation affaiblit les plaidoyers pour un investissement accru dans la prévention et le suivi post-hospitalisation.

      5. Structuration des Parcours et Coordination Territoriale :

      Les Projets Territoriaux de Santé Mentale (PTSM) sont considérés comme un outil essentiel pour améliorer la coordination des acteurs.

      Leur renforcement et leur évaluation sont des priorités, tout comme le développement de pratiques innovantes pour fluidifier les parcours entre la ville et l'hôpital.

      Analyse Détaillée des Thématiques

      1. La Crise d'Attractivité des Métiers en Psychiatrie

      Le principal frein à l'amélioration de l'offre de soins en santé mentale est la pénurie de personnel qualifié, en particulier de psychiatres.

      Constat d'une Pénurie Sévère :

      • ◦ La ressource humaine en psychiatrie est qualifiée de "denrée rare".   
      • Postes vacants : Plus de 23 % des postes de psychiatres dans les hôpitaux publics ne sont pas pourvus.  
      • Déficit de formation : Chaque année, 30 % des postes d'internes en psychiatrie restent vacants.  
      • Recours aux praticiens étrangers (Padu) : Même en doublant les postes offerts aux praticiens à diplôme hors Union européenne, le taux de vacance reste extrêmement élevé, atteignant 30 % à 50 % selon les régions.

      Un Cercle Vicieux : Cette crise d'attractivité crée un "cercle vicieux" : les étudiants en médecine réalisant leurs stages dans des services en sous-effectif sont peu enclins à choisir cette spécialité, ce qui perpétue la pénurie.

      Pistes de Solution Évoquées :

      Valoriser les stages : Mettre l'accent sur la qualité de l'encadrement des stagiaires pour améliorer l'image de la profession.  

      Flexibiliser l'exercice : Encourager et faciliter l'exercice mixte (ville-hôpital, public-privé) et le temps partagé, qui correspondent aux aspirations des jeunes médecins ne souhaitant plus un exercice unique et à temps plein.

      Des verrous réglementaires ont été levés depuis 2020 pour faciliter l'exercice mixte ville-hôpital.  

      Formation des paramédicaux : La réforme du métier d'infirmier intègre une obligation de stage en psychiatrie d'une durée minimale.

      Par ailleurs, plus de 540 infirmiers en pratique avancée (IPA) en santé mentale ont déjà été formés.

      2. Dissonance entre Efforts Budgétaires et Réalité de Terrain

      Des investissements financiers importants ont été réalisés, mais leurs effets sont encore insuffisants pour répondre à l'ampleur des besoins.

      Investissements Financiers Conséquents :

      • Fonds d'Innovation Organisationnelle en Psychiatrie (FIOP) : Depuis 2019, 288 millions d'euros ont été mobilisés pour accompagner 268 projets innovants. Le fonds a été reconduit en 2025.  
      • Pédopsychiatrie et Psychiatrie Périnatale : Un appel à projets a permis de financer 435 projets, avec des crédits annuels compris entre 20 et 35 millions d'euros. 
      • Dispositif "Mon Soutien Psy" : Fin 2024, le dispositif comptait plus de 4 100 psychologues conventionnés et avait bénéficié à près de 480 000 patients (dont 26 % de mineurs), pour un total de 2,5 millions de séances réalisées.

      Difficultés Persistantes sur le Terrain :

      ◦ Malgré ces chiffres, une "dissonance" est constatée entre les efforts budgétaires et la réalité vécue par les usagers et les professionnels : délais d'attente prolongés, défauts de prise en charge et manque de diagnostics.  

      ◦ Il est souligné que le système reste trop focalisé sur le "curatif" au détriment du "préventif".

      3. L'Impératif de la Prévention et du Repérage Précoce

      Le repérage précoce est identifié comme un axe stratégique majeur pour éviter l'aggravation des troubles et les conséquences sociales et familiales associées.

      Un Axe Prioritaire : Le dépistage est considéré comme "un des axes forts qu'il nous faut développer". Une mission a été confiée à trois personnalités qualifiées pour formuler des recommandations sur ce sujet.

      Les Acteurs Clés du Repérage :

      Médecins généralistes : Ils sont en première ligne, assurant 76 % des premières consultations pour troubles psychiatriques et traitant 73 % des dépressions.

      L'enjeu est de mieux les "outiller" et de renforcer le lien avec les spécialistes.  

      Santé scolaire : Une circulaire conjointe (Santé/Éducation Nationale) est en cours de rédaction pour formaliser des "circuits courts" entre les établissements scolaires et les Centres Médico-Psychologiques (CMP).  

      Maisons des Adolescents : Leur cahier des charges est en cours de rénovation pour y intégrer pleinement la dimension de repérage. Leurs moyens financiers seront renforcés de façon "considérable".

      L'Enjeu des "Coûts Évités" : L'investissement dans la prévention et le diagnostic précoce est présenté non seulement comme une plus-value pour les personnes concernées, mais aussi comme une source d'économies "majeures" pour la collectivité en évitant des prises en charge plus lourdes à long terme.

      4. Structuration de l'Offre et Coordination des Parcours

      L'organisation des soins sur les territoires et la fluidité des parcours patients sont des défis centraux.

      Projets Territoriaux de Santé Mentale (PTSM) :

      ◦ Considérés comme un "outil intéressant", ils mobilisent les acteurs du sanitaire, du social et du médico-social.  

      ◦ Chaque PTSM bénéficie d'un poste de coordinateur financé.  

      ◦ Une "deuxième génération" de PTSM est en préparation pour aller plus loin dans la structuration des parcours.  

      ◦ Une carte interactive des PTSM sera mise en ligne sur le site du ministère pour améliorer la lisibilité et le partage de bonnes pratiques.

      Défis de la Coordination :

      Post-hospitalisation : L'organisation des sorties d'hospitalisation psychiatrique présente des "vraies difficultés", entraînant des réhospitalisations au coût "relativement conséquent", un point déjà soulevé par la Cour des comptes en 2021. 

      Innovation organisationnelle : Le FIOP vise précisément à soutenir des projets qui testent de nouvelles organisations pour améliorer la coordination ville-hôpital et la graduation des soins.

      5. Manque de Données et Nécessité d'Évaluation

      Un "angle mort" important subsiste concernant les données chiffrées, ce qui freine l'optimisation de l'allocation des ressources.

      Absence de Modélisation Médico-Économique :

      ◦ Il y a un manque de données sur le "coût médico-économique du non-dépistage précoce" et sur les hospitalisations potentiellement évitées.  

      ◦ L'approche culturelle française est perçue comme moins avancée que dans les pays anglo-saxons sur l'utilisation d'outils comme les QALY/DALY pour prioriser les investissements.

      Évaluation des Politiques Publiques :

      Le FIOP comme modèle : Ce fonds est cité en exemple pour son processus d'évaluation "extrêmement rigoureuse", menée par des experts indépendants après trois ans de financement, pouvant mener à la pérennisation, la généralisation ou l'arrêt du projet.  

      L'évaluation des PTSM : Si une évaluation qualitative a été menée (le "Tour de France" de Franck Bélivier), un besoin d'évaluation plus systématique et comparative des performances est exprimé pour mieux identifier et diffuser les bonnes pratiques.

      6. Enjeux Spécifiques à Certaines Populations

      Pédopsychiatrie :

      ◦ Le secteur est "assez dépourvu" en lits, ce qui conduit à des hospitalisations d'enfants dans des services pour adultes.  

      ◦ Une inquiétude est soulevée quant au risque de "surdiagnostic", en référence à un rapport de la Cour des comptes, appelant à une vision plus globale de l'accompagnement.

      Santé Mentale en Milieu Carcéral :

      Prévalence élevée : Environ 30 % des détenus présentent des troubles psychiatriques.

      Pour beaucoup, l'incarcération représente le "premier contact avec le soin". 

      Crise d'attractivité aggravée : Les difficultés de recrutement sont "probablement pires" dans ce milieu. L'exemple du centre pénitentiaire de Fresnes, passé de 19 à 6 psychiatres, est emblématique. 

      Solutions : Le développement de postes à temps partagé est crucial pour attirer des praticiens. Une "feuille de route santé des personnes placées sous main de justice" co-pilotée par les ministères de la Santé et de la Justice vise à travailler sur cet enjeu.

    1. Reviewer #1 (Public review):

      Summary:

      In this study, the authors employed comprehensive proteomics and transcriptomics analysis to investigate the systemic and organ-specific adaptations to IF in males. They found that shared biological signaling processes were identified across tissues, suggesting unifying mechanisms linking metabolic changes to cellular communication, which revealed both conserved and tissue-specific responses by which IF may optimize energy utilization, enhance metabolic flexibility, and promote cellular resilience.

      Strengths:

      This study detected multiple organs, including the liver, brain, and muscle, and revealed both conserved and tissue-specific responses to IF.

      Weaknesses:

      (1) Why did the authors choose the liver, brain, and muscle, but not other organs such as the heart and kidney? The latter are proven to be the largest consumers of ketones, which is also changed in the IF treatment of this study.

      (2) The proteomics and transcriptomics analyses were only performed at 4 months. However, a strong correlation between IF and the molecular adaptations should be time point-dependent.

      (3) The context lacks a "discussion" section, which would detail the significance and weaknesses of the study.

      (4) There is no confirmation for the proteomic and transcriptomic profiling. For example, the important changes in proteomics could be further identified by a Western blot.

    2. Reviewer #2 (Public review):

      Summary:

      Fan and colleagues measure proteomics and transcriptomics in 3 organs (liver, skeletal muscle, cerebral cortex) from male C57BL/6 mice to investigate whether intermittent fasting (IF; 16h daily fasting over 4 months) produces systemic and organ-specific adaptations.

      They find shared signaling pathways, certain metabolic changes, and organ-specific responses that suggest IF might affect energy utilization, metabolic flexibility, while promoting resilience at the cellular level.

      Strengths:

      The fact that there are 3 organs and 2 -omics approaches is a strength of this study.

      Weaknesses:

      The analytical approach of the data generated by the present study is not well posed, because it doesn't help to answer key questions implicit in the experimental design. Consequently, the paper, as it is for now, reads as a mere description of results and not a response to specific questions.

      The presentation of the figures, the knowledge of the literature, and the inclusion of only one sex (male) are all weaknesses.

    3. Reviewer #3 (Public review):

      Summary:

      Fan et al utilize large omics data sets to give an overview of proteomic and gene expression changes after 4 months of intermittent fasting (IF) in liver, muscle, and brain tissue. They describe common and distinct pathways altered under IF across tissues using different analysis approaches. The main conclusions presented are the variability in responses across tissues with IF. Some common pathways were observed, but there were notable distinctions between tissues.

      Strengths:

      (1) The IF study was well conducted and ran out to 4 months, which was a nice long-term design.

      (2) The multiomics approach was solid, and additional integrative analysis was complementary to illustrate the differential pathways and interactions across tissues.

      (3) The authors did not overstep their conclusions and imply an overreached mechanism.

      Weaknesses:

      The weaknesses, which are minor, include the use of only male mice and the early start (6 weeks) of the IF treatment. See specifics in the recommendations section.

    4. Author response:

      Reviewer #1 (Public review):

      Summary: 

      In this study, the authors employed comprehensive proteomics and transcriptomics analysis to investigate the systemic and organ-specific adaptations to IF in males. They found that shared biological signaling processes were identified across tissues, suggesting unifying mechanisms linking metabolic changes to cellular communication, which revealed both conserved and tissue-specific responses by which IF may optimize energy utilization, enhance metabolic flexibility, and promote cellular resilience. 

      Strengths: 

      This study detected multiple organs, including the liver, brain, and muscle, and revealed both conserved and tissue-specific responses to IF.

      We appreciate the recognition of the study’s strengths and the opportunity to clarify the points raised.

      Weaknesses: 

      (1) Why did the authors choose the liver, brain, and muscle, but not other organs such as the heart and kidney? The latter are proven to be the largest consumers of ketones, which is also changed in the IF treatment of this study.

      We agree that the heart and kidney are critical organs in ketone metabolism. Our selection of the liver, brain, and muscle was guided by their distinct metabolic functions and relevance to systemic energy balance, neuroplasticity, and locomotor activity, key domains influenced by intermittent fasting (IF). These tissues also offer complementary perspectives on central and peripheral adaptations to IF. Notably, we have previously examined the effects of IF on the heart (eLife 12:RP89214), and we fully acknowledge the importance of the kidney. We intend to include it in future studies to broaden the scope and deepen our understanding of IF-induced systemic responses.

      (2) The proteomics and transcriptomics analyses were only performed at 4 months. However, a strong correlation between IF and the molecular adaptations should be time point-dependent.

      We appreciate this insightful comment. The 4-month time point was selected to capture long-term adaptations to IF, beyond acute or transitional effects. While we acknowledge that molecular responses to IF are time-dependent, our goal in this study was to establish a foundational understanding of sustained systemic and tissue-specific changes. We fully agree that a longitudinal approach would provide deeper insights into the temporal dynamics of IF-induced adaptations. To address this, we are currently undertaking a comprehensive 2-year study that is specifically designed to explore these time-dependent effects in greater detail.

      (3) The context lacks a "discussion" section, which would detail the significance and weaknesses of the study.

      We appreciate this observation. The manuscript was originally structured to emphasize results and interpretation within each section, but we recognize that a dedicated discussion section would enhance clarity and contextual depth. In the revised version, we will add a comprehensive discussion section addressing broader implications, limitations, and future directions of the study.

      (4) There is no confirmation for the proteomic and transcriptomic profiling. For example, the important changes in proteomics could be further identified by a Western blot. 

      We acknowledge the importance of orthogonal validation to support high-throughput findings. While our study primarily focused on uncovering systemic patterns through proteomic and transcriptomic profiling, we agree that targeted confirmation would strengthen the conclusions. To this end, we have included immunohistochemical validation of a key protein common to all three organs—Serpin A1C. Additionally, we are planning a dedicated follow-up study to expand functional validation of several key proteins identified in this manuscript, which will be pursued as a separate project.

      Reviewer #2 (Public review): 

      Summary: 

      Fan and colleagues measure proteomics and transcriptomics in 3 organs (liver, skeletal muscle, cerebral cortex) from male C57BL/6 mice to investigate whether intermittent fasting (IF; 16h daily fasting over 4 months) produces systemic and organ-specific adaptations. 

      They find shared signaling pathways, certain metabolic changes, and organ-specific responses that suggest IF might affect energy utilization, metabolic flexibility, while promoting resilience at the cellular level.

      Strengths: 

      The fact that there are 3 organs and 2 -omics approaches is a strength of this study. 

      We appreciate the reviewer’s recognition of the breadth of our study design. By integrating proteomics and transcriptomics across three metabolically distinct organs, we aimed to provide a comprehensive view of systemic and tissue-specific adaptations to IF. This multi-organ, multi-omics approach was central to uncovering both conserved and divergent biological responses.

      Weaknesses: 

      (1) The analytical approach of the data generated by the present study is not well posed, because it doesn't help to answer key questions implicit in the experimental design. Consequently, the paper, as it is for now, reads as a mere description of results and not a response to specific questions.

      We thank the reviewer for this important observation. Our initial aim was to establish a foundational atlas of molecular changes induced by IF across key organs. However, we recognize that clearer framing of the biological questions would enhance interpretability. In the revised manuscript, we will have restructured the introduction, results, and discussion to align more explicitly with specific hypotheses, particularly those related to energy metabolism, cellular resilience, and inter-organ signaling. We have also added targeted analyses and clarified how each dataset contributes to answering these questions.

      (2) The presentation of the figures, the knowledge of the literature, and the inclusion of only one sex (male) are all weaknesses.

      We appreciate this feedback and agree that these are important considerations. Regarding figure presentation, we will revise several figures for improved clarity, add more descriptive legends, and reorganize supplemental materials to better support the main findings. On the literature front, we will expand the discussion to include recent and relevant studies on IF, metabolic adaptation, and sex-specific responses. As for the use of only male mice, this was a deliberate choice to reduce hormonal variability and focus on establishing baseline molecular responses. We fully acknowledge the importance of sex as a biological variable and will soon be conducting studies in female mice to address this gap.

      Reviewer #3 (Public review):

      Summary: 

      Fan et al utilize large omics data sets to give an overview of proteomic and gene expression changes after 4 months of intermittent fasting (IF) in liver, muscle, and brain tissue. They describe common and distinct pathways altered under IF across tissues using different analysis approaches. The main conclusions presented are the variability in responses across tissues with IF. Some common pathways were observed, but there were notable distinctions between tissues.

      Strengths: 

      (1) The IF study was well conducted and ran out to 4 months, which was a nice long-term design. 

      (2) The multiomics approach was solid, and additional integrative analysis was complementary to illustrate the differential pathways and interactions across tissues. 

      (3) The authors did not overstep their conclusions and imply an overreached mechanism. 

      We sincerely thank the reviewer for acknowledging the strengths of our study design and analytical approach. We aimed to strike a careful balance between comprehensive data generation and cautious interpretation, and we appreciate the recognition that our conclusions were appropriately framed within the scope of the data.

      Weaknesses: 

      The weaknesses, which are minor, include the use of only male mice and the early start (6 weeks) of the IF treatment. See specifics in the recommendations section.

      We appreciate the reviewer’s thoughtful comments. The decision to use male mice and initiate IF at 6 weeks was based on minimizing hormonal variability and capturing early adult metabolic programming. We acknowledge that sex and developmental timing are important biological variables. To address this, we are conducting parallel studies in female mice and evaluating IF initiated at later life stages. These follow-up investigations will help determine the extent to which sex and timing influence the molecular and physiological outcomes of IF.

    1. Note de Synthèse : La Politique de Santé Mentale en France selon la Direction Générale de la Santé

      Synthèse

      Cette note synthétise les perspectives et les actions de la Direction Générale de la Santé (DGS) concernant la santé mentale en France, telles que présentées lors d'une audition parlementaire.

      Le constat principal est une dégradation "nette et durable" de la santé mentale de la population depuis la crise du Covid-19, se manifestant par une hausse significative des troubles anxieux, dépressifs et des idées suicidaires, particulièrement chez les jeunes.

      En 2023, 23 % des adultes déclaraient un niveau d'anxiété élevé et 16 % se disaient déprimés, des chiffres en nette augmentation depuis 2019.

      Le rôle de la DGS se concentre sur la prévention et la promotion de la santé mentale, en amont de la prise en charge psychiatrique. Son action repose sur quatre leviers stratégiques :

      1. Améliorer les connaissances et lutter contre la stigmatisation via des campagnes de communication et des actions locales.

      2. Promouvoir les comportements bénéfiques, notamment par le développement des compétences psychosociales, l'amélioration du sommeil et la prévention de l'usage problématique des écrans.

      3. Renforcer le repérage précoce à travers des programmes comme les "Premiers secours en santé mentale".

      4. Déployer une stratégie nationale de prévention du suicide, s'appuyant sur des dispositifs éprouvés comme le numéro national 3114 et le programme de recontact Vigilance, qui réduit de 38 % le risque de récidive.

      Malgré ces efforts, des défis majeurs persistent.

      La commission parlementaire souligne le décalage entre un diagnostic largement partagé et la mise en œuvre concrète sur le terrain, due notamment à un manque de professionnels (médecins scolaires, psychologues).

      Un débat central porte sur la faible culture de la prévention en France, qui privilégie historiquement le curatif, et sur la difficulté à sécuriser des financements pluriannuels pour des actions dont les bénéfices ne sont visibles qu'à long terme.

      La "Grande Cause Nationale 2025" est perçue comme une opportunité importante mais dont le démarrage a été freiné par le contexte politique.

      1. Rôle de la DGS et Contexte

      La Direction Générale de la Santé (DGS) positionne la santé mentale au cœur de ses priorités, avec un bureau dédié.

      Son action se distingue de celle de la Direction Générale de l'Offre de Soins (DGOS), qui gère la prise en charge psychiatrique.

      La DGS se concentre exclusivement sur les politiques de prévention et de promotion de la santé mentale.

      L'approche de la DGS est double :

      Intersectorielle : La santé mentale étant multifactorielle, elle est prise en compte dans tous les milieux de vie (école, travail, loisirs) en lien avec les ministères concernés.

      Populationnelle : Une attention particulière est portée aux publics les plus vulnérables, incluant les jeunes, les personnes âgées, les personnes en situation de handicap, en situation de précarité ou détenues.

      Il est précisé que la prise en charge globale du handicap relève de la Direction Générale de la Cohésion Sociale (DGCS).

      2. Diagnostic de la Santé Mentale en France : Un Constat Préoccupant

      L'Impact "Net et Durable" de la Crise Sanitaire

      La crise du Covid-19 a marqué un "véritable tournant", provoquant une dégradation significative et persistante de la santé mentale de la population française.

      Chez les adultes : Selon l'enquête CoviPrev de Santé publique France, en 2023 :

      23 % des personnes interrogées déclaraient un niveau d'anxiété élevé (+6 points par rapport à 2019).    ◦ 16 % se disaient déprimées (+5 points par rapport à 2019).

      Populations vulnérables : Les femmes, les jeunes adultes, les personnes précaires et celles ayant des antécédents de troubles psychiques présentent des indicateurs de santé mentale durablement dégradés.

      Le Suicide : Une Préoccupation Majeure

      Le suicide demeure un indicateur alarmant en France.

      • En 2022, le taux de suicide était de 13,3 pour 100 000 habitants, l'un des plus élevés d'Europe.

      • Ce taux est trois fois plus élevé chez les hommes que chez les femmes.

      • Après une baisse depuis les années 80, le taux a atteint un plateau sur lequel il est devenu difficile d'agir.

      La Vulnérabilité Particulière des Jeunes et des Personnes en Situation de Handicap

      Les jeunes : Les indicateurs sont "particulièrement préoccupants".

      Le nombre de passages aux urgences pour gestes suicidaires chez les 11-17 ans est en hausse.

      En 2023, 86 femmes de 15 à 19 ans sur 100 000 ont été hospitalisées pour gestes auto-infligés, une hausse de 46 % par rapport à 2017.

      Personnes en situation de handicap : Elles présentent un risque suicidaire majoré.

      Des études montrent un risque 7 à 10 fois plus important pour les personnes présentant des troubles du spectre autistique.

      Des Causes Multifactorielles

      Ces évolutions sont attribuées à une combinaison de facteurs :

      Environnementaux : L'éco-anxiété est une réalité, notamment chez les jeunes.

      Géopolitiques et économiques : Les conflits, les attentats et l'instabilité économique.

      Sociétaux : La pression scolaire, les usages numériques et l'exposition aux réseaux sociaux sont corrélés à une dégradation de la santé mentale des plus jeunes.

      3. Les Levier d'Action de la Direction Générale de la Santé

      Face à ces constats, la DGS déploie une stratégie de prévention et de promotion articulée autour de quatre axes principaux.

      Axe 1 : Amélioration des Connaissances et Lutte contre la Stigmatisation

      L'objectif est de lever les freins à l'accès aux soins, notamment l'auto-stigmatisation.

      Campagnes de communication : Santé publique France déploie des campagnes grand public et ciblées, avec un site internet dédié.

      Actions territorialisées : Les "Semaines d'information en santé mentale" (SISM) et les "Conseils locaux de santé mentale" (CLSM) sont déployés pour réunir localement élus, citoyens, associations et professionnels.

      Axe 2 : Promotion des Comportements Bénéfiques à la Santé Mentale

      Développement des compétences psychosociales (CPS) : Une stratégie interministérielle (portée par 7 ministères) vise à développer dès le plus jeune âge des compétences comme l'estime de soi, la relation à l'autre et l'esprit critique pour renforcer la résilience.

      Qualité du sommeil : Une feuille de route interministérielle a été lancée, rappelant qu'un sommeil altéré double le risque de développer une dépression.

      Prévention de l'usage excessif des écrans : Des actions sont menées pour contrer la corrélation observée entre le temps d'écran, l'exposition à des contenus inadaptés et les troubles dépressifs chez les jeunes.

      Axe 3 : Repérage Précoce des Troubles

      Premiers secours en santé mentale : Inspiré d'un programme australien, ce dispositif vise à former plus de 200 000 secouristes capables de repérer les situations de détresse dans leur entourage. Le ministre a annoncé un objectif porté à 300 000 formés d'ici 2027.

      Mon bilan prévention : Mis en place en 2023, ce dispositif invite les citoyens à des âges clés de la vie à faire un bilan global de leurs comportements en santé, incluant la santé mentale.

      Axe 4 : Stratégie Nationale de Prévention du Suicide

      Cette stratégie, pilotée par la DGS, a permis de mettre en place des dispositifs clés qui ont démontré leur efficacité.

      Dispositif

      Description

      Données Clés et Résultats

      3114

      Numéro national d'appel pour la prévention du suicide, accessible 24/7.

      Plus de 1000 appels par jour. Un budget de 23 millions d'euros.

      Vigilance

      Dispositif de recontact des personnes passées aux urgences pour une tentative de suicide.

      Réduit de 38 % le risque de réitération suicidaire. Retour sur investissement de 2 € pour 1 € investi. Aujourd'hui généralisé à 17 régions.

      Prévention de la contagion suicidaire

      Plans d'action locaux menés avec les Agences Régionales de Santé (ARS) et les élus pour prévenir les phénomènes de contagion après un suicide.

      Efficacité démontrée par des retours d'expérience qualitatifs.

      4. Enjeux, Débats et Perspectives

      La "Grande Cause Nationale 2025" : Une Opportunité Mitigée

      Reconnue comme une opportunité indéniable, la mise en œuvre de la "Grande Cause" a subi un "retard à l'embrayage" en raison du contexte politique (changement de gouvernement). Cependant, elle a permis de :

      Relancer la mobilisation interministérielle sur des thématiques transversales.

      Prioriser et concrétiser des projets, comme la campagne grand public de Santé publique France.

      Labelliser plus de 750 projets locaux, démontrant une appropriation territoriale.

      Le Défi des Moyens Humains et de la Mise en Œuvre Locale

      Un consensus émerge sur le fait que le diagnostic est connu, mais que l'action sur le terrain manque cruellement de moyens.

      • La prévention et le repérage précoce se heurtent à une pénurie de professionnels (infirmières scolaires, médecins scolaires, psychologues).

      • Les dispositifs comme les Conseils Locaux de Santé Mentale (CLSM) et les Projets Territoriaux de Santé Mentale (PTSM) visent à améliorer la coordination locale, mais la marche reste haute.

      Le Modèle Économique de la Prévention

      Le débat met en lumière une tension structurelle dans le système de santé français.

      Faiblesse de l'investissement : Les dépenses de prévention en France représentent 2 à 3 % des dépenses de santé, un niveau bas comparé aux standards de l'OCDE. La France a historiquement privilégié une culture du soin curatif.

      Logique de court terme : Les décideurs politiques sont contraints par des arbitrages budgétaires annuels, alors que les retours sur investissement de la prévention s'étalent sur plusieurs années. Le coût sociétal total des suicides et tentatives de suicide a été estimé à 24 milliards d'euros en 2019.

      Débat sur la pluriannualité : La proposition d'une loi de programmation pluriannuelle pour la santé mentale est avancée pour garantir des investissements à long terme. La DGS exprime une réserve, soulignant que la multiplication de telles lois rigidifie la dépense publique.

      Le Dispositif "Mon Soutien Psi"

      Ce dispositif, qui permet une prise en charge de séances de psychologue, est reconnu comme un progrès pour lever les freins financiers.

      Il est cependant noté qu'il a pu contribuer à une "fuite" des psychologues du secteur public (hôpitaux, Centres Médico-Psychologiques) vers le secteur libéral, affaiblissant la prise en charge des troubles plus lourds qui nécessitent une approche pluridisciplinaire.

      5. Focus sur des Populations Spécifiques

      Personnes âgées : Le taux de suicide chez les 85-94 ans est de 35 pour 100 000, soit près du triple du taux de la population générale, un chiffre largement attribué à l'isolement.

      Jeunes : Le harcèlement scolaire est identifié comme un facteur de risque majeur. La DGS collabore étroitement avec l'Éducation Nationale pour déployer les programmes de compétences psychosociales afin de mieux armer les élèves.

      Agriculteurs : Cette population connaît des taux de suicide extrêmement élevés. Des dispositifs spécifiques comme les "sentinelles" sont déployés par la MSA dans le cadre du suivi du mal-être agricole.

    1. AbstractBackground Reference genomes have a wide range of applications. Yet, we are from a complete genomic picture for the tree of life. We here contribute another piece to the puzzle by providing a high-quality reference genome for the Ural Owl (Strix uralensis), a species of conservation concern and efforts affected by habitat destruction and climate change.Results We generated a reference genome assembly for the Ural Owl based on high-fidelity (HiFi) long reads and chromosome conformation capture (Hi-C) data. It figures amongst the best avian genome assemblies currently available (BUSCO completeness of 99.94 %). The primary assembly had a size of 1.38 Gb with a scaffold N50 of 90.1 Mb, while the alternative assembly had a size of 1.3 Gb and a scaffold N50 of 17.0 Mb. We show an exceptionally high repeat content (21.07 %) that is different from those of other bird taxa with repeat extensions. We confirm a Strix characteristic chromosomal fusion and support the observation that bird microchromosomes have a higher density of genes, associated with a reduction in gene length due to shorter introns. An analysis of gene content provides evidence of changes in the keratin gene repertoire as well as modifications of metabolism genes of owls. This opens an avenue of research if this is related to flight adaptations. The population size history of the Ural Owl decreased over long periods of time with increases during the Eemian interglacial and stable size during the last glacial period. Ever since it is declining to its currently lowest effective population size. We also investigated cell culture of progressive passages as a tool for genetic resources. Karyotyping of passages confirmed no large variants, while a SNP analysis revealed a low presence of short variants across cell passages.Conclusions The established reference genome is a valuable resource for ongoing conservation efforts, but also for (avian) comparative genomics research. Further research is needed to determine whether cell culture passages can be safely used in genomic research.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf106), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 1: Jianbo Jian

      The authors provide a high-quality reference genome for the Ural Owl (Strix uralensis), these genomic resources are valuable for conservation and evolution. The manuscript is well-written, and the scientific story with cell culture for conservation is interesting. I have some questions or comments as following: 1、 in abstract, the N50 is contig or scaffold? 2、For the GenomeScope analysis, the estimated genome size is 1.29 Gb with low heterozygosity (0.2%). The assembled genome size is 1.38 Gb. Could there be duplicated genome sequences in the assembly, or did the genome survey evaluation exclude some k-mers? What were the parameters used in GenomeScope2 (e.g., was the -h parameter set to its default value)? 3、How do you perform the decontamination? 4、For the Hi-C contact map, due to some chromosomes are considerably larger while others are much smaller, it is suggested that the larger chromosomes could be displayed independently from the smaller ones to enhance clarity and interpretation.

    1. AbstractBackground Influenza A virus (IAV) poses a significant threat to animal health globally, with its ability to overcome species barriers and cause pandemics. Rapid and accurate IAV subtypes and host source prediction is crucial for effective surveillance and pandemic preparedness. Deep learning has emerged as a powerful tool for analyzing viral genomic sequences, offering new ways to uncover hidden patterns associated with viral characteristics and host adaptation.Findings We introduce WaveSeekerNet, a novel deep learning model for accurate and rapid prediction of IAV subtypes and host source. The model leverages attention-based mechanisms and efficient token mixing schemes, including the Fourier Transform and the Wavelet Transform, to capture intricate patterns within viral RNA and protein sequences. Extensive experiments on diverse datasets demonstrate WaveSeekerNet’s superior performance to existing models that use the traditional self-attention mechanism. Notably, WaveSeekerNet rivals VADR (Viral Annotation DefineR) in subtype prediction using the high-quality RNA sequences, achieving the maximum score of 1.0 on metrics including the Balanced Accuracy, F1-score (Macro Average), and Matthews Correlation Coefficient (MCC). Our approach to subtype and host source prediction also exceeds the pre-trained ESM-2 (Evolutionary Scale Modeling) models with respect to generalization performance and computational cost. Furthermore, WaveSeekerNet exhibits remarkable accuracy in distinguishing between human, avian, and other mammalian hosts. The ability of WaveSeekerNet to flag potential cross-species transmission events underscores its significant value for real-time surveillance and proactive pandemic preparedness efforts.Conclusions WaveSeekerNet’s superior performance, efficiency, and ability to flag potential cross-species transmission events highlight its potential for real-time surveillance and pandemic preparedness. This model represents a significant advancement in applying deep learning for IAV classification and holds promise for future epidemiological, veterinary studies, and public health interventions.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf089), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 3:Weihua Li

      (1) In the abstract, the statement 'WaveSeekerNet achieves scores of up to the maximum 1.0 across all evaluation metrics, including F1-score (Macro Average)' appears to slightly deviate from the actual experimental results. (2) In data preprocessing, the reasoning behind selecting and keeping the earliest collected sequence when duplicate sequences are encountered should be explained. (3) Compared with Figure 4, Figure 5 demonstrates performance improvements in most cases, but why is this not observed for some results in Figure 4d? (4) Could the oversampling/undersampling methods employed in the study introduce any potential biases to the analysis? (5) Given that VADR can provide viral classification and annotation information—which serves as the benchmark in this study, what specific advantages does WaveSeekerNet offer for subtype classification? (6) The paper employs 10-fold cross-validation to assess generalizability, yet the data processing section describes a temporal split (pre-2020 for training). Could the "Model Training and Testing" section provide further clarification on this?

    2. AbstractBackground Influenza A virus (IAV) poses a significant threat to animal health globally, with its ability to overcome species barriers and cause pandemics. Rapid and accurate IAV subtypes and host source prediction is crucial for effective surveillance and pandemic preparedness. Deep learning has emerged as a powerful tool for analyzing viral genomic sequences, offering new ways to uncover hidden patterns associated with viral characteristics and host adaptation.Findings We introduce WaveSeekerNet, a novel deep learning model for accurate and rapid prediction of IAV subtypes and host source. The model leverages attention-based mechanisms and efficient token mixing schemes, including the Fourier Transform and the Wavelet Transform, to capture intricate patterns within viral RNA and protein sequences. Extensive experiments on diverse datasets demonstrate WaveSeekerNet’s superior performance to existing models that use the traditional self-attention mechanism. Notably, WaveSeekerNet rivals VADR (Viral Annotation DefineR) in subtype prediction using the high-quality RNA sequences, achieving the maximum score of 1.0 on metrics including the Balanced Accuracy, F1-score (Macro Average), and Matthews Correlation Coefficient (MCC). Our approach to subtype and host source prediction also exceeds the pre-trained ESM-2 (Evolutionary Scale Modeling) models with respect to generalization performance and computational cost. Furthermore, WaveSeekerNet exhibits remarkable accuracy in distinguishing between human, avian, and other mammalian hosts. The ability of WaveSeekerNet to flag potential cross-species transmission events underscores its significant value for real-time surveillance and proactive pandemic preparedness efforts.Conclusions WaveSeekerNet’s superior performance, efficiency, and ability to flag potential cross-species transmission events highlight its potential for real-time surveillance and pandemic preparedness. This model represents a significant advancement in applying deep learning for IAV classification and holds promise for future epidemiological, veterinary studies, and public health interventions.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf089), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 1:Will Dampier

      The manuscript presented by Nguyen et al. is well written, well researched, and well executed. The use of this new "wavelet style" neural network shows both an increased training efficiency and improved accuracy at detecting influenza subtypes for surveillance. However, I think their comparison to a 'plain' Transformer model does not take advantage of the improvements in pre-training and transfer-learning that have become standard practice in deep-learning. I have also included some stylistic suggestions to improve the figures as presented. After addressing these comments, I believe that this will become a very strong manuscript.

      Major Comments:

      The authors present a comparison between their new wavelet architecture and a standard transformer architecture using a one-hot encoded vector of amino-acids. I believe that this is the correct 'null model' to compare your wavelet architecture to, however, it does not represent the 'state of the art' in utilizing transformers for sequence analysis. As I'm sure the authors are aware, the disadvantage of transformers is that they take an extensive amount of training (they note the transformer only models take 2-4X more training epochs to converge). However, the advantage they bring is that they can be extensively trained for one task and then transfer that learning to another related task. A number of models have been pre-trained on giant collections of proteins Asgari et al, https://doi.org/10.1371/journal.pone.0141287 and Rives et al https://doi.org/10.1073/pnas.2016239118 which then allow one to transfer that knowledge to different domains with fewer examples such as demonstrated in Dampier et al https://doi.org/10.3389/fviro.2022.880618. It would be interesting to see whether your wavelet model defeats these pre-trained models with transfer learning. If you showed that, you could argue that there is no need for the extensive expense of 'foundational models'.

      The authors discuss that there is a significant imbalance in the training set and they used up-sampling and limiting to balance out the class representation. Since the classes are not equally represented, the model may not be equally able to predict each class. And the high metrics may only be a representation of its ability to predict the popular classes correctly. The authors should include an additional set of figures (supplemental is fine) that show the metrics broken out by Subtype. It would also be interesting to see a graph of the class-size (before up-sampling) vs F1-score (or another metric) on that class. This could provide lower-bounds for how many samples are needed to train the model.

      Minor Comments:

      Figures 3, 4, and 5: These would benefit from a linked y-axis. It is hard to compare across A/B/C/D when the axes have different y-limits.

    1. AbstractBackground Food contamination by pathogens poses a global health threat, affecting an estimated 600 million people annually. During a foodborne outbreak investigation, microbiological analysis of food vehicles detects responsible pathogens and traces contamination sources. Metagenomic approaches offer a comprehensive view of the genomic composition of microbial communities, facilitating the detection of potential pathogens in samples. Combined with sequencing techniques like Oxford Nanopore sequencing, such metagenomic approaches become faster and easier to apply. A key limitation of these approaches is the lack of accessible, easy-to-use, and openly available pipelines for pathogen identification and tracking from (meta)genomic data.Findings PathoGFAIR is a collection of Galaxy-based FAIR workflows employing state-of-the-art tools to detect and track pathogens from metagenomic Nanopore sequencing. Although initially developed to detect pathogens in food datasets, the workflows can be applied to other metagenomic Nanopore pathogenic data. PathoGFAIR incorporates visualisations and reports for comprehensive results. We tested PathoGFAIR on 130 samples containing different pathogens from multiple hosts under various experimental conditions. For all but one sample, workflows have successfully detected expected pathogens at least at the species rank. Further taxonomic ranks are detected for samples with sufficiently high Colony-forming unit (CFU) and low Cycle Threshold (Ct) values.Conclusions PathoGFAIR detects the pathogens at species and subspecies taxonomic ranks in all but one tested sample, regardless of whether the pathogen is isolated or the sample is incubated before sequencing. Importantly, PathoGFAIR is easy to use and can be straightforwardly adapted and extended for other types of analysis and sequencing techniques, making it usable in various pathogen detection scenarios. PathoGFAIR homepage: https://usegalaxy-eu.github.io/PathoGFAIR/

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf017), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 2: Ann-Katrin Llarena

      Nasr and colleagues present an, at times, well-written manuscript with an interesting and robust pipeline that includes well-known softwares (you must make sure to cite the authors of these). However, the manuscript is, quote "...a collection of Galaxy-based FAIR workflows employing state-of-the-art tools to detect and track pathogens from metagenomic Nanopore sequencing". Its repeated how well it works, they even compare it to other software in table 1 (without proper benchmark). These initial statements are however not supported by the findings. The Salmonelal from the spiked samples are, as expected from food matrix present in low quantity), difficult to do more than state that the genus is present, and only a fraction of the samples can actually "complete" the entire pipeline. Also, the benchmarking is not really benchmarking (compare and measure this software against other competing software). No such comparison is done, and even though the intention of PathoGFAIR as stated throughout the paper, is detection and analysis of metagenomic samples, the benchmarking is done on isolate based wgs. It is also evident that the authors are not microbiologists as the manuscript is riddled with taxonomical misunderstandings about the vast genus Salmonella and when to use capital letters and italics. I am also lacking a proper discussion here on the results found in the spiking experiment in light of current EU legislation on Salmonella. Can this pipeline help in this regard? Sensitivity and specificity metrics are also lacking.

      Abstract: "foodborne pathogen data" / "metagenomic Nanopore pathogenic data" - suggest to rewrite, as what I think you are trying to say is " initially developed to detect foodborne pathogens from metagenomic nanopore data, the workflow can be used to detect any pathogen." "Colony-forming unit and Cycle Threshold values." rewrite sentence, I do not completely understand what you are trying to say. what is "sufficient colony forming units?" It will vary as well between pathogens (infection dose varies). You could rather state your sensitivity of the pipeline here - even though i think that sampling prep, library prep and seq influences that more than the bioinformatics. "In any sample": did you test all matrixes? "sample is isolated or incubated before seq" you cannot isolate a sample, but you isolate a bacteria from a sample. unprecise language.

      Introduction: In general, its well written, but a bit unprecise here and there. The authors also rely a lot on the following words: "rapid" "accurate". "outbreaks and epidemics" - rewrite, these are the same. "efforts to mitigate their spread and ensure food safety" again, complementary terms - rewrite. "global public health authorities" we do have everything from local to global food safety and public health authorities, I think one should highlight this. There is a difference between for instance EFSA and ECDC. "isolation can be complex"? do you mean complicated or work intensive? "The utilisation of Nanopore sequencing data, as exemplified in studies like [7]," citing practices like this is not really reader friendly. Suggest to write what they actually did in seven (as for instance the detection of blah in blah as shown in 7). "Once (meta)genomics data has been generated, bioinformatics approaches enable the rapid and accurate detection"; repetition of chapter above. You write in the former chapter that "the utilisation of nanopore data" which also includes bioinformatics of course. SURPI and Sunbeam is freely available? https://microbiomejournal.biomedcentral.com/articles/10.1186/s40168-019-0658-x https://chiulab.ucsf.edu/surpi/

      "PathoGFAIR: pathogen identification and tracking from metagenomics". Im not convinced that it can perform tracing in an outbreak where only a few SNPs are allowed. PathoGFAIR does not really speed up the process of sampling, does it. Actually, it takes more time to extract crude dna from a sample than to place it in a enrichment broth or do a dilution series, so the presteps are not really a part of this. "Tracking pathogens" - again, if species level is the lowest rank it can go to, its not enough to perform tracking.

      Overview chapter "input data is seq data generated w nanopore" basecalling is not included in the workflow? How is this performed? It affects the quality of the reads, so its nice to know what you did. The chapter is very wordy, and contains a lot of fill-words with salespitches almost. I would recommend rewriting it, for instance: Chapter that starts with subsequently and describes the different workflows and how they work together can be compressed. And the last three sections are salespitching.

      WF1: Preprocessing: How stringent filtering and quality control are implemented in the workflow? How good quality do you need for the wp2-4 to work sufficiently well? Did you test? Food vehicle animal? What is that - do you mean that if you extract dna from bovine meat, you map to bovine genome? "a tool ten times faster etc etc." is discussion and should be removed from what I think is materials and methods even though the title of the section is workflow 1. What is a food host? Kalamari database includes many foodborne pathogens, such as Shigella, E. coli, Campylobacter etc etc. how can you just remove all reads that match to this database? Table 1: Innuendo is based on isolate WGS, and not intended for WGS. Also, it has its own built in wgMLST schema employed using chewbbacca, so it definitely has allele-abased pathogen identification. Its intended for illumina data. Victors are strictly a platform to analyse virulence factors and not intended even for taxonomic profiling, and its webinterface doesn't work. IDseq has step-by-step guides available on their webpage, so I think that qualifies as a tutorial. You can also contact them (user support). I guess the same is true for OneCodex, as you actually pay for that one. So the table is unprecise at best and should be corrected (I didn't go through Submeam, SURPI or PAIPline specs to try to check if you got it correctly). Rewrite this. Further, I think you should only include systems / pipelines that are intended for metagenomics. You have a footnote * that I cannot see in the table as well.

      WF2 taxonomy profiling: The first sentence needs rewriting. Two sentences from "Although Kraken2 is a tool design…….." belongs in discussion. WF3: Medaka consensus pipeline : "This task is performed using neural networks applied from a pileup of individual sequencing reads against a draft assembly. " what draft assembly did you use here to create a consensus sequence? Actually, its not polishing contigs, its assemblying them? Again, there is some descriptions of the software which belongs in the discussion, say the perks one gets from using this tool over the other. I do not however get how screening for virulence genes = pathogen identification. The thing is that in a complex food matrix or faecal samples from animals, things like stx phages will also be present. These are not stec pathogens unless the phage is inside an e.coli. How do you make sure of the host for such mobile genetic elements as these virulence and amr genes often are located on? Seeing as this is the basis of your pathogen detection?

      WF4: A bit again on choosing software over the other that is discussion food. Wf4/wf5: I am worried about the reliance on snp based technics for nanopore reads. Is the quality good enough to achieve sufficiently robust results? Easily adaptable workflows Last section is repetition (about each wf operating independently) Use cases: Data generation: Please revise how to write Salmonella names correctly. They should be in italics for genus, species and subspecies names, while the serovar/serotype is non italic and capital letter. So the correct term would be: * Salmonella enterica subsp. enterica serovar Houtenae, or in short; Salmonella Houtenae. * The strain DSM554 is of serovar Typhimurium, and this should referenced like this: Salmonella enterica subsp. enterica serovar Typhimurium strain DSM 554 First two sentences are contradictory to eachother? Sentence starting "15 samples were incubated"; don't start sentence with number, it looks like 33.15 How much meat did you use? What CFU/g does these ct values translate too? Its important to know the sensitivity relative to legislation. The limit is zero in 100grams, but I don't assume you tested 100g? What does adaptive sampling mean? To exclude chicken DNA? The point v sentence under description of supplementary table t1 is a bit weird punctuation Gene-based pathogen identification: Working with meat to detect low abundance pathogenic bacteria is challenging without enrichment of the expected pathogen with selective methods. Just incubating it a x temperature might work for some bacteria, but others need special atmosphere (campylobacter, clostridia) and nutrients. How do you accommodate this? Figure 2 B: The grey bares samples ? why are they collapsed in the left corner? And shy are sdhA and mucD highlighted? Also, please put genes in italics. the grey bars on the right (y-axis) are not annotated? To which reference genome are the barplot in d referring to? I can see for instance in f that there is a number of snps or variants for the Houtenae and Typhimurium, but not Salamae, was the latter used as reference? "an AIDA autotransporter-like protein, only found in Enterica strain samples but not in samples spiked with Houtenae or Salamae strains." All these strains are of the subspecies enterica Figure 3: punctuations a bit off here and there. Why do you operate with cfu/ml? You added it to meat? It should be cfu/g? It would be nice with a presentation of the resistance panel of the three spiked strains before presenting the amr genes. "Similar but inverse relations are observed for CFU/mL value (Figure 3 C & D), with a threshold for VF and AMR gene detection at 106 ." cfu/ml of what? The rinse? Added ml? I don't even know how much meat were included in the dna extractions. "The further the samples are from these thresholds, the higher the number of VF genes and AMR genes identified. Indeed, the three top scattered dots with identified VF genes between 250 and 300 (Figure 3 A, C, E) are the samples with the highest number of reads, higher CFU/mL value, and a relatively lower Ct value compared to other samples." The tendency is ok, but not all. For instance, you have several exceptions here for both amr genes and vf genes. Maybe mark the dots after say spiked strain/enrichment or not?

      Discussion bit here : "enerally, allowing samples to incubate for a short period before se quencing enhances microbial growth, resulting in higher CFU/mL values and lower Ct values. This increase in microbial concentra tion improves the efficiency of direct sequencing by providing more genetic material for analysis, facilitating faster and more accurate pathogen detection. "

      Allele-based pathogen identification: "Salmonella enterica subspecies enterica serovar typhimarium (NC_003197.2)": see earlier comment on writing correct taxonomically for Salmonella. "However, given the diversity among Salmonella subspecies in the samples, a high number of complex variants and SNPs were anticipated. " You only operate with ONE subspecies of Salmonella - S. enterica subsp. enterica. That's the relevant subspecies, and contains over 2500 serovariants. I don't understand this process; in an outbreak setting you are dependent on tracing, i.e. showing that you isolates are clonal. Pathogfair relies on mapping to a reference genome, but that again relies on isolation of suspected isolate and building a high quality assembly for the allel-based pathogen identification to work. Its not enough to just show that you have that or that serotype, you will have to show that they are clonal (i.e. separated by a limited number of SNPs, say max 20 snps over the full length of the chromosome). This method cannot do this. Samples with prior pathogen isolation: Do understand you correctly that you now exstract dna from isolates? Not whole samples matrix? If so, how is this benchmarking a pipeline intended for metagenomics sequencing? If you were to extract dna from feces/ food and then use your pipeline, that would be benchmarking. However, this doesn't prove that your pipeline works as you intend it to/or claim that it does. How were the samples prepared? If isolates, extraction method and sequencing techniques? Species name is written non-capitalized first letter, so Campylobacter jejuni. All gene names should be italicized. Suggest rewriting sentence: The wet lab procedures performed to isolate and prepare these samples for sequencing adhered to standard microbiological techniques, including cultivation, enrich ment, and isolation steps" to reflect actual sequel; enrichment, cultivation and isolation and verification." Conclusion: If for use for solely isolates, I think assemblies are a better way to go than this pipeline; its more reliable for clonality analysis needed in outbreaks. "We further supported the scientific community by introducing new 46 benchmark samples, making them publicly available. This demonstrates our significant investment of time and resources, providing valuable assets for future research." There are now 82000 c. jejuni just on ncbi, of which 600 are complete. Salmonella genomes are clocking on 524500 assemblies on enterobase. The contribution of these strains are not because they are new samples, but because your isolates represent data from an underrepresented region of the world, namely Palestine.

      Supplmentary figure s4 is cropped so that x-line annotation is not visible. SFigure 5 Midpoint root amr phylogenetic tree? Supplementary table 1: its unclear for me if you added this amount of bacteria or it was the result of after 1h or 24h enrichment. Also, I don't understand how much meat you used for the dna extraction. Same goes for ct values.

    2. AbstractBackground Food contamination by pathogens poses a global health threat, affecting an estimated 600 million people annually. During a foodborne outbreak investigation, microbiological analysis of food vehicles detects responsible pathogens and traces contamination sources. Metagenomic approaches offer a comprehensive view of the genomic composition of microbial communities, facilitating the detection of potential pathogens in samples. Combined with sequencing techniques like Oxford Nanopore sequencing, such metagenomic approaches become faster and easier to apply. A key limitation of these approaches is the lack of accessible, easy-to-use, and openly available pipelines for pathogen identification and tracking from (meta)genomic data.Findings PathoGFAIR is a collection of Galaxy-based FAIR workflows employing state-of-the-art tools to detect and track pathogens from metagenomic Nanopore sequencing. Although initially developed to detect pathogens in food datasets, the workflows can be applied to other metagenomic Nanopore pathogenic data. PathoGFAIR incorporates visualisations and reports for comprehensive results. We tested PathoGFAIR on 130 samples containing different pathogens from multiple hosts under various experimental conditions. For all but one sample, workflows have successfully detected expected pathogens at least at the species rank. Further taxonomic ranks are detected for samples with sufficiently high Colony-forming unit (CFU) and low Cycle Threshold (Ct) values.Conclusions PathoGFAIR detects the pathogens at species and subspecies taxonomic ranks in all but one tested sample, regardless of whether the pathogen is isolated or the sample is incubated before sequencing. Importantly, PathoGFAIR is easy to use and can be straightforwardly adapted and extended for other types of analysis and sequencing techniques, making it usable in various pathogen detection scenarios. PathoGFAIR homepage: https://usegalaxy-eu.github.io/PathoGFAIR/

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf017), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 1: Federico Zambelli

      The authors present PathoGFAIR, a set of Galaxy workflows for the metagenomic analysis of shotgun Nanopore sequencing from isolated and non-isolated pathogens in contaminated food samples. They complement their work by analysing and releasing two datasets, one from isolated and the other from non-isolated samples, with the primary objective of illustrating the potentiality of the workflows. These datasets could also be used as benchmarks for future works.

      The manuscript is generally well-written, and the authors highlight the advantages of the proposed workflows in Table 1 by comparing them to similar solutions. The workflows are well integrated into the Galaxy network, are available on the three main usegalaxy instances, and provide a thorough tutorial through the Galaxy training platform. A notable advantage of PathoGFAIR over similar workflows is that, thanks to Galaxy, the final user can easily tailor them by replacing any tool in the workflow with others available in the Galaxy ecosystem. This also allows easy updates for the tools in the workflows.

      A few minor points that, if addressed, in my opinion, could further strengthen the manuscript:

      1 - The rationale behind the tool selection in each of the four workflows is not always clear. While insights are present for workflows 1 and 4, this is not true for workflows 2 and 3. The reader would benefit from understanding why one tool has been preferred over another for the same task, even more so, given the possibility to modify the workflows easily, when this preference could be the other way around in particular use cases or conditions.

      2—One of the main factors for a successful metagenomic analysis is the correctness, completeness, and up-to-dateness of the reference data. The authors should briefly describe how PathoGFAIR addresses this in Galaxy.

      3—While this workflow is clearly stated to be tailored for shotgun metagenomic sequencing, the authors contrast this approach only with targeted sequencing. Instead, they should also discuss the 16s rRNA metagenomic approach, for which Nanopore kits are available, and why PathoGFAIR has been limited to the analysis of shotgun data.

    1. Synthèse de l'Audition de la Défenseure des droits sur la Santé Mentale et le Handicap

      Résumé

      L'audition de la Défenseure des droits devant la commission d'enquête de l'Assemblée nationale dresse un tableau alarmant des défaillances systémiques dans la prise en charge de la santé mentale et du handicap en France.

      Le handicap constitue le premier motif de saisine pour discrimination (22 % des réclamations en 2024), soulignant un écart persistant entre les droits annoncés et leur effectivité.

      Les politiques de santé mentale sont jugées gravement insuffisantes, tant pour les majeurs que pour les mineurs.

      La situation est particulièrement critique en milieu carcéral, où la surreprésentation des troubles mentaux, conjuguée à la surpopulation et au manque de soins, conduit à des traitements qualifiés d'inhumains.

      Pour les jeunes, les données sont alarmantes (25 % souffrent de dépression), mais les services de pédopsychiatrie sont saturés, avec des délais d'attente dépassant un an, et un manque criant de données fiables pour piloter les politiques publiques.

      Le recours abusif à l'isolement et à la contention, notamment sur des mineurs hospitalisés en services pour adultes sans contrôle judiciaire, constitue une atteinte grave aux droits fondamentaux.

      Concernant le handicap, la loi de 2005, bien qu'ayant permis des avancées, est loin d'être intégralement appliquée.

      L'éducation inclusive reste un défi majeur, marqué par un manque d'AESH, des difficultés d'aménagement des examens et une absence de données précises sur le temps de scolarisation réel.

      L'emploi demeure le premier domaine de discrimination, et l'accessibilité (transports, logement, numérique) accuse un retard considérable. Les aides à l'autonomie sont insuffisantes et inégalitaires, notamment en raison du maintien d'une barrière d'âge à 60 ans.

      La Défenseure des droits insiste sur le fait que le non-respect des droits fondamentaux représente un coût social et économique élevé à terme, bien supérieur à celui d'un investissement dans la prévention et une prise en charge effective.

      L'urgence est de commencer par appliquer les textes existants et de prendre conscience de la détresse de la jeunesse.

      Introduction : Rôle et Observations du Défenseur des droits

      L'institution du Défenseur des droits est un observateur privilégié des carences des politiques publiques, car la question des droits des personnes handicapées traverse l'intégralité de ses cinq missions :

      • 1. Droits des usagers des services publics

      • 2. Lutte contre les discriminations

      • 3. Protection des droits des enfants

      • 4. Déontologie des forces de sécurité

      • 5. Protection des lanceurs d'alerte

      L'institution est également chargée du suivi de l'application de la Convention internationale relative aux droits des personnes handicapées (CIDPH), ratifiée par la France en 2019.

      Le Handicap : Premier Motif de Discrimination

      Depuis plusieurs années, le handicap est le premier motif de saisine en matière de discrimination. Cette constance révèle une problématique structurelle profonde.

      Année

      Nombre total de saisines (Discrimination)

      Part relative au handicap

      Nombre de réclamations (Handicap)

      2024

      5 679

      22%

      1 249

      Ces discriminations s'exercent dans de multiples domaines, incluant l'emploi, la scolarisation, la santé, la justice, les loisirs, le sport et la culture. L'institution se positionne comme un "très bon observatoire de ce qui ne va pas", mettant en lumière l'écart entre le droit annoncé et son effectivité sur le terrain.

      L'Argument Central : Le Coût du Non-Respect des Droits

      La Défenseure des droits conteste fermement l'idée que l'application des droits fondamentaux représenterait un coût financier trop important.

      Elle soutient au contraire que "c'est le non-respect des droits fondamentaux qui entraînera à terme un coût élevé pour la société", un argument particulièrement pertinent en période d'incertitude budgétaire.

      Les Défaillances dans la Prise en Charge de la Santé Mentale

      La Situation des Personnes Majeures

      La réponse des pouvoirs publics à la crise de la santé mentale, aggravée par la pandémie de Covid-19, reste insuffisante. Un Français sur trois sera confronté à un trouble psychiatrique au cours de sa vie. Les défaillances sont multiples :

      Quantitatives : Offres de soins trop faibles, capacités d'hospitalisation limitées, déserts médicaux.

      Organisationnelles : Système mal organisé et cloisonné entre le sanitaire et le médico-social.

      Humaines : Le secteur de la psychiatrie peine à recruter alors que les besoins augmentent.

      Ces carences entraînent des atteintes graves et répétées aux droits fondamentaux, notamment le droit à la santé, avec des délais d'attente excessifs, des ruptures de soins et des inégalités territoriales criantes qui pénalisent les plus précaires.

      Cas Spécifique : La Santé Mentale des Personnes Détenues

      La santé mentale des personnes détenues se dégrade faute d'un accompagnement adapté.

      Statistiques : Entre juillet 2024 et juillet 2025, la plateforme d'appel pour les détenus (3141) a reçu 1 065 appels (7,6%) pour des difficultés d'accès aux soins et 106 appels spécifiques pour un risque suicidaire.

      Causes de la surreprésentation des troubles mentaux :

      • 1. Une politique de désinstitutionnalisation qui a réduit les lits en psychiatrie sans développer de services de proximité en relais.  
      • 2. Une diminution du nombre de personnes déclarées pénalement irresponsables, qui se retrouvent de ce fait en prison.

      Conséquence juridique : Maintenir en détention une personne nécessitant une prise en charge médicale revient à lui infliger des "traitements inhumains", comme l'a souligné la Cour européenne des droits de l'homme (arrêt GC c. France, 2012).

      Facteurs aggravants : La surpopulation carcérale et l'absence de continuité des soins à la sortie, qui augmente le risque de récidive.

      La Situation des Mineurs

      Les résultats d'une étude de 2025 (Institut Montaigne, Mutualité française, Institut Teram) sont jugés alarmants :

      25% des jeunes (15-29 ans) souffrent de dépression.

      • Ce chiffre atteint 39% dans les outre-mer, avec des pics à plus de 50% en Guyane, 44% en Martinique et 43% à Mayotte.

      Problèmes Structurels Identifiés

      1. Manque de données fiables : L'absence de données agrégées au niveau national sur le nombre d'enfants en attente de prise en charge fragilise le pilotage des politiques publiques.

      Le rapport de la Cour des comptes de 2023 sur la pédopsychiatrie estime que sur 1,6 million d'enfants avec un trouble psychique, seuls 50 à 53% bénéficient de soins.

      2. Inégalités territoriales et pénurie de médecins : La politique du "virage ambulatoire" a renforcé le rôle des Centres Médico-Psychologiques (CMP), mais leur répartition est inégale (10 par département en moyenne, avec de fortes disparités).

      Les délais pour obtenir un premier rendez-vous dépassent souvent un an, ce qui est incompatible avec la nécessité d'une intervention rapide.

      3. Prise en charge inadaptée : Une source d'inquiétude majeure est l'hospitalisation d'enfants et d'adolescents au sein de services psychiatriques pour adultes, souvent par défaut de solutions dans le secteur médico-social ou en protection de l'enfance.

      Mesures d'Isolement et de Contention : Des Pratiques Abusives

      Le manque d'effectifs conduit trop souvent à des restrictions injustifiées des libertés, telles que des mesures d'isolement et de contention.

      Pour les majeurs : Le contrôle systématique par un juge des libertés et de la détention (JLD) pour les hospitalisations sans consentement est jugé peu efficace.

      Il repose principalement sur l'avis médical, et seules 10% des décisions aboutissent à une levée de la mesure. De plus, aucun contrôle judiciaire n'est prévu pour les soins ambulatoires sans consentement.

      Pour les mineurs : La situation est encore plus préoccupante. Un mineur hospitalisé à la demande de ses parents est placé sous le régime de "soins libres" et ne bénéficie d'aucun contrôle du JLD, même en cas d'isolement ou de contention.

      Ce vide juridique constitue une atteinte grave à leurs droits fondamentaux.

      Cas emblématique

      Une adolescente de 15 ans, atteinte d'autisme sévère, a été hospitalisée pendant plus de deux ans dans un service psychiatrique pour adultes, faute de place en structure médico-sociale. Durant cette période, elle a été :

      • Confinée dans une chambre d'isolement verrouillée plus de 20 heures par jour.

      • Déscolarisée.

      • Privée de soins somatiques essentiels (ex: soins dentaires). Cette situation, loin d'être un cas isolé, illustre les conséquences dramatiques du manque de solutions adaptées.

      Les Lacunes des Politiques Publiques Relatives au Handicap

      Éducation : Un Droit Garanti mais un Accès Difficile

      La loi de 2005 a permis une impulsion mais n'est toujours pas intégralement appliquée.

      Statistiques : 30% des saisines relatives aux droits de l'enfant concernent la scolarisation d'enfants en situation de handicap.

      Obstacles persistants : Inadaptation des locaux et du matériel, rigidité des programmes, formation insuffisante des professionnels.

      Accompagnement (AESH) : Malgré la création de postes, le manque persiste. La loi du 27 mai 2024 prévoyant la prise en charge de l'accompagnement sur le temps méridien par l'État est "très loin d'être effective" en raison de blocages entre les collectivités et les académies.

      Aménagements des examens : Une augmentation inquiétante des réclamations a été constatée en 2024 concernant des refus d'aménagement pour des élèves ou étudiants, parfois au prétexte paradoxal que leurs résultats scolaires étaient bons.

      Emploi : Premier Domaine de Discrimination

      L'emploi est le domaine où s'exercent le plus de discriminations liées au handicap. Sur les 1 249 réclamations de 2024, 21% concernent l'emploi privé et 24% l'emploi public. L'obligation d'emploi de 6% ne suffit pas à garantir l'égalité de traitement.

      Difficultés récurrentes :

      ◦ Aménagement tardif du poste de travail.    ◦ Non-respect par l'employeur des préconisations du médecin du travail.    ◦ Difficultés de maintien dans l'emploi, menant à des licenciements ou des démissions forcées.

      Accessibilité : Un Retard Important et Persistant

      L'accessibilité est une condition essentielle à la participation sociale et à la jouissance des droits.

      Transports : La loi a été modifiée pour ne concerner que les "points d'arrêt prioritaires", ce qui est jugé insuffisant.

      Logement : L'assouplissement des règles via la loi ELAN est une source d'inquiétude.

      Accessibilité numérique : La dématérialisation a des effets ambivalents. Selon l'ARCOM, peu de sites publics atteignent 50% d'accessibilité et seulement 5% sont totalement conformes. L'ordonnance de septembre 2023 renforçant les sanctions est saluée, mais elle doit s'accompagner du maintien d'accueils physiques accessibles.

      Aides à l'Autonomie : Insuffisantes et Inégales

      Vingt ans après la loi de 2005, le droit à la compensation du handicap présente des limites flagrantes.

      Barrière de l'âge : Une différence de traitement persiste selon que le handicap survient avant ou après 60 ans. La fusion des régimes, prévue pour 2010, n'a pas eu lieu.

      Prestation de Compensation du Handicap (PCH) :

      ◦ L'aide humaine est limitée aux besoins essentiels, excluant la vie sociale.    ◦ Les aides techniques sont sous-financées.    ◦ La PCH parentalité (2021) est critiquée pour ses critères restrictifs et son forfait inadapté.

      Conclusion et Recommandations Principales

      La Défenseure des droits conclut en réaffirmant l'engagement de son institution et formule plusieurs pistes d'action prioritaires :

      1. Application des textes existants : La première urgence, notamment pour le handicap, est "l'application pure et simple des textes votés par le parlement" et la publication des décrets d'application en attente.

      2. Priorité à la jeunesse : La santé mentale, grande cause nationale en 2025, doit se traduire par une "véritable prise de conscience collective", en particulier pour les jeunes qui ne peuvent être laissés sans réponse. L'investissement dans les CMP et le dépistage précoce est essentiel.

      3. Nécessité de données fiables : Il est impératif de collecter et d'agréger des données précises (ex: nombre d'heures de scolarisation effectives, nombre d'enfants en attente de place en IME) pour permettre un pilotage éclairé des politiques publiques.

      4. Formation des acteurs : Une meilleure formation des employeurs sur "l'aménagement raisonnable", des enseignants et des professionnels de santé est indispensable pour faire évoluer les pratiques.

      5. Abaisser la barrière d'âge de 60 ans : Il est nécessaire de mettre fin à cette distinction qui crée des inégalités de traitement injustifiées.

      6. Décloisonner les systèmes : Améliorer l'articulation entre les secteurs sanitaire, médico-social et éducatif est crucial pour assurer une fluidité des parcours et éviter les ruptures de prise en charge.

    1. Synthèse de l'Audition de Claire Hédon, Défenseure des droits

      L'audition de Claire Hédon, Défenseure des droits, a été l'occasion de présenter le rapport annuel d'activité de son institution, soulignant son rôle crucial dans la protection et la promotion des droits et libertés en France.

      Elle a exprimé une vive inquiétude face à la fragilisation des droits, exacerbée par un discours qui les présente comme des obstacles, ainsi que par la dématérialisation excessive des services publics et le désengagement de l'État dans les territoires.

      Deux alertes majeures ont été mises en lumière : l'ampleur croissante des discriminations et les défaillances de l'administration numérique, notamment pour les étrangers.

      1. Mission et Cadre d'Action de l'Institution

      La Défenseure des droits a rappelé que son institution, inscrite dans la Constitution, a pour mission de "veiller au respect des droits et des libertés". Ses cinq domaines de compétence sont :

      • La défense des droits des usagers dans leurs relations avec les services publics.
      • La défense et la promotion des droits de l'enfant et de l'intérêt supérieur de l'enfant.
      • La lutte contre les discriminations et la promotion de l'égalité.
      • Le respect de la déontologie des forces de sécurité.
      • L'information, l'orientation et la protection des lanceurs d'alerte.
      • L'institution, forte de 256 agents professionnels du droit et de 620 délégués bénévoles répartis sur tout le territoire, a traité près de 141 000 réclamations en 2024. Hédon a souligné l'efficacité de la médiation, avec 80 % des réclamations traitées par cette voie et les trois quarts aboutissant à un règlement amiable, permettant d'éviter la judiciarisation des conflits. Elle a insisté sur l'indépendance de son rôle, qui lui permet de "dire et d'obtenir des avancées" et de "faire émerger des sujets dans le débat public".

      2. Une Inquiétante Fragilisation des Droits

      Claire Hédon a exprimé une profonde inquiétude quant à la "fragilisation et l'éloignement des services publics liés à une dématérialisation excessive, un désengagement de l'État dans les territoires", ce qui "conduisent immanquablement à une fragilisation et à un éloignement des droits".

      Cette dynamique "mine l'effectivité des droits, génère d'ailleurs du ressentiment contre les institutions, génère aussi des tensions dans la société et abîme le sentiment d'appartenance à la République".

      3. L'Ampleur Croissante des Discriminations

      Les discriminations sont un "phénomène très préoccupant" dont l'ampleur "inquiète à la mesure d'ailleurs du non recours en la matière".

      Elle a fourni des chiffres éloquents :

      • 18 % de la population de 18 à 49 ans déclarait avoir été discriminée en 2020, contre 14 % en 2008.
      • Un jeune sur trois (18-34 ans) a été victime de discrimination, selon le 14ème baromètre avec l'OIT.
      • Hausse de 52 % du nombre de victimes de discrimination entre 2021 et 2022.

      • Malgré ces chiffres, les réclamations auprès de l'institution ont baissé de 15 % en 2024, ce qui interpelle la Défenseure. Le "non recours" s'explique par la "peur des représailles, le sentiment d'inutilité, le découragement, les difficultés à établir les faits [et la] méconnaissance des droits".

      Les discriminations liées à l'origine (cumulées avec nationalité, apparence physique et conviction religieuse) représentent 25 % des réclamations, avec un pic d'appels (+53%) en mai-juin 2024 pour des propos haineux.

      Claire Hédon regrette un "essoufflement des politiques publiques" et que la "non-discrimination comme objectif politique a pratiquement disparu du débat public et des discours des décideurs qui préfèrent parler de diversité de lutte contre les discours de haine".

      Elle a insisté sur l'importance de faire appliquer le droit existant plutôt que d'ajouter de nouveaux critères.

      Des exemples concrets de médiations réussies ont été cités, comme l'accès à un logement décent ou le non-renouvellement de CDD lié à un état de grossesse.

      4. Défaillances de la Dématérialisation et Droits des Étrangers

      • Le deuxième point d'alerte concerne les "atteintes aux droits causées par la fragilisation du service public dématérialisé", en particulier le cas de l'Administration Numérique des Étrangers en France (ANEF).

      Les réclamations concernant les relations avec les services publics représentent 90 % des saisines, dont 37 % pour les droits des étrangers en 2024 (contre 10 % en 2019 et un quart en 2023).

      La "multiplication des dysfonctionnements de l'administration numérique des étrangers en France prive trop souvent les personnes étrangères de la possibilité même de formuler une telle demande".

      Ces défaillances touchent majoritairement des personnes déjà intégrées, les plaçant en situation irrégulière et leur faisant perdre emploi et droits.

      Les recommandations incluent la reconnaissance du droit à un accès multicanal, la modification du téléservice pour permettre plusieurs démarches simultanées, la facilitation du renouvellement des attestations de prolongation d'instruction (API), et le renforcement des moyens humains dans les préfectures.

      Claire Hédon a réaffirmé que, contrairement aux dires du Ministre de l'Intérieur, les dysfonctionnements de l'ANEF n'ont pas diminué.

      5. Autres Domaines de Compétence

      • Droits de l'Enfant : Le rapport 2024 a porté sur le droit à un environnement sain.

      Les 3073 réclamations en 2024 alertent sur les difficultés scolaires des enfants handicapés (manque d'accompagnement), le problème des "lycéens sans lycée" (plus de 23 600 en 2024), et les "violences éducatives au sein d'établissements scolaires".

      La Défenseure a plaidé pour un contrôle renforcé des établissements et un suivi rigoureux des professionnels.

      • Déontologie des Forces de Sécurité : Sur 2434 saisines en 2024, des "défaillances dans le contrôle hiérarchique" et des "rapports incomplets, erronés ou minimisant les incidents" ont été constatés.

      Une décision de décembre 2024 sur la prise en charge d'une femme sous "soumission chimique" a mis en lumière la difficulté à distinguer cet état de l'alcoolisation, appelant à l'amélioration des techniques de détection et à la formation des forces de l'ordre.

      • Protection des Lanceurs d'Alerte : Le nombre de saisines a significativement augmenté, passant de 134 en 2022 à 519 en 2024 (soit une hausse de 70 % en 2024). Un pôle spécialisé a été créé. Les lanceurs d'alerte, souvent confrontés à des représailles, sont des "vigies de l'intérêt général".

      Les recommandations incluent l'amélioration de la communication autour du dispositif légal, le soutien financier et psychologique, et la réévaluation du périmètre des autorités externes de recueil des signalements (ex: inclure les ARS).

      6. Enjeu Prospectif : L'Intelligence Artificielle

      • L'IA est une "source de progrès indéniable mais aussi de menaces sur les droits et libertés", notamment par le biais d'algorithmes utilisés dans le recrutement, la gestion des ressources humaines, l'accès aux biens et services, et la lutte contre la fraude.

      Le recours au data mining dans la lutte contre la fraude présente des "risques de biais discriminatoires", touchant particulièrement les populations précaires.

      Un travail est en cours pour garantir une "action humaine" dans les procédures d'affectation scolaire (Parcoursup, Affelnet).

      7. Échanges avec les Parlementaires

      Les députés ont posé des questions variées, reflétant les préoccupations locales et nationales :

      • Antisémitisme : Bien que l'institution ne soit compétente que sur les discriminations et non les propos, Claire Hédon a noté une augmentation des actes antisémites et des appels au 3928 liés à des propos haineux en général. Elle a insisté sur l'absence de saisines spécifiques pour discrimination antisémite, mais un lien est fait avec le CRIF pour aborder les situations de harcèlement discriminatoire.
      • Discrimination liée à la grossesse : Claire Hédon a jugé la loi actuelle "très protectrice" et s'est dite "excessivement inquiète" du nombre de cas, notamment dans la fonction publique et le secteur privé, où des femmes sont poussées à la démission après un congé maternité.
      • Services Publics et Dématérialisation : Elle a réaffirmé que le problème n'est pas la dématérialisation en soi, mais le fait d'en faire la "seule porte d'entrée". Elle a appelé à la possibilité de déposer des dossiers papier et à renforcer le contact humain, saluant les initiatives comme la "pirogue France Service" en Guyane.
      • Droits des Étrangers : Les parlementaires ont confirmé les difficultés de leurs administrés. Claire Hédon a souligné que les atteintes aux droits des étrangers sont un "marqueur essentiel du niveau de protection plus généralement accordé aux droits et aux libertés dans notre pays". Elle a insisté sur la nécessité du renouvellement automatique des API pour désengorger les préfectures.
      • Discrimination des Seniors : Les chiffres du baromètre OIT montrent qu'un quart des seniors a subi une discrimination liée à l'âge, et un sur deux a connu des relations de travail dévalorisantes. Les seniors "non blancs" ou en mauvaise santé sont particulièrement touchés. Des recommandations incluent la sensibilisation, la formation des employeurs et l'anticipation des fins de carrière.
      • Refus de Soins Discriminatoires : Un rapport récent a mis en lumière l'ampleur du phénomène, avec des refus de rendez-vous et des minimisations de la douleur. Des "15 500 récits" de patients et soignants ont été reçus. Les recommandations visent à élaborer une stratégie nationale, faciliter les recours et prononcer des sanctions effectives.
      • Refus de Dépôt de Plainte : La persistance de refus de dépôt de plainte, particulièrement pour les femmes victimes de violence ou les personnes vulnérables (gens du voyage, handicapés), est une préoccupation.

      Bien que les délégués puissent intervenir, il est préférable d'éviter ce refus initial.

      • Accès à l'Éducation en Détention : L'institution est "excessivement inquiète" de la situation dans les établissements pour mineurs (EPM), notamment à Marseille, où les enfants sont confrontés à des heures d'éducation insuffisantes et un manque d'activités sportives. Les délégués de la Défenseure sont présents dans les lieux de détention et constatent ces entraves aux droits.
      • Statistiques par Sexe : Les statistiques sont disponibles et montrent des différences notables : les femmes saisissent majoritairement sur les droits des enfants, les hommes sur la déontologie des forces de sécurité.
      • En conclusion, Claire Hédon a rappelé que la défense des droits et libertés est une "nécessité pour les personnes concernées" et contribue à une "société plus apaisée et plus juste".

      Elle a souligné le rôle de son institution comme "pôle de stabilité et de permanence" dans un contexte où les droits sont fragiles, appelant à travailler sur l'effectivité du droit existant plutôt que d'en rajouter.

      Elle a enfin mis en garde contre le fait de "monter les populations les unes contre les autres", estimant que cela n'est bénéfique pour personne.

    1. AbstractBotryllus schlosseri (Tunicata) is a colonial chordate that has long been studied for its multiple developmental pathways and regenerative abilities and its genetically determined allorecognition system based on a polymorphic locus that controls chimerism and cell parasitism. We present the first chromosome-level genome assembly from an isogenic colony of B. schlosseri clade A1 using a mix of long and short reads scaf-folded using Hi-C. This haploid assembly spans 533 Mb, of which 96% are found in 16 chromosome-scale scaffolds. With a BUSCO completeness of 91.2%, this complete and contiguous B. schlosseri genome assembly provides a valuable genomic resource for the scientific community and lays the foundation for future investigations into the molecular mechanisms underlying coloniality, regeneration, histocompatibility, and the immune system in tunicates.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf097), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 2: Tilman Schell

      Review of

      First chromosome-level genome assembly of the colonial chordate model Botryllus schlosseri (Tunicata)

      from

      Olivier De Thier, Marie Lebel, Mohammed M. Tawfeeq, Roland Faure, Philippe Dru, Simon Blanchoud, Alexandre Alié, Federico D. Brown, Jean-François Flot and Stefano Tiozzo

      Comments to the authors

      De Thier et al. present a high-quality chromosome scale de novo assembly of the tunicate Botryllus schlosseri from mainly PacBio HiFi and Arima Hi-C reads. Further WGS Illumina and ONT data was applied to resolve assembly errors or support the correctness of the assembly structure. Structural and functional annotations are conducted thoroughly. Downstream analyses include a synteny comparison of different Tunicata based on ancestral linkage groups and Hox genes.

      The manuscript is well written and methods are mostly described to ensure reproducibility. Despite the good shape of the manuscript, I would like to give some remarks, which should be addressed in a revised manuscript before publication.

      General remarks

      I like the quote in the beginning of the introduction.

      The authors conducted downstream analyses with different related tunicate genome assemblies on chromosome level. For assembly metrics, there is a comparison regarding BUSCO assessment only. I would point out the high quality of the B. schlosseri assembly in Table 2 and 4 by comparison with the other chromosome level and annotated tunicate genome assemblies as well.

      I am not an expert regarding tunicates, so please excuse my basic, curiosity driven question: In the results section "The laboratory model Sub-clade A1" you state that a part of COI is used as a barcode to differentiate ascidian species. In the introduction you state that wild colonies are able to fuse resulting in mixed genotypes. Since sample E derived from the wild at some point, it might be theoretically possible to have not only mixed nuclear genotypes but mixed mitotypes too. Depending on how old sample E is and how fast fixation of a mitotype can happen within a colony, this might be reflected in your data. Furthermore, this thought could be expanded to nuclear genotypes, which could hamper scientific findings.

      Contamination filtering was based on a sequence similarity search and taxonomic assignment of blobtools only. Despite blobtools/blobtoolkit was applied I was not able to find a blobplot in the supplemental files. I would like to encourage the authors to add blobplots before and after contamination filtering at least to the supplement. In my opinion, blobplots are most powerful when considering GC content and coverage in the first place - especially, when dealing with taxa, which are underrepresented in public databases. Therefore, using taxonomic assignment only for contamination filtering might generate false positives (e.g. conserved sequences across the tree of life with taxonomic assignment different than Chordata but with similar GC and coverage as the target) and false negatives (e.g. short sequences of the assembly, which couldn't be assigned with different GC and coverage as the target).

      In the paragraphs "Results and Discussion" (Haplotype-resolved assembly) as well as in "Methods" (Haploid genome assembly) you use the term "haploid assembly" multiple times. I find this term misleading, since the genome is not haploid and the assembly represents both haplotypes at the same time. I assume that primary contigs from hifiasm were used to generate this assembly. Therefore, I would suggest to e.g. call this assembly "based on primary contigs", "non phased", "haplotype mixed" or "haplotype unresolved" (as opposite to "haplotype resolved").

      Particular remarks

      Results and Discussion

      Sequencing and genome size estimation

      Table 1 Please specify what "round 1" and "round 2" are referring to. Was one library sequenced twice or were two different libraries created and sequenced?

      Haploid genome assembly

      "We identified 28 contigs that belong to spore-forming unicellular parasites of the microsporidia group [32]. This represents the first report of this fungal group in a tunicate species." Is this identification based on blobtools taxonomic assignment? This is not described in the methods. Furthermore, can you rule out that identification or taxonomic assignment is false positive? If not you should tune down the second sentence and maybe discuss this.

      "We then performed Hi-C scaffolding using YaHS [34], which reduced the number of contigs to 256, before [...]" Technically, scaffolding with yahs can only increase the number of contigs because original (hifiasm) contigs are split because of the Hi-C signal (at least as long the option --no-contig-ec isn't applied). I would substitute "contigs" with "sequences".

      "Finally, a manual curation was performed, resulting in an assembly made up of 16 major scaffolds [...]" Is there any previous study on the karyotype of B. schlosseri? If so, citing it here would strengthen your results. Otherwise, I would recommend to state the karyotypes or the number of chromosome scale scaffolds of other tunicates here and discuss, if your findings are in line.

      Table 2 Please substitute "No. of scaffolds" with "No. of sequences". Please add the contig N50 values. As pointed out above, I would like to see a comparison to the other chromosome level tunicate genome assemblies here, instead of showing basically the same stats twice.

      "[…] highlighted the presence of two large-scale genomic palindromes located within Bs1 and a smaller one in Bs3 (Figure 3)." The figure shows the presence but maybe you can highlight them in the figure and the caption even more?

      "To find out whether these palindromes may result from assembly artifacts [40], we checked the localization of the duplicated BUSCO genes along the chromosomes and did another run of CRAQ [...]" You could support your findings by showing an even coverage distribution within the palindromes, which is similar to the coverage distribution of whole assembly. Either as a histogram or a zoomed in version of the read coverage across reference as in the outer layer of the circos plot could show this nicely.

      Methods

      Sampling, DNA isolation, and sequencing

      "HiFi PacBio long reads" Please provide more details on how PacBio libraries (was it actually one library sequenced twice or two different libraries?) were created and sequenced. Were low or ultra-low protocols used? On which machine was sequencing conducted?

      RNA-seq data

      Is downloading public data a method? In any case you should cite the original papers and provide a list of accession numbers (supplement) but I would remove this paragraph and add the information to the paragraph "Genome annotation", e.g. "Public available RNA-seq reads [23, 25, 8] were aligned to the soft-masked assemblies [...]"

      Data preprocessing

      Depending on how the PacBio libraries were created and which PacBio machine was utilized for sequencing, you should state how HiFi calling was conducted (e.g. Sequel II) and how PCR adapter and duplicates were filtered out (e.g. ultra-low).

      Haploid genome assembly

      "To this aim, contigs were aligned to the NCBI nucleotide database (accessed 2023 March 18) using BLAST+ [78]" Please state the version of BLAST+.

      "Finally, a BLASTN search for fragments of the mitochondrial genome among the contigs was performed using the published complete mitochondrial genome of B. schlosseri (RefSeq NC_021463.1) [28]." Were the fragments filtered out based on the blast search? Please explain what was done in detail. Which hits were considered (e.g. cutoffs)? The mitochondrial genome of E was assembled with NOVOPlasty, which is by the way not stated in the methods but in the results only. Was the assembled mt genome of E added to the assembly, once the fragments were filtered out?

      Haplotype-resolved assembly

      If I understand correctly, the rapid curation pipeline was applied but no dual-curation was conducted. When aiming for haplotype-resolved assemblies, I would recommend to apply this method, e.g. concatenating both haplotypes and creating a combined contact map of haplotype 1 and 2, which can be curated as usual, with the advantage of being able to exchange (parts of) sequences between the haplotypes. In some cases phasing from hifiasm is not correct and can be easily corrected with this approach.

    1. AbstractBackground Reference genomes for the entire sea turtle clade have the potential to reveal the genetic basis of traits driving the ecological and phenotypic diversity in these ancient and iconic marine species. Furthermore, these genomic resources can support conservation efforts and deepen our understanding of their unique evolution.Results We present haplotype-resolved, chromosome-level reference genomes and high-quality gene annotations for five sea turtle species. This completes the catalog of reference genomes of the entire sea turtle clade when combined with our previously published reference genomes. Our analysis reveals remarkable genome synteny and collinearity across all species, despite the clade’s origin dating back more than 60 million years. Regions of high interspecific genetic distance and intraspecific genetic diversity are consistently clustered in genomic hotspots, which are enriched with genes coding for immune response proteins, olfactory receptors, zinc fingers, and G-protein-coupled receptors. These hotspot regions may offer insights into the genetic mechanisms driving phenotypic divergence among species, and represent areas of significant adaptive potential. Ancient demographic analysis revealed a synchronous population expansion among sea turtle species during the Pleistocene, with varying magnitudes of demographic change, likely shaped by their diverse ecological adaptations, and biogeographic contexts.Conclusions Our work provides genomic resources for exploring genetic diversity, evolutionary adaptations, and demographic histories of sea turtles. We outline genomic regions with increased diversity, linked to immune response, sensory evolution, and adaptation to varying environments that have historically been subject to strong diversifying selection, and likely will underpin sea turtle’s responses to future environmental change. These reference genomes can assist conservation by providing insights into the demographic and evolutionary processes that sustain and threaten these iconic species.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf105), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 3: Xiaoli Liu

      (1)It is recommended to add keywords such as "conservation genomics" or "adaptive evolution" to better align with the content. (2)In the background section, after discussing the current status of sea turtles and existing genomic research, the study's content is introduced directly without adequately explaining why it is necessary to sequence the genomes of the remaining five species of sea turtles on top of the existing partial genomic data. The introduction of the research objectives appears somewhat abrupt. (3)Last line of page four"Previous analyses in particular of the……within this ancient clade [34,38]":When introducing the broad context of genomics and biodiversity conservation, it is important to provide detailed explanations for key concepts such as 'genomic synteny' and 'colinearity'. Although these concepts are covered later in the analysis of the turtle genome, providing initial elaboration can help readers better understand subsequent content. (4)Page 6 Section 2.2:The range of this quality value, 38.7, is incorrect. Please verify carefully. (5)Result 3.1:High conservation at the chromosomal level is supported, but repetitive sequences must be excluded from synteny analysis. (6)Section 3.4, Second Paragraph:The reliability of PSMC in low-diversity species, such as N. depressus, may be limited; it is recommended to validate findings with other methods, such as MSMC2. (7)It is recommended to include a detailed description of sample selection in the methods section, covering aspects such as geographic distribution, population size, and sample collection methods, to demonstrate the representativeness and reliability of the selected samples.

    2. AbstractBackground Reference genomes for the entire sea turtle clade have the potential to reveal the genetic basis of traits driving the ecological and phenotypic diversity in these ancient and iconic marine species. Furthermore, these genomic resources can support conservation efforts and deepen our understanding of their unique evolution.Results We present haplotype-resolved, chromosome-level reference genomes and high-quality gene annotations for five sea turtle species. This completes the catalog of reference genomes of the entire sea turtle clade when combined with our previously published reference genomes. Our analysis reveals remarkable genome synteny and collinearity across all species, despite the clade’s origin dating back more than 60 million years. Regions of high interspecific genetic distance and intraspecific genetic diversity are consistently clustered in genomic hotspots, which are enriched with genes coding for immune response proteins, olfactory receptors, zinc fingers, and G-protein-coupled receptors. These hotspot regions may offer insights into the genetic mechanisms driving phenotypic divergence among species, and represent areas of significant adaptive potential. Ancient demographic analysis revealed a synchronous population expansion among sea turtle species during the Pleistocene, with varying magnitudes of demographic change, likely shaped by their diverse ecological adaptations, and biogeographic contexts.Conclusions Our work provides genomic resources for exploring genetic diversity, evolutionary adaptations, and demographic histories of sea turtles. We outline genomic regions with increased diversity, linked to immune response, sensory evolution, and adaptation to varying environments that have historically been subject to strong diversifying selection, and likely will underpin sea turtle’s responses to future environmental change. These reference genomes can assist conservation by providing insights into the demographic and evolutionary processes that sustain and threaten these iconic species.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf105), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 2: Brendan Reid

      The authors of this work provide a fantastic addition to the genomic resources currently available for marine turtles with five new, apparently high-quality reference genomes. These new resources enable a number of interesting cross-species analyses in this group, including phylogenetic reconstruction, inference of demographic history, and identification of hotspots of diversity and divergence. I though this paper was quite clearly written and easy to read overall, and I have one major and a few more minor comments/suggestions.

      Major comment: there is an extensive literature on hybridization among marine turtle lineages (see Vilaca et al. 2021, https://doi.org/10.1111/mec.16113, for a recent genomic example), with lots of evidence for ancient gene flow after initial lineage divergence as well as recent hybridization. The authors do not really mention this phenomenon at all, and since I think it has a lot of bearing on all of the results it would make sense to re-think your findings in light of the fact that some level of gene flow has occurred. Would extensive synteny/lack of genomic rearrangements potentially enable hybridization? Is overall low divergence among lineages potentially a function of gene flow? Are regions of high divergence the result of selection (as you suggest), or could these regions potentially be resistant to gene flow? I believe that IQtree assumes a strictly bifurcating tree, and gene flow can influence PSMC inferences (see Mazet et al. 2016, https://doi.org/10.1038/hdy.2015.104) - how would gene flow among lineages affect your inference of divergence dates and demographic histories?

      MInor commentsL [note - line numbers would have been helpful for providing comments on specific items! I will refer to the lower-left page numbers and paragraph instead]:

      page 3, paragraph 2: Some of the applications you refer to here don't seem terribly germane to the relevance of "genomic resources" in management and conservation per se, and several are just methods using some kind of genetic data ... e.g., "abundance"/close-kin mark recapture doesn't require full genomes (and the reference you cite used microsat data), and the "community"/eDNA applications don't generally rely on genomes but instead on databases of a few (usually mitochondrial) genes. Either include methods that truly benefit from the development of high-quality reference genomes or broaden this to something like "growth in molecular ecology techniques".

      page 4, paragraph 2: last sentence is a bit of a run-on, could break this up a bit.

      page 10, paragraph 3: for me, the ROH methods need some additional explanation and interpretation. The more detailed methods indicate that the ROH were identified on the basis of lower-than-average heterozygosity rather than true homozygosity - I can understand why this might have been done (since the baseline level of heterozygosity varies across species) but it still seems a bit arbitrary and could risk mistaking stretches with simply low variation for IBD tracts. I wonder if a ROH-detection method like ROHan that explicitly incorporates baseline genomic heterozygosity into its model would be more appropriate for comparing results across species and could give different results. I also question a bit the interpretation of these low-diversity tracts as evidence of inbreeding per se. The authors do not comment much on the length distributions of these ROH - given that many of them are quite short I would expect that if there was mating between close kin it probably happened far back in the past and the IBD tracts have been broken up by recombination.

      page 11, paragraph 2: for PSMC analyses it is important to note the method assumes that differences in coalescence time/Ne across the genome result from demography alone. If portions of the genome are under balancing/diversifying selection (such as the areas of high diversity that you detect in this study), the local Ne for inferred these regions would be expected to be larger than the rest of the genome, which could lead to the spurious detection of population expansion or contraction (more likely a contraction for balancing selection). See Boitard et al. 2022 (https://doi.org/10.1093/genetics/iyac008) for a more detailed treatement. I would try excluding the regions putatively under diversifying selection and re-run PSMC to see if your inferences change.

    3. AbstractBackground Reference genomes for the entire sea turtle clade have the potential to reveal the genetic basis of traits driving the ecological and phenotypic diversity in these ancient and iconic marine species. Furthermore, these genomic resources can support conservation efforts and deepen our understanding of their unique evolution.Results We present haplotype-resolved, chromosome-level reference genomes and high-quality gene annotations for five sea turtle species. This completes the catalog of reference genomes of the entire sea turtle clade when combined with our previously published reference genomes. Our analysis reveals remarkable genome synteny and collinearity across all species, despite the clade’s origin dating back more than 60 million years. Regions of high interspecific genetic distance and intraspecific genetic diversity are consistently clustered in genomic hotspots, which are enriched with genes coding for immune response proteins, olfactory receptors, zinc fingers, and G-protein-coupled receptors. These hotspot regions may offer insights into the genetic mechanisms driving phenotypic divergence among species, and represent areas of significant adaptive potential. Ancient demographic analysis revealed a synchronous population expansion among sea turtle species during the Pleistocene, with varying magnitudes of demographic change, likely shaped by their diverse ecological adaptations, and biogeographic contexts.Conclusions Our work provides genomic resources for exploring genetic diversity, evolutionary adaptations, and demographic histories of sea turtles. We outline genomic regions with increased diversity, linked to immune response, sensory evolution, and adaptation to varying environments that have historically been subject to strong diversifying selection, and likely will underpin sea turtle’s responses to future environmental change. These reference genomes can assist conservation by providing insights into the demographic and evolutionary processes that sustain and threaten these iconic species.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf105), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 1: Laura Caquelin

      1. Summary of the Study The authors aimed to create high-quality reference genomes for five sea turtle species to better understand their genetic diversity, evolutionary adaptations, and ecological traits. They used haplotype-resolved, chromosome-level reference genomes and gene annotations to reveal conserved genome structures, genetic hotspots linked to immune response and sensory evolution, and patterns of demographic expansion. Their findings highlight areas of genetic diversity critical for adaptation and conservation efforts.

      2. Scope of reproducibility

      According to our assessment the primary objective is: Investigation of multi-copy gene family enrichment in genomic hotspots of sea turtles.

      • Outcome: Significant enrichment of "MHC", "Immunology-related", "G-Protein Coupled Receptor" (GPCR), "Olfactory Receptor" or "Zinc-Finger" in genomic hotspots with high genetic divergence, diversity, and gene density.
      • Analysis method outcome: Fisher's exact test followed by Benjamini-Hochberg correction
      • Main result: "Following functional annotation of the genes found in these hotspots, we found enrichment for multi-copy gene families coding for proteins with functions in immune response, olfactory receptors (ORs), zinc fingers, and G-protein-coupled receptors (GPCRs_ (Fig 4c, Tables S6 & S7). This included enrichment of immunology-related genes, GPCRs, ORs, and Zinc-finger genes in chromosome 13 (adjusted p < 10-42, 10-47, 10-79, 0.01, respectively), MHC genes, Immunology-related genes, GPCRs, ORs, and Zinc-finger genes in chromosome 14 (adjusted p < 10-24, 10-6, 10-2, 10-10, 10-52, respectively) and Immunology-related genes and GPCRs in chromosome 24 (adjusted p < 10-3 and 10-3, respectively)." (page 10).

      • Availability of Materials a. Data

      • Data availability: Open
      • Data completeness: Complete
      • Access Method: Repository
      • Repository: https://git.imp.fu-berlin.de/begendiv/sea_turtlegenomes
      • Data quality: The data files have been shared and appear sufficient for running the analyses. However, no metadata is provided to describe the content, structure, or origin of the files which limits interpretability and reusability. b. Code
      • Code availability: Open
      • Programming Language(s): R (for the enrichment test)
      • Repository link: https://git.imp.fu-berlin.de/begendiv/sea_turtlegenomes
      • License: MIT license
      • Repository status: Public
      • Documentation: Short README, describe only the presentation of the directory.

      • Computational environment of reproduction analysis

      • Operating system for reproduction: MacOS 14.7.4

      • Programming Language(s): R
      • Code implementation approach: Using shared code
      • Version environment for reproduction: R version 4.4.1/RStudio 2024.09.0

      • Results

      5.1 Original study results

      Results 1: The main results are presented in Figure 4 and the numerical p-values are available on supplementary table 6 and table 7.

      5.3 Steps for reproduction -> Run the code "enrichment_test.R" shared on Git - Issue 1: Files needed to run the code are not shared in the Git repository: "GCF_009764565.3_rDerCor1.pri.v4_genomic.longest.aa.tsv", "hotspots_chr13.longest.aa.tsv", "hotspots_chr14.longest.aa.tsv", "hotspots_chr24.longest.aa.tsv". -- Resolved: These analysis data are not shared in the internal Gigascience FTP server or the Git repository. After request, the authors uploaded all the files into the Git repository.

      5.4 Statistical comparison Original vs Reproduced results - Results: The table S6 and S7 was reproduced: -- Supplementary table S6: see screenshot from R console -- Supplementary table S7: see screenshot from R console

      • Comments: The original R code "enrichment_test.R" simply stored the p-values results in a value object. To simplify the comparison process, directly obtain the final table, and ensure reproducibility while minimizing errors, we implemented the creation of the table.

      ------------------ Start of R code ------------------ Creating final tables Corresponding to supplementary table S6 table_S6 <- data.frame( enrichment = c("MHC", "Immunology", "GPCR", "Olfactory", "Zinc-finger"), Chr13 = c(p_mhc13, p_immune13, p_gpcr13, p_or13, p_zinc13), Chr14 = c(p_mhc14, p_immune14, p_gpcr14, p_or14, p_zinc14), Chr24 = c(p_mhc24, p_immune24, p_gpcr24, p_or24, p_zinc24))

      Corresponding to supplementary table S7 Create a vector of names for rows and columns ( ! warning the pvalues in fdrs are not in the same order as the table S7) enrichment <- c("MHC", "Olfactory", "GPCR", "Immunology", "Zinc-finger") chromosomes <- c("Chr13", "Chr14", "Chr24")

      Reorganizing fdrs in a matrix table_S7 <- matrix(fdrs, nrow = length(enrichment), byrow = TRUE) rownames(table_S7) <- enrichment colnames(table_S7) <- chromosomes

      Organizing rows as the original table S7 library(dplyr) table_S7 <- as.data.frame(table_S7) # Convert matrix to data frame table_S7 <- table_S7 %>% slice(match(c("MHC", "Immunology", "GPCR", "Olfactory", "Zinc-finger"), enrichment)) ------------------- End of R code -------------------

      • Errors detected: The statement "MHC genes, Immunology-related genes, GPCRs, ORs, and Zinc-finger genes in chromosome 14 (adjusted p < 10^-24, 10^-6, 10^-2, 10^-10, 10^-52, respectively)" (page 10) appears to contain an error. Specifically, the p-value for Olfactory Receptors (5.583367e-10) is greater than the threshold of 10^-10, suggesting that this value should instead be below 10^-9. Therefore, the threshold for Olfactory Receptors should be revised to 10^-9.

      • Statistical Consistency: The p-values are consistent (see screenshot from R console).

      • Conclusion

      • Summary of the computational reproducibility review The inferential statistics for the objective "Investigation of multi-copy gene family enrichment in genomic hotspots of sea turtles" were successfully reproduced using the original analysis code provided by the authors. The input data needed to run the code were initially unavailable but were subsequently shared through the Git repository. An inconsistency was noted in the text of the manuscript reporting a threshold for Olfactory Receptors, where the stated 10^-10 should be revised to 10^-9 based on the observed p-value (5.583367e-10).

      • Recommendations for authors While the original analysis code was successfully used to reproduce the results, we recommend improving the documentation to enhance clarity and reproducibility. In particular: -- Code annotation: The scripts would benefit from more detailed comments within the code to clarify the logic of each step. This would greatly help users follow the analyses more easily and understand the purpose of specific commands or operations. -- README file: The current README provides only a general overview. We suggest expanding it to include: --- A brief description of each script or analysis pipeline. --- An indication of which figure, table, or result in the manuscript each script corresponds to. --- Clear instructions on how to execute the analyses in the correct order, if applicable. -- Metadata: For the datasets used or generated by the scripts, it would be helpful to include accompanying metadata files that explain: --- The definition of each variable name. --- The origin of each dataset (raw, processed, etc). --- Any preprocessing steps applied before analysis. -- Data availability: At this stage, we have only verified the reproducibility of one part of the study. To facilitate full reproducibility of the entire study, we recommend sharing all necessary data files required to run every script present in the repository.

      These improvements would make the repository significantly more user-friendly and would strengthen the reproducibility of the study.

    1. Synthèse de l'Audition de la CNAF sur la Santé Mentale et le Handicap

      Résumé

      L'audition de la Caisse Nationale des Allocations Familiales (CNAF) devant la commission d'enquête a mis en lumière son rôle substantiel, bien que souvent discret, dans la prise en charge du handicap et de la santé mentale en France.

      L'intervention de la CNAF s'articule autour de deux axes majeurs : le versement de prestations financières (légales et pour le compte de tiers) et le financement de services aux familles via sa politique d'action sociale.

      Les points critiques à retenir sont les suivants :

      1. Gestion de Prestations Financières Clés : La CNAF gère des allocations majeures telles que l'Allocation aux Adultes Handicapés (AAH), représentant 13,8 milliards d'euros pour 1,3 million de bénéficiaires en 2024, et l'Allocation d'Éducation de l'Enfant Handicapé (AEEH), s'élevant à 1,6 milliard d'euros pour 500 000 bénéficiaires.

      2. Impact de la Déconjugalisation de l'AAH : La réforme de la déconjugalisation, effective depuis novembre 2023, est un succès opérationnel.

      Elle a bénéficié à 66 000 allocataires avec une hausse moyenne de 400 € par mois, incluant 22 300 nouveaux bénéficiaires. Un système parallèle est maintenu pour 31 000 personnes afin d'éviter toute perte de droits.

      3. Promotion de l'Inclusion en Milieu Ordinaire : La CNAF promeut activement l'inclusion des enfants en situation de handicap dans les structures de droit commun (crèches, accueils de loisirs) via des "bonus inclusion handicap".

      Le succès de ce dispositif pour les accueils de loisirs est notable, avec des dépenses en 2024 ( 53 millions d'euros) dépassant déjà plus du double de l'objectif initial pour 2027.

      4. Santé Mentale des Jeunes : La CNAF finance 200 Points Accueil Écoute Jeunes (PAEJ), des structures de première ligne pour les adolescents en difficulté, et travaille à un accord-cadre avec le ministère de la Santé pour mieux articuler ces dispositifs au sein de l'écosystème de santé mentale.

      5. Relation avec les MDPH : Bien que des progrès significatifs aient été réalisés grâce à la dématérialisation des flux, les délais de traitement des dossiers par les Maisons Départementales des Personnes Handicapées (MDPH) demeurent un enjeu majeur, obligeant les CAF à mettre en place des procédures de prolongation de droits pour éviter les ruptures de versement.

      6. Complexité de l'AAH : Un axe d'amélioration majeur identifié est la simplification de l'AAH. Sa complexité actuelle, notamment pour les bénéficiaires qui travaillent, peut créer des freins à l'emploi et générer des situations d'incompréhension.

      La CNAF suggère d'intégrer cette prestation dans le mouvement global de modernisation et de simplification des aides sociales.

      Interventions de la CNAF : Un Double Axe d'Action

      La CNAF structure son action en faveur des personnes en situation de handicap et de leurs familles autour de deux piliers fondamentaux, inscrits dans sa Convention d'Objectifs et de Gestion (COG) 2023-2027 avec l'État.

      1. Versement de Prestations Légales

      La CNAF est l'opérateur de versement pour plusieurs prestations essentielles, certaines financées par la branche Famille, d'autres gérées pour le compte de l'État ou de la branche Autonomie. Ce rôle s'effectue en partenariat étroit avec les MDPH, qui sont en charge de l'évaluation médicale et de la détermination des taux d'incapacité.

      Prestation

      Description

      Chiffres Clés (2024)

      Allocation aux Adultes Handicapés (AAH)

      Assurer un revenu minimal aux personnes en situation de handicap de plus de 20 ans.

      13,8 milliards € versés à 1,3 million de bénéficiaires.

      Allocation d'Éducation de l'Enfant Handicapé (AEEH)

      Compenser les dépenses liées au handicap d'un enfant de moins de 20 ans.

      1,6 milliard € versés à 500 000 bénéficiaires.

      Allocation Journalière de Présence Parentale (AJPP)

      Compenser la perte de revenus pour un parent cessant son activité pour s'occuper d'un enfant malade ou handicapé.

      2 848 bénéficiaires (au 30 juin), coût de 261 millions €.

      Allocation Journalière du Proche Aidant (AJPA)

      Compenser la perte de revenus pour un proche aidant cessant son activité ponctuellement.

      1 652 bénéficiaires (au 30 juin), coût de 11 millions €.

      2. Financement de Services aux Familles via l'Action Sociale

      Le second pilier est financé par le Fonds National d'Action Sociale (FNAS) et vise à rendre les services de droit commun accessibles aux familles concernées par le handicap, promouvant ainsi une politique d'inclusion active.

      Inclusion dans la Petite Enfance (Crèches) :

      • ◦ Un bonus "inclusion handicap" majore le financement des crèches qui accueillent des enfants en situation de handicap.   
      • ◦ En 2023, 25 millions d'euros ont été dépensés à ce titre.  
      • ◦ Près de 50 % des crèches en France bénéficient de ce bonus, témoignant de son adoption massive.

      L'objectif est de favoriser une inclusion précoce pour fluidifier le parcours ultérieur, notamment la scolarisation.

      Inclusion dans les Accueils de Loisirs (Périscolaire et Extrascolaire) :

      • ◦ Généralisé en 2024, un bonus similaire existe pour les Accueils de Loisirs Sans Hébergement (ALSH).  
      • ◦ Le dispositif a rencontré un succès immédiat et supérieur aux prévisions : 53 millions d'euros ont été engagés en 2024, soit plus du double de l'estimation pour toute la durée de la COG (jusqu'en 2027).  
      • ◦ Cela indique une forte mobilisation des collectivités pour adapter leurs offres et garantir une continuité de prise en charge après l'école.

      Soutien au Répit Familial et aux Vacances :

      ◦ Les CAF mènent une politique active de soutien au départ en vacances, avec des dispositifs et financements spécifiques pour les familles concernées par le handicap (enfants ou parents).    ◦ Le dispositif VACAF permet de faire partir environ 500 000 personnes chaque année.    ◦ Des offres spécifiques (séjours passerelles) existent pour les familles nécessitant un accompagnement renforcé.

      Pôles de Ressources Handicap :

      ◦ La CNAF finance, à l'échelle départementale, des pôles de ressources visant à faciliter la connexion entre les familles et les structures d'accueil de droit commun (crèches, ALSH), levant ainsi les obstacles pratiques et informationnels.

      Enjeux et Réformes Clés

      La Déconjugalisation de l'Allocation aux Adultes Handicapés (AAH)

      Mise en œuvre le 1er novembre 2023, cette réforme très attendue visait à individualiser l'AAH sans tenir compte des revenus du conjoint.

      Un bilan quantitatif significatif :

      • 66 000 allocataires ont bénéficié de la réforme, avec une hausse moyenne de 400 € par mois.   
      • ◦ Parmi eux, 44 000 étaient déjà allocataires et ont vu leur AAH augmenter de 327 €/mois en moyenne. 
      • 22 300 nouvelles personnes, auparavant inéligibles à cause des revenus de leur conjoint, sont entrées dans le dispositif avec un gain mensuel moyen de 554 €.

      Une réforme "sans perdant" :

      • ◦ Pour les 31 000 personnes pour qui le nouveau mode de calcul aurait été désavantageux, l'ancien système est maintenu.  
      • ◦ Cela conduit à la coexistence de deux systèmes de calcul de l'AAH, qui perdurera plusieurs décennies.

      Articulation avec les MDPH : Fluidité et Prévention des Ruptures

      La qualité des échanges d'information avec les MDPH est cruciale pour le versement des prestations.

      Progrès et défis : La dématérialisation des flux a considérablement amélioré et sécurisé les échanges par rapport à la situation d'il y a cinq ans, où les flux papier étaient encore nombreux.

      Gestion des délais : Les délais d'instruction longs au sein des MDPH restent une difficulté majeure.

      Pour éviter les ruptures de droits, notamment lors des renouvellements, les CAF pratiquent la prolongation des droits en attendant la décision de la MDPH.

      Cette pratique, bien que créant un risque financier (génération d'indus si le droit n'est pas renouvelé), est jugée préférable pour ne pas précariser les familles.

      La Santé Mentale des Jeunes : Le Rôle des Points Accueil Écoute Jeunes (PAEJ)

      La CNAF a repris le financement des PAEJ, qui constituent une offre de première ligne pour les jeunes en difficulté psychologique.

      Un maillage territorial : 200 structures ont été financées en 2023 sur tout le territoire.

      Un rôle de pivot : Les PAEJ travaillent en réseau, en amont avec le milieu scolaire pour le repérage, et en aval en orientant vers des structures de soin (CMP, Maisons des Adolescents) lorsque nécessaire.

      Vers un cadre national : Des discussions sont en cours avec le ministère de la Santé pour établir un accord-cadre national afin de clarifier les rôles et d'assurer la complémentarité des dispositifs, notamment face au risque de désengagement de certains co-financeurs comme les Agences Régionales de Santé (ARS).

      Lutte contre les Erreurs et Gestion des Indûs

      La CNAF utilise un algorithme de "datamining" depuis 2011 pour cibler ses contrôles, non pas sur des populations, mais sur des risques d'erreur pouvant générer des versements indus.

      Logique du ciblage : Le système identifie les situations où le risque d'erreur déclarative est le plus élevé.

      Il s'agit principalement des prestations sensibles aux variations de revenus déclarées trimestriellement (RSA, Prime d'activité).

      Cas de l'AAH : Les bénéficiaires de l'AAH ne sont pas plus contrôlés en tant que tels. Le risque est plus élevé pour la population spécifique des bénéficiaires de l'AAH qui travaillent, en raison de la complexité des règles de cumul et de la variabilité des revenus à déclarer.

      Solution à la source : La réforme de la "solidarité à la source" est présentée comme la solution principale.

      En instaurant des déclarations pré-remplies pour le RSA et la Prime d'activité, elle vise à réduire drastiquement les erreurs à la base et, par conséquent, les contrôles a posteriori et les indus.

      L'extension de ce principe à la partie "activité" de l'AAH est une piste de réflexion.

      Perspectives et Axes d'Amélioration

      Interrogée sur les pistes d'amélioration du système, la CNAF a souligné plusieurs points :

      1. Moderniser et Simplifier l'AAH : L'AAH est décrite comme une prestation d'une "grande complexité", qui s'est "hybridée" avec la déconjugalisation (à la fois minimum social et prestation plus large).

      Cette complexité peut être un frein à l'emploi et fragiliser les bénéficiaires.

      La CNAF plaide pour que l'AAH soit intégrée au mouvement global de simplification des prestations sociales, afin d'améliorer la lisibilité et de ne pas décourager le travail.

      2. Reconnaître les Coûts de l'Inclusion : Une étude financée par la CNAF a mis en évidence les "coûts très importants de l'inclusion" scolaire, largement portés par les mères, avec des conséquences parfois lourdes sur leur vie professionnelle (jusqu'à l'arrêt de l'activité).

      Cet enjeu justifie les efforts financiers importants des CAF pour soutenir la prise en charge périscolaire.

      3. Renforcer le Soutien sur les Temps Périscolaires : La CNAF a intensifié son soutien financier aux ALSH et a étendu depuis 2024 son financement à la pause méridienne.

      L'effort financier est considérable : un enfant en situation de handicap en ALSH est financé par la CAF à hauteur de 4,50 € de l'heure, contre 0,60 € pour un autre enfant.

      Ce soutien est essentiel pour permettre le maintien dans l'emploi des parents.

    1. AbstractThe vast majority of cancers exhibit Somatic Copy Number Alterations (SCNAs)—gains and losses of variable regions of DNA. SCNAs can shape the phenotype of cancer cells, e.g. by increasing their proliferation rates, removing tumor suppressor genes, or immortalizing cells. While many SCNAs are unique to a patient, certain recurring patterns emerge as a result of shared selectional constraints or common mutational processes. To discover such patterns in a robust way, the size of the dataset is essential, which necessitates combining SCNA profiles from different cohorts, a non-trivial task.To achieve this, we developed CNSistent, a Python package for imputation, filtering, consistent segmentation, feature extraction, and visualization of cancer copy number profiles from heterogeneous datasets. We demonstrate the utility of CNSistent by applying it to the publicly available TCGA, PCAWG, and TRACERx cohorts. We compare different segmentation and aggregation strategies on cancer type and subtype classification tasks using deep convolutional neural networks. We demonstrate an increase in accuracy over training on individual cohorts and efficient transfer learning between cohorts. Using integrated gradients we investigate lung cancer classification results, highlighting SOX2 amplifications as the dominant copy number alteration in lung squamous cell carcinoma.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf104), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 1: Stefano Monti

      This is a well-written paper that aims to develop a tool that can integrate SCNA from large datasets possibly generated using different platforms to identify alteration patterns that are often undetected in smaller data subsets. Authors have used CNN-based method for integrating the data, extracting features and predicting cancer types from SCNA profiles. The tool has the potential to significantly simplify the integration and analysis of large scale SCNA studies. However, some (hopefully addressable) weaknesses are noted:

      1. The choice of a classification task as the (only) way to evaluate the proposed method is questioned. I would argue that the most important use of SCNA detection is in support of mechanistic investigations, by identifying novel candidate loci likely to harbor tumor suppressors (copy losses) and oncogenes (copy gains). This type of analysis is hardly mentioned in the manuscript, and it is not clear how well the proposed tool would support it. I surmise it can, but the authors should discuss (and present results about) it.

      2. If we were to focus on the task of recurrent SCNA detection, then meta-analysis approaches (where separate analyses are performed on each of the datasets, and only the results are integrated) would need to be considered as an alternative to the approach here proposed (e.g., application of GISTIC to each of PCAWG, TCGA, TRACERx separately, followed by meta-analysis integration of the results). I am not saying meta-analysis would be superior, but the authors should discuss it, and possibly evaluate it.

      3. The reported metrics to quantify the quality of the integration are insufficient to assess the results. There is some lack of clarity about the classification accuracy results reported, since it is not clear whether all the components of the model building were adequately brought into the cross-validation (or train/test) loop. More specifically, when reporting the accuracy of the cancer type classification, it is reported that 1 megabase segmentation yields the best results. It is not clear if this size selection was performed within the train set only (and/or within the CV loop) or across the entire dataset. If the latter, this may significantly affect the accuracy results, which could not be deemed (unbiased) "test set" results. This should be clarified, and if the segment size selection was indeed performed outside the train/test split, accuracy measures should be computed again by performing the segment size selection properly (which of course it would mean a potentially different size would be selected for each of the folds).

      4. Comparisons with other methods: The authors only compare their method to random forest (RF). Related to the previous point: I presume the RF model used the segment size that was optimized for the CNN model (i.e., 1Mb). If this is the case, it would be an unfair comparison, since RF might favor a different size. Also, additional classifiers should be evaluated (e.g., Elastic Net, SVM, etc.).

      5. There is no sufficient discussion of existing tools/methods. This should be corrected (see also my comment about meta-analysis approaches).

      6. Metadata effects: Age influences the copy number alterations. The authors don't consider age or any other metadata and their implication in the classification task.

      7. Run time statistics and user requirement: While the authors report runtime curves per command (S Fig 6), it is difficult to translate this to total runtime. It would be useful if runtime for the entire training of a model were reported. Additionally, if available, comparison of run time stats with the established model that they cite would be useful.

      8. IG-based explanation. I found this section sort of perfunctory, not sufficiently justified, and adding little to the manuscript. IG is computationally expensive, and it does not provide any way to statistically quantify the found associations. Simpler methods, such as testing for association between SCNA occurrence and cancer type should be evaluated and compared to.

      9. Model selection: No adequate justification of why they picked CNN for this task when the referenced paper itself claims the DNN architecture performs better. Not sure but is this because of the varying segment size? Again, this is not clearly stated. https://pmc.ncbi.nlm.nih.gov/articles/PMC9203194/#tab1

    1. AbstractPolyadenylation is a dynamic process which is important in cellular physiology. Oxford Nanopore Technologies direct RNA-sequencing provides a strategy for sequencing the full-length RNA molecule and analysis of the transcriptome and epi-transcriptome. There are currently several tools available for poly(A) tail-length estimation, including well-established tools such as tailfindr and nanopolish, as well as two more recent deep learning models: Dorado and BoostNano. However, there has been limited benchmarking of the accuracy of these tools against gold-standard datasets. In this paper we evaluate four poly(A) estimation tools using synthetic RNA standards (Sequins), which have known poly(A) tail-lengths and provide a valuable approach to measuring the accuracy of poly(A) tail-length estimation. All four tools generate mean tail-length estimates which lie within 12% of the correct value. Overall, Dorado is recommended as the preferred approach due to its relatively fast run times, low coefficient of variation and ease of use with integration with base-calling.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf098), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 2: Jesse Daniel Brown

      This manuscript addresses a relevant and timely question: benchmarking poly(A) tail-length estimation tools (BoostNano, tailfindr, nanopolish, and Dorado) using synthetic RNA standards (Sequins) with known tail lengths. Poly(A) tail-length estimation is increasingly important for understanding mRNA stability, processing, and regulation at the single-molecule level. As direct RNA sequencing expands in use, reliable methods to measure poly(A) tail lengths are needed. The study's desiign—leveraging Sequins as a "gold standard" to benchmark tools—is strong and fills an area is need in current literature. The analysis is thorough in its basic comparisons, and the results are likely to be useful to researchers who need to choose suitable software for poly(A) tail analysis. However, the manuscript would benefit from deeper contextualization, more rigorous statistical methodology, and clearer reporting of computational details. Ensuring reproducibility and providing clearer guidance on interpreting the results in real biological contexts would strengthen the mannuscript. The suggestions below are aimed at making the study more valuable to the community. For this reason, my recommendation is Revisions ARE Needed

      Introduction

      Abstract: ★★★★☆ (4/5) Actually in place of the introduction, it has it strengths: The introduction adequately outlines why polyadenylation is biologically important and why direct RNA sequencing provides a unique opportunity for poly(A) tail-length estimation. It justifies the use of Sequins as synthetic standards, which is a robust approach to derive ground-truth tail lengths.

      Areas for Improvement:The introduction could better connect poly(A) tail-length estimation to downstream applications. For instance, mention how accurate tail-length estimation could improve understanding of mRNA decay rates, translation efficiency, or isoform-specific regulation.

      Adding references that contextualize poly(A) tail dynamics in broader biological phenomena would help readers understand the significance. For example, it is almost a necessity to cite work such as "Roles of mRNA poly(A) tails in regulation of eukaryotic gene expression" by Lori A. Passmore & Jeff Coller (2022, Nature Reviews Molecular Cell Biology) which provides a comprehensive analysis of poly(A) tail dynamics and their impact on mRNA decay, stability, and translation regulation. P & C (2022) also expands on these principles by discussing the mechanistic underpinnings of poly(A)-mediated decay and translation regulation, making it a broader and more recent contribution to polyadenylation biology, which the authors should consider.

      Grammar of the abstract: Error: "There are currently several tools available for poly(A) tail-length estimation, including well-established tools such as tailfindr and nanopolish, as well as two more recent deep learning models: Dorado and BoostNano." Suggestion: "Several tools are currently available for poly(A) tail-length estimation, including well-established methods like tailfindr and nanopolish, as well as two more recent deep learning models: Dorado and BoostNano."

      Error: "which lie within 12% of the correct value." Suggestion: "that lie within 12% of the correct value."

      Clarify the library preparation steps to avoid confusion about the "direct" nature of RNA sequencing. The text currently implies that no reverse transcription is required, but then references an ONT Reverse Transcription Adapter. Distinguish between a full-length cDNA synthesis step (not required) and the use of a poly(T)-containing adapter for sequencing library preparation.

      Methods

      Methods: ★★★★☆ (4/5) The methods section has its strengths; the data sources and preparation (Sequins spiked into host RNA) are clearly described. Versions of tools are provided, enhancing reproducibility.

      Areas for Improvement are statistical analysis, comparisons and tests, hardware and computation details, and understanding of run time differences. Currently, the study models distributions as normal and uses mean and SD, but no normality tests or justification for these choices are presented. Consider performing normality tests or using nonparametric measures. Additionally, providing confidence intervals or other robust statistics (median, interquartile ranges) would clarify variability.

      For the comparisons and tests, the authors should explain why you chose root mean square error (RMSE) minimization and other metrics. Could alternative tests, like Wilcoxon signed-rank tests or paired t-tests (Wilcocoxon: this non-parametric test is suitable for paired comparisons when the assumption of normality is not met. -useful to compare the predicted tail lengths from each tool against the expected lengths, especially if the data distribution is skewed.), be used to compare the distribution of tail-length estimates more rigorously? Paired t-Test, because this test could be applied if the normality assumption holds, providing a straightforward way to assess whether the mean difference between predicted and expected values is statistically significant. (If so, justification should be provided for why or why not)

      There are some additional metrics to explore: ---Median Absolute Deviation (MAD): Consider adding MAD as it is robust to outliers and could complement RMSE to provide a better understanding of central tendencies and variability. ---Mean Absolute Error (MAE): MAE is another alternative that simplifies the interpretation by focusing solely on the magnitude of errors without squaring them, potentially offering more intuitive insights for readers. The authors should address testing for normality, explicitly stating whether normality tests were conducted on the data (e.g., Shapiro-Wilk or Kolmogorov-Smirnov tests). If normality is confirmed, justify the use of parametric tests like RMSE or t-tests. If not, justify why non-parametric tests (e.g., Wilcoxon) were not employed or discuss plans to include them in future studies.

      Explain the choice of statistical methods over time by discussing how the choice of statistical tests aligns with the study's goals. For example, emphasize whether the focus was on understanding overall error distribution, tool consistency, or accuracy in predicting specific tail lengths.

      The authors could use visual representations of error complementing the statistical tests with visual aids such as boxplots, violin plots, or Bland-Altman plots to illustrate the error distributions and discrepancies between predicted and actual tail lengths across tools.

      The authors should provide hardware and computational details like providing explicit details on the computational environment—CPU/GPU models, RAM, OS—for each tool's run. While the Git-hub read me suggests how to run the system, it lacks any details about system requirements. Readers need this to understand runtime differences and attempt to replicate performance measurements.

      The authors should consider tool parameterization and indicate if any specific parameters (beyond defaults) were used in tailfindr, nanopolish, Dorado, or BoostNano runs. If no changes were made from defaults, state this explicitly.

      Results

      The result's strengths are that they are presented clearly, showing density distributions and discussing short-tail anomalies. The identification of Dorado as a preferred tool due to speed, integration, and conservative filtering is well-supported by the data. The study acknowledges that all tools achieve broadly similar accuracy, differing mainly in runtime and filtering criteria, which is a practical insight for users.

      The results have areas for improvement: Regrading the short-tail reads explanation, the authors attribute short (<10 nt) poly(A) tails to truncated transcripts or mis-priming. For this reason, it is suggested that the authors strengthen this discussion with additional evidence or reasoning. For instance, is there a correlation between read quality and short-tail length estimates? Do truncated reads consistently align to internal A-rich stretches? Multiple peaks in distributions: Some density plots (Figure 1) show multiple peaks or shoulder peaks. Discuss potential reasons for these patterns. Are they related to tool-specific biases, read quality, or adapter/poly(T) truncation? Application Context: The results focus on method performance, but it would help readers to understand how these differences might influence downstream tasks. For example, if a method overestimates poly(A) length slightly, how could this affect conclusions about RNA stability or differential tail-length analysis between experimental conditions? Figures and tables: Figure 1: Clear density plots, but consider adding vertical lines at expected tail lengths (30 nt and 60 nt) to guide interpretation. Splitting the figure into separate panels for R1 and R2 or using insets might clarify multiple peaks. Figure 2: The IGV snapshots are informative. Enhance interpretability by adding annotations (arrows or boxes) highlighting truncated vs. full-length reads. Increase font sizes for readability. Figure 3: Useful comparison of reads filtered by Dorado but retained by BoostNano. Add a brief note or labeling to indicate expected tail lengths. Discuss possible reasons for Dorado's conservative filtering here or in the main text. Tables: Provide definitions for abbreviations (nt, CPU, GPU) in captions. For Table 2, adding confidence intervals around the mean tail-length estimates would strengthen statistical rigor. For Table 3, specify hardware details as recommended above.

      Grammar Mistakes and errors in the results section: Results Section: Sentence: "The four methods display a similar pattern in the density distribution, with a prominent normal-like peak near the expected poly(A) length, but also with a over-representation of shorter poly(A) tails, ranging at approximately ~0-10 nt (Figure 1)." Issue: "a over-representation" Correction: "an over-representation"

      Sentence: "We expected that these shorter peaks were derived from either fragmentation of the transcript, mis-priming of internal poly(A) stretches or degradation of the poly(A) tails." Issue: tense mismatch ("expected" vs. "were derived"). Correction: "We expect" -- "were derived", loses context and tense contformity-- therefore the sentence should be adjusted- "We hypothesize that these shorter peaks are derived from either fragmentation of the transcript, mis-priming of internal poly(A) stretches, or degradation of the poly(A) tails."

      Sentence: "Interestingly, upon investigating these earlier peaks, we found that Dorado excludes reads which are retained in the analysis by BoostNano, despite them being classified as passed reads (Figure 3)." Issue: Ambiguous pronoun "them." (them could incorrectly identify three possible targets in the sentence) Correction: "Interestingly, upon investigating these earlier peaks, we found that Dorado excludes reads retained in the analysis by BoostNano, even though these reads are classified as passed reads (Figure 3)."

      Sentence: "Therefore, Dorado appears to be a more conservative approach than BoostNano." Issue: No grammar issues, but the statement could be more precise. Suggested improvement: "Thus, Dorado demonstrates a more conservative approach compared to BoostNano."

      Sentence: "In order to determine which normal distribution fit the peak best, we found the parameters (mean, SD) which minimize the root mean square error between the candidate normal distribution and the density distribution for an interval of 10 nt to the right of the mode." Issue: Verb tense consistency ("fit"). Correction: "To determine which normal distribution fits the peak best, ..."

      Sentence: "The peaks also lose their normal-like behavior for larger values." Issue: Could use a more formal tone. Correction: "The peaks also deviate from their normal-like behavior at larger values."

      Sentence: "Next, we compared the computational time required by each method to predict the tail-length of 4000 reads." Issue: Hyphenation of "tail-length." Correction: "Next, we compared the computational time required by each method to predict the tail length of 4,000 reads."

      Sentence: "BoostNano also offers the option of using the Application Programming Interface (API) call instead of the direct method, which omits the file copy implemented in the direct approach, reducing the run time to 8 m 8 s." Here, the sentence is extremely overwritten which cuases a lack of clarity. Correction: "BoostNano offers an alternative API-based method, which skips the file copy step of the direct approach, reducing the runtime to 8 minutes and 8 seconds."

      Discussion

      Discussion: ★★★☆☆ (3/5) The discussion as its strengths as it correctly identifies that Dorado's advantages (speed, integration with basecalling) make it appealing as a default choice. The authors acknowledge that all tools are within a similar accuracy range, suggesting the deciding factor may be speed or integration rather than raw performance differences. HOWEVER- there are areas for improvement: Further dissect the limitations of each tool. For example, BoostNano shows good SD but slightly off mean for R1; what does this mean for its use cases? Address the discrepancy between tailfindr, nanopolish, and Dorado in terms of how they define and detect poly(A) boundaries. Why does Dorado not evaluate start/end positions of poly(A) tails in event space, and how might this influence results? Include a brief discussion about how results might generalize to more complex transcriptomes. Real samples have varying GC content, fragment lengths, and potentially modified bases. A short commentary acknowledging these factors would show awareness that synthetic standards cannot capture the full complexity of natural RNA opulations. For these reasons, it is suggested that the authors suggest future directions. For instance, how could tool developers incorporate these findings to improve their methods? Could future benchmarking sets include a gradient of tail lengths to better understand length-specific biases?

      Grammar Mistakes and errors in the discussion section: Sentence: "BoostNano and tailfindr tools provided estimation of the starting and ending positions of the poly(A) tails in event space while this information was absent in Dorado outputs." Issue: "provided estimation" should be "provide estimation" to align with present tense. Correction: "BoostNano and tailfindr tools provide estimation of the starting and ending positions of the poly(A) tails in event space, while this information is absent in Dorado outputs."

      Sentence: "On the R1 dataset, BoostNano showed a tighter distribution with the smallest SD, but its peak was the furthest from the correct value." The issue here is that the test results are still speaking about genneral truths leading to verb tense inconsistency; "showed" should match other verbs in the section. Correction: "On the R1 dataset, BoostNano shows a tighter distribution with the smallest SD, but its peak is the furthest from the correct value."

      Sentence: "tailfindr had the most accurate estimation but also the largest error interval."

      The issue here is the verb tense mismatch; "had" should be consistent with present tense to show truth, not past truth. Correction: "tailfindr has the most accurate estimation but also the largest error interval."

      Sentence: "Furthermore, Boostnano is more lenient in keeping reads for poly(A) estimation than Dorado."

      Issue: "Boostnano" capitalization error; it should be "BoostNano." Correction: "Furthermore, BoostNano is more lenient in keeping reads for poly(A) estimation than Dorado."

      Sentence: "Overall, our results suggest that the four tools investigated in this study - BoostNano, tailfindr, nanopolish and Dorado have similar performance with their accuracy varying from one dataset to the other, with a potential length bias."

      Issue: Missing commas for clarity; replace "with their accuracy varying from one dataset to the other" for conciseness. Correction: "Overall, our results suggest that the four tools investigated in this study—BoostNano, tailfindr, nanopolish, and Dorado—have similar performance, with accuracy varying across datasets and showing potential length bias."

      Sentence: "Therefore, we expect Dorado to be implemented as the default method of poly(A) tail estimation in the near future, with the rapid estimation timeframe, comparable estimation lengths to other tools, conservative nature and the added benefit of ease of obtaining this information during basecalling."

      There are several issues here including verbosity and lack of parallelism. Correction: "Therefore, we expect Dorado to be implemented as the default method for poly(A) tail estimation, given its rapid estimation timeframe, comparable accuracy to other tools, conservative nature, and ease of integration with basecalling."

      Sentence: "This work demonstrates the value of having access to synthetic RNA molecules with known poly(A) tail-lengths for validating the accuracy of poly(A) tail estimation algorithms."

      Issue: The phrase "validating the accuracy of" could be simplified for readability. Correction: "This work demonstrates the value of synthetic RNA molecules with known poly(A) tail lengths for validating poly(A) tail estimation algorithms."

      Sentence: "As methods improve, we anticipate that these datasets will be valuable for assessing improvements in estimation of poly(A) tails."

      Issue: "improvements in estimation of" is awkward. Correction: "As methods improve, we anticipate that these datasets will be valuable for assessing advancements in poly(A) tail estimation."

      References need to be added to accomodate the suggested material review, but existing references are good-

      NEEDS REVISION Jesse Daniel Brown PD AASU

      Note:

      I previously reviewed this paper previously in Research Hub and you can read these comments via the Research Hub review page here: https://www.researchhub.com/paper/8634403/using-synthetic-rna-to-benchmark-polya-length-inference-from-direct-rna-sequencing/reviews#threadId=55398.

      The original preprint linked to the Research Hub review is here: https://doi.org/10.1101/2024.10.25.620206

    2. AbstractPolyadenylation is a dynamic process which is important in cellular physiology. Oxford Nanopore Technologies direct RNA-sequencing provides a strategy for sequencing the full-length RNA molecule and analysis of the transcriptome and epi-transcriptome. There are currently several tools available for poly(A) tail-length estimation, including well-established tools such as tailfindr and nanopolish, as well as two more recent deep learning models: Dorado and BoostNano. However, there has been limited benchmarking of the accuracy of these tools against gold-standard datasets. In this paper we evaluate four poly(A) estimation tools using synthetic RNA standards (Sequins), which have known poly(A) tail-lengths and provide a valuable approach to measuring the accuracy of poly(A) tail-length estimation. All four tools generate mean tail-length estimates which lie within 12% of the correct value. Overall, Dorado is recommended as the preferred approach due to its relatively fast run times, low coefficient of variation and ease of use with integration with base-calling.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf098), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 1: Christoph Dieterich

      In this manuscript, the authors present a benchmark to assess the performance of different tools designed for estimation of polyA tail length from Nanopore direct RNA-sequencing data. These tools include tailfindr, nanopolish, Dorado and Boost Nano. Benchmarks on tools and algorithms to analyze Nanopore data, both third party tools and official ONT releases, are of utmost importance for the field. The use of synthetic constructs with known ground truth is recommended as well. Consequently, this study has the potential to provide a significant contribution to the field.

      In the current form, I can however not recommend it for publication in GigaScience. My major concerns are: a) Use of only RNA002 data. This chemistry is outdated and thus the Benchmark is only relevant for old, possibly already published data. A comprehensive Benchmark should also include RNA004 and available tools there (at least Dorado). b) The current data set only contains two polyA tail length, which are relatively short and do not cover longer polyA tails that are common e.g. in mammalian cells. A proper Benchmark should show the performance of the analyzed tools over a range of polyA tail lengths.

      Minor comments: 1) Abstract: "All four tools generate mean tail-length estimates which lie within 13% of the correct value." The value of 13% is given in the Abstract from the submission system, wherease the abstract in the Main text says 12%. Which value is correct? 2) Background, first paragraph: the role of the polyA tail in RNA circularization, which is required for efficient translation of cellular mRNAs is not mentioned. Reference is missing for "is increasingly recognised as a dynamic process which influences timing and degree of protein production." 3) Background, second paragraph: Chiron seems to be a relatively old basecaller (no models for new chemistries). It should be mentioned here that it is required for BoostNano. 4) Mis-priming of internal polyA sites may an important confounding (and currently overlooked) source of errors in Nanopore sequencing. This should be quantified properly and analyzed in more detail (length of these stretches, influence of other nucleotides within the A-rich stretch, etc.). Should be done as well on whole transcriptome data with more possible mispriming sites. 5) Why do the authors think that the poly(T) stretch of the RTA might be truncated? This is composed of DNA oligos, which should be quite stable 6) What are the parameters for filtering used by Dorado and BoostNano? Can the authors explain, why the filtered reads differ? 7) Dorado seems to systematically underestimate polyA tail length. Is this true also for data generated with RNA004 chemistry and longer polyA tails?

    1. Synthèse du webinaire : Le programme EVARS, un outil indispensable pour la protection des enfants

      Résumé

      Ce document de synthèse résume les points clés du webinaire organisé par la FCPE nationale le 23 septembre 2025, consacré au programme d'Éducation à la Vie Affective, Relationnelle et à la Sexualité (EVARS).

      Entré en vigueur à la rentrée 2025, ce programme vise à garantir l'application effective de la loi Aubry de 2001, qui rendait obligatoire trois séances annuelles d'éducation à la sexualité, mais qui n'était appliquée que pour 15 % des élèves en 2024.

      Les intervenants — Marc Pelletier du Ministère de l'Éducation nationale, Sarah Durocher du Planning familial et l'animateur Didier Valentin — ont unanimement présenté le programme comme un enjeu nécessaire et indispensable pour la protection de l'enfance.

      Il répond aux missions fondamentales de l'École : promouvoir l'égalité, lutter contre les discriminations, enseigner le consentement et prévenir toutes les formes de violence.

      Le programme est également une réponse directe aux défis contemporains auxquels la jeunesse est confrontée, notamment l'exposition précoce à la pornographie, le harcèlement et les violences sexistes et sexuelles.

      Élaboré suite à un vaste processus consultatif et validé par le Conseil d'État, le programme repose sur trois principes directeurs : l'unité thématique, la progressivité stricte des contenus adaptés à l'âge, et la complémentarité avec les autres enseignements. Il est obligatoire et les parents ne peuvent y soustraire leurs enfants.

      La mise en œuvre s'appuie sur une formation massive des personnels de l'Éducation nationale et, dans le second degré, sur des interventions complémentaires d'associations agréées, toujours dans le cadre de projets co-construits avec les équipes pédagogiques.

      Face aux campagnes de désinformation, les intervenants ont insisté sur la nécessité d'une communication claire auprès des familles pour dissiper les malentendus et réaffirmer que l'objectif n'est pas d'enseigner des pratiques sexuelles, mais de construire une culture du respect, de l'égalité et du bien-être.

      Contexte et Justification du Programme EVARS

      Un Impératif Légal et une Nécessité Sociale

      Le programme EVARS a été conçu pour répondre à un déficit majeur dans l'application de la législation française.

      Bien que la loi Aubry de 2001 ait rendu l'éducation à la sexualité obligatoire à raison de trois séances par an, un constat alarmant a été dressé en 2024 : seuls 15 % des élèves en avaient réellement bénéficié.

      L'objectif principal du nouveau programme est donc de garantir l'effectivité de cette loi sur tout le territoire.

      Marc Pelletier, de la Direction générale de l'enseignement scolaire (DGESCO), a souligné que l'EVARS s'inscrit pleinement dans les missions fondamentales que la Nation confie à l'École, telles que définies dans le Code de l'éducation :

      Promouvoir l'égalité, notamment entre les femmes et les hommes.

      Lutter contre toutes les formes de discrimination, y compris celles fondées sur le sexe, l'identité de genre ou l'orientation sexuelle.

      Éduquer au principe du consentement et au respect du corps humain.

      Prévenir toutes les formes de violence, en particulier les violences sexistes et sexuelles, et contribuer au repérage des situations de violences intrafamiliales, y compris l'inceste.

      Répondre aux Enjeux Contemporains de la Jeunesse

      Le programme a été jugé indispensable pour outiller les enfants et les adolescents face aux réalités et aux risques de leur époque. Plusieurs statistiques alarmantes ont été citées pour justifier son déploiement :

      Enjeu

      Donnée Clé

      Exposition à la pornographie

      23 millions de mineurs y sont exposés chaque mois.

      Agressions sexuelles

      Un enfant est victime toutes les trois minutes en France.

      Violences sexuelles sur mineurs

      80 % des victimes sont des filles.

      Harcèlement scolaire

      Concerne 5 % des écoliers, 6 % des collégiens et 4 % des lycéens qui se trouvent dans une situation de vulnérabilité.

      Inceste

      160 000 enfants en sont victimes en France.

      Pour Sarah Durocher, présidente du Planning familial, l'un des principaux leviers pour contrer la désinformation massive à laquelle les jeunes sont exposés via Internet est une éducation structurée et fiable dispensée à l'école.

      Le Soutien des Fédérations de Parents et des Associations

      La FCPE, organisatrice du webinaire, a exprimé son soutien "avec force et convictions" au programme.

      Pour la fédération, l'EVARS est essentiel pour informer, prévenir, construire une société plus égalitaire, libérer la parole, donner des repères clairs, apprendre à dire non et comprendre la notion de consentement.

      La FCPE fait également partie du Collectif pour une véritable éducation à la sexualité, aux côtés du Planning familial et d'autres organisations, afin de parler d'une même voix et de fournir des outils concrets aux familles et aux établissements pour contrer la désinformation.

      Élaboration, Contenu et Principes Directeurs

      Un Processus de Création Consultatif et Validé

      Le programme EVARS n'a pas été créé de manière arbitraire. Son élaboration a suivi un processus rigoureux et consultatif :

      1. Groupe de travail (2023) : Mis en place pour analyser les raisons de la faible application de la loi de 2001.

      2. Saisine du Conseil Supérieur des Programmes (CSP) : Le ministre Pap Ndiaye a mandaté le CSP pour élaborer un projet de programme, avec une attention particulière à la distinction entre le premier et le second degré.

      3. Consultations : La DGESCO a mené de larges consultations sur la base du projet du CSP, incluant des professionnels de l'éducation, des organisations syndicales, des partenaires institutionnels et une consultation publique.

      4. Adoption (Janvier 2025) : Le projet a été adopté à l'unanimité des votants au sein des instances consultatives.

      5. Validation Juridique (Juin 2025) : Le Conseil d'État a rejeté deux recours administratifs demandant son annulation, confirmant ainsi sa conformité légale et son caractère "neutre et objectif".

      Trois Principes Fondamentaux

      Le programme est structuré autour de trois principes essentiels pour garantir sa cohérence et son adéquation.

      1. Unité : À tous les niveaux, l'enseignement s'articule autour de trois questions structurantes :

      • ◦ Comment se connaître, vivre et grandir ?  
      • ◦ Comment rencontrer les autres, construire avec eux des relations respectueuses et s'y épanouir ?  
      • ◦ Comment trouver sa place dans la société, y être libre et responsable ?

      2. Progressivité : Le principe le plus fondamental est l'adaptation stricte des contenus et des modalités à l'âge et à la maturité des élèves. Le nom même du programme change pour marquer cette distinction :

      • Premier degré (école) : Éducation à la Vie Affective et Relationnelle (EVAR).  
      • Second degré (collège/lycée) : Éducation à la Vie Affective, Relationnelle et à la Sexualité (EVARS).

      Le mot "sexualité" n'apparaît dans le programme qu'à partir de la classe de quatrième.

      3. Complémentarité : Les trois séances annuelles forment un parcours cohérent.

      L'EVARS est conçu pour compléter les enseignements disciplinaires (SVT, Enseignement Moral et Civique) et les actions éducatives globales de l'établissement (ex: programme de lutte contre le harcèlement).

      Une Approche Progressive et Adaptée à Chaque Âge

      Niveau

      Dénomination

      Thèmes Abordés

      Maternelle

      EVAR

      Émotions, identification des parties du corps, notion d'intimité, reconnaissance des adultes de confiance.

      Élémentaire (CP-CM2)

      EVAR

      Sentiments, stéréotypes de sexe, lutte contre les discriminations, consentement (abordé sans forcément nommer le terme), dangers d'Internet, harcèlement.

      Collège

      EVARS

      Changements liés à la puberté, vie privée, respect de l'intimité, sentiments amoureux, respect des différences, prévention des violences (sexuelles, emprise).

      Lycée

      EVARS

      Engagement dans une relation, droit d'être soi, acceptation et pression sociales, construction de relations saines à soi et aux autres.

      Il est crucial de noter que le terme "sexualité" est entendu dans un sens global, incluant les dimensions psychologiques, affectives, juridiques et sociales, et non comme un cours sur les pratiques sexuelles.

      Mise en Œuvre Pratique et Pédagogie

      Le Rôle Central des Personnels de l'Éducation Nationale

      Un effort de formation massif est en cours pour accompagner les équipes. Cela inclut des séminaires nationaux, des formateurs académiques, et des parcours de formation en ligne ("parcours magister") accessibles à tous les professeurs.

      N'importe quel professeur volontaire peut animer ces séances, pas uniquement les enseignants de SVT.

      Les personnels de santé scolaire (infirmières, psychologues) sont des acteurs clés.

      Leur connaissance des élèves permet d'adapter les séances aux problématiques locales.

      Des protocoles clairs existent pour l'accueil de la parole des enfants en cas de révélation de violences, garantissant que l'enseignant n'est jamais seul face à ces situations.

      L'Intervention des Associations Agréées

      Le recours à des partenaires extérieurs est encadré :

      Recommandé dans le second degré : Les interventions d'associations sont encouragées au collège et au lycée pour leur expertise complémentaire.

      Non prioritaire dans le premier degré : Le ministère préconise que les séances soient menées par les professeurs des écoles, intégrées au quotidien de la classe.

      Conditions strictes :

      ◦ L'association doit être agréée par le Ministère, un label garantissant son respect des valeurs de la République et la pertinence de son approche pédagogique.    ◦ L'intervention doit s'inscrire dans un projet pédagogique co-construit avec l'équipe de l'établissement.    ◦ Un professionnel de l'établissement doit toujours être présent pendant la séance.

      Le Planning familial, qui intervient auprès de 3600 établissements, a précisé refuser autant de demandes qu'il en accepte, illustrant la forte demande du terrain.

      Déroulement Type d'une Séance : L'Approche de Didier Valentin

      Didier Valentin a illustré la pédagogie active et non-jugeante utilisée lors des séances.

      Philosophie : "N'essayons pas de convaincre, tentons de faire réfléchir." L'objectif est la réduction des risques et le développement de l'esprit critique.

      Focus sur le "Relationnel" : Une grande partie du travail porte sur la manière dont les jeunes interagissent, se parlent et vivent ensemble, bien avant d'aborder la sexualité.

      Outils interactifs : Les séances ne sont pas des cours magistraux. Elles s'appuient sur des outils participatifs qui partent du vécu des jeunes :

      • Exemple 1 : Un tableau où les élèves collent des post-it sur les "avantages et inconvénients" d'être une fille, un garçon ou une personne non-binaire, pour lancer un débat sur les stéréotypes et l'empowerment.  
      • Exemple 2 : Diffusion de courtes vidéos vues sur les réseaux sociaux (TikTok) pour lancer un débat contradictoire et analyser les discours (ex: masculinistes).

      Questions des Parents et Lutte Contre la Désinformation

      Cadre Réglementaire et Communication

      Caractère obligatoire : Il a été rappelé que l'EVARS est un enseignement obligatoire. Un parent ne peut pas demander une dispense pour son enfant.

      Information des familles : Le Ministère recommande fortement que les établissements communiquent de manière transparente sur les objectifs du programme, par exemple lors des réunions de rentrée, afin de "dissiper les malentendus".

      Rôle des parents d'élèves : Les représentants des parents ont un rôle à jouer dans les instances comme le Comité d'Éducation à la Santé, à la Citoyenneté et à l'Environnement (CESCE) pour participer à l'élaboration du projet d'établissement.

      Répondre aux Inquiétudes et aux "Infox"

      Les intervenants ont reconnu l'existence d'une "panique morale" et de campagnes de désinformation actives. Sarah Durocher a mentionné que certains groupes tentent de se faire élire comme représentants de parents d'élèves dans le but de faire barrage au programme.

      Pour rassurer les familles, plusieurs points ont été martelés :

      Formation des intervenants : Les professionnels des associations sont formés (ex: 160 à 400 heures pour le Planning familial) et leur casier judiciaire est vérifié.

      Développement des compétences psycho-sociales : Le programme vise à renforcer les compétences émotionnelles, cognitives et relationnelles des élèves, qui sont des vecteurs de réussite scolaire et de bien-être.

      Une éducation féministe pour tous : Didier Valentin a résumé l'objectif comme une "éducation féministe" visant à déconstruire les stéréotypes de genre pour créer des relations plus égalitaires et, in fine, faire baisser les violences.

    1. Synthèse de l'Audition sur le Service Civique

      Résumé

      L'audition de la présidente de l'Agence du service civique met en lumière la dualité d'un dispositif de 15 ans, largement salué comme un "vrai succès" par la Cour des Comptes et plébiscité par les jeunes et les structures d'accueil, mais aujourd'hui menacé par des restrictions budgétaires drastiques.

      Avec plus de 868 000 participants depuis sa création, le Service Civique s'est imposé comme un outil majeur de cohésion sociale, de mixité et un tremplin d'insertion pour la jeunesse.

      Cependant, l'annulation de crédits pour 2025 réduit la cible de 150 000 à 135 000 jeunes, supprimant de fait 15 000 missions et fragilisant un écosystème associatif déjà sous tension.

      Les débats ont révélé un large consensus sur la pertinence du dispositif, mais aussi des inquiétudes profondes concernant son financement, les risques de substitution à l'emploi, les allégations de dévoiement idéologique et la tension structurelle entre sa vocation d'engagement citoyen et son rôle de facto dans l'insertion professionnelle.

      1. Le Service Civique : Bilan et Impact en Chiffres

      Créé par la loi du 10 mars 2010, le Service Civique est un dispositif d'engagement volontaire qui a démontré un impact significatif en 15 ans d'existence.

      Fondamentaux du Dispositif

      Public : Jeunes de 16 à 25 ans (jusqu'à 30 ans pour les jeunes en situation de handicap).

      Mission : Mission d'intérêt général auprès d'associations ou d'institutions publiques.

      Durée : Environ 6 mois, avec un maximum de 12 mois.

      Intensité : En 2023, la durée moyenne était de 7 mois avec une intensité hebdomadaire de 27 heures.

      Indemnisation : 620 € par mois.

      Bénéfices : Accompagnement, formation civique et citoyenne (incluant les premiers secours), couverture sociale complète et validation de trimestres de retraite de base.

      Bilan Quantitatif

      Total de participants : 868 000 jeunes ont réalisé une mission depuis 2010.

      Missions à l'étranger : 15 000 jeunes ont effectué leur mission à l'international.

      Volume annuel : Près de 90 000 nouvelles missions ont été engagées en 2023.

      Pour 2024, le chiffre s'élève à 86 431 entrées en mission, correspondant à l'atteinte de la cible annuelle (avant réduction) de 150 000 jeunes en service civique sur l'année.

      Taux d'occupation : 100 % des places disponibles sont occupées depuis 2023.

      Profil des Volontaires et Structures d'Accueil

      Le dispositif se caractérise par une forte mixité sociale et de parcours.

      Catégorie

      Données Clés

      Profil à l'entrée

      1/3 étudiants, 1/3 demandeurs d'emploi, 1/3 inactifs.

      Publics spécifiques

      3,3 % de jeunes en situation de handicap.

      14 % de jeunes issus des quartiers prioritaires de la ville (QPV).

      31 % de jeunes issus de la ruralité.

      Structures d'accueil

      62 % en associations.

      28 % dans l'État et ses opérateurs (ex: Ministère de l'Éducation Nationale).

      9 000 organismes d'accueil différents au total.

      Taux de Satisfaction et Impact

      Le Service Civique est un dispositif très connu et apprécié, tant par les volontaires que par les recruteurs.

      Notoriété : Plus de 9 jeunes sur 10 connaissent le dispositif.

      Satisfaction des volontaires : 85 % des jeunes ayant effectué une mission se déclarent satisfaits.

      Satisfaction des recruteurs : Près de 70 % portent un avis favorable.

      Impact sur le parcours :

      Professionnel : 73 % des jeunes déclarent avoir mobilisé leur expérience pour leur parcours professionnel un an après leur sortie.    ◦ Orientation : 63 % l'ont utilisée pour leur orientation ou réorientation.    ◦ Insertion : 80 % des jeunes sont en emploi ou en formation 6 mois après la fin de leur mission.

      Impact sur l'engagement : 56 % des jeunes poursuivent une activité bénévole après leur mission, contre 36 % avant d'y entrer.

      2. La Crise Budgétaire : Un Tournant pour le Dispositif

      La principale menace pesant sur le Service Civique est d'ordre budgétaire, remettant en cause le consensus politique et la trajectoire de croissance du dispositif.

      La Cible Historique de 150 000 Jeunes

      Depuis 2017, un consensus national s'est établi autour d'une cible de 150 000 jeunes en service civique sur l'année, ce qui correspond à environ 85 000 nouvelles entrées en mission par an, soit un peu plus de 10 % d'une classe d'âge.

      La loi de finances initiale pour 2025 prévoyait les moyens nécessaires pour atteindre cet objectif.

      L'Impact des Annulations de Crédits

      Annulation pour 2024 : Plus de 70 millions d'euros ont été annulés.

      Décret d'annulation pour 2025 : Un décret a ramené la cible à 135 000 jeunes sur l'année, supprimant de fait 15 000 missions.

      Conséquences sur la trésorerie : La trésorerie de l'Agence a été réduite d'une norme prudentielle d'un mois à 15 jours, puis à une hypothèse de 6 jours (9 millions d'euros) pour 2025.

      Gel supplémentaire ("surgel") : Un surgel a été appliqué, dont le dégel partiel est espéré par la ministre.

      Conséquences sur l'Écosystème

      La réduction du nombre de missions a un double effet :

      1. Pour les jeunes : 15 000 jeunes seront privés de cette opportunité, alors que la demande est déjà très forte (3 candidatures enregistrées pour 1 mission disponible).

      2. Pour les associations : Cette réduction fragilise le tissu associatif, qui accueille la majorité des volontaires et dépend de leur contribution.

      Plusieurs intervenants ont souligné que les associations, déjà confrontées à des baisses de subventions, verront leur capacité d'action et d'accueil diminuée.

      3. Thèmes Stratégiques et Initiatives Clés

      Malgré les difficultés budgétaires, l'Agence du service civique développe des axes stratégiques pour répondre aux priorités nationales et aux aspirations de la jeunesse.

      Les Nouvelles Priorités Thématiques

      Service Civique Écologique : Lancé en avril 2024 avec un objectif de 50 000 missions d'ici 2027. La première étape de 1 000 missions supplémentaires a été dépassée, témoignant d'un "réel engouement" de la part des jeunes et de l'écosystème.

      Service Civique Solidarité Senior : Développé dans le cadre du plan "bien vieillir" pour répondre aux enjeux de société liés au vieillissement.

      Lutte contre le harcèlement scolaire : 1 000 missions ont été dédiées à la prévention et à la lutte contre ce fléau en milieu scolaire, un exemple jugé "archétypal" d'une mission réussie où les jeunes complètent l'action des agents publics sans s'y substituer.

      Le Lien avec le Service National Universel (SNU)

      L'abandon de la généralisation du SNU a eu un impact. Il était anticipé qu'une généralisation aurait massivement augmenté la demande de Service Civique, portant la cible théorique à 25 % d'une classe d'âge.

      L'abandon de ce projet évite une amplification de la tension actuelle entre l'offre et la demande, mais la question du décalage reste "posée de manière cruelle".

      Le Déploiement dans les Collectivités Territoriales

      Le développement du Service Civique s'est historiquement appuyé sur des partenariats avec de grandes associations nationales.

      Le déploiement dans les collectivités territoriales reste un axe de progression : seules 192 intercommunalités sur 1254 disposent d'un agrément.

      Un travail a été engagé avec Intercommunalité de France pour faciliter l'accueil de volontaires au niveau local, notamment dans les petites communes.

      4. Controverses et Préoccupations Soulevées

      L'audition a été l'occasion pour les députés d'exprimer plusieurs critiques et inquiétudes majeures concernant le fonctionnement et la finalité du dispositif.

      Le Risque de Substitution à l'Emploi

      Préoccupation : Des députés (notamment du groupe Écologiste) craignent que le Service Civique ne soit utilisé pour remplacer de "vrais emplois", notamment dans les services publics (ex: missions d'accueil).

      Réponse de l'Agence : C'est une "préoccupation constante" et essentielle. Le Code du service national l'interdit. L'Agence contrôle en amont (agrément) et en aval (signalements). La présidente note que le risque de substitution est plus élevé dans le secteur sportif associatif que dans les services publics, où la satisfaction des jeunes est par ailleurs plus élevée.

      Allégations de Dévoiement et Questions de Neutralité

      Préoccupation : Le Rassemblement National, s'appuyant sur un article du Journal du Dimanche, a soulevé le risque de "dévoiement" du dispositif au profit de "structures exclusivement tournées vers l'aide aux migrants" ou d'"écoles privées musulmanes", questionnant le respect de la neutralité républicaine.

      Réponse de l'Agence : La présidente a fermement réfuté ces allégations, qualifiant l'article de "mal documenté". Elle précise que l'association La SIMAD n'a accueilli que deux volontaires depuis 2020 et que l'association La Plume Bleue n'en a jamais accueilli. Elle a rappelé que l'Agence travaille avec les cellules préfectorales de lutte contre l'islamisme radical (CLIR) pour renforcer les contrôles.

      Un Outil d'Insertion Professionnelle plutôt que d'Engagement Citoyen ?

      Préoccupation : Un député (groupe UDR) a avancé que le dispositif s'est transformé en "simple contrat jeune", servant davantage l'insertion professionnelle que l'engagement citoyen.

      Il s'appuie sur une étude de l'INJEP montrant une corrélation entre le taux de chômage des jeunes et le recours au Service Civique, ainsi que sur les fortes disparités territoriales (27,4 % de participation dans les DROM contre 9,5 % dans l'Hexagone).

      Réponse de l'Agence : La présidente reconnaît que les motivations professionnelles sont une évidence et que le dispositif est un "tremplin vers l'emploi".

      Elle insiste cependant sur le fait qu'il s'agit d'une expérience allant au-delà d'un "simple contrat", car elle offre une "expérience concrète des valeurs de la République" et vise à "humaniser le service public".

      Inclusivité et Accessibilité

      Préoccupation : Le faible taux de participation des jeunes en situation de handicap (3,3 %) a été souligné (groupe Liot).

      Réponse de l'Agence : Ce chiffre est jugé "certainement insuffisant" mais en progression (+1,5 point en 4 ans).

      La principale réponse pour améliorer l'accessibilité de tous les publics est de développer une offre "d'ultra-proximité" sur tout le territoire, afin de ne pas rendre un déménagement nécessaire.

      5. Citations Marquantes

      Sur le succès et la menace (Présidente de la commission) : "La Cour des comptes a souligné, je cite, que le service civique est un vrai succès malgré quelques fragilités.

      Ce constat est donc favorable aujourd'hui et menacé par certaines interrogations pour ne pas dire inquiétude sur le devenir de ce dispositif."

      Sur l'essence du dispositif (Priska Tevenot, Ensemble) : "S'engager et apprendre de soi, c'est ce qui distingue le volontariat en service civique du simple job étudiant. [...] Le service civique, c'est une école de l'engagement, une école de la vie."

      Sur la rigueur budgétaire (Florence Joubert, Rassemblement National) : "Ce dispositif mérite d'être soutenu à condition qu'il ne soit pas dévoyé.

      Car nous parlons tout de même d'un financement public de près de 600 millions d'euros par an."

      Sur la substitution à l'emploi (Sophie Tailler Paulian, Écologiste) : "Comment éviter que le service civique ne vienne finalement remplacer de vrais emplois et ne soit pas finalement aussi une sorte de sas [...] avant d'entrer dans un vrai emploi ?"

      Sur le sacrifice du Service Civique (Florence Erouin Léotet, Socialiste) : "C'est pourtant pour tenter de sauver ce dispositif [le SNU] en échec que l'on choisirait de sacrifier le service civique, un outil d'émancipation et de fraternité républicaine."

      Sur la confusion des genres (Maxime Michelet, UDR) : "Le service civique semble être parfois davantage un outil d'insertion professionnelle que d'engagement citoyen."

      Sur la finalité du dispositif (Présidente de l'Agence) : "La promesse [du Service Civique] n'est autre encore une fois que de faire l'expérience de l'intérêt général et de la cohésion républicaine, de la mixité sociale. Donc c'est une promesse effectivement supérieure à celle d'un simple contrat jeune."

      Sur la valeur ajoutée (Présidente de l'Agence) : "Il [le Service Civique] ne se substitue pas à l'emploi, aux agents publics, mais il humanise le service public. [...]

      C'est vraiment un des moteurs qui fait la différence entre l'engagement de service civique et une simple expérience professionnelle."

    1. Synthèse des Auditions de la Cour des Comptes : Enseignement Primaire et CVEC

      Résumé

      L'audition de la Cour des comptes à l'Assemblée nationale a mis en lumière des diagnostics critiques concernant deux piliers du système éducatif français : * l'enseignement primaire et * la Contribution de Vie Étudiante et de Campus (CVEC).

      Concernant l'enseignement primaire, le rapport dresse un "constat d'échec" de la politique publique.

      Malgré une dépense croissante (55 milliards d'euros en 2023, soit 2% du PIB), le niveau des élèves français est alarmant, se classant dernier de l'Union européenne en mathématiques en CM1.

      Le système aggrave les inégalités sociales et territoriales, avec une organisation du temps scolaire jugée "en décalage avec les besoins de l'enfant", notamment la semaine de 4 jours.

      La Cour préconise une refonte du modèle scolaire, incluant la systématisation des regroupements d'écoles, la réforme du statut des directeurs pour leur accorder plus d'autonomie, l'amélioration de l'attractivité du métier d'enseignant et une meilleure association des collectivités territoriales via des conventions triennales.

      Pour la CVEC, le bilan est contrasté. Depuis 2018, près de 900 millions d'euros ont été collectés, finançant des actions bénéfiques pour la vie étudiante (santé, culture, social).

      Cependant, le dispositif souffre d'un manque de transparence, d'une gestion complexe et d'une sous-utilisation notable des fonds, avec un reliquat de 100 millions d'euros. Le nombre d'étudiants assujettis n'est même pas connu précisément par le ministère.

      La Cour recommande de résorber les crédits inutilisés, de renforcer l'information et l'association des étudiants, de clarifier les règles de calcul de la contribution et d'assurer un suivi rigoureux de son utilisation, notamment par un rapport annuel au Parlement.

      I. Rapport sur l'Enseignement Primaire : Un Modèle à Réinventer

      Le rapport de la Cour des comptes sur les 6,3 millions d'élèves des 48 000 écoles françaises est le fruit d'une analyse nationale et territoriale approfondie.

      Il s'articule autour de quatre constats majeurs qui appellent à une refonte structurelle du système.

      1. Constat d'Échec : Baisse de Niveau et Aggravation des Inégalités

      La Cour qualifie sans équivoque la politique publique d'enseignement primaire d'« échec ». Les indicateurs de performance sont particulièrement préoccupants :

      Niveau Scolaire en Chute Libre : Malgré une dépense par élève en hausse, le niveau suit une tendance inverse.

      Mathématiques (CM1) : La France se classe dernière des 21 pays de l'Union européenne participant à l'enquête.    ◦ Français :

      Après une baisse continue depuis 2001, le niveau stagne, plaçant la France à l'antépénultième place des 18 pays de l'UE évalués.

      Explosion des Inégalités : L'école primaire non seulement reproduit mais "creuse les inégalités".

      Déterminisme Social : Une corrélation "très nette" existe entre les difficultés scolaires et l'origine sociale des parents.

      Les enfants de cadres améliorent leurs résultats, tandis que ceux des ouvriers voient les leurs diminuer.    ◦ Disparités Territoriales :

      Des inégalités aigües sont observées, notamment dans les académies ultramarines où, malgré un coût par écolier supérieur de 30%, le niveau des élèves est particulièrement bas.

      2. Organisation Inadaptée et Crise d'Attractivité du Métier

      L'organisation même de l'école est pointée du doigt comme étant déconnectée des besoins fondamentaux des élèves.

      Rythmes Scolaires : S'appuyant sur l'avis de l'Académie nationale de médecine, la Cour souligne que "l'organisation du temps scolaire n'apparaît pas prioritairement conçu en fonction des élèves".

      Le rapport met en évidence le "rôle néfaste de la semaine dite de 4 jours", une spécificité française au sein des pays de l'OCDE où le modèle dominant est la semaine de 5 jours.

      Crise du Recrutement des Enseignants : Le manque d'attractivité du métier est devenu structurel.

      Postes non pourvus : En 2024, 1 350 postes de professeurs des écoles n'ont pas été pourvus sur 10 270 offerts (près de 13%).

      Dans certaines académies comme Créteil et Versailles, il y a moins d'un candidat par poste.  

      Facteurs Multiples : Faible reconnaissance sociale, rémunération peu attractive en début de carrière, conditions de travail dégradées et carrières peu évolutives.

      3. Le Paradoxe d'une Dépense Croissante pour des Résultats Décevants

      Alors que les effectifs sont en forte baisse (prévision de 350 000 élèves en moins entre 2023 et 2028), la dépense publique pour l'école primaire ne cesse d'augmenter.

      Indicateur

      Données Clés

      Dépense Totale (2023)

      55 milliards d'euros (2% du PIB)

      Part de la Dépense Nationale d'Éducation

      29 %

      Croissance (2013-2022)

      +12 % (+6 milliards d'euros hors inflation)

      Répartition du Financement (hors pensions)

      État : ~20 milliards d'euros (2022)

      Collectivités territoriales : 19 milliards d'euros (2022)

      Cette augmentation continue, couplée à la dégradation des résultats, impose selon la Cour de s'interroger sur "l'efficience de la politique éducative".

      4. Recommandations pour une Refondation

      Face à ce diagnostic, la Cour formule plusieurs recommandations structurelles pour "repenser le modèle actuel de l'école" :

      Gouvernance :

      Statut du Directeur d'École : Engager une réforme pour généraliser progressivement la fonction de directeur à temps complet, en commençant par les écoles regroupées, afin de leur donner les leviers pour piloter le projet pédagogique.     ◦ Regroupement d'Écoles : Systématiser les regroupements pédagogiques dans les territoires en déclin démographique (18% des écoles comptent déjà une ou deux classes).     ◦ Partenariat avec les Collectivités : Établir des conventions triennales entre les services de l'Éducation nationale et les collectivités pour objectiver la politique éducative locale (carte scolaire, bâti, périscolaire).

      Attractivité et Formation des Enseignants :

      • Recrutement : Diversifier les viviers en ouvrant plus de postes au 3ème concours et permettre des recrutements sur contrats de moyen terme dans les académies en tension. 

      • Formation Continue : Assurer le remplacement systématique des enseignants en formation, favoriser les formations en équipe de proximité et mieux utiliser les crédits budgétaires alloués (60% non consommés en 2022).

      Pédagogie et Bien-être :

      • Centrer sur l'Élève : Faire du bien-être une priorité, en améliorant la cohérence entre les temps scolaire, périscolaire et extrascolaire.  
      • Numérique : Mieux intégrer le numérique comme outil pédagogique, en renforçant la formation des enseignants.  
      • Transition Écologique : Adapter le bâti scolaire, dont 52% présente des risques climatiques de grande ampleur (canicules, inondations).

      II. Rapport sur la Contribution de Vie Étudiante et de Campus (CVEC) : Entre Utilité et Opacité

      Issu d'une saisine citoyenne, le rapport sur la CVEC analyse l'utilisation d'une contribution qui a généré près de 900 millions d'euros depuis sa création en 2018.

      1. Un Dispositif Utile mais Perfectible

      La CVEC a permis de financer des actions diversifiées qui ont contribué à améliorer la vie étudiante : services d'écoute psychologique, épiceries solidaires, ateliers sportifs et culturels, aide à l'équipement numérique. Paradoxalement, sa création s'est accompagnée d'un gain de pouvoir d'achat pour la majorité des étudiants, car elle a remplacé la cotisation à la sécurité sociale étudiante, bien plus élevée (217 € en 2017-2018).

      2. Principaux Enseignements et Dysfonctionnements

      L'enquête de la Cour met en évidence six points critiques :

      1. Sous-utilisation des Fonds : Environ 100 millions d'euros sur les 900 millions collectés entre 2018 et 2024 n'avaient pas été dépensés à la date de l'enquête.

      2. Gestion Complexe : Le dispositif est jugé complexe, avec une redistribution par péréquation.

      De plus, une sous-évaluation des plafonds a conduit à des reversements de 14 millions d'euros au budget général de l'État.

      3. Augmentation du Montant : Le montant est passé de 90 € en 2018 à 105 € pour la rentrée 2024, soit une hausse de plus de 16%, sans que les modalités de calcul soient clairement définies pour en maîtriser la progression.

      4. Recouvrement Imprécis : Ni le ministère, ni le réseau des œuvres universitaires ne connaissent le nombre précis d'étudiants assujettis, empêchant de vérifier que tous ceux qui le doivent paient la contribution.

      5. Manque de Cadrage : Il n'existe pas de définition claire de la "vie étudiante", ce qui nuit à la cohérence des dépenses.

      Les seuils d'affectation (30% pour les projets étudiants et le social, 15% pour la médecine préventive) ne sont pas uniformément appliqués.

      6. Manque de Transparence : Les étudiants ont une connaissance "extrêmement limitée" de l'utilisation de la CVEC.

      L'information du Parlement est également jugée insuffisante.

      3. Recommandations de la Cour des Comptes

      Pour remédier à ces faiblesses, la Cour émet cinq recommandations principales :

      1. Résorber les reliquats de crédits inutilisés d'ici 2026.

      2. Préciser la méthode d'indexation de la CVEC sur l'inflation, en prévoyant un mécanisme de plafonnement de la hausse.

      3. Mettre en place des outils pour s'assurer du complet recouvrement de la taxe.

      4. Accroître le financement des projets pour les étudiants inscrits dans des établissements non bénéficiaires.

      5. Renforcer l'information des étudiants et transmettre au Parlement un rapport annuel détaillé sur l'utilisation de la CVEC.

      La Cour souligne enfin que la CVEC "ne pouvait à elle seule répondre à tous les besoins des étudiants", qui relèvent de politiques publiques de plus grande ampleur (logement, santé, précarité).

      III. Perspectives et Débats Parlementaires

      Les interventions des députés ont reflété une large adhésion aux constats de la Cour, tout en soulignant des points de divergence sur les solutions et des préoccupations politiques spécifiques.

      Sur l'enseignement primaire, un consensus s'est dégagé sur la gravité de la situation.

      Les députés ont interrogé la Cour sur les leviers prioritaires à actionner, le bien-fondé du retour à la semaine de 4,5 jours, la nécessité de se concentrer sur les savoirs fondamentaux, et le besoin d'une plus grande autonomie pour les établissements.

      Sur la CVEC, les critiques ont été vives concernant le manque de transparence, les fonds non utilisés et l'augmentation de son montant.

      Plusieurs groupes (Écologiste, LFI-NFP, GDR) ont qualifié la CVEC de "taxe injuste" et appelé à sa suppression au profit d'un financement direct par l'État.

      Le groupe RN a également dénoncé le financement présumé "d'événements à caractère politique et communautaire".

      En réponse, la Cour a insisté sur quatre pistes pour améliorer la transparence de la CVEC : une meilleure association des étudiants aux commissions de décision (en visant un quota de 50%), une communication plus large sur les projets financés (via des portails en ligne, des "ambassadeurs CVEC"), une harmonisation des bilans financiers des établissements, et une clarification des dispositifs pour éviter les doublons.

    1. Synthèse de la Mission Flash sur l'Accompagnement à l'Orientation des Élèves

      Synthèse

      Ce document de synthèse présente les conclusions de la mission flash sur l'évaluation de l'accompagnement des élèves à la découverte des métiers et à l'orientation, menée par les rapporteurs Arnaud Bonet et Laurent Croisier.

      Après quatre mois de travaux et plus de 24 auditions, le rapport dresse le constat d'un système d'orientation perçu comme un "chantier perpétuel" et un "chemin escarpé", source d'angoisse pour les élèves, les familles et les équipes éducatives, en raison de l'absence d'une stratégie nationale claire et de la succession de réformes.

      Les conclusions s'articulent autour de cinq axes majeurs :

      1. Un parcours d'orientation continu : L'orientation doit être un processus de long terme, débutant dès l'école primaire pour déconstruire les stéréotypes et s'étendant tout au long de la scolarité, en impliquant étroitement les familles.

      2. Un accompagnement individualisé : La mise en place d'un référent orientation issu du corps enseignant dans chaque établissement est jugée indispensable, tout comme la création d'un droit effectif à la réorientation et la valorisation des compétences non académiques.

      3. La lutte contre les inégalités : Le rapport souligne que l'orientation reste fortement déterminée socialement et propose des mesures pour combattre l'autocensure, revaloriser la voie professionnelle et mieux accompagner les élèves en situation de handicap et ultramarins.

      4. La mobilisation des moyens : Des investissements significatifs sont nécessaires, notamment pour la formation certifiante des enseignants, le financement d'heures dédiées à l'orientation et la révision de la carte des Centres d'Information et d'Orientation (CIO).

      5. Une coordination renforcée des acteurs : Face aux tensions et à la confusion nées du partage de compétences entre l'État et les Régions depuis 2018, le rapport préconise une clarification des rôles et une meilleure articulation des actions pour offrir un parcours plus cohérent aux élèves.

      Au total, 45 pistes d'amélioration sont proposées pour transformer l'orientation d'un parcours subi en un levier d'égalité des chances et d'émancipation, permettant à chaque jeune de construire un avenir choisi.

      Analyse Détaillée des Conclusions du Rapport

      1. Constat Général : Un Parcours d'Orientation Fragmenté et Anxiogène

      Les rapporteurs ouvrent leur analyse en qualifiant l'orientation de "chantier perpétuel" et de "chemin escarpé et redouté".

      Ce système est marqué par une succession de réformes qui, faute d'une véritable stratégie nationale, ont abouti à une fragmentation des actions.

      L'orientation est trop souvent vécue comme une série de décisions ponctuelles et anxiogènes plutôt que comme un processus continu et réfléchi.

      2. Axe 1 : Pour un Continuum d'Orientation de l'École Primaire au Lycée

      Pour remédier à cette fragmentation, le rapport insiste sur la nécessité de concevoir l'orientation comme un processus s'inscrivant dans la durée.

      Découverte des métiers dès le primaire : Il est proposé d'anticiper la démarche de découverte des métiers dès l'école primaire.

      L'objectif n'est pas d'orienter précocement les élèves, mais d'élargir leurs horizons et de "déconstruire les représentations conduisant à l'autocensure", car "la construction des stéréotypes n'attend pas la classe de 5e".

      Implication des familles : Considérant que les parents sont les "premiers prescripteurs de l'orientation", le rapport préconise d'instaurer un dialogue régulier entre les familles et les équipes éducatives, avec un premier temps d'échange formel dès la classe de 5e.

      Transparence de l'information :

      ◦ Face à une information abondante mais parfois "paralysante", le rôle de l'ONISEP comme acteur de référence est salué.

      La nouvelle plateforme "Avenir(s)", déployée depuis décembre 2023, a vocation à devenir l'outil central pour l'accompagnement de la 5e à la terminale.

      Son adoption reste cependant un défi, avec 86 000 élèves connectés au 30 mai 2024, pour un objectif initial de 200 000.    *  ◦ Une alerte est lancée sur les intitulés des diplômes et des formations, jugés souvent sources de confusion.

      Parcoursup : La plateforme est décrite comme "complexe, opaque et anxiogène". Les rapporteurs recommandent :

      • ◦ D'inscrire dans la loi l'obligation de transparence des algorithmes (déjà publics).  

      • ◦ De rendre publics et clairement formulés les critères de sélection des commissions de vœux.  

      • ◦ L'un des rapporteurs recommande de "rechercher une alternative crédible à Parcoursup" pour garantir un accueil inconditionnel dans les filières universitaires non sélectives.

      Réforme des stages :

      • ◦ Pour le stage de 3e, il est proposé de permettre de le scinder en plusieurs expériences courtes pour découvrir un panel de métiers plus varié et lutter contre la reproduction des inégalités sociales.  

      • ◦ Pour le stage de 2de, il est proposé de supprimer son caractère obligatoire pour en faire un "espace de découverte et d'approfondissement d'un projet personnel".  

      • ◦ La diffusion du "job shadowing" (suivi d'un professionnel pendant une journée) est également recommandée.

      3. Axe 2 : La Nécessité d'un Accompagnement Personnalisé

      L'aide individualisée à l'orientation, bien que prévue dans les textes, n'est pas toujours effective.

      Trois pistes sont avancées :

      Un référent orientation dans chaque établissement : La nomination d'un "référent pour l'orientation et la découverte des métiers" est préconisée dans chaque établissement, y compris dans les lycées généraux et technologiques.

      Ce rôle devrait être confié à un personnel enseignant, et non à un psychologue de l'Éducation nationale (Psy-EN), pour plusieurs raisons :

      • ◦ Les enseignants sont au contact quotidien de l'ensemble des élèves.  

      • ◦ Les Psy-EN sont en nombre insuffisant (ratio estimé à 1 pour 1200 à 1300 élèves).    ◦

      Les Psy-EN partagent leur temps entre plusieurs établissements et leurs missions sont désormais majoritairement centrées sur le suivi psychologique.

      Un droit effectif à la réorientation : Les parcours scolaires sont jugés "trop rigides".

      Le rapport appelle à un "véritable droit à la réorientation", perçu non comme un échec mais comme une opportunité, en créant des passerelles effectives entre les différentes voies.

      Valorisation des compétences non académiques : Le rapport insiste sur la nécessité de repérer et de mettre en valeur les compétences et ressources des élèves, y compris ceux en difficulté scolaire.

      4. Axe 3 : Lutter Contre les Déterminismes et les Inégalités

      L'orientation scolaire reste "très largement socialement déterminée". Le rapport cible cinq champs d'action :

      Combattre l'autocensure : Encourager les mécanismes d'inspiration par les pairs ("rôles modèles") en mobilisant d'anciens élèves, des étudiants ou de jeunes professionnels.

      Impliquer toutes les familles : Organiser des événements sur l'orientation dans des tiers-lieux (maisons de quartier, mairies) pour toucher les familles les plus éloignées de l'école.

      Revaloriser la voie professionnelle : Pour lutter contre la perception de la voie professionnelle comme un "choix par défaut" et une "orientation subie", il est proposé d'inciter à la création de lycées polyvalents et d'expérimenter des classes mixtes en seconde (générale, technologique et professionnelle) autour d'un tronc commun.

      Élèves en situation de handicap :

      • ◦ Garantir un accès prioritaire à l'internat. 
      • ◦ Automatiser la transmission des informations sur les aménagements de scolarité entre établissements (avec accord de la famille).

      Néobacheliers ultramarins :

      • ◦ Augmenter le montant de l'aide "Parcours" (actuellement 500 €).  
      • ◦ Rehausser le plafond fiscal (actuellement environ 27 000 €) du "Passeport pour la mobilité des études".

      5. Axe 4 : Moyens Humains et Budgétaires à Mobiliser

      L'atteinte des objectifs nécessite des moyens concrets.

      Formation des personnels : Mettre en place une formation obligatoire et certifiante à l'orientation pour les enseignants, tant en formation initiale (INSPÉ) que continue.

      Financement des heures dédiées : Les volumes horaires prévus (12h en 4e, 36h en 3e, 54h au lycée) sont souvent indicatifs et non financés.

      Le rapport demande que ces heures soient intégrées à l'emploi du temps et que le référent orientation bénéficie d'une décharge horaire sur ses obligations de service, plutôt qu'une simple indemnité via le "Pacte enseignant".

      Rôle des Psy-EN et carte des CIO :

      • ◦ Mettre à jour le Code de l'éducation qui mentionne encore les "conseillers d'orientation-psychologues", un corps abrogé en 2017.  
      • ◦ Formaliser par convention la mission d'appui des Psy-EN aux enseignants.   
      • ◦ Revoir la carte des 411 CIO, dont le nombre a été réduit d'un quart en dix ans, afin de garantir qu'aucun élève ne soit à plus de 45 minutes en transport en commun d'un centre.

      6. Axe 5 : Améliorer la Coordination entre les Acteurs

      La loi de 2018 confiant l'information sur les métiers aux Régions a créé une source de "confusion" et de "tension" avec l'État, responsable du conseil.

      Un partage de compétences flou : Un consensus se dégage sur la nécessité de clarifier les missions de chacun, sans pour autant opérer un nouveau transfert de compétences vers les Régions.

      Une offre régionale méconnue : L'action des Régions est mal connue des établissements.

      Selon la Cour des comptes (2022), seuls 22 % des établissements déclarent avoir recours aux ressources régionales documentaires et 12 % aux dispositifs régionaux.

      Des outils de coordination inopérants : Le programme annuel d'orientation, qui doit articuler les actions de la Région et le projet de l'établissement, n'est que très rarement mis en place.

      Recommandations de coordination :

      • ◦ Améliorer la communication sur l'offre de services des Régions.  
      • ◦ S'assurer de la mise en place du programme annuel d'orientation dans chaque établissement.  
      • ◦ Cartographier les actions régionales pour identifier les zones non couvertes.  
      • ◦ Garantir que la plateforme "Avenir(s)" de l'ONISEP valorise les informations régionales pour éviter la concurrence.
    1. Synthèse du Rapport sur les Impacts des Réformes du Baccalauréat Professionnel

      Résumé

      Ce document de synthèse analyse les conclusions d'un rapport parlementaire sur les réformes successives du baccalauréat professionnel.

      Le diagnostic central est sans appel : malgré un discours politique constant valorisant la voie professionnelle comme une filière d'excellence, celle-ci demeure une "voie de garage" perçue négativement, marquée par une forte ségrégation sociale et scolaire.

      Les réformes successives depuis 2009, notamment le passage du bac en trois ans, sont identifiées comme la source d'une baisse continue du niveau des élèves. Cette érosion est principalement due à une réduction drastique du volume horaire des enseignements, en particulier généraux, ce qui affaiblit les savoirs fondamentaux des bacheliers.

      En conséquence, leur insertion professionnelle se dégrade (taux d'emploi à 6 mois passé de 50% en 2011 à 45% en 2022) et leur poursuite d'études, bien que croissante, se solde par un taux d'échec élevé (41% en BTS), qualifié de "gâchis humain" et de "trahison".

      Les dispositifs récents, tels que le "parcours différencié" en terminale, sont jugés contre-productifs, générant un absentéisme massif et des difficultés d'organisation insolubles.

      Le rapport préconise des mesures correctrices, dont la possibilité d'une quatrième année de formation pour les élèves en difficulté, et critique le manque de vision stratégique et de concertation qui caractérise les politiques menées.

      1. Diagnostic d'une Voie Dévalorisée et Ségrégative

      Le rapport dresse un portrait sombre de la perception et de la composition sociologique du baccalauréat professionnel, soulignant une hypocrisie politique persistante.

      1.1. Une Perception Négative et une Hypocrisie Institutionnelle

      Bien qu'un bachelier sur trois soit titulaire d'un baccalauréat professionnel (173 000 lauréats en 2024), le diplôme souffre d'un déficit d'image majeur.

      Absence de Célébration : Le rapport note que "l'on ne fête que rarement la réussite au baccalauréat professionnel", un détail révélateur du regard porté sur ce diplôme par les élèves eux-mêmes et la société.

      Discours Politique Contredit par les Faits : Les responsables politiques de tous bords promeuvent la voie professionnelle comme une "voie d'excellence", mais cette rhétorique masque une réalité de relégation et de promesses non tenues.

      Le rapport dénonce une "forme d'hypocrisie consistant à porter au Pinacle cette voie de formation tout en tolérant la relégation".

      Double Discours Interne : L'institution scolaire elle-même entretient une ambiguïté, certaines autorités académiques reprochant aux collèges d'orienter "en trop grand nombre" des élèves vers le bac pro, leur "manquant d'ambition".

      1.2. Un Concentré de Difficultés et une Ségrégation Sociale Massive

      Les lycées professionnels concentrent les difficultés du système éducatif et fonctionnent comme une zone de ségrégation sociale.

      Surreprésentation des Milieux Populaires : 70 % des élèves ont des parents employés, ouvriers ou inactifs, contre moins de 40 % dans les voies générale et technologique.

      Poids de l'Éducation Prioritaire : 29 % des élèves de REP+ et 26 % des élèves de REP s'orientent en seconde professionnelle, contre 18 % hors éducation prioritaire et seulement 10 % issus du privé.

      Concentration des Élèves à Besoins Éducatifs Particuliers :

      ◦ Les jeunes en situation de handicap sont cinq fois plus nombreux en lycée professionnel qu'en filière générale.    ◦ 42 % des élèves allophones scolarisés en lycée le sont en formation professionnelle.

      Orientation Subie : La voie professionnelle est majoritairement une orientation par défaut pour les élèves au niveau scolaire jugé insuffisant. Près de 80 % des élèves du décile le plus faible en 6ème rejoignent un CAP ou une seconde professionnelle, contre seulement 1,8 % des élèves du décile le plus élevé.

      2. L'Érosion du Niveau : Causes et Conséquences

      Le rapport conteste fermement la thèse ministérielle d'une élévation du niveau et identifie la réduction du temps de formation comme la cause principale de la baisse des compétences des bacheliers.

      2.1. Le Passage au Bac en 3 ans : "La Mère de Toutes les Contre-réformes"

      La réforme de 2009, passant le cursus de 4 à 3 ans, est considérée comme la décision fondatrice de la dégradation de la filière.

      Logique Erronée d'Égalité : La réforme visait "l'égale dignité" avec les filières générales en alignant la durée des études.

      Le rapport critique cette approche, arguant que "le principe d'égalité impose de traiter de façon identique des situations identiques mais n'impose nullement de traiter de façon identique des situations différentes".

      Les élèves de la voie professionnelle, ayant des acquis scolaires plus faibles, nécessitaient au contraire un soutien renforcé.

      Avertissements Ignorés : Dès 2005, un rapport de l'Inspection Générale prévenait qu'une "grande majorité d'élèves ne peut pas suivre un parcours vers un baccalauréat professionnel en 3 ans".

      2.2. Une Diminution Continue du Volume d'Enseignement

      Les réformes successives ont entraîné une baisse constante du temps de formation, affectant particulièrement les savoirs fondamentaux.

      Année de Réforme

      Volume Horaire Total (sur 3 ans)

      Volume des Enseignements Généraux (sur 3 ans)

      2009

      2900 heures

      1218 heures

      2018

      2520 heures

      -

      2023

      2350 heures

      1070 heures

      Cette réduction a eu pour conséquence une "perte de connaissance générale et de compétences professionnelles", un "déficit de maturité et de savoir-être" unanimement dénoncés par les syndicats, organisations patronales et experts entendus.

      3. Insertion Professionnelle et Poursuite d'Études : Un Double Échec

      La dévalorisation du diplôme se traduit par une insertion sur le marché du travail plus difficile et un parcours du combattant pour ceux qui poursuivent des études supérieures.

      3.1. Une Insertion Professionnelle en Déclin

      Taux d'emploi à 6 mois : Pour les bacheliers professionnels ne poursuivant pas leurs études, ce taux est passé de plus de 50 % en 2011 à 45 % en 2022.

      L'insertion des titulaires de CAP a diminué dans des proportions similaires.

      Comparaison avec le BTS : Le taux d'emploi à 6 mois pour les titulaires d'un BTS atteint 64 %, ce qui explique l'attrait pour la poursuite d'études.

      3.2. Une Poursuite d'Études Risquée et Coûteuse

      Face à une insertion dégradée, de plus en plus d'élèves se tournent vers l'enseignement supérieur, souvent sans y être préparés.

      Augmentation de la Poursuite d'Études : 47 % des bacheliers professionnels poursuivent leurs études, contre 34 % en 2010, majoritairement en BTS.

      Un Taux d'Échec Massif : 41 % des bacheliers professionnels engagés en BTS échouent à obtenir leur diplôme. Leur taux de réussite est inférieur de 15 à 25 points à celui des bacheliers généraux ou technologiques.

      Un "Gâchis Humain" : Le rapport dénonce "les illusions perdues, un incroyable gâchi humain et osons le mot une forme de trahison" envers des élèves encouragés à continuer sans avoir les bases nécessaires ("fossé parfois infranchissable").

      4. Analyse Critique des Dispositifs des Réformes de 2018 et 2023

      Les réformes les plus récentes sont décrites comme une accumulation de dispositifs "cosmétiques" ou "contre-productifs", mis en œuvre sans vision cohérente.

      4.1. Dispositifs Jugés Inefficaces

      Familles de Métiers : Censées permettre une orientation progressive, elles ont en réalité "contribué à complexifier les parcours" et entraînent une "confiscation du choix de la spécialité" en fin de seconde.

      Co-intervention et Chef-d'œuvre : Qualifiés de "simples gadgets" par Daniel Bloc, le créateur du bac pro, ces dispositifs sont jugés inefficaces. Leur mise en œuvre a demandé une énergie considérable aux équipes pour des résultats décevants. Le rapport propose leur suppression.

      4.2. Le "Parcours Différencié" en Terminale : Une Aberration

      La réorganisation de l'année de terminale (réforme de 2023), avec un parcours en "Y" (stage de 6 semaines pour l'insertion ou cours de 6 semaines pour la poursuite d'études), est un échec retentissant.

      Calendrier Intenable : L'avancement des épreuves en mai pour libérer le mois de juin est qualifié d'"aberration" par tous les acteurs auditionnés, contraignant les élèves à un rythme d'apprentissage trop soutenu.

      Surcharge pour les Entreprises : L'augmentation des semaines de stage (PFMP) s'est faite sans concertation avec les organisations patronales, qui n'étaient pas demandeuses. Une "lassitude des structures hôtes" est constatée face à la multiplication des demandes de stage.

      Absentéisme Massif : Le parcours "poursuite d'études" est marqué par un absentéisme dépassant 60 %, voire 95 % dans certains établissements, et une "démobilisation complète".

      Dérives et Difficultés d'Organisation : La mise en place est un casse-tête pour les établissements, et de nombreux stages se déroulent dans des secteurs sans rapport avec la spécialité de l'élève.

      5. Recommandations Principales

      Face à ce constat, le rapport formule plusieurs propositions structurantes.

      1. Instaurer une 4ème Année Optionnelle : Permettre aux élèves les plus en difficulté de suivre une année de formation supplémentaire, en effectif réduit, centrée sur les savoirs fondamentaux.

      2. Supprimer les Dispositifs Inefficaces : Mettre fin à la co-intervention et au chef-d'œuvre.

      3. Réformer l'Organisation de la Terminale : Revenir sur le "parcours différencié".

      4. Développer les Certificats de Spécialisation : En l'absence de retour au bac en 4 ans, développer massivement ces formations de niveau 4 pour faciliter l'insertion, bien que cela soit une "manière détournée de réintroduire une 4e année".

      5. Lancer une Campagne Nationale de Promotion : Travailler sur le long terme pour changer les mentalités et valoriser réellement la voie professionnelle.

      6. Réactions et Perspectives des Groupes Politiques

      Rassemblement National (Roger Chudo) : Partage le diagnostic du "lycée des pauvres" et des "formations parking". Critique une réforme sans "vision prospective". Propose de confier la formation professionnelle aux régions et de rétablir les 4ème et 3ème technologiques.

      Ensemble (Céline Calvez) : Défend l'engagement présidentiel et les dispositifs comme la co-intervention et le chef-d'œuvre, arguant que "ce n'est pas tant le niveau des savoirs fondamentaux qui est en cause que le sens donné à ses savoirs". S'interroge sur les raisons de leur échec (principe ou manque de moyens).

      LFI-NUPES (Rodrigo Arenas) : Dénonce une vision qui considère les élèves comme une "main d'œuvre en devenir... si possible à bas prix", par opposition aux lycéens de la voie générale "éduqués pour devenir des citoyens".

      Plaide pour un lycée unifié où apprentissages manuels et intellectuels sont accessibles à tous.

      Socialistes et apparentés (Aida Adizadet) : Souligne que le premier métier des enseignants en LP est de "redonner confiance".

      Critique la "logique faussement élitiste" qui divise la jeunesse et rappelle le taux d'absentéisme de 95 % dans le parcours "poursuite d'études".

      Les Républicains (Alexandre Portier) : Affirme que le lycée pro devrait être la "voie royale" et la "clé de voûte de notre souveraineté nationale".

      Note que le LP est le seul segment à avoir gagné des élèves. Prône la stabilité : "le plus urgent c'est surtout d'arrêter de changer tout le temps".

      Écologiste - NUPES (Arnaud Bonet) : Déplore l'instabilité créée par les réformes qui s'enchaînent. Voit les difficultés du lycée pro comme "le reflet" des échecs en amont, au primaire et au collège.

      Démocrate (MoDem et indépendants) (Delphine Lingeman) : Pointe l'hypocrisie générale ("y compris parmi nous") et les problèmes cruciaux de mobilité dans les zones rurales qui entravent le libre choix de l'orientation.

      LIOT (Salvator Castiglioni) : Partage les recommandations et le constat d'un "décalage entre les propos ministériels décrivant une voix d'excellence mais vue par les élèves comme une filière par défaut".

      GDR - NUPES (Jean-Hugues Maillot) : Regrette le traitement marginal de l'Outre-mer, où le décrochage et le chômage des jeunes sont très élevés.

      Utilise la métaphore du poisson et du singe pour critiquer un système qui ne reconnaît qu'un type d'intelligence.

      Interventions additionnelles : D'autres interventions ont souligné le "manque criant d'enseignants qualifiés" (RN), la situation aggravée en Seine-Saint-Denis (LFI), et la nécessité de former des "citoyens dotés d'un véritable esprit critique, pas des simples exécutants" (GDR).

    1. These 74 metabolites were compromised of: lactic acid, pyruvate, glycerol, glyceric acid, citric acid, aconitate, isocitric acid, 2-ketoglutaric acid, succinic acid, fumaric acid, malic acid, 2-aminoadipic acid, lysine, valine, leucine, isoleucine, threonine, glycine, serine, alanine, glutamic acid, glutamine, proline, aspartic acid, asparagine, methionine, cysteine, phenylalanine, tyrosine, tryptophan, histidine, ornithine, phosphate, diphosphate, phosphoglycerol, 3-phosphoglycerate, fructose, galactose, glucose-6-phosphate, mannitol, sorbitol, galactitol, inositol, myoinositol phosphate, sucrose-trehalose, lauric acid, myristic acid, palmitelaidic acid, palmitic acid, linoleic acid, oleic acid, elaidic acid, stearic acid, arachidonic acid, 1-monooleoyglycerol, 1-monostearylglycerol, 2-monostearylglycerol, squalene, xanthine, hypoxanthine, uracil, adenosine-5’-monophosphate, erythrosine, erythrose-4-phosphate, tocopherol, B-alanine, 2-ketoisocaproic acid, gluconic acid, ascorbic acid, uric acid, and urea. Total cholesterol, glucose, and creatinine were also measured in this panel to provide a measure of quantitative validation of the panel because they had previously been tested using clinical diagnostics and analyzed for the effect of fasting [5].

      everything they measured during the study to see the effect of fasting periods

    1. AbstractWater buffalo is a cornerstone livestock species in many low- and middle-income countries, yet major gaps persist in its genomic characterization—complicated by the divergent karyotypes of its two sub-species (swamp and river). Such genomic complexity makes water buffalo a particularly good candidate for the use of graph genomics, which can capture variation missed by linear reference approaches. However, the utility of this approach to improve water buffalo has been largely unexplored.We present a comprehensive pangenome that integrates four newly generated, highly contiguous assemblies of Pakistani river buffalo with available assemblies from both sub- species. This doubles the number of accessible high-quality river buffalo genomes and provides the most contiguous assemblies for the sub-species to date. Using the pangenome to assay variation across 711 global samples, we uncovered extensive genomic diversity, including thousands of large structural variants absent from the reference genome, spanning over 140 Mb of additional sequence. We demonstrate the utility of these data by identifying putative functional indels and structural variants linked to selective sweeps in key genes involved in productivity and immune response across 26 populations.This study represents one of the first successful applications of graph genomics in water buffalo and offers valuable insights into how integrating assemblies can transform analyses of water buffalo and other species with complex evolutionary histories. We anticipate that these assemblies, and the pangenome and putative functional structural variants we have released, will accelerate efforts to unlock water buffalo’s genetic potential, improving productivity and resilience in this economically important species.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf099), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 4: Wai Yee Low

      Review of "A comprehensive water buffalo pangenome reveals extensive structural variation linked to population specific signatures of selection". This is an impressive work at the frontier of buffalo genomics. I truly enjoy reading the work and my questions/comments are aimed at improving it further. My detailed comments are below: Line 30: I think it is better you include the actual number of publicly available assemblies used to create the pangenome graph. Line 71: There is now a swamp buffalo reference genome with annotation too (NCBI accession: PCC_UOA_SB_1v2). Perhaps consider to cite the swamp buffalo ref https://academic.oup.com/gigascience/article/doi/10.1093/gigascience/giae053/7753516 and rewrite the sentence to say a pangenome can be used for both swamp and river, but a single linear ref from either subspecies for read mapping is not good enough. Line 79: "highlighted" Line 82: What do you mean by "higher quality"? The assemblies have been discussed in this review: https://www.frontiersin.org/journals/genetics/articles/10.3389/fgene.2021.629861/full Line 105: Technically, the graph method for bovine species, which includes water buffalo, is being investigated by the Bovine Pangenome Consortium (BPC). However, nothing useful has been published on the buffalo graph but perhaps consider citing the BPC since your paper overlaps with it (https://genomebiology.biomedcentral.com/articles/10.1186/s13059-023-02975-0). Line 165: It will be good if you add a bit more context of the PanGenie method here as the researchers in buffalo community are not used to this. Additionally, it will be great if all code is made available on GitHub or as Supplementary Info. Line 170: To produce phase pangenome graph, don't you need all input assemblies to be phased? All are input assemblies phased? The UOA_WB_1 is locally phased, not phased throughout the genome. Line 235: "a list of 403 unrelated individuals." What does this translate to in terms that geneticists can understand? Do you mean siblings have been removed? Or individuals sharing the same grandparents were removed? Line 246: Can you please explain how did you get the coordinates to match between the GATK and PanGenie method? You'll need matching coordinates for concordance analysis. As I understand it, the GATK was based on UOA_WB_1? Line 254: Why these 3 chromosomes? Line 257: If you had not filtered for relatedness, how will it impact the selective sweep work? I think including some context will help the readers. Line 259: do you mean at least six samples per group? If yes, is 6 samples enough? Line 261: genotype quality less than 25 according to bcftools? Since you only used biallelic variants, please provide the breakdown between biallelic and multiallelic. Line 281: "… we first PacBio HiFi sequenced one female" Please rewrite this. Line 282: How common are these two breeds in percentage? Line 291: Is this already known? Perhaps cite the literature to show the agreement with previous studies? Fig 1D: This is a bit too small to see especially the SV distribution at the bottom. I can hardly see the median? Line 310: Why did you choose UOA_WB_1 as the reference? Line 311: the ~32.8 mil variants are comprised of SNPs as well? Fig 2: This is probably a panel of a figure but should not be the entire figure. The size of the circle indicates sample size but there should be a legend on the plot for this to say the sizes, right? Darker colour should be used to highlight the countries with samples instead of white? Maybe this could be a Supp figure too. Line 356: S Figure 4 and 5 should be main figures? You will need to annotate the abbreviation of sample-country in the legend of S Figure 5. Line 360: "To enable reuse we have made this dataset available …" The dataset should be made available to reviewers? Line 368: "76% of SNVs were called by both callers" 76% seem low. Also, called does not mean concordant. What is the concordance among called SNVs in both? Did the pangenome approach called most of the variants found in GATK? If not, what might be the reasons? Fig 3B: It is not immediately clear what the difference is, between non repetitive and repetitive regions. The overlapping text in the x-axes makes it hard to read. Line 390: "Analyses such as the study of selective sweeps or genome-wide association studies where low frequency variants are often filtered out will benefit less from the advantages of GATK, particularly given its longer run time." From here on, in this paragraph, it's Discussion, not Results. Line 418: Why human? Could you use cattle? Line 427: I tried the browser and not sure what I can learn from it. It will be helpful if there is a README with some examples on what can be explored. Line 450: How large before you considered it as larger variant? Is this ability to study larger variants still hold despite using only ~10 assemblies in the graph? The use of short reads for selective sweep study will still benefit from being able to incorporate these larger variants? As I understand it, the larger variants were found only from graph, not from the short reads. As such, the selective sweep may not be associated with any larger variants? Line 470: Fig S8 should be a main figure? Line 513: Instead of uniprot link, perhaps consider including this as Supplementary info or text. The info in the link may change in the future. Line 551: However, without scaffolding, the assemblies of Pakistani river buffalo may not be good enough to function as reference genomes for river buffalo? Line 552: When considering new bases, did you do this for each assembly independently or the new bases were discovered cumulatively? Line 581: Some of my questions at Line 450 can be discussed here. Line 586: Perhaps consider discussing the limitations of the small number of assemblies used to create the graph. As such, many SVs are likely still missing and we are still unable to properly assess allele frequency of these larger SVs. Additionally, while some SVs may not be considered as large in this work, it does not mean they have no impact.

    2. AbstractWater buffalo is a cornerstone livestock species in many low- and middle-income countries, yet major gaps persist in its genomic characterization—complicated by the divergent karyotypes of its two sub-species (swamp and river). Such genomic complexity makes water buffalo a particularly good candidate for the use of graph genomics, which can capture variation missed by linear reference approaches. However, the utility of this approach to improve water buffalo has been largely unexplored.We present a comprehensive pangenome that integrates four newly generated, highly contiguous assemblies of Pakistani river buffalo with available assemblies from both sub- species. This doubles the number of accessible high-quality river buffalo genomes and provides the most contiguous assemblies for the sub-species to date. Using the pangenome to assay variation across 711 global samples, we uncovered extensive genomic diversity, including thousands of large structural variants absent from the reference genome, spanning over 140 Mb of additional sequence. We demonstrate the utility of these data by identifying putative functional indels and structural variants linked to selective sweeps in key genes involved in productivity and immune response across 26 populations.This study represents one of the first successful applications of graph genomics in water buffalo and offers valuable insights into how integrating assemblies can transform analyses of water buffalo and other species with complex evolutionary histories. We anticipate that these assemblies, and the pangenome and putative functional structural variants we have released, will accelerate efforts to unlock water buffalo’s genetic potential, improving productivity and resilience in this economically important species.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf099), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 3: Laura Caquelin

      1. SummaryoftheStudy This study used graph genomics to better characterize water buffalo genomes. By building a pangenome from new and existing assemblies, the authors analyzed 711 samples. These samples revealed structural variation. These results highlight the value of graph genomics. This method

      2. Scopeofreproducibility According to our assessment the primary objective is: to identify genomic variants within selective sweep regions in the water buffalo genome.

      3. Outcome: Enrichment of high-impact structural variants (SVs), insertions/deletions (indels) and single nucleotide variants (SNVs) in selective sweep regions.
      4. Analysis method outcome: Variants were compared between selective sweep regions and genome-wide. Fisher's exact test was used to assess enrichment of functional variants.
      5. Main result: "Prior to annotation, multiallelic variants were normalized by splitting them into separate biallelic entries, resulting in 6,159,686 indels, 28,669,966 SNVs, and 160,921 SVs entries. Within putative selective sweep regions we identified 208,862 indels, 997,500 SNVs and 6,748 SVs. Notably an enrichment of HIGH impact SVs, indels and SNVs were observed within selective sweep regions (Figure 5A, Supplementary Table S6), with 50-80% more variants in these areas having a HIGH impact compared to genome-wide. Among the high impact variants in selective sweep regions only 20% were SNVs, with the remainder being SVs and indels, suggesting high impact larger variants may underlie putative selective sweeps." (Lines 453 to 461)

      6. AvailabilityofMaterials a. Data

      7. Data availability: Open
      8. Data completeness: Complete, all data necessary to reproduce main results are available
      9. Access Method: Supplementary files - Repository: -
      10. Data quality: Structured b. Code
      11. Code availability: Shared for the review after request - Programming Language(s): R
      12. Repository link: -
      13. License: -
      14. Repository status: -
      15. Documentation: No documentation

      16. Computational environment of reproduction analysis

      17. Operating system for reproduction: MacOS 14.7.4
      18. Programming Language(s): R
      19. Code implementation approach: Creating script according to the methodology description/Using shared code
      20. Version environment for reproduction: R version 4.4.1/RStudio 2024.09.0

      21. Results 5.1 Original study results

      22. Results 1: Results are presented in Figure 5A. 5.2 Steps for reproduction -> Reproduce the results The code was not shared initially, but as the data were provided and the test was a Fisher's exact test, I wrote code to reproduce the p-values.

      23. Issue 1: P-values for the SNVs variant as well as the « Modifier » impact class were not provided. -- Resolved: Authors provided an updated Supplementary table S6 with exact numerical p-values for each variant and each impact class. The code "variantEnrichAtPeaks.R" to generate the Figure 5A and the Supplementary table S6 was also shared. New version of the supplementary Table S6: (see screenshot)

      The comparison between the reproduced results and the original results was then performed using the shared code. (Notably, the results from the R script written allowed for the generation of the same p-value as the one presented in Figure 5A).

      • Issue 2: In the script "variantEnrichAtPeaks.R", only the figures were generated, not the new supplementary Table S6 with the numerical p-values. -- Resolved: Some code lines was added in the function "makePlot" to generate this table in addition to the figure.

      Line 159 to 178 of the script "variantEnrichAtPeaks_RCC."

      1. Supplementary table S6 (add)

      summary_table <- df %>% mutate( Type = variantType, Genome_Wide_Prop = Genome_wide / sum(Genome_wide), Selective_Sweep_peaks_Prop = Sweep / sum(Sweep), Ratio_of_proportions = Selective_Sweep_peaks_Prop / Genome_Wide_Prop) %>% left_join(pval_df, by = "Impact") %>% select( Impact, Type, Genome_Wide = Genome_wide, Selective_Sweep peaks = Sweep, Genome_Wide Prop = Genome_Wide_Prop, Selective_Sweep peaks Prop= Selective_Sweep_peaks_Prop, Ratio of proportions= Ratio_of_proportions, Fishers exact P = p_value)

      return(list(plot = p, summary_table = summary_table))

      5.3 Statistical comparison Original vs Reproduced results - Results: Figure and table S6 were reproduced for each variant type and impact: -- SVs type: (see screenshot) -- Indels type: (see screenshot) -- And SNVs type: (see screenshot)

      • Comments: The shared code was used to compute the p-values and generated the Figures. Minor numerical error discrepancy was observed for some p-values, likely due to rounding differences. The p-values in the original Excel file appear to be stored with less decimal precision than those computed in R. This difference is negligible and does not indicate a reproducibility issue.
      • Errors detected: No error detected.
      • Statistical Consistency: The results were successfully reproduced with the share code.

      • Conclusion

      • Summary of the computational reproducibility review The Fisher's exact tests for enrichment across variant and impact categories, presented in Figure 5A of the manuscript, were successfully reproduced using the data in supplementary table S6 and the shared code. Results were consistent with the original, with only negligible rounding differences in p-values.
      • Recommendations for authors We were able to reproduce study with the data and information provided in the Figure 5A description. To further improve transparency and ensure full reproducibility of your manuscript, the following recommendations are suggested: -- Make the codes to reproduce all analyses in the paper openly available to allow anyone to reproduce the results. Ideally, provide a README or requirements.txt file describing how to run the analysis, including software versions, packages, and dependencies. -- Include statistical outputs, such as exact p-values, in supplementary materials when possible. This ensures clarity and eases verification. Ideally, provide metadata: For the datasets used or generated by the scripts, it would be helpful to include accompanying metadata files that explain: --- The definition of each variable name. --- The origin of each dataset (raw, processed, etc). --- Any preprocessing steps applied before analysis.
    3. AbstractWater buffalo is a cornerstone livestock species in many low- and middle-income countries, yet major gaps persist in its genomic characterization—complicated by the divergent karyotypes of its two sub-species (swamp and river). Such genomic complexity makes water buffalo a particularly good candidate for the use of graph genomics, which can capture variation missed by linear reference approaches. However, the utility of this approach to improve water buffalo has been largely unexplored.We present a comprehensive pangenome that integrates four newly generated, highly contiguous assemblies of Pakistani river buffalo with available assemblies from both sub- species. This doubles the number of accessible high-quality river buffalo genomes and provides the most contiguous assemblies for the sub-species to date. Using the pangenome to assay variation across 711 global samples, we uncovered extensive genomic diversity, including thousands of large structural variants absent from the reference genome, spanning over 140 Mb of additional sequence. We demonstrate the utility of these data by identifying putative functional indels and structural variants linked to selective sweeps in key genes involved in productivity and immune response across 26 populations.This study represents one of the first successful applications of graph genomics in water buffalo and offers valuable insights into how integrating assemblies can transform analyses of water buffalo and other species with complex evolutionary histories. We anticipate that these assemblies, and the pangenome and putative functional structural variants we have released, will accelerate efforts to unlock water buffalo’s genetic potential, improving productivity and resilience in this economically important species.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf099), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 2: Yi Zhang

      This manuscript presents the first high-quality, haplotype-resolved genome assemblies for two representative Pakistani river buffalo breeds (Nili Ravi and Azikheli), integrating them with existing assemblies to construct a water buffalo pangenome. The study leverages graph genomics to characterize structural variation (SV), identifying >140 Mb of non-reference sequence and 111,352 SVs. By genotyping of 711 global samples against this pangenome, the authors uncover population-specific selective sweeps linked to productivity, immunity, and adaptation traits, revealing potentially functional SVs, though these findings are limited by the absence of validation evidence and cross-study comparisons. The work highlights graph genomics as a transformative tool for integrative analyses of evolutionarily related species in an unbiased way and provides resources to accelerate buffalo breeding.

      General Comments 1.The study's methodology is rigorous, combining long-read assembly, graph-based genotyping (PanGenie), and population-level sweep scans. Nevertheless, the manuscript would benefit from discussion of graph limitations, such as bias against rare variants (Fig. 3B) and challenges in graph construction for species with karyotypic divergence. 2. The selection signature analyses were done across a number of population groups but the paper only showcases a limited selection of results. To strengthen the manuscript, the authors could concentrate on a consistent set of populations. This would enable a more in-depth examination of selective signals common across buffalo population groups or unique selective signals specific to certain groups. 3. It could be informative to conduct comparative analyses of selection signatures using variant datasets from PanGenie and GATK. This could reveal whether the pangenome approach might uncover important structural variants within selection signals that GATK fails to identify.

      Specific Comments 1. In Figure 1D and the main text, the rationale behind dividing the SVs into 40 sets is not clearly presented. If the interpretation is correct, the y-axis label of the bar graph should denote the number of SVs rather than size. Moreover, the main title "SVs Size Distribution" at the top seems more relevant to the box plots at the bottom. 2. Lines 325 - 326 state that the newly assembled pangenome graph exhibits a substantial increase in genome size compared to the existing reference genome. It is recommended that the authors describe the distribution of the 147,865,364 bp across the entire genome. Are they found more prevalent in specific regions of certain chromosomes? 3. In lines 410 - 412, there may be an issue with the citation of Table S2. The table contains 402 individuals, whereas the text mentions 282. 4. Figure 3 shows that, when using 30x samples in the variant calling comparison between Pangenie and GATK, there are still a large number of SNV variants detectable only by GATK. A more in-depth technical discussion of these differences would greatly enhance the reader's comprehension of these findings and the relative performance of the two methods. 5. To provide a more intuitive understanding of how SV can influence gene function and contribute to the traits, the authors could include a figure that displays an example gene structure along with the SV of interest within a selection signal peak.

    1. AbstractOpportunistic assessment of vertebral strength from clinical computed tomography (CT) scans holds substantial promise for fracture risk stratification, yet variability in calibration methods and finite element (FE) modeling approaches has led to limited comparability across studies. In this work, we provide a publicly available benchmark dataset that supports standardized biomechanical analysis of the thoracic and lumbar spine using density-calibrated CT data. We extended the VerSe 2019 dataset to include phantomless quantitative CT calibration, automated vertebral substructure segmentation, and vertebral strength estimates derived from both linear and nonlinear FE models. The cohort comprises 141 patients scanned across five CT systems, including contrast-enhanced protocols. Phantomless calibration was performed using automatically segmented tissue references and validated against synchronous calibration phantoms in 17 scans. To evaluate model performance, we implemented a nonlinear elastoplastic FE model and compared it to two linear estimates. A displacement-calibrated linear model (0.2% axial strain) demonstrated excellent agreement with nonlinear failure loads (R = 0.96; mean difference = -0.07 kN), while a stiffness-based approach showed similarly strong correlation (R = 0.92). We evaluated vertebral strength at all thoracic and lumbar levels, enabling level-wise normalization and comparison. Strength ratios revealed consistent anatomical trends and identified T12 and T9 as reliable alternatives to L1 for opportunistic screening and model standardization. All calibrated scans, segmentations, software, and modeling outputs are publicly released, providing a benchmark resource for validation and development of FE models, radiomics tools, and other quantitative imaging applications in musculoskeletal research.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf094), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 2: Karan Devane

      The study uses an open-source dataset collected in a population representative of those who would benefit from opportunistic screening and included physiological variation (i.e. contrast enhanced images and pre-existing fracture), alongside validation of density and FE assessment calibration methods. The methods are described in detail, including software versioning schemes, and links to the software sources as relevant for use in replicating methods. Additionally, the enhanced dataset is being included alongside the publication. The primary purpose of this study was to prepare and make available a public dataset for use in continued testing and development of opportunistic screening methods. The data appears to be conservatively analyzed as such, and the authors make notes of existing limitations of the population and sample characteristics where applicable. Additionally, the phantomless calibration technique is validated within this dataset prior to use in support of the "generalizability of the approach" (178), though the applied sample for this is relatively small (n=17 with in-scan phantoms). The manuscript is well-written and easy to understand but I have a few suggestions and comments that need to be addressed.

      The data are well-controlled for the study cohort, however as mentioned by the authors (228-232), this cohort is biased towards individuals with pre-existing skeletal fragility, as indicated by the average lumbar T-score as assessed by DXA falling in the osteopenic range (-1.5, Table 1). Beyond this, the authors made use of multiple validated calibration techniques to support the use of their internal calibration scheme, as well as analysis of potential confounding variables such as contrast enhanced CT scans. Relative vertebral strength analysis (Figure 6, Table 2), however does not appear to be analyzed with respect to the fractures mentioned as present throughout the cohort (193). While differences in strength may be primarily explained by density or size, it is possible that the incidence of pre-existing fracture occurring in the thoracolumbar segment may influence adaptation of the other vertebrae in the region [1][2][3], and as such analysis for fracture inclusion may be warranted.

      The use of standardized FE modeling techniques supports the goal for reproducibility of assessment in clinical FE modeling. While the authors made efforts to enhance the reproducibility and generalizability of the dataset, they themselves note that the source population is not necessarily descriptive of a general population (lines 227-232). Though this population is representative of those indicated for opportunistic screening, the development of risk curves necessitates the inclusion of healthy individuals, and follow-up analysis to fully flesh out the use of opportunistic FE in clinical settings, however this analysis would require a much larger cohort, and are outside the scope of the current manuscript. Further, while 'voxel-models' are typically regarded as standard, tetrahedral element models may generally provide better representation of complex biological geometries [4]. All approaches to FE have drawbacks, and tetrahedral models may be less-optimal solutions compared to hexahedral elements for convergence and the possibility of artificial stiffening, the high prevalence of osteophytes and degradation [5], particularly in older populations where screening is indicated, may warrant the use of tetrahedral elements which capture the intricacies of vertebral geometry that impact FE derived strength [6]. While again potentially outside the scope of this study, it might be noted as an additional formulative variable for FE approaches to estimating fracture risk.

      Line 269 -> "… applications such as radiomics-driven [approach?] for opportunistic …" As fracture prevalence is included in the dataset, it may be worthwhile to include analysis of fracture-adjacent vertebra in the selection of surrogate vertebra for L1 in opportunistic screening. Does pre-existing fracture influence which vertebrae selected, and should this decision be made on a person-to-person basis, taking into consideration the particular condition of the vertebrae available in the scan?

      [1] https://pmc.ncbi.nlm.nih.gov/articles/PMC8752702/ [2]https://academic.oup.com/jbmr/article/39/12/1744/7825427 [3] https://pmc.ncbi.nlm.nih.gov/articles/PMC7697376/ [4]https://www.sciencedirect.com/science/article/pii/S0021929005003568 [5] https://link.springer.com/article/10.1007/s12565-010-0080-8 [6]https://www.sciencedirect.com/science/article/pii/S1529943018306466

    1. Recent advancements in transcriptomics and proteomics have opened the possibility for spatially resolved molecular characterization of tissue architecture with the promise of enabling a deeper understanding of tissue biology in either homeostasis or disease. The wealth of data generated by these technologies has recently driven the development of a wide range of computational methods. These methods have the requirement of advanced coding fluency to be applied and integrated across the full spatial omics analysis process thus presenting a hurdle for widespread adoption by the biology research community. To address this, we introduce SPEX (Spatial Expression Explorer), a web-based analysis platform that employs modular analysis pipeline design, accessible through a user-friendly interface. SPEX’s infrastructure allows for streamlined access to open source image data management systems,analysis modules, and fully integrated data visualization solutions. Analysis modules include essential steps covering image processing, single-cell and spatial analysis. We demonstrate SPEX’s ability to facilitate the discovery of biological insights in spatially resolved omics datasets from healthy tissue to tumor samples.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf090), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 2: Qianqian Song

      The manuscript presents an advancement in spatial omics analysis but needs improvements in Quantitative benchmarking, Computational scalability assessment, etc. With these revisions, SPEX has the potential to become a widely adopted platform in the spatial omics community. I have specific comments as below:

      1) While the manuscript provides a qualitative comparison of SPEX with other spatial omics tools (e.g., Squidpy, Giotto, Aquilla), quantitative benchmarking is missing. It is needed to include a performance benchmark comparing runtime efficiency, segmentation accuracy, and clustering resolution against existing tools. Also, it is necessary to show computational efficiency metrics (e.g., memory usage, execution time, scalability across datasets of varying sizes).

      2) The study presents compelling results, but there is no independent validation or interpretation of computational outputs using experimental methods.

      3) The manuscript does not discuss hardware requirements, processing speed, or computational limitations. It is needed to provide an assessment of SPEX's performance on different computing environments (e.g., local workstations vs. cloud computing vs. high-performance clusters).

      4) The Colocation Quotient (CLQ) method is well described, but the manuscript does not provide statistical validation (e.g., p-values, confidence intervals) for detected spatial relationships.

    2. Recent advancements in transcriptomics and proteomics have opened the possibility for spatially resolved molecular characterization of tissue architecture with the promise of enabling a deeper understanding of tissue biology in either homeostasis or disease. The wealth of data generated by these technologies has recently driven the development of a wide range of computational methods. These methods have the requirement of advanced coding fluency to be applied and integrated across the full spatial omics analysis process thus presenting a hurdle for widespread adoption by the biology research community. To address this, we introduce SPEX (Spatial Expression Explorer), a web-based analysis platform that employs modular analysis pipeline design, accessible through a user-friendly interface. SPEX’s infrastructure allows for streamlined access to open source image data management systems,analysis modules, and fully integrated data visualization solutions. Analysis modules include essential steps covering image processing, single-cell and spatial analysis. We demonstrate SPEX’s ability to facilitate the discovery of biological insights in spatially resolved omics datasets from healthy tissue to tumor samples.

      This work has been peer reviewed in GigaScience (see https://doi.org/10.1093/gigascience/giaf090), which carries out open, named peer-review. These reviews are published under a CC-BY 4.0 license and were as follows:

      Reviewer 1: Ka Yee Yeung

      Li et al. presented SPEX (Spatial Expression Explorer), a web-based open-source end-to-end analysis platform offering modular design and a user accessible interface. The users demonstrated use cases in spatial transcriptomics (MERFISH lung cancer) and spatial proteomics datasets (tonsil, public multiplex ion beam imaging data). SPEX includes the following analytical modules 1. image processing modules includes a 4-step sequence (image pre-processing, single-cell segmentation, post-processing, feature selection). Image loading supports OMERO integration. Output is a cell by expression matrix in Anndata format. 2. clustering modules for both spatial transcriptomic and proteomic data. 3. spatial analysis module implements the CLQ (Colocation Quotient) method. 4. spatial expression analysis module includes differential expression and pathway analysis. SPEX supports visualization via Vitessce.

      The paper is well written, addresses a rising interest and critical need in the biomedical community. The reviewer would like to request clarifications on how extensible the modules are. The author mentioned a SPEX pipeline builder in which "modules are selected from a library and dragged into a visual pipeline map", and also mentioend the support for "flexible plug-in analysis modules". What are the packages available from the library? Can users import their own code or script or package? How to create new plug-in's?

      The reviewer is also wondering how do the users interact with the results? Can the user click on the resulting image and select regions of interest to zoom in?

    1. ________________

      1- get yourself in right space 2 - avoid distractions 3 - pace yourself 4 - read you most difficult assignments early I think number 4 will take the mode time because it is something that is difficult for you and they generally takes more time.

    1. Use body language (such as giving eye contact, leaning forward, and nodding) to indicate their engagement in the conversation e Pause to paraphrase, ask questions, and summarize the conversation in order to avoid miscommunication e Resist judging the comments that a beginning teacher makes ° Respond in a way that communicates respect and appreciation for what the beginning teacher shares (such as “I hear what you’re saying,” “It sounds like you really feel frus- trated,” or “Thank you for sharing that. How can I help?”) In addition to using active listening during conversations, mentors should pay attention to the non- verbal cues a beginning teacher uses. Look for signs of fatigue (such as slow movements or difficulty concentrating), frustration (such as eye-rolling or crossed arms), or despair (such as puffy eyes or other indicators of crying). By paying attention to both verbal and nonverbal communications, a mentor can see indications of distress before they come to a head and show the beginning teacher that he or she cares. val “\o / Ty \S Yo Conduct Daily Check-Ins rar rene? >» Daily check-ins are short conversations between mentors and mentees about how a mentec is feel- ing and performing, both inside and outside the classroom. Mentors can send emails and text messages to mentees or call them on the phone, even outside school hours. Do not feel obligated to make these check-ins formal or extensive; even a simple “How’s it going?” followed by active listening can make a world of difference. Staying in communication with mentees helps them feel supported but also helps a mentor notice when something is amiss. This easy strategy can facilitate the growth of the mentor- mentee relationship throughout the school year. Validate the Teacher's Feelings Once it becomes clear how a mentee feels, provide reassurance that his or her feelings are normal and will not last forever. Relate the mentee’s experience to the different phases of first-year teaching (Moir, 1999; see figure 1.2 on page 9) to validate his or her feelings and show that many beginning teachers feel the same way. Giving ; new teachers a chance to relate to these j phases can help them feel a Providing Emotional Support 4 por sense of normalcy regarding their emotions and concerns. Some also feel a sense of relief that they are not alone in their journey, particularly during the survival and disillusionment phases. Be sure to point out that teachers do not stay in these phases forever and that the job becomes easier and easier with each passing year. Additionally, share personal reflections and anecdotes from your own first years as a teacher to help the mentee feel a sense of camaraderie. Use the essays and reflection questions in appendix B (page 79), which provide a window into the life of a beginning teacher, or reflect individually on the first- year teaching phases (see figure 1.2 on page 9) to recall the unique challenges and emotions that a new teacher faces. Consider difficult experiences from recent years, as well, and describe the different chal- lenges and rewards that each year brings. Alternatively, collect and share stories from other colleagues in the school building. Point out that even the most seasoned teachers began as novices. These shared | experiences can stimulate a comfortable and reflective dialogue between a mentor and a mentee. Send Encouraging Messages Periodically send positive notes, emails, and text messages to beginning teachers to remind them of your availability and support. Include positive, behavior-specific feedback in letters to mentees to keep their spirits high and to encourage them to press on. For example, write something such as, “I noticed that instead of correcting Jerrod in front of the class today, you spoke privately with him about his behavior—that was very effective!” Sy, support for beginning teachers. Robert J. Marzano and Debra J. Pickering (2011) pointed out that inspirational quotations that demonstrate examples of self-efficacy can be encouraging. As Dale H. Schunk and Frank Pajares (2009) explained, self-efficacy “refers to the perceived capabilities for learn- | ing or performing actions at designated levels” (p. 35). In other words, teachers who have a strong sense of self-efficacy believe that they can execute their duties successfully or learn to execute them successfully. Because a beginning teacher may also be struggling to cultivate self-efficacy, inspirational quotations can serve as powerful reminders of the importance of persevering, striving for goals, and staying optimistic. When providing examples of motivating quotations, mentors can refer to this list of selected BrainyMedia (2014) quotations, as cited in Marzano and Pickering (2011), involving three categories: (1) perseverance, (2) greatness and following hopes and dreams, and (3) optimism. oN Choosing cards that contain reflective quotes or heartening messages can also provide sae oY Perseverance e “Genius is eternal patience.” —Michelangelo e “Without struggle, there can be no progress.” —Frederick Douglass e “Tn the middle of difficulty lies opportunity.” —Albert Einstein e “Don’t fear mistakes, there are none.” —Miles Davis e “T’ve got to keep breathing. It’ll be my worst business mistake if I don’t.” —Steve Martin ¢ “Tf you’re going through hell, keep going.” —Winston Churchill e “Tt’s not whether you get knocked down; it’s whether you get up.” —Vince Lombardi ss) Me, er

      Using positive body language is so important. This is an area I want to grow in, and also ask for feedback as I. might not be aware of how I'm coming across.

    1. Note de Synthèse : L'instrumentalisation des associations et les voies de la coconstruction

      Synthèse

      Ce document de synthèse analyse les conclusions du webinaire "Face à l'instrumentalisation des associations", quatrième épisode du cycle "Renforcement du monde associatif".

      Il met en lumière la menace croissante de l'instrumentalisation, identifiée comme un des quatre facteurs majeurs d'affaiblissement du secteur associatif, aux côtés de la répression des libertés, de la marchandisation et de la managérialisation.

      Cette instrumentalisation se manifeste par une pression exercée sur les associations pour qu'elles s'alignent sur les politiques publiques, une tendance exacerbée par une transformation structurelle des financements publics qui privilégient la commande publique au détriment des subventions.

      Des exemples récents aux niveaux local, national et européen illustrent une stratégie de discrédit visant les associations qui conservent une parole politique critique, résumée par l'injonction :

      "dès lors que les associations reçoivent de l'argent public, elles ont intérêt à se tenir sages".

      Face à ce scénario d'affaiblissement, le webinaire explore en profondeur l'antidote principal : la coconstruction des politiques publiques.

      Loin d'une simple consultation, la coconstruction est présentée dans sa définition la plus exigeante, impliquant un partage du pouvoir et des éléments de codécision.

      Pour être efficace, elle doit s'appuyer sur une méthodologie rigoureuse, commençant par un diagnostic partagé et se poursuivant jusqu'à l'évaluation commune des actions.

      Deux modèles d'action concrets sont examinés :

      1. Les schémas d'orientation (Solima) du secteur culturel, qui offrent un retour d'expérience de près de vingt ans sur des processus de concertation structurés.

      Bien qu'ils aient prouvé leur efficacité pour améliorer l'interconnaissance et la coopération, ils révèlent des limites quant à leur capacité à faire évoluer durablement les politiques publiques et à surmonter la culture du "qui paie, décide".

      2. La démocratie d'interpellation, qui apparaît comme un prérequis essentiel.

      Ce concept vise à doter les citoyens, et notamment les plus marginalisés, des outils (pétitions à seuils, fonds de soutien) leur permettant d'inscrire leurs préoccupations à l'agenda politique, créant ainsi les conditions initiales d'une future coconstruction.

      En conclusion, si la coconstruction représente une voie prometteuse pour renforcer la vitalité démocratique et l'autonomie du monde associatif, sa mise en œuvre reste un défi majeur.

      Elle se heurte à un contexte politique et économique défavorable et nécessite de surmonter des obstacles culturels profonds pour passer d'une logique de prestation de service à un partenariat authentique fondé sur le partage du pouvoir.

      1. Le Scénario de l'Affaiblissement : L'Instrumentalisation comme Menace Centrale

      Le webinaire identifie l'instrumentalisation comme une composante clé d'un "scénario d'affaiblissement" qui pèse sur le monde associatif.

      Ce processus vise à réduire les associations à un rôle d'exécutantes des politiques publiques, les privant de leur capacité d'initiative, de critique et de participation à la vie de la cité.

      Définition et Manifestations

      L'instrumentalisation est un processus par lequel les pouvoirs publics tendent à considérer les associations non plus comme des partenaires autonomes porteurs de projets d'intérêt général, mais comme de simples prestataires de services.

      Marianne Langlais (Collectif des associations citoyennes - CAC) la définit comme une attente que les associations, dès lors qu'elles sont financées par de l'argent public, "se tiennent sages".

      Cela implique :

      • • S'inscrire sans contester dans la ligne politique dominante, qualifiée de "néolibérale et autoritaire".

      • • Ne pas porter un message politique différent de celui attendu par les financeurs.

      • • Rester "politiquement neutre" dans un contexte qui ne l'est pas.

      Exemples Concrets de Discrédit Politique

      Cette pression s'accompagne de campagnes de discrédit visant à délégitimer les associations qui conservent une parole politique. Plusieurs exemples récents ont été cités : Niveau Acteur Cible Discours / Action Objectif Local Christelle Morançais (Présidente, Pays de la Loire)

      Associations culturelles

      Les accuse d'être le "monopole d'associations très politisées qui vivent d'argent public" pour justifier des coupes budgétaires massives. Justifier des coupes budgétaires.

      National

      Bruno Retailleau (Ministre de l'Intérieur)

      La Cimade et autres associations d'aide aux étrangers

      Affirme qu'elles doivent "agir en cohérence avec l'État", remettant en cause leur travail en centre de rétention.

      Aligner les actions des associations sur la politique gouvernementale.

      Européen

      Droite et extrême droite européenne

      Associations environnementales

      Lancement d'une "fake news" les accusant d'être payées par la Commission pour faire du lobbying pro-pacte vert.

      Les priver de financements européens, notamment du programme LIFE (budget de 5,4 milliards d'euros).

      Le Levier Financier : De la Subvention à la Commande Publique

      Au cœur du processus d'instrumentalisation se trouve une transformation profonde des modes de financement public.

      On observe un recul structurel de la subvention de fonctionnement au profit de la commande publique (marchés publics, appels à projets).

      • Contexte Européen : La création du marché unique en 1987 et sa règle d'or d'une "concurrence libre et non faussée" ont conduit à considérer la subvention comme une potentielle aide d'État illicite.

      • Impact en France : La part des subventions dans les recettes associatives a chuté de 41 % entre 2005 et 2017.

      • Conséquences : Le rapport Suxe ("Renforcer le financement des associations :

      une urgence démocratique", mai 2023) souligne que cette évolution entraîne une "fragilisation de leur équilibre financier, mais aussi et surtout par une perte de sens et une invisibilisation de ce qui caractérise l'association".

      Ce changement modifie radicalement le rapport de force :

      • La subvention reconnaît l'association comme étant à l'origine de l'initiative, sans attente de contrepartie directe.

      Elle favorise une politique ascendante ("bottom-up") où les associations agissent en "vigies citoyennes".

      • La commande publique positionne l'État ou la collectivité comme acheteur d'un service, fixant un cadre strict.

      Elle impose une politique descendante ("top-down") où les associations deviennent des prestataires.

      2. L'Antidote : La Coconstruction des Politiques Publiques

      Face à l'instrumentalisation, la coconstruction est présentée comme le principal antidote, permettant de restaurer un dialogue équilibré et de renforcer la vitalité démocratique.

      Fondamentaux et Définition Exigeante

      La coconstruction est définie non pas comme une simple consultation ou concertation – démarches souvent sources de "effets déceptifs" – mais comme un processus exigeant de partage du pouvoir.

      Selon Jean-Baptiste (CAC), on peut parler de coconstruction "à partir du moment où il y a des éléments de codécision".

      Cette approche s'ancre dans une vision de la démocratie en acte, illustrée par la définition de Paul Ricœur :

      "Est démocratique une société qui se reconnaît divisée, c’est-à-dire traversée par des contradictions d’intérêt, et qui se fixe comme modalité d’associer à part égale chaque citoyen dans l'expression, l'analyse, la délibération et l'arbitrage de ces contradictions."

      Une Méthodologie Structurée

      L'expérience montre que la coconstruction est un "chemin escarpé" et ne peut réussir sans méthode.

      Les travaux menés notamment par Laurent Fress dans le cadre d'une recherche-action (2017-2018) ont permis d'identifier cinq étapes clés pour un processus rigoureux :

      1. État des lieux et diagnostic partagé : Coproduire le savoir sur un territoire.

      Cette phase est fondamentale car "savoir, c'est pouvoir". Les Observatoires Locaux de la Vie Associative (OLVA) portés par le Rnma sont des outils privilégiés pour cette étape.

      2. Débat public et priorisation : Dégager collectivement les enjeux prioritaires et définir les modalités de la coconstruction.

      3. Validation des objectifs et plan d'action : Décider d'un plan d'action concret et, point crucial, en déterminer les moyens. C'est souvent à cette étape que les démarches échouent.

      4. Suivi de la mise en œuvre : Piloter conjointement la réalisation du plan d'action.

      5. Bilan commun et évaluation partenariale : Mesurer collectivement les effets et ajuster les priorités.

      Contexte et Obstacles

      La mise en œuvre de la coconstruction se heurte à un contexte général peu favorable :

      • • Une culture politique historiquement jacobine et décisionniste en France.

      • • L'imposition du New Public Management qui cantonne les associations à un rôle de gestionnaires.

      • • Un contexte économique de coupes budgétaires qui fragilise les partenaires associatifs et réduit les marges de manœuvre.

      3. Études de Cas et Modèles d'Action

      Le webinaire a mis en avant deux approches concrètes qui illustrent les potentiels et les défis de la coconstruction.

      L'Expérience du Secteur Culturel : Les Schémas d'Orientation (Solima)

      Présenté par Grégoire Patau (Ufisc), le Schéma d'Orientation des Lieux de Musiques Actuelles (Solima) est une méthode de coconstruction expérimentée depuis près de 20 ans.

      • Principes : Horizontalité (pas de hiérarchie entre les parties prenantes – État, collectivités, acteurs), démarche ascendante, inscription dans la durée.

      • Méthodologie : Un processus cyclique d'observation, conception, mise en œuvre et évaluation.

      • Bilan de l'expérience :

      • ◦ Succès : A systématiquement permis une meilleure connaissance des acteurs du territoire, renforcé les réseaux et généré de nouvelles coopérations.

      • ◦ Limites : A eu un impact plus limité sur la redéfinition concrète des politiques publiques ou l'allocation de nouveaux moyens.

      La posture des pouvoirs publics reste souvent "surplombante" et le principe du "qui paie, décide" difficile à dépasser.

      Le manque de moyens dédiés à l'animation et le risque d'essoufflement sont également des freins majeurs.

      La Démocratie d'Interpellation : Poser les Sujets à l'Agenda

      Léa Galois (Institut Alinski) a introduit le concept de démocratie d'interpellation comme une condition préalable à la coconstruction.

      Il s'agit de permettre aux citoyens, collectifs et associations de faire émerger un sujet et de l'inscrire à l'agenda politique, en particulier pour les voix habituellement "inaudibles".

      • Mécanismes proposés :

      • ◦ Paliers de pétition : Atteindre un certain nombre de signatures déclencherait de nouveaux droits (ex: un droit au dialogue avec les élus, un droit à une contre-expertise, le déclenchement d'un référendum d'initiative citoyenne).
      • ◦ Droit à la ressource : Création d'un "fonds d'interpellation" pour rembourser les frais de campagne et permettre aux groupes disposant de peu de moyens de se mobiliser efficacement.

      • Enjeux : L'un des défis majeurs, observé à Grenoble, est d'éviter que ces dispositifs ne reproduisent les inégalités politiques en étant principalement saisis par les catégories socioprofessionnelles les plus favorisées (CSP+).

      4. Perspectives et Recommandations Stratégiques

      Pour sortir du scénario de l'affaiblissement, plusieurs pistes d'action sont envisagées.

      • Traduire les Rapports en Actions : Il est jugé crucial d'éviter que le rapport Suxe ne reste lettre morte.

      La préconisation 16 est particulièrement mise en avant : abroger le Contrat d'Engagement Républicain (CER), jugé liberticide, et lui substituer la Charte des engagements réciproques, dont une évaluation nationale des déclinaisons locales est appelée de vœux.

      • L'Enjeu Crucial des Ressources : Un constat traverse toutes les interventions : la coconstruction et l'interpellation requièrent des moyens.

      Il est essentiel de faire reconnaître et financer la fonction "d'ingénierie et d'animation des coopérations" pour garantir un équilibre des pouvoirs dans le dialogue.

      • Vers un "Soulèvement Associatif" : Face au durcissement du contexte, le CAC lance un appel à une mobilisation pour un "soulèvement associatif", visant à reprendre une parole politique forte.

      Cette initiative est soutenue par la nécessité de documenter la situation, notamment via l'enquête nationale sur la santé financière des associations lancée par le Rnma, le Mouvement associatif et Hexopée.

      • S'outiller Méthodologiquement : La suite des travaux de la recherche participative ESCAPE devrait se concentrer sur la production d'outils méthodologiques, voire de manuels et de formations, pour aider les acteurs associatifs et les collectivités à mettre en œuvre des démarches de coconstruction rigoureuses et efficaces.

    1. Reviewer #2 (Public review):

      The manuscript by Yang et al. investigates how a prior experience (notably by the activation of sensory/reinforcing dopaminergic neurons) alters olfactory response and memory expression in Drosophila. They refer to a priming effect with the definition: "Priming is a process by which exposure to a stimulus affects the response to a subsequent stimulus in Humans". The authors observed that exposing flies to a series of shocks (or the optogenetic activation of aversively reinforcing dopaminergic neurons) decreases ensuing odour avoidance. Conversely, optogenetic activation of sweet-sensing neurons increases following odour avoidance. They proposed that the reduced odour avoidance was due to the involvement of reward dopaminergic neurons involved during shock (or the optogenetic activation of aversively reinforcing dopaminergic neurons). They indeed show the involvement of reward dopaminergic neurons innervating the mushroom body (the fly learning and memory centre) during shock preexposure. Recording (calcium activity) from reward dopaminergic neurons before and after shock preexposure shows that only a small subset of dopaminergic neurons innervating the mushroom body γ4 compartment increases their response to odour after shock. They then showed the requirement of the γ4 reward dopaminergic neurons during shock preexposure on ensuing odour avoidance. They also tested the role of the dopamine receptor in the mushroom body. They finally recorded from different mushroom body output neurons, including the one (MBON-γ4γ5) likely affected by the increased activity of the corresponding γ4 reward dopaminergic neurons after shock preexposure. They recorded odour-evoked responses from these neurons before and after shock preexposure, but did not find any plasticity, while they found a logical effect during spaced cycles of aversive training.

      Overall, the study is very interesting with a substantial amount of behavioural analysis and in vivo 2-photon calcium imaging data, but some major (and some minor) issues have to be resolved to strengthen their conclusions.

      (1) According to neuropsychological work (Henson, Encyclopedia of Neuroscience (2009), vol. 7, pp. 1055-1063), « Priming refers to a change in behavioral response to a stimulus, following prior exposure to the same, or a related, stimulus. Examples include faster reaction times to make a decision about the stimulus, a bias to produce that stimulus when generating responses, or the more accurate identification of a degraded version of the stimulus". Or "Repetition priming refers to a change in behavioural response to a stimulus following re-exposure" (PMID: 18328508). I therefore do not think that the effects observed by the authors are really the investigation of the neural mechanisms of priming. To me, the effect they observed seems more related to sensitisation, especially for the activation of sweet-sensing neurons. For the shock effect, it could be a safety phenomenon, as in Jacob and Waddell, 2020, involving (as for sugar reward) different subsets for short-term and long-term safety.

      (2) The author missed the paper from Thomas Preat, The Journal of Neuroscience, October 15, 1998, 18(20):8534-8538 (Decreased Odor Avoidance after Electric Shock in Drosophila Mutants Biases Learning and Memory Tests). In this paper, one of the effects observed by the authors has already been described, and the molecular requirement of memory-related genes is investigated. This paper should be mentioned and discussed.

      (3) Overall, the bidirectional effect they observed is interesting; however, their results are not always clear, and the use of a delta PI is sometimes misleading. The authors have mentioned that shocks induced attraction to the novel odour, while they should stick to the increase or decrease in preference/avoidance. As not all experiments are done in parallel logic, it is not always easy to understand which protocol the authors are using. For example, only optogenetics is used in the appetitive preexposure. Does exposing flies to sugar or activating reward dopaminergic neurons also increase odour avoidance? The observed increased odour avoidance after optogenetic activation of sweet-sensing neurons involve reward (e.g., decreased response) and/or punishment (e.g., increased response) to increase odour avoidance? The author should always statistically test the fly behavioural performances against 0 to have an idea of random choice or a clear preference toward an odour. On the appetitive side, the internal hunger state would play an important role. The author should test it or at least discuss it.

      (4) The authors found a discrepancy between genetic backgrounds; sometimes the same odour can be attractive or aversive. Different effects between the T-maze and the olfactory arena are found. The authors proposed that: "Punishment priming effect was still not detected, probably due to the insensitivity of the optogenetic arena". This is unclear to me, considering all prior work using this arena. The author should discuss it more clearly. They mentioned that flies could not be conditioned with air and electric shock. However, flies could be conditioned with the context + shock, which is changing in the T-maze and not in the optogenetic area.

    2. Author response:

      We thank both reviewers for their valuable comments. We have prepared a point-by-point response below.

      Reviewer #1 (Public review):

      Weaknesses:

      (1) The conclusions regarding the links between neural and behavioral mechanisms are mostly well supported by the data. However, what is less convincing is the authors' argument that their study offers evidence of 'priming'. An important hallmark of priming, at least as is commonly understood by cognitive scientists, is that it is stimulus specific: i.e., a repeated stimulus facilitates response times (repetition priming), or a repeated but previously ignored stimulus increases response times (negative priming). That is, it is an effect on a subsequent repeated stimulus, not ANY subsequent stimulus. Because (prime or target) stimuli are not repeated in the current experiments, the conditions necessary for demonstrating priming effects are not present. Instead, a different phenomenon seems to be demonstrated here, and one that might be more akin to approach/avoidance behavior to a novel or salient stimulus following an appetitive/aversive stimulus, respectively.

      (2) On a similar note, the authors' claim that 'priming' per se has not been well studied in non-human animals is not quite correct and would need to be revised. Priming effects have been demonstrated in several animal types, although perhaps not always described as such. For example, the neural underpinnings of priming effects on behavior have been very well characterized in human and non-human primates, in studies more commonly described as investigations of 'response suppression'.

      We thank the reviewer for these critical comments. After careful consideration of both reviews, we agree that “priming” may not be the most accurate term to describe the behavioral phenomenon. We plan to revise our terminology throughout the manuscript accordingly to better capture the generalized nature of the effect we observe.

      (3) The outcome measure - i.e., difference scores between the two odors or odor and non-odor (i.e., the number of flies choosing to approach the novel odor versus the number approaching the non-odor (air)) - appears to be reasonable to account for a natural preference for odors in the mock-trained group. However, it does not provide sufficient clarification of the results. The findings would be more convincing if these relative scores were unpacked - that is, instead of analyzing difference scores, the results of the interaction between group and odor preference (e.g., novel or air) (or even within the pre- and post-training conditions with the same animals) would provide greater clarity. This more detailed account may also better support the argument that the results are not due to conditioning of the US with pure air.

      We use the PI score as a standard metric to quantify all the odor preference in behavioral assays because it allows for robust comparison across different genetic or treatment groups under the same experimental setting. In T-maze, real time tracking of fly trajectories is technically difficult. With olfactory arenas, we showed some examples of fly distribution in quadrants over the entire odor choice test period (Figure 2—figure supplement 2) for both pre-trained and post-trained groups and discussed the trajectories in Discussion. We will ensure this point is clarified in the revised text.                       

      Reviewer #2 (Public review):

      […] They finally recorded from different mushroom body output neurons, including the one (MBON-γ4γ5) likely affected by the increased activity of the corresponding γ4 reward dopaminergic neurons after shock preexposure. They recorded odour-evoked responses from these neurons before and after shock preexposure, but did not find any plasticity, while they found a logical effect during spaced cycles of aversive training.

      We thank the reviewer for the summary. We would like to clarify that we did, in fact, observe plasticity in MBON-γ4γ5 following shock exposure, as shown in Figure 4B.

      Overall, the study is very interesting with a substantial amount of behavioural analysis and in vivo 2-photon calcium imaging data, but some major (and some minor) issues have to be resolved to strengthen their conclusions.

      (1) According to neuropsychological work (Henson, Encyclopedia of Neuroscience (2009), vol. 7, pp. 1055-1063), « Priming refers to a change in behavioral response to a stimulus, following prior exposure to the same, or a related, stimulus. Examples include faster reaction times to make a decision about the stimulus, a bias to produce that stimulus when generating responses, or the more accurate identification of a degraded version of the stimulus". Or "Repetition priming refers to a change in behavioural response to a stimulus following re-exposure" (PMID: 18328508). I therefore do not think that the effects observed by the authors are really the investigation of the neural mechanisms of priming. To me, the effect they observed seems more related to sensitisation, especially for the activation of sweet-sensing neurons. For the shock effect, it could be a safety phenomenon, as in Jacob and Waddell, 2020, involving (as for sugar reward) different subsets for short-term and long-term safety.

      As noted in our response to Reviewer #1, we plan to revise our use of the term “priming” in the manuscript to more accurately interpret the behavioral phenomenon.

      (2) The author missed the paper from Thomas Preat, The Journal of Neuroscience, October 15, 1998, 18(20):8534-8538 (Decreased Odor Avoidance after Electric Shock in Drosophila Mutants Biases Learning and Memory Tests). In this paper, one of the effects observed by the authors has already been described, and the molecular requirement of memory-related genes is investigated. This paper should be mentioned and discussed.

      We thank the reviewer for bringing this important reference to our attention. We will cite the Preat (1998) paper and discuss its relevant findings in relation to our own in the revised manuscript.

      (3) Overall, the bidirectional effect they observed is interesting; however, their results are not always clear, and the use of a delta PI is sometimes misleading. The authors have mentioned that shocks induced attraction to the novel odour, while they should stick to the increase or decrease in preference/avoidance.

      The ΔPI is calculated either as (trained PI – mock PI) for different animals or as (post PI – pre PI) for the same animals, with the specific calculation clarified in each figure legend. A positive ΔPI signifies an increase in preference for the odor, which is equivalent to a relative attraction or a decrease in avoidance.

      As not all experiments are done in parallel logic, it is not always easy to understand which protocol the authors are using. For example, only optogenetics is used in the appetitive preexposure. Does exposing flies to sugar or activating reward dopaminergic neurons also increase odour avoidance? The observed increased odour avoidance after optogenetic activation of sweet-sensing neurons involve reward (e.g., decreased response) and/or punishment (e.g., increased response) to increase odour avoidance?  

      We used different behavioral assays (T-maze or arena), stimuli (real shock or optogenetics), and protocols (different or same animal groups) to robustly demonstrate the phenomenon across platforms. We explained each protocol in the figures or texts, and we’ll make them clearer to follow in the revised version. We focused on activating a clean set of sugar sensing neurons because this optogenetic stimulus is an effective and efficient substitute to real sugar. We agree that testing reward dopaminergic neuron activation is a logical extension and will consider adding these experiments in the revised work.

      The author should always statistically test the fly behavioural performances against 0 to have an idea of random choice or a clear preference toward an odour.

      Our primary focus is on the change in preference induced by training, rather than the innate odor preference itself, which can be highly variable due to physiological and environmental factors. Statistical testing against 0 for innate preference scores is not standard practice in this specific paradigm, as the critical question is whether a treatment alters behavior relative to a control.

      On the appetitive side, the internal hunger state would play an important role. The author should test it or at least discuss it.

      For appetitive experiments, we always starve the flies on 1% agar for two days prior to behavioral tests to standardize their hunger state. We will consider adding fed flies as control groups in the revised work.

      (4) The authors found a discrepancy between genetic backgrounds; sometimes the same odour can be attractive or aversive.

      We observed minor discrepancies in innate odor preferences across genetic backgrounds, which is a known and common occurrence. Different genotypes and temperatures can result in different baseline PI scores. However, the key finding is that the relative change in odor preference following an aversive stimulus is consistent: it increases the relative preference for an odor compared to air. This sometimes reverses valence (aversion to attraction) and other times simply reduces aversion. Our analysis focuses on this consistent, relative change.

      Different effects between the T-maze and the olfactory arena are found. The authors proposed that: "Punishment priming effect was still not detected, probably due to the insensitivity of the optogenetic arena". This is unclear to me, considering all prior work using this arena. The author should discuss it more clearly.

      The punishment effect with CS+ present was reliably detected in the T-maze (Figure 1A) but was not significant in the olfactory arena (Figure 2—figure supplement 1B-C). We hypothesize that the olfactory arena assay is less sensitive than the T-maze for detecting such subtle behavioral changes. This is evidenced by the fact that even classical odor-shock conditioning yields lower PI in the arena (typically ~0.4) than in the T-maze (~0.8), likely due to the greater distance flies must explore and travel. The higher variance in the arena may therefore mask more modest effects. Here the effect under investigation was induced by optogenetically activating only a small subset of aversive dopaminergic neurons, a stimulus that is likely weaker than full electric shock. This reduced stimulus strength may have contributed to the challenge of detecting a significant effect in the less sensitive arena paradigm.

      They mentioned that flies could not be conditioned with air and electric shock. However, flies could be conditioned with the context + shock, which is changing in the T-maze and not in the optogenetic area.

      While flies can be conditioned to context, during the optogenetic stimulation period in the arena, the light is delivered uniformly across all four quadrants. Therefore, any potential context conditioning would be equivalent across the entire chamber and should not bias the final distribution of flies between the odor and air quadrants during the test, nor affect the calculated PI score.

    1. Reviewer #2 (Public review):

      Summary:

      In this single center, single arm, open label non-randomised study the authors tested the use of paclitaxel at 180-220 mg/m2 and cisplatin at 60mg/m2 in patients with squamous NSCLC and pemetrexed at 500mg/m2 and cisplatin at 60mg/m2 in adenocarcinoma of lung origin in the neoadjuvant setting. The chemotherapy appears to have been given at a relatively standard dose; though the platin dose at 60mg/m2 is somewhat lower than has been used in the checkmate 816 trial (75mg/m2/dose), this is a well-established dose for NSCLC.

      Key differences to currently approved neoadjuvant chemo-ICI treatment is that anti-PD1 antibody sintilimab (at 200mg/dose) was given on day 5 and that only 2 cycles of chemotherapy were given pre surgery, but then repeated on two occasions post surgery. Between May/2020 and Nov/2023 50 patients were screened, 38 went on to have this schedule of tx, 31 (~82%) went on to have surgery and 27 had the adjuvant treatment. The rate of surgery is entirely consistent with the checkmate 816 data.

      Question to the authors:

      It would be very helpful to understand why 7 (~18% of the population) patients did not make it to surgery and whether this is related to disease progression, toxicity or other reasons for withdrawal.

      The key clinical endpoints were pCR and mPR rates. 2/38 patients are reported to have achieved a radiological pCR but only 31 patients underwent surgery with histological verification. Supp table2 suggests that 10/31 patients achieved a pCR, 6/31 additional patients achieved a major pathological response and that 13/31 did not achieve a major pathological response

      It would be really helpful for understanding the clinical outcome to present the histopathological findings in the text in a bit more detail and to refer the outcome to the radiological findings. I note that the reference for pathological responses incorrectly is 38 patients as only 31 patients underwent surgery and were evaluated histologically.

      The treatment was very well tolerated with only 1 grade 3 AE reported. The longer term outcome will need to be assessed over time as the cohort is very 'young'. It is not clear what the adjuvant chemo-ICI treatment would add and how this extra treatment would be evaluated for benefit - if all the benefit is in the neoadjuvant treatment then the extra post-operative tx would only add toxicity

      Please consider what the two post-operative chemo-ICI cycles might add to the outcome and how the value of these cycles would be assessed. Would there be a case for a randomised assessment in the patients who have NOT achieved a mPR histologically?

      While the clinical dataset identifies that the proposed reduced chemo-ICI therapy has clinical merit and should be assessed in a randomized study, the translational work is less informative.

      The authors suggest that the treatment has a positive impact on T lymphocytes. Blood sampling was done at day 0 and day 5 of each of the four cycle of chemotherapy with an additional sample post cycle 4. The authors state that data were analysed at each stage.

      The data in Figure 3B are reported for three sets of pairs: baseline to pre day 5 in cycle 1, day 5 to day 21 in cycle 1, baseline of cycle to to day 5. It remains unclear whether the datasets contain the same top 20 clones and it would be very helpful to show kinetic change for the individual 'top 20 clones' throughout the events in individual patients; as it stands the 'top20 clones' may vary widely from timepoint to timepoint. Of note, the figures do not demonstrate that the top 20 TCR clones were 'continuously increased'.

      Instead, the data suggest that there are fluctuations in the relative distributions over time but that may simply be a reflection of shifts in T cell populations following chemotherapy rather than of immunological effects in the cancer tissue.<br /> Consistent with this the authors conclude (line 304/5): "No significant difference was observed in the diversity, evenness, and clonality of TCR clones across the whole treatment procedure" and this seems to be a more persuasive conclusion than the statement 'that a positive effect on T lymphocytes was observed' - where it is also not clear what 'positive' means.

      The text needs a more balanced representation of the data: only a small subset of four patients appear to have been evaluated to generate the data for figure 3B and only three patients (P5, P6, P7) can have contributed to figure 3C if the sample collection is represented accurately in Figure 3A.

      The text refers to flow cytometric results in SF3. However, no information is given on the flow cytometry in M&M, markers or gating strategy.

      Please consider changing the terminology of the 'phases' into something that is easier to understand. One option would be to use a reference to a more standard unit (cycle 1-4 of chemotherapy and then d0/d5/d21).

      Please make it explicit in the text that molecular analyses were undertaken for some patients only, and how many patients contribute to the data in figures 3B-F. Figure 3A suggests paired mRNA data were obtained in 2 patients (P2 and P5) but I cannot find the results on these analyses; four individual blood samples to assess TCR changes int PH1/PH2/PH3and PH4 were only available in four patients (P4,P5,P7,P9). Only three patients seem to have the right samples collected to allow the analysis for 'C3' in figure 3C.

      Please display for each of the 'top 20 clones' at any one timepoint how these clones evolve throughout the study; I expect that a clone that is 'top 20' at a given timepoint may not be among the 'top twenty' at all timepoints.

      Please also assess if the expanded clonotypes are present (and expanded) in the cancer tissue at resection, to link the effect in blood to the tumour. Given that tissue was collected for 31 patients, mRNA sequencing to generate TCR data should be possible to add to the blood analyses in the 12 patients in Figure 3A. Without this data no clear link can be made to events in the cancer.

      Please provide in M&M the missing information on the flow cytometry methodology (instrument, antibody clones, gating strategy) and what markers were used to define T cell subsets (naïve, memory, central memory, effector memory).

      The authors also describe that ctDNA reduces after chemo-ICI treatment. This is well documented in their data but ultimately irrelevant: if the cancer volume is reduced to the degree of a radiological or pathological response /complete response then the quantity of circulating DNA from the cancer cells must reduce. More interesting would be the question whether early changes predict clinical outcome and whether recurrent ct DNA elevations herald recurrence.

      Please probe whether the molecular data identify good radiological or pathological outcomes before cycle 2 is started and whether the ctDNA levels identify patients who will have a poor response and/or who relapse early.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      Cai et al have investigated the role of msiCAT-tailed mitochondrial proteins that frequently exist in glioblastoma stem cells. Overexpression of msiCAT-tailed mitochondrial ATP synthase F1 subunit alpha (ATP5) protein increases the mitochondrial membrane potential and blocks mitochondrial permeability transition pore formation/opening. These changes in mitochondrial properties provide resistance to staurosporine (STS)-induced apoptosis in GBM cells. Therefore, msiCAT-tailing can promote cell survival and migration, while genetic and pharmacological inhibition of msiCAT-tailing can prevent the overgrowth of GBM cells.

      Strengths:

      The CAT-tailing concept has not been explored in cancer settings. Therefore, the present provides new insights for widening the therapeutic avenue. 

      Your acknowledgment of our study's pioneering elements is greatly appreciated.

      Weaknesses:

      Although the paper does have strengths in principle, the weaknesses of the paper are that these strengths are not directly demonstrated. The conclusions of this paper are mostly well-supported by data, but some aspects of image acquisition and data analysis need to be clarified and extended.

      We are grateful for your acknowledgment of our study’s innovative approach and its possible influence on cancer therapy. We sincerely appreciate your valuable feedback. In response, this updated manuscript presents substantial new findings that reinforce our central argument. Moreover, we have broadened our data analysis and interpretation, as well as refined our methodological descriptions.

      Reviewer #2 (Public Review):

      This work explores the connection between glioblastoma, mito-RQC, and msiCAT-tailing. They build upon previous work concluding that ATP5alpha is CAT-tailed and explore how CAT-tailing may affect cell physiology and sensitivity to chemotherapy. The authors conclude that when ATP5alpha is CAT-tailed, it either incorporates into the proton pump or aggregates and that these events dysregulate MPTP opening and mitochondrial membrane potential and that this regulates drug sensitivity. This work includes several intriguing and novel observations connecting cell physiology, RQC, and drug sensitivity. This is also the first time this reviewer has seen an investigation of how a CAT tail may specifically affect the function of a protein. However, some of the conclusions in this work are not well supported. This significantly weakens the work but can be addressed through further experiments or by weakening the text.

      We appreciate the recognition of our study's novelty. To address your concerns about our conclusions, we have revised the manuscript. This revision includes new data and corrections of identified issues. Our detailed responses to your specific points are outlined below.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      (1) In Figure 1B, please replace the high-exposure blots of ATP5 and COX with representative results. The current results are difficult to interpret clearly. Additionally, it would be helpful if the author could explain the nature of the two different bands in NEMF and ANKZF1. Did the authors also examine other RQC factors and mitochondrial ETC proteins? I'm also curious to understand why CAT-tailing is specific to C-I30, ATP5, and COX-V, and why the authors did not show the significance of COX-V.

      We appreciate your inquiry regarding the data.  Additional attempts were made using new patient-derived samples; however, these results did not improve upon the existing ATP5⍺, (NDUS3)C-I30, and COX4 signals presented in the figure.  This is possibly due to the fact that CAT-tail modified mitochondrial proteins represent only a small fraction of the total proteins in these cells.  It is acknowledged that the small tails visible above the prominent main bands are not particularly distinct. To address this, the revised version includes updated images to better illustrate the differences. We believe the assertion that GBM/GSCs possess CAT-tailed proteins is substantiated by a combination of subsequent experimental findings. The figure (refer to new Fig. 1B) serves primarily as an introduction. It is important to note that the CAT-tailed ATP5⍺ plays a vital role in modulating mitochondrial potential and glioma phenotypes, a function which has been demonstrated through subsequent experiments.

      It is acknowledged that the CAT-tail modification is not exclusive to the ATP5⍺protein.  ATP5⍺ was selected as the primary focus of this study due to its prevalence in mitochondria and its specific involvement in cancer development, as noted by Chang YW et al.  Future research will explore the possibility of CAT tails on other mitochondrial ETC proteins. Currently, NDUS3 (C-I30), ATP5⍺, and COX4 serve as examples confirming the existence of these modifications. It remains challenging to detect endogenous CAT-tailing, and bulk proteomics is not yet feasible for this purpose. COX4 is considered significant.  We hypothesize that CAT-tailed COX4 may function similarly to the previously studied C-I30 (Wu Z, et al), potentially causing substantial mitochondrial proteostasis stress.  

      Concerning RQC proteins, our blotting analysis of GBM cell lines now includes additional RQC-related factors. The primary, more prominent bands (indicated by arrowheads) are, in our assessment, the intended bands for NEMF and ANKZF1.  Subsequent blotting analyses showed only single bands for both ANKZF1 and NEMF, respectively. The additional, larger molecular weight band of NEMF, which was initially considered for property analysis (phosphorylation, ubiquitination, etc.), was not examined further as it did not appear in subsequent experiments (refer to new Fig. S1C).

      References:

      Chang YW, et al. Spatial and temporal dynamics of ATP synthase from mitochondria toward the cell surface. Communications biology. 2023;6(1).

      Wu Z, et al. MISTERMINATE Mechanistically Links Mitochondrial Dysfunction With Proteostasis Failure. Molecular cell. 2019;75(4).

      (2) In addition to Figure 1B, it would be interesting to explore CAT-tailed mETC proteins in cancer tissue samples.

      This is an excellent point, and we appreciate the question. We conducted staining for ATP5⍺ and key RQC proteins in both tumor and normal mouse tissues. Notably, ATP5⍺ in GBM exhibited a greater tendency to form clustered punctate patterns compared to normal brain tissue, and not all of it co-localized with the mitochondrial marker TOM20 (refer to new Fig. S3C-E). Crucially, we observed a significant increase in NEMF expression within mouse xenograft tumor tissues, alongside a decrease in ANKZF1 expression (refer to new Fig. S1A, B). These findings align with our observations in human samples.

      (3) Please knock down ATP5 in the patient's cells and check whether both the upper band and lower band of ATP5 have disappeared or not.

      This control was essential and has been executed now. To validate the antibody's specificity, siRNA knockdown was performed. The simultaneous elimination of both upper and lower bands upon siRNA treatment (refer to new Fig. S2A) confirms they represent genuine signals recognized by the antibody.

      (4) In Figure 1C and ID, add long exposure to spot aggregation and oligomer. Figure 1D, please add the blots where control and ATP5 are also shown in NHA and SF (similar to SVG and GSC827).

      New data are included in the revised manuscript to address the queries. Specifically, the new Fig 1D now displays the full queue as requested, featuring blots for Control, ATP5α, AT3, and AT20. Our analysis reveals that AT20 aggregates exhibit higher expression and accumulation rates in GSC and SF cells.

      Fig. 1C has been updated to include experimental groups treated with cycloheximide and sgNEMF. Our results show that sgNEMF effectively inhibits CAT-tailing in GBM cell lines, whereas cycloheximide has no impact. After consulting with the Reporter's original creator and optimizing expression conditions, we observed no significant aggregates with β-globin-non-stop protein, potentially due to the length of endogenous CAT-tail formation (as noted by Inada, 2020, in Cell Reports). Our analysis focused on the ratio of CAT-tailed (red box blots) and non-CAT-tailed proteins (green box blots). Comparing these ratios revealed that both anisomycin treatment and sgNEMF effectively hinder the CAT-tailing process, while cycloheximide has no effect.

      (5) In Figure 1E, please double-check the results with the figure legend. ATP5A aggregated should be shown endogenously. The number of aggregates shown in the bar graph is not represented in micrographs. Please replace the images. For Figure 1E, to confirm the ATP5-specific aggregates, it would be better if the authors would show endogenous immunostaining of C-130 and Cox-IV.

      Labels in Fig. 1E were corrected to reflect that the bar graph in Fig. 1F indicates the number of cells with aggregates, not the quantity of aggregates per cell. The presence of endogenous ATP5⍺ is accurately shown. To address the specificity of ATP5⍺, immunostaining for endogenous NUDS3 was conducted. This revealed NUDS3 aggregation in GBM cells (SF and GSC) lacking TOM20, as demonstrated in the new Fig. S3A, B. These findings suggest NUDS3 also undergoes CAT-tailing modification, similar to ATP5⍺.

      (6) Figure 3A. Please add representative images in the anisomycin sections. It is difficult to address the difference.

      We appreciate your feedback. Upon re-examining the Calcein fluorescence intensity data in Fig. 3A, we believe the images accurately represent the statistical variations presented in Fig. 3B. To address your concerns more effectively, please specify which signals in Fig. 3A you find potentially misleading. We are prepared to revise or substitute those images accordingly.

      (7) Figure 3D. If NEMF is overexpressed, is the CAT-tailing of ATP 5 reversed?

      Thank you. Your prediction aligns with our findings. We've added data to the revised Fig. S6A, B, which demonstrates that both NEMF overexpression and ANKZF1 knockdown lead to elevated levels of CRC. This increase, however, was not statistically significant in GSC cells. A plausible explanation for this discrepancy is that the MPTP of GSC cells is already closed, thus any additional increase in CAT-tailing activity does not result in further amplification.

      (8) Figure 3G. Why on the BN page are AT20 aggregates not the same as shown in Figure 2E?

      We appreciate your inquiry regarding the ATP5⍺ blots, specifically those in the original Fig. 3G (left) and 2E (right). Careful observation of the ATP5⍺ band placement in these figures reveals a high degree of similarity. Notably, there are aggregates present at the top, and the diffuse signals extend downwards. Given that this is a gradient polyacrylamide native PAGE, the concentration diminishes towards the top. Consequently, the non-rigid nature of the Blue Native PAGE gel may lead to slight variations in the aggregate signals; however, the overall patterns are very much alike. To mitigate potential misinterpretations, we have rearranged the blot order in the new Fig. 3M.

      (9) Figure 4D. The amount of aggregation mediated by AT20 is more compared to AT3. Why are there no such drastic effects observed between AT3 and AT20 in the Tunnel assay?

      The previous Figure 4D presents the quantification of cell migration from the experiment depicted in Figure 4C. But this is a good point. TUNEL staining results are directly influenced by mitochondrial membrane potential and the state of mitochondrial permeability transition pores (MPTP), not by the degree of protein aggregation. Our previous experiments showed comparable effects of AT3 and AT20 on mitochondria (Fig. 2E, 3K), which aligns with the expected similar outcomes on TUNEL staining. As for its biological nature, this could be very complicated. We hope to explore it in future studies.

      (10) Figure 5C: The role of NEMF and ANKZF1 can be further clarified by conducting Annexin-PI assays using FACS. The inclusion of these additional data points will provide more robust evidence for CAT-tailing's role in cancer cells.

      In response to your suggestion, we have incorporated additional data into the revised version.

      Using the Annexin-PI kit, we labeled apoptotic cells and detected them using flow cytometry (FACS). Our findings indicate that anisomycin pretreatment, NEMF knockdown (sgNEMF), and ANZKF1 upregulation (oeANKZF1) significantly increase the rate of STS-induced apoptosis compared to the control group (refer to new Fig. S9D-G).

      (11) Figure 5F: STS is a known apoptosis inhibitor. Why it is not showing PARP cleavage?

      Also, cell death analysis would be more pronounced, if it could be shown at a later time point. What is the STS and Anisomycin at 24h or 48h time-point? Since PARP is cleaved, it would also be better if the authors could include caspase blots.

      I guess what you meant to say here is "Staurosporine is a protein kinase inhibitor that can induce apoptosis in multiple mammalian cell lines." Our study observed PARP cleavage even in GSCs, which are typically more resistant to staurosporine-induced apoptosis (C-PARP in Fig. S9B). The ratio of C-PARP to total PARP increased. We selected a 180-minute treatment duration because longer treatments with STS + anisomycin led to a late stage of apoptosis and non-specific protein degradation (e.g., at 24 or 48 hours), making PARP comparisons less meaningful. Following your suggestion, we also examined caspase 3/7 activity in GSC cells treated with DMSO, CHX, and anisomycin. We found that anisomycin treatment also activated caspases (Fig. S9A).

      (12) In Figure 5, the addition of an explanation, how CAT-tailing can induce cell death, would add more information such as BAX-BCL2 ratio, and cytochrome-c release from the mitochondria.

      Thank you for your suggestion. In this study, we state that specific CAT-tails inhibit GSC cell death/apoptosis rather than inducing it. Therefore, we do not expect that examining BAX-BCL2 and mitochondrial cytochrome c release would offer additional insights.

      (13) To confirm the STS resistance, it would be better if the author could do the experiments in the STS-resistant cell line and then perform the Anisomycin experiments.

      Thank you. We should emphasize that our data primarily originates from GSC cells. These cells already exhibit STS-resistance when compared to the control cells (Fig. S8A-C).

      (14) It would be more advantageous if the author could show ATP5 CATailed status under standard chemotherapy conditions in either cell lines or in vivo conditions.

      This is an interesting question. It's worth exploring this question; however, GSC cells exhibit strong resistance to standard chemotherapy treatments like temozolomide (TMZ).

      Additionally, we couldn't detect changes in CAT-tailed ATP5⍺ and thus did not include that data.

      (15) In vivo (cancer mouse model or cancer fly model) data will add more weight to the story.

      We appreciate your intriguing question. An effective approach would be to test the RQC pathway's function using the Drosophila Notch overexpression-induced brain tumor model. However, Khaket et al. have conducted similar studies, stating, "The RNAi of Clbn, VCP, and Listerin (Ltn), homologs of key components of the yeast RQC machinery, all attenuated NSC over-proliferation induced by Notch OE (Figs. 5A and S5A–D, G)." This data supports our theory, and we have incorporated it into the Discussion. While the mouse model more closely resembles the clinical setting, it is not covered by our current IACUC proposal. We intend to verify this hypothesis in a future study.

      Reference:

      Khaket TP, Rimal S, Wang X, Bhurtel S, Wu YC, Lu B. Ribosome stalling during c-myc translation presents actionable cancer cell vulnerability. PNAS Nexus. 2024 Aug 13;3(8):pgae321.

      Reviewer #2 (Recommendations For The Authors):

      Figure 1B, C: To demonstrate that Globin, ATP5alpha, and C-130 are CAT-tailed, it is necessary to show that the high mobility band disappears after NEMF deletion or mutagenesis of the NFACT domain of NEMF. This can be done in a cell line. The anisomycin experiment is not convincing because the intensity of the bands drops and because no control is done to show that the effects are not due to translation inhibition (e.g. cycloheximide, which inhibits translation but not CAT tailing). Establishing ATP5alpha as a bonafide RQC substrate and CAT-tailed protein is critical to the relevance of the rest of the paper.

      Thank you for suggesting this crucial control experiment.

      To confirm the observed signal is indeed a bona fide CAT-tail, it's essential to demonstrate that NEMF is necessary for the CAT-tailing process. We have incorporated data from NEMF knockdown (sgNEMF) and cycloheximide treatment into the revised manuscript. Our findings show that both sgNEMF and anisomycin treatment effectively inhibit the formation of CAT-tailing signals on the reporter protein (Fig. 1C). Similarly, NEMF knockdown in a GSC cell line also effectively eliminated CAT-tails on overexpressed ATP5⍺ (Fig. S2B).

      In general, the text should be weakened to reflect that conclusions were largely gleaned from artificial CAT tails made of AT repeats rather than endogenously CAT-tailed ATP5alpha. CAT tails could have other sequences or be made of pure alanine, as has been suggested by some studies.

      Thank you for your reminder. We have reviewed the recent studies by Khan et al. and Chang et al., and we found their analysis of CAT tail components to be highly insightful. We concur with your suggestion regarding the design of the CAT tail sequence. We aimed to design a tail that maintained stability and resisted rapid degradation, regardless of its length. In the revised version, we clarify that our conclusions are based on artificial CAT tails, specifically those composed of AT repeat sequences (p. 9). We acknowledge that the presence of other sequence components may lead to different outcomes (p. 19).

      Reference:

      Khan D, Vinayak AA, Sitron CS, Brandman O. Mechanochemical forces regulate the composition and fate of stalled nascent chains. bioRxiv [Preprint]. 2024 Oct 14:2024.08.02.606406. Chang WD, Yoon MJ, Yeo KH, Choe YJ. Threonine-rich carboxyl-terminal extension drives aggregation of stalled polypeptides. Mol Cell. 2024 Nov 21;84(22):4334-4349.e7. 

      Throughout the work (e.g. 3B, C), anisomycin effects should be compared to those with cycloheximide to observe if the effects are specific to a CAT tail inhibitor rather than a translation inhibitor.

      We agree that including cycloheximide control experiments is crucial. The revised version now incorporates new data, as depicted in Fig. S5A, B, illustrating alterations in the on/off state of MPTP following cycloheximide treatment. Furthermore, Fig. S6A, B present changes in Calcium Retention Capacity (CRC) under cycloheximide treatment. The consistency of results across these experiments, despite cycloheximide treatment, suggests that anisomycin's role is specifically as a CAT tail inhibitor, rather than a translation inhibitor.

      Line 110, it is unclear what "short-tailed ATP5" is. Do you mean ATP5alpha-AT3? If so this needs to be introduced properly. Line 132: should say "may indicate accumulation of CAT-tailed protein" rather than "imply".

      We acknowledge your points. We have clarified that the "short-tailed ATP5α" refers to ATP5α-AT3 and incorporated the requested changes into the revised manuscript.

      Figure 1C: how big are those potential CAT-tails (need to be verified as mentioned earlier)?

      They look gigantic. Include a ladder.

      In the revised Fig. 1D, molecular weight markers have been included to denote signal sizes. The aggregates in the previous Fig. 1C, also present in the control plasmid, are likely a result of signal overexposure. The CAT-tailed protein is observed just above the intended band in these blots. These aggregates have been re-presented in the updated figures, and their signal intensities quantified.

      Line 170: "indicating that GBM cells have more capability to deal with protein aggregation".

      This logic is unclear. Please explain.

      We appreciate your question and have thoroughly re-evaluated our conclusion. We offer several potential explanations for the data presented in Fig. 1D: (1) ATP5α-AT20 may demonstrate superior stability. (2) GSC (GBM) cells might lack adequate mechanisms to monitor protein accumulation. (3) GSC (GBM) cells could possess an increased adaptive capacity to the toxicity arising from protein accumulation. This discussion has been incorporated into the revised manuscript (lines 166-169).

      Line 177: how do you know the endogenous ATP5alpha forms aggregates due to CAT-tailing? Need to measure in a NEMF hypomorph.

      We understand your concern and have addressed it. Revised Fig. 3G, H demonstrates that a reduction in NEMF levels, achieved through sgNEMF in GSC cells, significantly diminishes ATP5α aggregation. This, in conjunction with the Anisomycin treatment data presented in revised Fig. 3E, F, confirms the substantial impact of the CAT-tailing process on this aggregation.

      Line 218: really need a cycloheximide or NEMF hypomorph control to show this specific to CAT-tailing.

      We have revised the manuscript to include data from sgNEMF and cycloheximide treatments, specifically Fig. 3G, H, and Fig. S5C, D, as detailed in our response above.

      Lines 249,266, Figure 5A: The mentioned experiments would benefit from controls including an extension of ATP5alpha that was not alanine and threonine, perhaps a gly-ser linker, as well as an NEMF hypomorph.

      We sincerely appreciate your insightful comments. In response, the revised manuscript now incorporates control data for ATP5α featuring a poly-glycine-serine (GS) tail. This data is specifically presented in Figs. S2E-G, S4E, S7A, D, E, and S8F, G. Our experimental findings consistently demonstrate that the overexpression of ATP5α, when modified with GS tails, had no discernible impact on protein aggregation, mitochondrial membrane potential, GSC cell mobility, or any other indicators assessed in our study.

      Figure S5A should be part of the main figures and not in the supplement.

      This has been moved to the main figure (Fig. 5C).

    1. Reviewer #2 (Public review):

      This study provides some interesting observations on how different flavour e-cigarettes can affect lung immunology; however, there are numerous flaws, including a low replicate number and a lack of effective validation methods, meaning findings may not be repeated. This is a revised article but several weaknesses remain related to the analysis and interpretation of the data.

      Strengths:

      The strength of the study is the successful scRNA-seq experiment which gives some preliminary data that can be used to create new hypotheses in this area.

      Weaknesses:

      Although some text weaknesses have been addressed since resubmission, other specific weaknesses remain: The major weakness is the n-number and analysis methods. Two biological n per group is not acceptable to base any solid conclusions. Any validatory data was too little (only cell % data) and not always supporting the findings (e.g. figure 3D does not match 3B/4A). Other examples include:

      (1) There aren't enough cells to justify analysis - only 300-1500 myeloid cells per group with not many of these being neutrophils or the apparent 'Ly6G- neutrophils'

      (2) The dynamic range of RNA measurement using scRNAseq is known to be limited - how do we know whether genes are not expressed or just didn't hit detection? This links into the Ly6G negative neutrophil comments, but in general the lack of gene expression in this kind of data should be viewed with caution, especially with a low n number and few cells. The data in the entire paper is not strong enough to base any solid conclusion - it is not just the RNA-sequencing data.

      (3) There is no data supporting the presence of Ly6G negative neutrophils. In the flow cytometry only Ly6G+ cells are shown with no evidence of Ly6G negative neutrophils (assuming equal CD11b expression). There is no new data to support this claim since resubmission and the New figures 4C and D actually show there are no Ly6G negative cells - the cells that the authors deem Ly6G negative are actually positive - but the red overlay of S100A8 is so strong it blocks out the green signal - looking to the Ly6G single stains (green only) you can see that the reported S100A8+Ly6G- cells all have Ly6G (with different staining intensities).

      (4) Eosinophils are heavily involved in lung macrophage biology, but are missing from the analysis - it is highly likely the RNA-sequence picked out eosinophils as Ly6G- neutrophils rather than 'digestion issues' the authors claim

      (5) After author comments, it appears the schematic in Figure 1A is misleading and there are not n=2/group/sex but actually only n=1/group/sex (as shown in Figure 6A). Meaning the n number is even lower than the previous assumption.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):

      Summary:

      The authors tackled the public concern about E-cigarettes among young adults by examining the lung immune environment in mice using single-cell RNA sequencing, discovering a subset of Ly6G- neutrophils with reduced IL-1 activity and increased CD8 T cells following exposure to tobaccoflavored e-cigarettes. Preliminary serum cotinine (nicotine metabolite) measurements validated the effective exposure to fruit, menthol, and tobacco-flavored e-cigarettes with air and PG:VG serving as control groups. They also highlighted the significance of metal leaching, which fluctuated over different exposure durations to flavored e-cigarettes, underscoring the inherent risks posed by these products. The scRNAseq analysis of e-cig exposure to flavors and tobacco demonstrated the most notable differences in the myeloid and lymphoid immune cell populations. Differentially expressed genes (DEGs) were identified for each group and compared against the air control. Further subclustering revealed a flavor-specific rise in Ly6G- neutrophils and heightened activation of cytotoxic T cells in response to tobacco-flavored e-cigarettes. These effects varied by sex, indicating that immune changes linked to e-cig use are dependent on gender. By analyzing the expression of various genes and employing gene ontology and gene enrichment analysis, they identified key pathways involved in this immune dysregulation resulting from flavor exposure. Overall, this study affirmed that e-cigarette exposure can suppress the neutrophil-mediated immune response, subsequently enhancing T cell toxicity in the lung tissue of mice.

      Strengths:

      This study used single-cell RNA sequencing to comprehensively analyze the impact of e-cigarettes on the lung. The study pinpointed alterations in immune cell populations and identified differentially expressed genes and pathways that are disrupted following e-cigarette exposure. The manuscript is well written, the hypothesis is clear, the experiments are logically designed with proper control groups, and the data is thoroughly analyzed and presented in an easily interpretable manner. Overall, this study suggested novel mechanisms by which e-cigs impact lung immunity and created a dataset that could benefit the lung immunity field.

      Weaknesses:

      The authors included a valuable control group - the PG:VG group, since PG:VG is the foundation of the e-liquid formulation. However, most of the comparative analyses use the air group as the control. Further analysis comparing the air group to the PG:VG group, and the PG:VG group to the individual flavored e-cig groups will provide more clear insights into the true source of irritation. This is done for a few analyses but not consistently throughout the paper. Flavor-specific effects should be discussed in greater detail. For example, Figure 1E shows that the Fruit flavor group exhibits more severe histological pathology, but similar effects were not corroborated by the singlecell data.

      We thank the reviewer for this query. We agree that PG:VG group is the foundation of the e-liquid formulation and hence comparisons with this group are of significance to understand the effect of individual flavors on the cell population. Though we compared the flavored e-cig groups with PG:VG group, we did not discuss it in detail within the manuscript to avoid confusions in interpretation for this study. However, we have now included the comparisons with the PG:VG group as a Supplement File S13-S18 in our revised manuscript to facilitate proper interpretation of our omics data to interested readers.

      While we agree that flavor-specific effects might be of interest, we did not delve into exploring them in detail as the fruit flavor e-liquids have now been regulated/banned from sale in the US. Thus, from regulatory point of view, the effects of tobacco-flavored e-liquids hold most interest. Since at the time of conducting this study, fruit flavors were in the market, we have still included the data. However, studying it further was not the focus of this work.

      The characterization of Ly6g+ vs Ly6g- neutrophils is interesting and potentially very impactful. Key results like this from scRNAseq analyses should be validated by qPCR and flow cytometry.

      Also, a recent study by Ruscitti et al reported Ly6g+ macrophages in the lung which can potentially confound the cell type analysis. A more detailed marker gene and sub-population analysis of the myeloid clusters could rule out this potential confounding factor.

      We agree with the reviewer that the loss of Ly6G on neutrophils is a very interesting finding and we have designed a neutrophil specific experiment to study the impact of e-cig exposure on neutrophil maturation and function which will be discussed in subsequent work by our group. To address the concerns raised by the reviewer, we stained the lung tissue samples from air-and tobacco flavored e-cig aerosol exposed mouse lungs with Ly6G and S100A8 (universal marker for neutrophil) to see the infiltration of Ly6G+ vs Ly6G- neutrophils within the lungs of exposed and unexposed mice. Results from this study showed that exposure to tobacco-flavored e-cig aerosol affects the neutrophil population within the mouse lungs. In fact, the changes were more pronounced for female mice. The data have now been shown in Figure 4.

      Reviewer #2 (Public review):

      This study provides some interesting observations on how different flavors of e-cigarettes can affect lung immunology, however there are numerous flaws including a low number of replicates and a lack of effective validation methods which reduces the robustness and rigor of the findings.

      Strengths:

      The strength of the study is the successful scRNA-seq experiment which gives good preliminary data that can be used to create new hypotheses in this area.

      Weaknesses:

      The major weakness is the low number of replicates and the limited analysis methods. Two biological n per group is not acceptable to base any solid conclusions. Any validatory data was too little (only cell % data) and did not always support the findings (e.g. Figure 4D does not match 4C). Often n seems to be combined and only one data point is shown, it is not at all clear how the groups were analyzed and how many cells in each group were compared.

      We thank the reviewer for recognizing the strengths of this manuscript while pointing out the errors to allow us to improve our analyses. We understand that the low number of replicates in this work makes the analyses difficult to draw solid conclusions, but this was a pilot study to identify the changes in the mouse lung upon acute exposures to flavored e-cig aerosols at a single cell level. So far, the e-cig field has been primarily focused on conducting toxicological studies to help regulatory bodies to set standards and enforce laws to better regulate the manufacture, sale and distribution of e-cig products. However, adolescents and young adults are still getting access to these products, and there is little to no understanding of how this may affect the lung health upon acute and chronic exposures. Single cell technology is a powerful tool to analyze the gene expression changes within cell populations to study cell heterogeneity and function. Yet, it is a costly tool owing to which conducting such analyses on large sample sizes is not ideal. This pilot study was designed to get some initial leads for our future studies involving larger sample sizes and chronic exposures. However, due to the vast information that is provided by a single cell RNA sequencing experiment, we intend to share it with a larger audience to support research and further study in this area. We understand that the validations are limited in our current work and so we have now conducted coimmunostaining to validate the Ly6G+ and Ly6G- neutrophil population. We have now included single cell findings with the validating experiments using classical methods of experimentation including ELISA, immunostaining or flow cytometry and revamped the whole manuscript. However, it is important to mention that such validations are sometimes challenging as many of these techniques still investigate the tissue while the changes shown in single cell analyses are mainly pertaining to a single cell type. This could be well-understood by looking at the flow cytometry results for neutrophils where we use Ly6G as a marker to stain for neutrophils which is only found in mature neutrophil population.

      Only 71,725 cells mean only 7,172 per group, which is 3,586 per animal - how many of these were neutrophils, T-cells, and macrophages? This was not shown and could be too low.

      We do agree that the number of cells could be too low. To avoid this, we did not study gene expression variations at the finest level of cell identity. We classified the cell clusters into general annotations -myeloid, lymphoid, endothelial, stromal and epithelial- and identified the changes in the gene expressions. Of these, only two clusters (myeloid and lymphoid) with more than ~1000 cells per cell type per group were studied in detail. We have included the cell count information to allow better interpretation of our results in the revised manuscript. For a single cell point of view, a cell count of ~3500 each with over 20000 features (genes) has good statistical strength and merit in our opinion.

      The dynamic range of RNA measurement using scRNA seq is known to be limited - how do we know whether genes are not expressed or just didn't hit detection? This links into the Ly6G negative neutrophil comment, but in general, the lack of gene expression in this kind of data should be viewed with caution, especially with a low n number and few cells.

      This is a well-taken point, and we thank the reviewer for this comment. We agree that the dynamic range RNA measurement is limited low cell numbers that could lead to bias. However, none of the clusters with counts lower than 150 were included for differential gene analyses. To avoid confusion, we now show immunofluorescence results to validate the findings. We are certain that with the inclusion of these validation experiments, will convince the reviewer about the loss of Ly6G marker from neutrophils and lack of proper neutrophilic response in exposed mouse lungs as compared to the controls.

      There is no rigorous quantification of Ly6G+ and Ly6G- cells int he flow cytometry data.

      We understand that flow-based quantification of our scRNA seq findings would be interesting. However, flow cytometry and single cell suspension to perform sequencing were performed parallelly for this study. We used a basic flow panel using single markers to identify individual immune cell type. We did identify changes in the Ly6G population in our treated and control samples using scRNA seq and intend to exclude it as a marker for our future studies using flow cytometry. Unfortunately, the same analyses could not be performed for the current batch of samples. We have now included results from IHC staining to identify the Ly6G+ and Ly6G- population in the lung tissues from control and treated mice in revised manuscript to address some of the concerns raised here. 

      Eosinophils are heavily involved in lung biology but are missing from the analysis.

      We use RBC lysis buffer to remove the excess RBCs during lung digestion for preparation of single cell suspension for scRNA seq in this study. Reports suggest that RBC lysis could adversely affect the eosinophil number and function. We did not identify any cell cluster, representing markers for eosinophils through our scRNA seq data and we believe that our lung digestion protocol could be the reason for it. We have studied the eosinophil changes through flow cytometry in these samples and have found significant changes as well. However, due to our inability to find cell clusters for eosinophil through scRNA seq data, we did not include these results in the final manuscript previously. To avoid confusion and maintain transparency, we have now included the changes in eosinophils through flow cytometry in revised manuscript (Figure S4).

      The figures had no titles so were difficult to navigate.

      We have now revamped the figures to make it easier for the readers to navigate.

      PGVG is not defined and not introduced early enough.

      We have made the necessary changes in the revised manuscript.

      Neutrophils are not well known to proliferate, so any claims about proliferation need to be accompanied by validation such as BrdU or other proliferation assays.

      We have now removed the cell cycle scoring information from the revised manuscript. Performing BrDU assay was not possible for these tissues due to limited samples and resources. However, we may consider performing it in our future studies.

      It was not clear how statistics were chosen and why Table S2 had a good comparison (two-way ANOVA with gender as a variable) but this was not used for other data particularly when looking at more functional RNA markers (Table S2 also lacks the interaction statistic which is most useful here).

      We have now included the two-way ANOVA statistics (Supplementary File S3) for other data included in the revised manuscript. It is important to note that since we did not identify any significant changes upon two-way ANOVA, the interaction statistics were not available for the abovementioned statistical test. We have included the interaction information wherever available.

      Many statistics are only vs air control, but it would be more useful as a flavor comparison to see these vs PGVG. In some cases, the carrier PGVG looks worse than some of the flavors (which have nicotine).

      While we agree with this comment of the reviewer, comparisons with PG:VG were not included due to the low cell numbers for PG:VG samples obtained following quality control and filtering of scRNA seq analyses.  However, considering the reviewer’s question we still include the details of comparisons with PG:VG included as supplementary files S13-S18 in the revised manuscript.

      The n number is a large issue, but in Figures such as 4, 6, and 7 it could be a bigger factor. The number of significant genes identified has been determined by chance rather than any real difference, e.g. Is Il1b not identified in Fruit flavor vs air because there wasn't enough n, while in Air vs Tobacco, it randomly hit the significance mark. This is but an example of the problems with the analysis and conclusions.

      While we agree in part with the concern raised here. In our opinion, an omics study is not necessarily aimed at finding the changes at transcript level with absolute certainty, but rather to identify probable cell and gene targets to validate with subsequent work. We did not claim that our findings are absolute outcomes but rather add the limitation of sample number and need for further research at every step. The strength of this work is to be the first study of its kind looking at changes in the lung cell population at single cell level upon e-cig aerosol exposure. This study has provided us with interesting gene and cell targets that we are now validating with future work. We still strongly believe that a dataset like this is a useful resource for a wider audience.  

      The data in Figure 7A is confusing, if this is a comparison to air, then why does air vs air not equal 1? Even if this was the comparison to the average of air between males and females, then this doesn't explain why CCL12 is >1 in both. Is this z-score instead? Regardless the data is difficult to interpret in this format.

      We have now changed the format of data representation in the figure.

      Individual n was not shown for almost all experiments - e.g. Figure 1D - what is this representative of? Figure 2D - is this bulk-grouped data for all cells and all mice? The heatmaps are also pooled from 2n and don't show the variability.

      Wherever needed, the n number has been included in the Figure legend. Additionally, the n number is shown in Figure 1A. However, with respect to the second comment we would like to differ from the reviewer’s opinion. Each scRNA seq data had 2 samples – one for male and another for female which has been clearly shown in the current figures. The pooling of cells as mentioned in the comment happened at the stage of preparation of cell suspension from each sex/group at the start of the sequencing. We show the results of the pooled sample showing the variability amongst pooled samples, which we acknowledge is a shortcoming of our work. In terms of representation of the heat maps and data analyses we have included all the needed information to uphold transparency of our study design and data visualization for each figure and would like to stick to the current representations. However, validation cohort does not involve any pooling of sample and still agrees with most of the deductions made from this study. So we are confident that no over statements have been made in this work and we still provide a useful dataset to inform future research in this area.

      Reviewer #3 (Public review):

      This work aims to establish cell-type specific changes in gene expression upon exposure to different flavors of commercial e-cigarette aerosols compared to control or vehicle. Kaur et al. conclude that immune cells are most affected, with the greatest dysregulation found in myeloid cells exposed to tobacco-flavored e-cigs and lymphoid cells exposed to fruit-flavored e-cigs. The up-and-downregulated genes are heavily associated with innate immune response. The authors suggest that a Ly6G-deficient subset of neutrophils is found to be increased in abundance for the treatment groups, while gene expression remains consistent, which could indicate impaired function. Increased expression of CD4+ and CD8+ T cells along with their associated markers for proliferation and cytotoxicity is thought to be a result of activation following this decline in neutrophil-mediated immune response.

      Strengths:

      (1) Single-cell sequencing data can be very valuable in identifying potential health risks and clinical pathologies of lung conditions associated with e-cigarettes considering they are still relatively new.

      (2) Not many studies have been performed on cell-type specific differential gene expression following exposure to e-cig aerosols.

      (3) The assays performed address several factors of e-cig exposure such as metal concentration in the liquid and condensate, coil composition, cotinine/nicotine levels in serum and the product itself, cell types affected, which genes are up- or down-regulated and what pathways they control.

      (4)Considerations were made to ensure clinical relevance such as selecting mice whose ages corresponded with human adolescents so that the data collected was relevant.

      Weaknesses:

      The exposure period of 1 hour a day for 5 days is not representative of chronic use and this time point may be too short to see a full response in all cell types. The experimental design is not well-supported based on the literature available for similar mouse models.

      This study was not designed to study the effects of chronic exposures on lung tissues. We were interested in delineating the effect of acute exposures for which the proposed study design was chosen. Previous work by our group has performed similar exposures and has been well received by the community. We understand that chronic exposures will be interesting to look at, but that was beyond the scope of this pilot study. Longer / chronic exposures will be conducted considering disease modifying effects of e-cigarettes.

      Several claims lack supporting evidence or use data that is not statistically significant. In particular, there were no statistical analyses to compare results across sex, so conclusions stating there is a sex bias for things like Ly6G+ neutrophil percentage by condition are observational.

      We thank the reviewer for this observation, and we have now included the necessary validations and details of the sex-based statistical analyses in the revised version of this manuscript. 

      Statistical analyses lack rigor and are not always displayed with the most appropriate graphical representation.

      We thank the reviewer and have included all the necessary statistical details with more details in the revised manuscript.

      Overall, the paper and its discussion are relatively limited and do not delve into the significance of the findings or how they fit into the bigger picture of the field.

      As pointed out by the reviewers themselves the strength of this work is in the first ever scRNA seq analyses of mice exposed to differently flavored e-cig aerosols in vivo. We also show cellspecific differential gene expressions and address some of the major queries made around e-cig research including release of metals on a day-to-day basis from the same coil. The limited sample number makes it difficult to draw solid conclusions from this work, which has been discussed as a shortcoming. Nevertheless, the major strength of this work is not in identifying specific trends, but rather to determine the possible cell and gene targets to expand the study for longer (chronic) exposures with a larger sample group. We have mentioned the significance of the study with respect to vaping effects on cellular heterogeneity leading to deleterious effects.

      The manuscript lacks validation of findings in tissue by other methods such as staining.

      We have now included some validation experiments and revamped the revised manuscript to support scRNA seq findings.

      This paper provides a foundation for follow-up experiments that take a closer look at the effects of e-cig exposure on innate immunity. There is still room to elaborate on the differential gene expression within and between various cell types.

      We thank the reviewer for this observation. The cell numbers for some cell clusters (especially epithelial cells) were too low. So, though we have performed the differential gene expression analyses on all the cell clusters, we refrained from discussing it in the manuscript to avoid over interpretation of our results. Only clusters with high enough (> 150) cells per sex per group were used to plot the heatmaps. We have now included the cell numbers for each cell type in the revisions to allow better interpretation of our data. Furthermore, the raw data from this study will be freely available to the public upon publication of this manuscript. This would enable the interested readers to access the raw data and study the cell types of interest in detail based on their study requirements. This data will be a useful resource for all in this community to inform and design future studies. 

      Recommendation For The Author:

      Major comments

      Mouse experiments are extremely variable and an n of 2 is not enough. Because of the complexity of separating male and female mice, the analyses are not adequately powered to support conclusions. The two-way ANOVA style approach to consider sex as a separate variable was a great idea in Table S2 - but this was not used elsewhere, and there is a need to show the interaction statistic (which would say if there is a flavor effect dependent on sex).

      We thank the reviewers for this recommendation. We agree that the experiments are highly variable. However, it is not merely an outcome of a small sample size (which we address as one of the limitations). What is important to mention here is the fact that validating results from single cell technologies using regular molecular biology techniques is challenging and may not completely align. It is because we are comparing single cell population in the former and a heterogeneous cell population in latter. However, considering this comment, we have now toned down our conclusions and performed some extra experiments to validate single cell findings. We also provide the results from two-way ANOVA statistics for all the figures/experiments performed in this work. 

      More validatory data with PCR, immunostaining, and flow cytometry would be very helpful. This includes validating the neutrophil functional and phenotype data and the T-cell data by flow cytometry.

      To validate the presence of Ly6G+ and Ly6G- neutrophil population, we performed coimmunostaining experiments and proved that exposure to tobacco-flavored e-cig aerosols results in increase in cell percentages of two neutrophil population in female mice. We also re-analyzed our Flow cytometry data to align with scRNA seq results. Multiplex protein assay was another technique used to show altered innate/adaptive immune responses upon exposure to differently flavored e-cig aerosol. Of note, considering the short duration of exposure we did not identify significant changes in cell numbers or inflammatory responses. But we have now validated our scRNA seq results using various techniques to draw meaningful conclusions.

      The in vivo experimental design seems to model very short-term exposure. In the literature, including the papers cited in the references, much longer time points are used, extending from several weeks to months of exposure. There seem to be few examples of papers using 5-day exposure and those that do are inspired by traditional cigarette smoke rather than e-cig aerosols or model acute exposure by making the daily duration longer. It is important to consider the possibility that the greatest number of up- or down-regulated genes are found in immune cell populations solely because they are the first to be affected by e-cig exposure and the other cell types just do not have time to become dysregulated in 5 days.

      We thank the reviewers for this comment. We do not refute the fact that our observations of major changes in the immune cell population are due to the short duration of exposure. This was one of the first studies using single cell technologies to look at cell specific changes in the mouse lungs exposed to e-cig aerosols. However, the future experiments being conducted in our lab are using more controlled approach to mimic chronic exposures to e-cig aerosols to identify changes in other cell types and long-term effects of e-cig exposures in vivo. However, since this was not the focus of this work, we have not discussed it in detail.

      The validity of the claims pertaining to septal thickening and mean linear intercept (MLI) are questionable due to the poor lung inflation of the treatment group, which the authors acknowledge. Thus, MLI cannot be accurately used. It is contradictory to state that the fruit-flavored treatment group presented challenges with inflation but then concluded that there is a phenotype. In addition, inflation with low-melting agarose is not an ideal method because it does not use a liquid column to maintain constant pressure. For these metrics to be used and evaluated, it is imperative that all lobes are properly inflated. Therefore, these data should either be repeated or removed.

      We agree with this critique and have removed the MLI quantification from the revised manuscripts, we also do not make claims regarding much histological changes upon exposure. We suggest further work in future to get better understanding of the effect of differently flavored e-cig aerosol exposure on mouse lungs.

      What is the purpose of analyzing cell cycle scores? Why is it relevant that neutrophils are in G2M-phase? Figure 3B shows that neutrophils are clearly in both G1- and G2M-phase and this cluster includes both Ly6G+ and Ly6G- subsets, so it does not seem accurate to claim that they are in the G2M-phase of the cell cycle, nor does it reveal anything novel about Ly6G- neutrophils. Is it possible that the cell cycle score is noting a point in differentiation when neutrophils acquire/begin expressing Ly6G? Ly6G expression in neutrophils has been found to be associated with differentiation and maturation. To rule out the possibility that this is a cell state being identified, differential gene expression between the 2 neutrophil subsets should be shown in a volcano plot. It would also be useful to stain for Ly6G+/- neutrophils using either IF or RNAscope to prove they are present. If the claim is that Ly6G- neutrophils are a "unique" population, it must be established to what extent they are unique. Immune cells cluster together on UMAPs, so what if these are a different cell type entirely, like another immature myeloid lineage, and this is an artifact of clustering? This could be clarified with a trajectory analysis and further subsetting of the immune population.

      We thank the reviewers for this comment. We now realize that analyzing the cell cycle scores was not serving the intended purpose in this work. Moreover, due to the use of pooled samples for scRNA seq analyses, it may not be best to perform such downstream analyses in our datasets. We have thus removed these graphs from the revised version and have tried to simplify the conclusions of our study to the readers. 

      Our main take home from this study is the increase in number of mature (Ly6G+) and immature (Ly6G-) neutrophils in tobacco-flavored e-cig aerosol exposed mouse lungs as compared to air control. This result was validated using co-immunofluorescence in the revised manuscript (Figure 4).

      In vivo validation of findings should be included, especially for the claimed changes. As of now, this paper serves more as a dataset that could be further explored by other groups, which in itself is valuable, but it is just one single cell sequencing experiment without validation.

      We thank the reviewers for this comment. We have used multiple techniques (flow cytometry, multiplex protein assay, co-immunofluorescence) in the revised manuscript to validate the scRNA seq findings. However, this was a preliminary study which was designed to generate a small dataset for future experiments, and we do not have resources to add more validatory experiments for this study. We are currently designing chronic e-cig exposure studies to elaborate upon certain hypothesis generated through this study in future.

      Minor Comments

      There are several examples of typos or small errors in the text that would benefit from proofreading. Examples: line 51 "in the many countries including (the) United States (US), (the) United Kingdom..."; on line 54, the reference cited states that 9.4% of middle schoolers are daily users, not 9.2%; on line 55 the reference cited states that these are the most commonly used flavors, not the most preferred, which explains why the percentages do not add up to 100; line 120 "the lungs were in a collapsed state than the other groups"; line 127 "to confirm out speculations"; line 136 "PGVG" instead of the previously used "PG:VG"; line 140 "(single cell capture))"; line 999 "result in" rather than "results in" for Figure 4 title, etc.

      We thank the reviewer for this comment. The manuscript has been thoroughly proofread and edited to avoid typos and grammatical errors.

      If this is a "pilot study" (as it is stated in the introduction) it is meant to assess the validity of experimental design on a small scale to later test a hypothesis. The authors should change the phrasing.

      We have now changed the phrasing as suggested.

      The introduction lacked the necessary context and background. Some information described in the results section could be addressed in the intro. For example: What is the significance of neutrophils having a Ly6G deficiency? Why was the exposure duration of 1 hour a day for 5 days chosen? Why use nose-only exposure when many models use whole-body exposure? Why look at cell-type-specific changes?

      We have made the necessary amendments in the introduction.

      Some figure titles only address certain panels rather than summarizing the figure as a whole. For example, the title of Figure 1 only refers to panel D and is unrelated to serum cotinine levels, septa thickening, or mean linear intercept. The text discussed conclusions about septa thickening and Lm values for the fruit-flavored treatment group, so they are equally relevant to the figure compared to the metal levels.

      We have now changed the Figures and Figure legends to summarize the figure.

      significance level is not defined in Figure 1 legend although it is used in Figure 1C.

      The Figure legend has now been updated.

      Figure 1E does not include a scale bar.

      We have now included the scale bar in updated figures.

      The multiplex ELISA shown in the experimental design schematic is not further discussed in the paper. Flow cytometry plots should be displayed in addition to the data they generated.

      The flow cytometry plots have now been included (Figures 3&5) and the results for Multiplex ELISA are shown as Figure S3D and lines 327-342 of the revised manuscript.

      In Figure 1F, a multivariate ANOVA should be used so that multiple groups can be compared across sex, rather than plotting in a sex-specific manner and claiming there exists a sex bias. The small sample size also introduces an issue because a p-value cannot be generated with so few samples.

      Per the suggestions made previously, figure 1F has now been removed from the revised manuscript.

      The protocol for achieving a single-cell suspension should be detailed in the methods section. As is, it only describes the sample collection and preparation. This could help elucidate to the reader why the UMAP shows such a large abundance of immune cells.

      We have now included the protocol in the revised manuscript.

      Clarify whether PG:VG was used as a control in the scRNA sequencing in addition to air to generate the UMAP in Figure 2A.

      Yes, PG:VG was used as one of the controls which has now been illustrated as groupwise comparison in Figure 2D. We have also included the comparisons to identify DEGs in myeloid and lymphoid clusters upon comparison of various treatment groups versus PGVG (Supplementary Files S13-S18)

      A UMAP should be shown for each treatment group/flavor. The overall UMAP in Figure 1A is good, but there could be another panel with separate projections for each condition.

      A groupwise UMAP has now been included in Figure 2D.

      In Figure 2C, relative cell percentage is not a reliable method to quantify cell type and the histogram is not a great way to visualize the data or its statistical significance. These claims should also be validated in tissue.

      We thank the reviewers for this comment and have tried to validate the findings using Flow cytometry. However, we may want to add that the changes observed in single cell technologies cannot be validated using simple molecular biology techniques as the markers used to specify cell clusters in scRNA seq is too specific which was not the case for the design of flow panel in this work. Our major purpose of using cell percentages was to show the flavor-specific changes in generalized cell populations in mouse lungs. So, we have still included these graphs in the revised manuscript.

      Figure 2D could be better illustrated with a volcano plot to show which genes are being dysregulated rather than just how many. Knowing which genes are affected is more valuable than knowing just the number of genes.

      Figure 2D is no longer a part of the revised manuscript. For the other comparisons we have still used heatmaps as they also depict sex-specific changes in gene expressions, which would have been difficult to elucidate using volcano plots.

      Assuming Figure 3C is representative of all conditions, then Figures 3C and D demonstrate that Ly6G- neutrophils are present in all conditions including controls. To see whether they are truly present in different abundances between treatment and control groups, separate UMAPs of the neutrophil subsets should be made per condition or use a dot plot for Figure 3A. This also applies to Figure 3B.

      We thank the reviewers for pointing this out. We have now revamped the whole manuscript and used additional validation experiments to show the presence of Ly6G- and Ly6G+ neutrophil population upon exposure to tobacco-flavored e-cig aerosols. 

      Figure 3E shows that there is no statistically significant change in % of Ly6G+ neutrophils across treatment groups, but the text claims that there is "an increase in the levels of Ly6G+ neutrophils in lung digests from mouse lungs exposed to tobacco-flavored e-cig aerosols" (lines 207-209). The text also claims that "The observed increase was more pronounced in males as compared to females" (lines 209-210), but there was no statistical analysis across sexes to support this statement. It is clear that the change in % of Ly6G+ neutrophils is more pronounced in males than females, but it is still not statistically significant. This figure should also be repeated for analysis of Ly6G- neutrophils. Lines 272-274 mention that the % increase is higher for Ly6G- neutrophils than for Ly6G+ neutrophils, but there is not an analogous histogram to demonstrate this. The claims made in lines 275-280 are not clearly shown in any figure.

      We thank the reviewer for this query. This was an error on our part. We have now added sex-specific changes using scRNA seq, flow cytometry and co-immunofluorescence-based experiments to prove that more pronounces changes in the Ly6G+ and Ly6G- neutrophil population occurs in female mice and not males.

      Figures 4 and 6 have an overwhelming amount of heatmaps. Volcano plots with downstream analyses could be used to make some of this data more legible. The main findings should be validated in vivo/in tissue.

      We have now revamped the figures and data distribution to make the data legible and remove overwhelming amount of data from the slides.

      For Figure 5, show cell type by condition and do differential gene expression analysis displayed in a volcano plot. Then, stain tissue to validate the findings. Compare across sex during statistical analysis.

      The necessary changes have been made.

      Figure 6 error: panels E and F should be labeled as "tobacco" rather than "fruit".

      Error has now been fixed.

      Figure 7C can be placed in the supplemental materials.

      It has now been included in supplemental materials.

      The Figure 6E title should have been tobacco instead of fruit.

      This error has now been fixed.

      Line 381 mentioned the wrong subfigure. (Figure 7B instead of 7E).

      We have now made the necessary edits.

    3. Reviewer #1 (Public review):

      Summary:

      The authors assess the impact of E-cigarette smoke exposure on mouse lungs using single cell RNA sequencing. Air was used as control and several flavors (fruit, menthol, tobacco) were tested. Differentially expressed genes (DEGs) were identified for each group and compared against the air control. Changes in gene expression in either myeloid or lymphoid cells were identified for each flavor and the results varied by sex. The scRNAseq dataset will be of interest to the lung immunity and e-cig research communities and some of the observed effects could be important. Unfortunately, the revision did not address the reviewers' main concerns about low replicate numbers and lack of validations. The study remains preliminary and no solid conclusions could be drawn about the effects of E-cig exposure as a whole or any flavor-specific phenotypes.

      Strengths:

      The study is the first to use scRNAseq to systematically analyze the impact of e-cigarettes on the lung. The dataset will be of broad interest.

      Weaknesses:

      scRNAseq studies may have low replicate numbers due to the high cost of studies but at least 2 or 3 biological replicates for each experimental group is required to ensure rigor of the interpretation. This study had only N=1 per sex per group and some sex-dependent effects were observed. This could have been remedied by validating key observations from the study using traditional methods such as flow cytometry and qPCR, but the limited number of validation experiments did not support the conclusions of the scRNAseq analysis. An important control group (PG:VG) had extremely low cell numbers and was basically not useful. Statistical analysis is lacking in almost all figures. Overall, this is a preliminary study with some potentially interesting observations but no solid conclusions can be made from the data presented.

      (1) The only new validation experiment is the immunofluorescent staining of neutrophils in Figure 4. The images are very low resolution and low quality and it is not clear which cells are neutrophils. S100A8 (calprotectin) is highly abundant in neutrophils but not strictly neutrophil-specific. It's hard to distinguish positive cells from autofluorescence in both Ly6g and S100a8 channels. No statistical analysis in the quantification.

      (2) It is unclear what the meaning of Fig. 3A and B is, since these numbers only reflect the number of cells captured in the scRNAseq experiment and are not biologically meaningful. Flow cytometry quantification is presented as cell counts, but the percentage of cells from the CD45+ gate should be shown. No statistical analysis is shown, and flow cytometry results do not support the conclusions of scRNAseq data.

    1. Reviewer #1 (Public review):

      Summary:

      The authors revealed the cellular heterogeneity of companion cells (CCs) and demonstrated that the florigen gene FT is highly expressed in a specific subpopulation of these CCs in Arabidopsis. Through a thorough characterization of this subpopulation, they further identified NITRATE-INDUCIBLE GARP-TYPE TRANSCRIPTIONAL REPRESSOR 1 (NIGT1)-like transcription factors as potential new regulators of FT. Overall, these findings are intriguing and valuable, contributing significantly to our understanding of florigen and the photoperiodic flowering pathway. However, there is still room for improvement in the quality of the data and the depth of the analysis. I have several comments that may be beneficial for the authors.

      Strengths:

      The usage of snRNA-seq to characterize the FT-expressing companion cells (CCs) is very interesting and important. Two findings are novel: 1) Expression of FT in CCs is not uniform. Only a subcluster of CCs exhibits high expression level of FT. 2) Based on consensus binding motifs enriched in this subcluster, they further identify NITRATE-INDUCIBLE GARP-TYPE TRANSCRIPTIONAL REPRESSOR 1 (NIGT1)-like transcription factors as potential new regulators of FT.

      Weaknesses:

      (1) Title: "A florigen-expressing subpopulation of companion cells". It is a bit misleading. The conclusion here is that only a subset of companion cells exhibit high expression of FT, but this does not imply that other companion cells do not express it at all.

      (2) Data quality: Authors opted for fluorescence-activated nuclei sorting (FANS) instead of traditional cell sorting method. What is the rationale behind this decision? Readers may wonder, especially given that RNA abundance in single nuclei is generally lower than that in single cells. This concern also applies to snRNA-seq data. Specifically, the number of genes captured was quite low, with a median of only 149 genes per nucleus. Additionally, the total number of nuclei analyzed was limited (1,173 for the pFT:NTF and 3,650 for the pSUC2:NTF). These factors suggest that the quality of the snRNA-seq data presented in this study is quite low. In this context, it becomes challenging for the reviewer to accurately assess whether this will impact the subsequent conclusions of the paper. Would it be possible to repeat this experiment and get more nuclei?

      (3) Another disappointment is that the authors did not utilize reporter genes to identify the specific locations of the FT-high expressing cells (cluster 7 cells) within the CC population in vivo. Are there any discernible patterns that can be observed?

      (4) The final disappointment is that the authors only compared FT expression between the nigtQ mutants and the wild type. Does this imply that the mutant does not have a flowering time defect particularly under high nitrogen conditions?

      Comments on revisions:

      I think the authors took my comments seriously and addressed most of my concerns. Overall, I find this to be a very interesting paper.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review): 

      Summary: 

      The authors revealed the cellular heterogeneity of companion cells (CCs) and demonstrated that the florigen gene FT is highly expressed in a specific subpopulation of these CCs in Arabidopsis. Through a thorough characterization of this subpopulation, they further identified NITRATE-INDUCIBLE GARP-TYPE TRANSCRIPTIONAL REPRESSOR 1 (NIGT1)-like transcription factors as potential new regulators of FT. Overall, these findings are intriguing and valuable, contributing significantly to our understanding of florigen and the photoperiodic flowering pathway. However, there is still room for improvement in the quality of the data and the depth of the analysis. I have several comments that may be beneficial for the authors. 

      Strengths: 

      The usage of snRNA-seq to characterize the FT-expressing companion cells (CCs) is very interesting and important. Two findings are novel: 1) Expression of FT in CCs is not uniform. Only a subcluster of CCs exhibits high expression level of FT. 2) Based on consensus binding motifs enriched in this subcluster, they further identify NITRATE-INDUCIBLE GARP-TYPE TRANSCRIPTIONAL REPRESSOR 1 (NIGT1)-like transcription factors as potential new regulators of FT. 

      We are pleased to hear that reviewer 1 noted the novelty and importance of our work. As reviewer 1 mentioned, we are also excited about the identification of a subcluster of companion cells with very high FT expression. We believe that this work is an initial step to describe the molecular characteristics of these FT-expressing cells. We are also excited to share our new findings on NIGT1s as potential FT regulators. We believe this finding will attract a broader audience, as the molecular factor coordinating plant nutrition status with flowering time remains largely unknown despite its well-known phenomenon.

      Weaknesses: 

      (1) Title: "A florigen-expressing subpopulation of companion cells". It is a bit misleading. The conclusion here is that only a subset of companion cells exhibit high expression of FT, but this does not imply that other companion cells do not express it at all. 

      We agree with this comment, as it was not our intention to sound like that FT is not produced in other companion cells than the subpopulation we identified. We revised the title to more accurately reflect the point. The new title is “Companion cells with high florigen production express other small proteins and reveal a nitrogen-sensitive FT repressor.”

      (2) Data quality: Authors opted for fluorescence-activated nuclei sorting (FANS) instead of traditional cell sorting method. What is the rationale behind this decision? Readers may wonder, especially given that RNA abundance in single nuclei is generally lower than that in single cells. This concern also applies to snRNA-seq data. Specifically, the number of genes captured was quite low, with a median of only 149 genes per nucleus. Additionally, the total number of nuclei analyzed was limited (1,173 for the pFT:NTF and 3,650 for the pSUC2:NTF). These factors suggest that the quality of the snRNA-seq data presented in this study is quite low. In this context, it becomes challenging for the reviewer to accurately assess whether this will impact the subsequent conclusions of the paper. Would it be possible to repeat this experiment and get more nuclei?

      We appreciate this comment; we noticed that we did not clearly explain the rationale for using single-nucleus RNA sequencing (snRNA-seq) instead of single-cell RNA-seq (scRNA-seq). As reviewer 1 mentioned, RNA abundance in scRNA-seq is higher than in snRNA-seq. To conduct scRNA-seq using plant cells, protoplasting is the necessary step. However, in our study, protoplasting has many drawbacks in isolating our target cells from the phloem. First, it is technically challenging to efficiently isolate protoplasts from highly embedded phloem companion cells from plant tissues. Typically, at least several hours of enzymatic incubation are required to obtain protoplasts from companion cells (often using semi-isolated vasculatures), and the efficiency of protoplasting vasculature cells remains low. Secondly, for our analysis, restoring the time information within a day is also crucial. Therefore, we employed a more rapid isolation method. In the revision, we will explain our rationale for choosing snRNA-seq due to the technical limitations. In the revised manuscripts, we added four new sentences in the Introduction section to clearly explain these points.

      Reviewer 1 also raised a concern about the quality of our snRNA-seq data, referring to the relatively low readcounts per nucleus. Although we believe that shallow reads do not necessarily indicate low quality and are confident in the accuracy of our snRNA-seq data, as supported by the detailed follow-up experiments (e.g., imaging analysis in Fig. 4B), we agree that it is important to address this point in the revision and alleviate readers’ concerns regarding the data quality. 

      We believe the primary reason for the low readcounts per cell is the small amount of RNA present in each Arabidopsis vascular cell nucleus that we isolated. For bulk nuclei RNAseq, we collected 15,000 nuclei. However, the total RNA amount was approximately 3 ng. It indicates that each nucleus isolated contains a very limited amount of RNA (by the simple calculation, 3,000 pg / 15,000 nuclei = 0.2 pg/nucleus). It appears that the size of cells and nuclei was still small in 2-week-old seedlings; thus, each nucleus may contain lower levels of RNA. During the optimization process, we also tried to fix the tissues that we hoped to restore nuclear retained RNA, but unfortunately, in our hands, we encountered the technical issue of nuclei aggregation that hindered the sorting process, which is not suitable for single-nucleus RNA-seq.

      Reviewer 1 suggested that we repeat the same snRNA-seq experiment. We agree that having more cells increases the reliability of data. However, to our knowledge, higher cell numbers enhance the confidence of clustering, but not readcounts per cell. In our snRNAseq data, our target, FT-expressing cells, were observed in cluster 7, which projected at an obvious distance from other cell clusters. Therefore, we think that having more nuclei does not significantly help in separating high FT-expressing cluster 7 cells and different types of cells, although we may obtain more DEGs from the cluster 7 cells. Considering the costs and time required for additional snRNA-seq experiments, we think that adding more followup molecular biology experiment data would be more practical. We clearly stated the limitations of our approach in the Discussion section. “A drawback of our snRNA-seq analysis was shallow reads per nucleus. It appears mainly due to the low abundance of mRNA in nuclei from 2-week-old leaves. Based on our calculation, the average mRNA level per nucleus is approximately 0.2 pg (3,000 pg mRNA from 15,000 sorted nuclei). Future technological advance is needed to improve the data quality“

      In this revised version of the manuscript, we silenced FT gene expression using an amiRNA against FT driven by tissue-specific promoters [pROXY10, cluster 7; pSUC2, companion cells; pPIP2.6, cluster 4 (for the spatial expression pattern of PIP2.6, please see the new data shown in Fig. S8F); pGC1, guard cells]. Given that both FT and ROXY10 were highly expressed in cluster 7 of our snRNA-seq dataset, we anticipated the late flowering phenotype of pROXY10:amiRNA-ft. As we expected, pROXY10:amiR-ft but not pPIP2.6:amiR-ft lines showed delayed flowering phenotypes (Fig. S14A), supporting the validity of our snRNA-seq approach. We are also now more confident in the resolution of our snRNA-seq analysis, since cluster 4-specific PIP2.6 did not cause late flowering despite its higher basal expression than ROXY10 (Fig. S14B).

      (3) Another disappointment is that the authors did not utilize reporter genes to identify the specific locations of the FT-high expressing cells (cluster 7 cells) within the CC population in vivo. Are there any discernible patterns that can be observed? 

      In the original manuscript, as we showed only limited spatial images of overlap between FT and other cluster 7 genes in Fig. 4B, this comment is totally understandable. To respond to it, we added whole leaf images showing the spatial expression of FT and other cluster 7 genes (Fig. S12). These data indicate that cluster 7 genes including FT are expressed highly in minor veins in the distal part of the leaf but weakly in the main vein. We also added enlarged images of spatial expression of FT and cluster 7 genes (FLP1 and ROXY10) to note that those genes do not overlap completely (Fig. S13).

      In contrast to cluster 7 genes, genes highly expressed in cluster 4, such as LTP1 and MLP28, are reportedly highly expressed in the main leaf vein. To further confirm it, we established a transgenic line that expresses a GFP-fusion protein controlled by the promoter of a cluster 4-specific gene PIP2.6 (Fig. S8F). It also showed strong GFP signals in the main vein, consistent with previous observations of LTP1 and MLP28.   In summary, FT-expressing cells (cluster 7 cells) are enriched in companion cells in the minor vein, and their expression patterns show a clear distinction from genes expressed in the main vein (e.g., cluster 4-specific genes). 

      (4) The final disappointment is that the authors only compared FT expression between the nigtQ mutants and the wild type. Does this imply that the mutant does not have a flowering time defect particularly under high nitrogen conditions? 

      We agree with reviewer 1 that more experiments are required to conclude the role of NIGT1 on FT regulation, in addition to our Y1H data, flowering time data of NIGT1 overexpressors, and FT expression in NIGT1 overexpressors and nigtQ mutant.

      First, to test the direct regulation of NIGT1s on FT transcription, we conducted a transient luciferase (LUC) assay in tobacco leaves using effectors (p35S:NIGT1.2, p35S:NIGT1.4, and p35S:GFP) and reporters [pFT:LUC (FT promoter fused with LUC) and pFTm:LUC (the same FT promoter with mutations in NIGT1-binding sites fused with LUC)]. Our result showed that NIGT1.2 and NIGT1.4, but not GFP, decreased the activity of pFT:LUC but not pFTm:LUC (Fig. 5C). This indicates that NIGT1s directly repress the FT gene.

      Second, to address reviewer 1’s suggestion about the effect of of nigtQ mutation on flowering time, we have grown WT and nigtQ plants on 20 mM and 2 mM NH<sub>4</sub>NO<sub>3</sub>. Under 20 mM NH<sub>4</sub>NO<sub>3</sub>, the nigtQ line bolted at earlier days than WT; under 2 mM NH<sub>4</sub>NO<sub>3</sub>, nigtQ and WT bolted at almost same timing (Fig. S17D and E). This result suggests that the nigtQ mutation affects flowering timing depending on nitrogen nutrient status. However, leaf numbers of bolted plants were not different between WT and nigtQ lines (Fig. S17E). Therefore, it appears that nigtQ mutation also accelerated overall growth of plants rather than flowering promotion. We also have measured flowering time by counting leaf numbers of the nigtQ and WT plants at bolting on nitrogen-rich soil. The mutant generated slightly more leaves than WT when they flowered (Fig. S17G). These results suggest that the NIGT-derived fine-tuning of FT regulation is conditional on higher nitrogen conditions. 

      Minor: 

      (1) Abstract: "Our bulk nuclei RNA-seq demonstrated that FT-expressing cells in cotyledons and in true leaves differed transcriptionally.". This sentence is not informative. What exactly is the difference in FT-expressing cells between cotyledons and true leaves? 

      We modified the sentence to clarify the differences between cotyledons and true leaves. “Our bulk nuclei RNA-seq demonstrated that FT-expressing cells in cotyledons and true leaves showed differences especially in FT repressor genes.”

      (2) As a standard practice, to support the direct regulation of FT by NIGT1, the authors should provide EMSA and ChIP-seq data. Ideally, they should also generate promoter constructs with deletions or mutations in the NIGT1 binding sites. 

      To test direct interaction of NIGT1 to the FT promoter sequences, we performed the transient reporter assay using FT promoter driven luciferase reporter (Fig. 5C). NIGT1.2 and NIGT1.4 repressed the FT promoter activity; however, with NIGT1 binding site mutations, this repression was not observed, indicating that NIGT1 binds to the ciselements in the FT promoter to repress its transcription.

      (3) Sorting: Did the authors fix the samples before preparing the nuclei suspension? If not, could this be the reason the authors observed the JA-responsive clusters (Fig. 2J)? Please provide more details related to nuclei sorting in the Methods section. 

      We added a new subsection in the Materials and Methods section to explain a detail of the nuclei sorting procedure. We did not include a sample fixation step. We have tried formaldehyde fixation; however, it clumped nuclei, which was not suitable for snRNA-seq. Moreover, fixation steps generally reduce readcounts of single-cell RNA-seq according to the 10X Genomics’ guideline.

      We agree that JA responses were triggered during the FANS nuclei isolation. Therefore, we added the following sentence. “Since our FANS protocol did not include a sample fixation step to avoid clumping, these cells likely triggered wounding responses during the chopping and sorting process (Fig. S1B).  

      Reviewer #2 (Public review): 

      This manuscript submitted by Takagi et al. details the molecular characterization of the FTexpressing cell at a single-cell level. The authors examined what genes are expressed specifically in FT-expressing cells and other phloem companion cells by exploiting bulk nuclei and single-nuclei RNA-seq and transgenic analysis. The authors found the unique expression profile of FT-expressing cells at a single-cell level and identified new transcriptional repressors of FT such as NIGT1.2 and NIGT1.4. 

      Although previous researchers have known that FT is expressed in phloem companion cells, they have tended to neglect the molecular characterization of the FT-expressing phloem companion cells. To understand how FT, which is expressed in tiny amounts in phloem companion cells that make up a very small portion of the leaf, can be a key molecule in the regulation of the critical developmental step of floral transition, it is important to understand the molecular features of FT-expressing cells in detail. In this regard, this manuscript provides insight into the understanding of detailed molecular characteristics of the FT-expressing cell. This endeavor will contribute to the research field of flowering time. 

      We are grateful that reviewer 2 recognizes the importance of transcriptome profiling of FTexpressing cells at the single-cell level.

      Here are my comments on how to improve this manuscript. 

      (1) The most noble finding of this manuscript is the identification of NTGI1.2 as the upstream regulator of FT-expressing cluster 7 gene expression. The flowering phenotypes of the nigtQ mutant and the transgenic plants in which NIGT1.2 was expressed under the SUC2 gene promoter support that NIGT1.2 functions as a floral repressor upstream of the FT gene. Nevertheless, the expression patterns of NIGT1.2 genes do not appear to have much overlap with those of NIGT1.2-downstream genes in the cluster 7 (Figs S14 and F3). An explanation for this should be provided in the discussion section. 

      We agree with reviewer 2 that the spatial expression patterns of NIGT1.2 and cluster 7 genes do not overlap much, and some discussion should be provided in the manuscript. Although we do not have a concrete answer for this phenomenon, we obtained the new data showing that NIGT1.2 and NIGT1.4 directly repress the FT gene in planta (Fig. 5C).  As NIGT1.2/1.4 are negative regulators of FT, it is plausible that NIGT1.2/1.4 may suppress FT gene expression in non-cluster 7 cells to prevent the misexpression of FT. We added this point in the Results section.

      (2) To investigate gene expression in the nuclei of specific cell populations, the authors generated transgenic plants expressing a fusion gene encoding a Nuclear Targeting Fusion protein (NTF) under the control of various cell type-specific promoters. Since the public audience would not know about NTF without reading reference 16, some explanation of NTF is necessary in the manuscript. Please provide a schematic of constructs the authors used to make the transformants.

      As reviewer 2 pointed out, we lacked a clear explanation of why we used NTF in this study. NTF is the fusion protein that consists of a nuclear envelope targeting WPP domain, GFP, and a biotin acceptor peptide. It was initially designed for the INTACT (isolation of nuclei tagged in specific cell types) method, which enables us to isolate bulk nuclei from specific tissues. Although our original intention was to profile the bulk transcriptome of mRNAs that exist in nuclei of the FT-expressing cells using INTACT, we utilized our NTF transgenic lines for snRNA-seq analysis. To explain what NTF is to readers, we included a schematic diagram of NTF (Fig. S1A) and more explanation about NTF in the Results section.

      Again, we appreciate all reviewers’ careful and constructive comments. With these changes, we hope our revised manuscript is now satisfactory.

    1. Author response:

      The following is the authors’ response to the previous reviews

      Reviewer #1 (Public review): 

      Summary: 

      The study by Klug et al. investigated the pathway specificity of corticostriatal projections, focusing on two cortical regions. Using a G-deleted rabies system in D1-Cre and A2a-Cre mice to retrogradely deliver channelrhodopsin to cortical inputs, the authors found that M1 and MCC inputs to direct and indirect pathway spiny projection neurons (SPNs) are both partially segregated and asymmetrically overlapping. In general, corticostriatal inputs that target indirect pathway SPNs are likely to also target direct pathway SPNs, while inputs targeting direct pathway SPNs are less likely to also target indirect pathway SPNs. Such asymmetric overlap of corticostriatal inputs has important implications for how the cortex itself may determine striatal output. Indeed, the authors provide behavioral evidence that optogenetic activation of M1 or MCC cortical neurons that send axons to either direct or indirect pathway SPNs can have opposite effects on locomotion and different effects on action sequence execution. The conclusions of this study add to our understanding of how cortical activity may influence striatal output and offer important new clues about basal ganglia function. 

      The conceptual conclusions of the manuscript are supported by the data, but the details of the magnitude of afferent overlap and causal role of asymmetric corticostriatal inputs on some behavioral outcomes may be a bit overstated given technical limitations of the experiments. 

      For example, after virally labeling either direct pathway (D1) or indirect pathway (D2) SPNs to optogenetically tag pathway-specific cortical inputs, the authors report that a much larger number of "non-starter" D2-SPNs from D2-SPN labeled mice responded to optogenetic stimulation in slices than "non-starter" D1 SPNs from D1-SPN labeled mice did. Without knowing the relative number of D1 or D2 SPN starters used to label cortical inputs, it is difficult to interpret the exact meaning of the lower number of responsive D2-SPNs in D1 labeled mice (where only ~63% of D1-SPNs themselves respond) compared to the relatively higher number of responsive D1-SPNs (and D2-SPNs) in D2 labeled mice. While relative differences in connectivity certainly suggest that some amount of asymmetric overlap of inputs exists, differences in infection efficiency and ensuing differences in detection sensitivity in slice experiments make determining the degree of asymmetry problematic. 

      It is also unclear if retrograde labeling of D1-SPN- vs D2-SPN- targeting afferents labels the same densities of cortical neurons. This gets to the point of specificity in some of the behavioral experiments. If the target-based labeling strategies used to introduce channelrhodopsin into specific SPN afferents label significantly different numbers of cortical neurons, might the difference in the relative numbers of optogenetically activated cortical neurons itself lead to behavioral differences? 

      We thank the reviewer for the comments and for raising additional interpretations of our results. We agree that determining the relative number of D1- versus D2-SPN starter cells would allow a more accurate estimate of connectivity. However, due to current technical limitations, achieving this level of precision remains challenging. As the reviewer also noted, differences in the number of cortical neurons targeting D1- versus D2-SPNs could introduce additional complexity to the functional effects observed in the behavioral experiments. Moreover, functional heterogeneity is likely to exist not only among cortical neurons projecting to striatal D1- or D2-SPNs, but also within the striatal D1- and D2-SPN populations themselves. Addressing these questions at the single-neuron level will require more refined viral tools in combination with improved recording and manipulation techniques. Despite these limitations, our results suggest that a subpopulation of cortical neurons selectively targets striatal D1-SPNs, supporting a functional dichotomy of pathway-specific corticostriatal subcircuits in the control of behavior.   

      Reviewer #2 (Public review): 

      Summary: 

      Klug et al. use monosynaptic rabies tracing of inputs to D1- vs D2-SPNs in the striatum to study how separate populations of cortical neurons project to D1- and D2-SPNs. They use rabies to express ChR2, then patch D1-or D2-SPNs to measure synaptic input. They report that cortical neurons labeled as D1-SPN-projecting preferentially project to D1-SPNs over D2-SPNs. In contrast, cortical neurons labeled as D2-SPN-projecting project equally to D1- and D2-SPNs. They go on to conduct pathway-specific behavioral stimulation experiments. They compare direct optogenetic stimulation of D1- or D2-SPNs to stimulation of MCC inputs to DMS and M1 inputs to DLS. In three different behavioral assays (open field, intra-cranial self-stimulation, and a fixed ratio 8 task), they show that stimulating MCC or M1 cortical inputs to D1-SPNs is similar to D1-SPN stimulation, but that stimulating MCC or M1 cortical inputs to D2-SPNs does not recapitulate the effects of D2-SPN stimulation (presumably because both D1- and D2-SPNs are being activated by these cortical inputs). 

      Strengths: 

      Showing these same effects in three distinct behaviors is strong. Overall, the functional verification of the consequences of the anatomy is very nice to see. It is a good choice to patch only from mCherry-negative non-starter cells in the striatum. This study adds to our understanding of the logic of corticostriatal connections, suggesting a previously unappreciated structure. 

      Weaknesses: 

      One limitation is that all inputs to SPNs are expressing ChR2, so they cannot distinguish between different cortical subregions during patching experiments. Their results could arise because the same innervation patterns are repeated in many cortical subregions or because some subregions have preferential D1-SPN input while others do not. 

      Thank you for raising this thoughtful concern. It is indeed not feasible to restrict ChR2 expression to a specific cortical region using the first-generation rabies-ChR2 system alone. A more refined approach would involve injecting Cre-dependent TVA and RG into the striatum of D1- or A2A-Cre mice, followed by rabies-Flp infection. Subsequently, a Flp-dependent ChR2 virus could be injected into the MCC or M1 to selectively label D1- or D2-projecting cortical neurons. This strategy would allow for more precise targeting and address many of the current limitations.

      However, a significant challenge lies in the cytotoxicity associated with rabies virus infection. Neuronal health begins to deteriorate substantially around 10 days post-infection, which provides an insufficient window for robust Flp-dependent ChR2 expression. We have tested several new rabies virus variants with extended survival times (Chatterjee et al., 2018; Jin et al., 2024), but unfortunately, they did not perform effectively or suitably in the corticostriatal systems we examined.

      In our experimental design, the aim is to delineate the connectivity probabilities to D1 or D2-SPNs from cortical neurons. Our hypothesis considered includes the possibility that similar innervation patterns could occur across multiple cortical subregions, or that some subregions might show preferential input to D1-SPNs while others do not, or a combination of both scenarios. This leads us to perform a series behavior test that using optogenetic activation of the D1- or D2-projecting cortical populations to see which could be the case.

      In the cortical areas we examined, MCC and M1, during behavioral testing, there is consistency with our electrophysiological results. Specifically, when we stimulated the D1-projecting cortical neurons either in MCC or in M1, mice exhibited facilitated local motion in open field test, which is the same to the activation of D1 SPNs in the striatum along (MCC: Fig 3C & D vs. I; M1: Fig 3F & G vs. L). Conversely, stimulation of D2-projecting MCC or M1 cortical neurons resulted in behavioral effects that appeared to combine characteristics of both D1- and D2-SPNs activation in the striatum (MCC: Fig 3C & D vs. J; M1: Fig 3F & G vs. M). The similar results were observed in the ICSS test. Our interpretation of these results is that the activation of D1-projecting neurons in the cortex induces behavior changes akin to D1 neuron activation, while activation of D2-projecting neurons in the cortex leads to a combined effect of both D1 and D2 neuron activation. This suggests that at least some cortical regions, the ones we tested, follow the hypothesis we proposed.

      There are also some caveats with respect to the efficacy of rabies tracing. Although they only patch non-starter cells in the striatum, only 63% of D1-SPNs receive input from D1-SPN-projecting cortical neurons. It's hard to say whether this is "high" or "low," but one question is how far from the starter cell region they are patching. Without this spatial indication of where the cells that are being patched are relative to the starter population, it is difficult to interpret if the cells being patched are receiving cortical inputs from the same neurons that are projecting to the starter population. The authors indicate they are patching from mCherry-negative neurons within the region of the mCherry-positive neurons, but since the mCherry population will include both true starter cells and monosynaptically connected cells, this is not perfectly precise. Convergence of cortical inputs onto SPNs may vary with distance from the starter cell region quite dramatically, as other mapping studies of corticostriatal inputs have shown specialized local input regions can be defined based on cortical input patterns (Hintiryan et al., Nat Neurosci, 2016, Hunnicutt et al., eLife 2016, Peters et al., Nature, 2021). 

      This is a valid concern regarding anatomical studies. Investigating cortico-striatal connectivity at the single-cell level remains technically challenging due to current methodological limitations. At present, we rely on rabies virus-mediated trans-synaptic retrograde tracing to identify D1- or D2-projecting cortical populations. This anatomical approach is coupled with ex vivo slice electrophysiology to assess the functional connectivity between these projection-defined cortical neurons and striatal SPNs. This enables us to quantify connection ratios, for example, the proportion of D1-projecting cortical neurons that functionally synapse onto non-starter D1-SPNs.

      To ensure the robustness of our conclusions, it is essential that both the starter cells and the recorded non-starter SPNs receive comparable topographical input from the cortex and other brain regions. Therefore, we carefully designed our experiments so that all recorded cells were located within the injection site, were mCherry-negative (i.e., non-starter cells), and were surrounded by ChR2-mCherry-positive neurons. This configuration ensured that the distance between recorded and starter cells did not exceed 100 µm, maintaining close anatomical proximity and thereby preserving the likelihood of shared cortical innervation within the examined circuitry.

      These methodological details are also described in the section on ex vivo brain slice electrophysiology, specifically in the Methods section, lines 453–459:

      “D1-SPNs (eGFP-positive in D1-eGFP mice, or eGFP-negative in D2-eGFP mice) or D2-SPNs (eGFP-positive in D2-eGFP mice, or eGFP-negative in D1-eGFP mice) that were ChR2-mCherry-negative, but in the injection site and surrounded by cells expressing ChR2-mCherry were targeted for recording. This configuration ensured that the distance between recorded and starter cells did not exceed 100 µm, maintaining close anatomical proximity and thereby preserving the likelihood of shared cortical innervation within the examined circuitry.”

      This experimental strategy was implemented to control for potential spatial biases and to enhance the interpretability of our connectivity measurements.

      A caveat for the optogenetic behavioral experiments is that these optogenetic experiments did not include fluorophore-only controls, although a different control (with light delivered in M1) is provided in Supplementary Figure 3. Another point of confusion is that other studies (Cui et al, J Neurosci, 2021) have reported that stimulation of D1-SPNs in DLS inhibits rather than promotes movement. This study may have given different results due to subtly different experimental parameters, including fiber optic placement and NA.

      We appreciate the reviewer’s thoughtful evaluation and comments. We have added a short discussion of Cui et al.’s study on optogenetic stimulation of D1-SPNs in the DLS (lines 341-343), which reports findings that contrast with ours and those of other studies.

      Reviewer #3 (Public review): 

      Review of resubmission: The authors provided a response to the reviews from myself and other reviewers. While some points were made satisfactorily, particularly in clarification of the innervation of cortex to striatum and the effects of input stimulation, many of my points remain unaddressed. In several cases, the authors chose to explain their rationale rather than address the issues at hand. A number of these issues (in fact, the majority) could be addressed simply by toning done the confidence in conclusions, so it was disappointing to see that the authors by and large did not do this. I repeat my concerns below and note whether I find them to have been satisfactorily addressed or not. 

      In the manuscript by Klug and colleagues, the investigators use a rabies virus-based methodology to explore potential differences in connectivity from cortical inputs to the dorsal striatum. They report that the connectivity from cortical inputs onto D1 and D2 MSNs differs in terms of their projections onto the opposing cell type, and use these data to infer that there are differences in cross-talk between cortical cells that project to D1 vs. D2 MSNs. Overall, this manuscript adds to the overall body of work indicating that there are differential functions of different striatal pathways which likely arise at least in part by differences in connectivity that have been difficult to resolve due to difficulty in isolating pathways within striatal connectivity, and several interesting and provocative observations were reported. Several different methodologies are used, with partially convergent results, to support their main points. 

      However, I have significant technical concerns about the manuscript as presented that make it difficult for me to interpret the results of the experiments. My comments are below. 

      Major: 

      There is generally a large caveat to the rabies studies performed here, which is that both TVA and the ChR2-expressing rabies virus have the same fluorophore. It is thus essentially impossible to determine how many starter cells there are, what the efficiency of tracing is, and which part of the striatum is being sampled in any given experiment. This is a major caveat given the spatial topography of the cortico-striatal projections. Furthermore, the authors make a point in the introduction about previous studies not having explored absolute numbers of inputs, yet this is not at all controlled in this study. It could be that their rabies virus simply replicates better in D1-MSNs than D2-MSNs. No quantifications are done, and these possibilities do not appear to have been considered. Without a greater standardization of the rabies experiments across conditions, it is difficult to interpret the results. 

      This is still an issue. The authors point out why they chose various vectors. I can understand why the authors chose the fluorophores etc. that they did, yet the issues I raised previously are still valid. The discussion should mention that this is a potential issue. It does not necessarily invalidate results, but it is an issue. Furthermore, it is possible (in all systems) that rabies replicates better/more efficiently in some cells than others. This is one possible interpretation that has not really been explored in any study. I don't suggest the authors attempt to do that, but it should be raised as a potential interpretation. If the rabies results could mean several different things, the authors owe it to the readership to state all possible interpretations of data.

      We thank the reviewer for the comments and suggestions. Because the same fluorophore (mCherry) was used in both TVA- and ChR2-expressing viruses, it was not possible to distinguish true starter SPNs from TVA-only SPNs or monosynaptically labeled SPNs. This limitation makes it difficult to precisely assess the efficiency of rabies labeling and retrograde tracing in our experimental setup. Moreover, differences in rabies replication efficiency between D1- and D2-SPNs could potentially lead to an apparent lower connection probability from D1-projecting cortical neurons to D2-SPNs than from D2-projecting cortical neurons to D1-SPNs. We have added this clarification to the Discussion (lines 280-297).

      The authors claim using a few current clamp optical stimulation experiments that the cortical cells are healthy, but this result was far from comprehensive. For example, membrane resistance, capacitance, general excitability curves, etc are not reported. In Figure S2, some of the conditions look quite different (e.g., S2B, input D2-record D2, the method used yields quite different results that the authors write off as not different). Furthermore, these experiments do not consider the likely sickness and death that occurs in starter cells, as has been reported elsewhere. Health of cells in the circuit is overall a substantial concern that alone could invalidate a large portion, if not all, of the behavioral results. This is a major confound given those neurons are thought to play critical roles in the behaviors being studied. This is a major reason why first-generation rabies viruses have not been used in combination with behavior, but this significant caveat does not appear to have been considered, and controls e.g., uninfected animals, infected with AAV helpers, etc, were not included. 

      This issue remains unaddressed. I did not request clarity about experimental design, but rather, raised issues about the potential effects of toxicity. I believe this to be a valid concern that needs to be discussed in the manuscript, especially given what look visually like potential differences in S2. 

      We understand and appreciate the reviewer’s concern regarding the potential cytotoxicity of rabies virus infection. Although we performed the in vivo optogenetic behavioral experiments during a period when rabies-infected cells are generally considered relatively healthy, some deficits in starter cells may still occur and could contribute to the observed effects of optogenetic cortical stimulation. We have added this clarification to the Discussion (lines 298-306).

      The overall purity (e.g., EnvA pseudotyping efficiency) of the RABV prep is not shown. If there was a virus that was not well EnvA-pseudotyped and thus could directly infect cortical (or other) inputs, it would degrade specificity. This issue has not been addressed. Viral strain is irrelevant. The quality of the specific preparations used is what matters.

      While most of the study focuses on the cortical inputs, in slice recordings, inputs from the thalamus are not considered, yet likely contribute to the observed results. Related to this, in in vivo optogenetic experiments, technically, if the thalamic or other inputs to the dorsal striatum project to the cortex, their method will not only target cortical neurons but also terminals of other excitatory inputs. If this cannot be ruled it, stating that the authors are able to selectively activate the cortical inputs to one or the other population should be toned down. 

      The authors added text to the discussion to address this point. While it largely does what is intended, based on the one study cited, I disagree with the authors' conclusions that it is "clear" that potential contamination from other sites does not play a role. The simplest interpretation is the one the authors state, and there is some supporting evidence to back up that assertion, but to me that falls short of making the point "clear" that there are no other interpretations. 

      The statements about specificity of connectivity are not well founded. It may be that in the specific case where they are assessing outside of the area of injections, their conclusions may hold (e.g., excitatory inputs onto D2s have more inputs onto D1s than vice versa). However, how this relates to the actual site of injection is not clear. At face value, if such a connectivity exists, it would suggest that D1-MSNs receive substantially more overall excitatory inputs than D2s. It is thus possible that this observation would not hold over other spatial intervals. This was not explored and thus the conclusions are over-generalized. e.g., the distance from the area of red cells in the striatum to recordings was not quantified, what constituted a high level of cortical labeling was not quantified, etc. Without more rigorous quantification of what was being done, it is difficult to interpret the results. 

      Again, the goal here would be to make a statement about this in the discussion to clarify limitations of the study. I don't expect the authors to re-do all of these experiments, but since they are discussing the corticostriatal circuits, which have multiple subdomains, this remains a relevant point. It has not been addressed. 

      The results in Figure 3 are not well controlled. The authors show contrasting effects of optogenetic stimulation of D1-MSNs and D2-MSNs in the DMS and DLS, results which are largely consistent with the canon of basal ganglia function. However, when stimulating cortical inputs, stimulating the inputs from D1-MSNs gives the expected results (increased locomotion) while stimulating putative inputs to D2-MSNs had no effect. This is not the same as showing a decrease in locomotion - showing no effect here is not possible to interpret. 

      I think that the caveat of showing no clear effects of inputs to D2 stimulation should be pointed out. Yes, I understand that the viruses appeared to express etc., but again it remains possible that the results are driven by a lack of e.g., sufficient ChR2 expression. Aside from a full quantification of the number of cells expressing ChR2, overlap in fiber placement and ChR2 expression (which I don't suggest), this remains a possibility and should be pointed out, as it remains a possibility. 

      In the light of their circuit model, the result showing that inputs to D2-MSNs drive ICSS is confusing. How can the authors account for the fact that these cells are not locomotor-activating, stimulation of their putative downstream cells (D2-MSNs) does not drive ICSS, yet the cortical inputs drive ICSS? Is the idea that these inputs somehow also drive D1s? If this is the case, how do D2s get activated, if all of the cortical inputs tested net activate D1s and not D2s? Same with the results in Figure 4 - the inputs and putative downstream cells do not have the same effects. Given potential caveats of differences in viral efficiency, spatial location of injections, and cellular toxicity, I cannot interpret these experiments. 

      The explanation the authors provide in their rebuttal makes sense, however this should be included in the discussion of the manuscript, as it is interesting and relevant. 

      We thank the reviewer for the valuable comments and suggestions. In line with the reviewer’s recommendation, we have incorporated these explanations into the Discussion (lines 242–279) to help interpret the complex behavioral outcomes of optogenetic stimulation of cortical neurons projecting to D1- or D2-SPNs.

      Reviewer #2 (Recommendations for the authors): 

      I appreciate the authors' responses, which helped clarify some experimental choices. I appreciate that the experiment in Fig S3 serves as a reasonable light control for optogenetics experiments. The careful comparison with methods in Cui et al (2021) is useful, although not added to the main manuscript. Some of the other citations here don't really address the controversy, e.g. Kravitz at al is in DMS, but perhaps fully addressing this issue is outside the scope of the current manuscript and awaits further experiments. I also appreciate the clarification for recording locations that "This configuration ensured that the distance between recorded and starter cells did not exceed 100 µm, maintaining close anatomical proximity and thereby preserving the likelihood of shared cortical innervation within the examined circuitry." However, the statement in the reviewer response does not seem to be added to the manuscript's methods, which I think would be helpful. The criteria for choosing recorded cells are still a bit fuzzy without a map of recording locations and histology. There is also a problem that mCherry-positive cells could be starter cells or could be monosynaptically traced cells, so it is hard to know the area of the starter cell population in these experiments for sure. My evaluation of the manuscript remains largely the same as the original. However, I have adjusted my public review a bit to incorporate the authors' responses. I still think this paper has valuable information, suggesting an interesting and previously unappreciated structure of corticostriatal inputs that I hope this group and others will continue to investigate and incorporate into models of basal ganglia function.

      We thank the reviewer for the valuable suggestions. We have now included a comparison with Cui et al. in the Discussion. In addition, we have added the criteria for selecting recorded cells to the Methods section: ‘This configuration ensured that the distance between recorded and starter cells did not exceed 100 µm, maintaining close anatomical proximity and thereby preserving the likelihood of shared cortical innervation within the examined circuitry.’

    1. Reviewer #3 (Public review):

      This paper introduces the Avian Vocalization Network (AVN), a novel birdsong analysis pipeline using deep learning. By automating vocal annotation tasks, the AVN generates interpretable song features and song similarity scores on novel datasets without retraining. The performance of the network is solid and is comparable to that of human annotators.

      The authors have improved the manuscript in several aspects, such as the comparison with the Goffinet work. Overall, the AVN feature set could become a useful tool for evaluating birdsongs. But the authors also chose not to address a certain number of criticisms, and some issues remain poorly addressed, and the work is not reproducible at this stage. With a little effort, these issues could get resolved in my view. I will just pick on four issues that I think can be easily addressed:

      (1) Limitation of feature set: They claim that AVN satisfies the criteria (line 60) of "creating a common feature space for the comparison of behavioural phenotypes ..."(line 51), but then on LDA analysis, explained on line 910 they say "excluding amplitude and amplitude modulation features as they were found to vary". Since their feature set is not stable and not truly 'common' to all tasks, this limitation needs addressing in the discussion (that some features seem to vary undesirably, and they need exclusion based on some criteria to be defined).

      (2) Missing information on classification training loss: The Authors insist that their triplet loss is not related to classification, and they brush off my request for more information. In their rebuttal, they write: 'The loss function is related to the relative distance between embeddings of syllables with the same or different labels, not the classification of syllables as same or different.' Perplexingly, however, in the revised paper, authors speak themselves of 'classes', in Line 1004: this allows the model to begin learning an easier task, of separating syllables of different classes by a smaller margin.' So it seems the authors actually agree with me that there is an underlying classification task. I am therefore going to make it a bit more explicit here what I'm asking for, hoping this will better resonate with them.

      In line 984 they define their loss function and in lines 994-996 they define 'hard' and 'semi-hard' triplets. Authors then train a system to minimize the loss with a ratio of 75 percent semi-hard triplets and 25 percent hard triplets and a final weighing parameter value alpha=0.7. What I'm asking for is this 'classification' loss their trained model achieves, or in other words, the fraction of triplets that end up producing a loss, either of the 'hard' or 'semi-hard' type. For example, if their model manages to separate all 'possible triplets' by a margin of at least alpha, then the loss would be zero. If the model achieves to separate all triplets except one, then the loss would correspond to the amount by which the separation differences between the anchor and the positive vs negative samples exceeds alpha. So, an important number to provide in the paper is the fraction of triplets that incur a nonzero loss, i.e., the fraction of semi-hard triplets. And another important quantity is the fraction of hard triplets, i.e. the fraction of triplets that would incur a loss if alpha were set to zero, or, in other words, the triplets for which the negative sample is closer to the anchor than the positive sample. By the way, I assume this latter fraction of hard cases will be zero - that their model does not confuse any positive and negative training samples...<br /> Note: the quantification chosen by the authors termed 'contrast index' is interesting, but it is a derived quantity, it is not the quantity authors chose to optimize during training. If authors were to report both the training loss achieved and the 'contrast index', follow-up work could be benchmarked against both these quantities. If for example, a follow-up model achieves smaller loss but worse contrast, then the loss is not a good placeholder measure for optimizing contrast. Alternatively, follow-up work could focus on the contrast index as training objective, obliterating the need for the triplet loss as an intermediate step (I don't buy the authors' argument that such an optimization would be infeasible).

      (3) Reproducibility: they explain the way they train the CNN with triplet loss to produce the embeddings, but we're missing both actual scripts on GitHub to train and inference from scratch, and model weights, or even hyper parameters they used. Authors only provide the architecture, and I don't think that's enough to be considered replicable in today's standards. I would suggest they release complete model checkpoint weights for the result they report, the exact data splits, the hyper parameters they used and training and testing code, so that one can very easily verify their claims and apply their methods to other datasets. Note: for example, the code to extract the embeddings is incomplete (the function definition of single_bird_extract_embeddings cannot be found on GitHub) and the model weights they used are missing.

      (4) With regards to the age prediction model, the authors should specify that this model is mainly useful for comparisons across studies but less so for precise evaluation of the effects of a treatment within a study. Namely, the effect on song of a treatment is best assessed by comparison to within-subject past song, and by comparison to age-matched control birds (ideally siblings) raised in identical conditions, rather than to invoke a generic model trained on other birds and from different colonies and breeding conditions as authors propose to do. In other words, to introduce a generic model for evaluation of song maturity introduces measurement noise in terms of the additional birds and their variable conditions, which can hinder precise assessment of treatment effects. Note that to state that in past work such maturity models were used is not a good justification, scientifically speaking.

      Finally, the authors write that methods for syllable segmentation have not been systematically compared but the whisperseg work they use did such a comparison. So the authors should revise their novelty claim of being the first to compare syllable segmentation methods.

    2. Author Response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      Summary: 

      This paper applies methods for segmentation, annotation, and visualization of acoustic analysis to zebra finch song. The paper shows that these methods can be used to predict the stage of song development and to quantify acoustic similarity. The methods are solid and are likely to provide a useful tool for scientists aiming to label large datasets of zebra finch vocalizations. The paper has two main parts: 1) establishing a pipeline/ package for analyzing zebra finch birdsong and 2) a method for measuring song imitation. 

      Strengths: 

      It is useful to see existing methods for syllable segmentation compared to new datasets.

      It is useful, but not surprising, that these methods can be used to predict developmental stage, which is strongly associated with syllable temporal structure.

      It is useful to confirm that these methods can identify abnormalities in deafened and isolated songs. 

      Weaknesses: 

      For the first part, the implementation seems to be a wrapper on existing techniques. For instance, the first section talks about syllable segmentation; they made a comparison between whisperseg (Gu et al, 2024), tweetynet (Cohen et al, 2022), and amplitude thresholding. They found that whisperseg performed the best, and they included it in the pipeline. They then used whisperseg to analyze syllable duration distributions and rhythm of birds of different ages and confirmed past findings on this developmental process (e.g. Aronov et al, 2011). Next, based on the segmentation, they assign labels by performing UMAP and HDBScan on the spectrogram (nothing new; that's what people have been doing). Then, based on the labels, they claimed they developed a 'new' visualization - syntax raster ( line 180 ). That was done by Sainburg et. al. 2020 in Figure 12E and also in Cohen et al, 2020 - so the claim to have developed 'a new song syntax visualization' is confusing. The rest of the paper is about analyzing the finch data based on AVN features (which are essentially acoustic features already in the classic literature). 

      First, we would like to thank this reviewer for their kind comments and feedback on this manuscript. It is true that many of the components of this song analysis pipeline are not entirely novel in isolation. Our real contribution here is bringing them together in a way that allows other researchers to seamlessly apply automated syllable segmentation, clustering, and downstream analyses to their data. That said, our approach to training TweetyNet for syllable segmentation is novel. We trained TweetyNet to recognize vocalizations vs. silence across multiple birds, such that it can generalize to new individual birds, whereas Tweetynet had only ever been used to annotate song syllables from birds included in its training set previously. Our validation of TweetyNet and WhisperSeg in combination with UMAP and HDBSCAN clustering is also novel, providing valuable information about how these systems interact, and how reliable the completely automatically generated labels are for downstream analysis. We have added a couple sentences to the introduction to emphasize the novelty of this approach and validation.

      Our syntax raster visualization does resemble Figure 12E in Sainburg et al. 2020, however it differs in a few important ways, which we believe warrant its consideration as a novel visualization method. First, Sainburg et al. represent the labels across bouts in real time; their position along the x axis reflects the time at which each syllable is produced relative to the start of the bout. By contrast, our visualization considers only the index of syllables within a bout (ie. First syllable vs. second syllable etc) without consideration of the true durations of each syllable or the silent gaps between them. This makes it much easier to detect syntax patterns across bouts, as the added variability of syllable timing is removed. Considering only the sequence of syllables rather than their timing also allows us to more easily align bouts according to the first syllable of a motif, further emphasizing the presence or absence of repeating syllable sequences without interference from the more variable introductory notes at the start of a motif. Finally, instead of plotting all bouts in the order in which they were produced, our visualization orders bouts such that bouts with the same sequence of syllables will be plotted together, which again serves to emphasize the most common syllable sequences that the bird produces. These additional processing steps mean that our syntax raster plot has much starker contrast between birds with stereotyped syntax and birds with more variable syntax, as compared to the more minimally processed visualization in Sainburg et al. 2020. There doesn’t appear to be any similar visualizations in Cohen et al. 2020. 

      The second part may be something new, but there are opportunities to improve the benchmarking. It is about the pupil-tutor imitation analysis. They introduce a convolutional neural network that takes triplets as an input (each tripled is essentially 3 images stacked together such that you have (anchor, positive, negative), Anchor is a reference spectrogram from, say finch A; positive means a different spectrogram with the same label as anchor from finch A, and negative means a spectrogram not related to A or different syllable label from A. The network is then trained to produce a low-dimensional embedding by ensuring the embedding distance between anchor and positive is less than anchor and negative by a certain margin. Based on the embedding, they then made use of earth mover distance to quantify the similarity in the syllable distribution among finches. They then compared their approach performance with that of sound analysis pro (SAP) and a variant of SAP. A more natural comparison, which they didn't include, is with the VAE approach by Goffinet et al. In this paper (https://doi.org/10.7554/eLife.67855, Fig 7), they also attempted to perform an analysis on the tutor pupil song.  

      We thank the reviewer for this suggestion. We have included a comparison of our triplet loss embedding model to the VAE model proposed in Goffinet et al. 2021. We also included comparisons of similarity scoring using each of these embedding models combined with either earth mover’s distance (EMD) or maximum mean discrepancy (MMD) to calculate the similarity of the embeddings, as was done in Goffinet et al. 2021. As discussed in the updated results section of the paper and shown in the new Figure 6–figure supplement 1, the Triplet loss model with MMD performs best for evaluating song learning on new birds, not included in model training. We’ve updated the main text of the paper to reflect this switch from EMD to MMD for the primary similarity scoring approach.

      Reviewer #2 (Public Review):

      Summary: 

      In this work, the authors present a new Python software package, Avian Vocalization Network (AVN) aimed at facilitating the analysis of birdsong, especially the song of the zebra finch, the most common songbird model in neuroscience. The package handles some of the most common (and some more advanced) song analyses, including segmentation, syllable classification, featurization of song, calculation of tutor-pupil similarity, and age prediction, with a view toward making the entire process friendlier to experimentalists working in the field.

      For many years, Sound Analysis Pro has served as a standard in the songbird field, the first package to extensively automate songbird analysis and facilitate the computation of acoustic features that have helped define the field. More recently, the increasing popularity of Python as a language, along with the emergence of new machine learning methods, has resulted in a number of new software tools, including the vocalpy ecosystem for audio processing, TweetyNet (for segmentation), t-SNE and UMAP (for visualization), and autoencoder-based approaches for embedding.

      Strengths: 

      The AVN package overlaps several of these earlier efforts, albeit with a focus on more traditional featurization that many experimentalists may find more interpretable than deep learning-based approaches. Among the strengths of the paper are its clarity in explaining the several analyses it facilitates, along with high-quality experiments across multiple public datasets collected from different research groups. As a software package, it is open source, installable via the pip Python package manager, and features high-quality documentation, as well as tutorials. For experimentalists who wish to replicate any of the analyses from the paper, the package is likely to be a useful time saver.

      Weaknesses: 

      I think the potential limitations of the work are predominantly on the software end, with one or two quibbles about the methods.

      First, the software: it's important to note that the package is trying to do many things, of which it is likely to do several well and few comprehensively. Rather than a package that presents a number of new analyses or a new analysis framework, it is more a codification of recipes, some of which are reimplementations of existing work (SAP features), some of which are essentially wrappers around other work (interfacing with WhisperSeg segmentations), and some of which are new (similarity scoring). All of this has value, but in my estimation, it has less value as part of a standalone package and potentially much more as part of an ecosystem like vocalpy that is undergoing continuous development and has long-term support. 

      We appreciate this reviewer’s comments and concerns about the structure of the AVN package and its long-term maintenance. We have considered incorporating AVN into the VocalPy ecosystem but have chosen not to for a few key reasons. (1) AVN was designed with ease of use for experimenters with limited coding experience top of mind. VocalPy provides excellent resources for researchers with some familiarity with object-oriented programming to manage and analyze their datasets; however, we believe it may be challenging for users without such experience to adopt VocalPy quickly. AVN’s ‘recipe’ approach, as you put it, is very easily accessible to new users, and allows users with intermediate coding experience to easily navigate the source code to gain a deeper understanding of the methodology. AVN also consistently outputs processed data in familiar formats (tables in .csv files which can be opened in excel), in an effort to make it more accessible to new users, something which would be challenging to reconcile with VocalPy’s emphasis on their `dataset`classes. (2) AVN and VocalPy differ in their underlying goals and philosophies when it comes to flexibility vs. standardization of analysis pipelines. VocalPy is designed to facilitate mixing-and-matching of different spectrogram generation, segmentation, annotation etc. approaches, so that researchers can design and implement their own custom analysis pipelines. This flexibility is useful in many cases. For instance, it could allow researchers who have very different noise filtering and annotation needs, like those working with field recordings versus acoustic chamber recordings, to analyze their data using this platform. However, when it comes to comparisons across zebra finch research labs, this flexibility comes at the expense of direct comparison and integration of song features across research groups. This is the context in which AVN is most useful. It presents a single approach to song segmentation, labeling, and featurization that has been shown to generalize well across research groups, and which allows direct comparisons of the resulting features. AVN’s single, extensively validated, standard pipeline approach is fundamentally incompatible with VocalPy’s emphasis on flexibility. We are excited to see how VocalPy continues to evolve in the future, and recognize the value that both AVN and VocalPy bring to the songbird research community, each with their own distinct strengths, weaknesses, and ideal use cases. 

      While the code is well-documented, including web-based documentation for both the core package and the GUI, the latter is available only on Windows, which might limit the scope of adoption. 

      We thank the reviewer for their kind words about AVN’s documentation. We recognize that the GUI’s exclusive availability on Windows is a limitation, and we would be happy to collaborate with other researchers and developers in the future to build a Mac compatible version, should the demand present itself. That said, the python package works on all operating systems, so non-Windows users still have the ability to use AVN that way.

      That is to say, whether AVN is adopted by the field in the medium term will have much more to do with the quality of its maintenance and responsiveness to users than any particular feature, but I believe that many of the analysis recipes that the authors have carefully worked out may find their way into other code and workflows. 

      Second, two notes about new analysis approaches:

      (1) The authors propose a new means of measuring tutor-pupil similarity based on first learning a latent space of syllables via a self-supervised learning (SSL) scheme and then using the earth mover's distance (EMD) to calculate transport costs between the distributions of tutors' and pupils' syllables. While to my knowledge this exact method has not previously been proposed in birdsong, I suspect it is unlikely to differ substantially from the approach of autoencoding followed by MMD used in the Goffinet et al. paper. That is, SSL, like the autoencoder, is a latent space learning approach, and EMD, like MMD, is an integral probability metric that measures discrepancies between two distributions. (Indeed, the two are very closely related: https://stats.stackexchange.com/questions/400180/earth-movers-distance-andmaximum-mean-discrepency.) Without further experiments, it is hard to tell whether these two approaches differ meaningfully. Likewise, while the authors have trained on a large corpus of syllables to define their latent space in a way that generalizes to new birds, it is unclear why such an approach would not work with other latent space learning methods.  

      We recognize the similarities between these approaches and have included comparisons of the VAE and MMD as in the Goffinet paper to our triplet loss model and EMD.  As discussed in the updated results section of the paper and shown in the new Figure 6–figure supplement 1, the Triplet loss model with MMD performs best for evaluating song learning on new birds, not included in model training. We’ve updated the main text of the paper to reflect this switch from EMD to MMD for the primary similarity scoring approach. 

      (2) The authors propose a new method for maturity scoring by training a model (a generalized additive model) to predict the age of the bird based on a selected subset of acoustic features. This is distinct from the "predicted age" approach of Brudner, Pearson, and Mooney, which predicts based on a latent representation rather than specific features, and the GAM nicely segregates the contribution of each. As such, this approach may be preferred by many users who appreciate its interpretability.  

      In summary, my view is that this is a nice paper detailing a well-executed piece of software whose future impact will be determined by the degree of support and maintenance it receives from others over the near and medium term.

      Reviewer #3 (Public Review):

      Summary: 

      The authors invent song and syllable discrimination tasks they use to train deep networks. These networks they then use as a basis for routine song analysis and song evaluation tasks. For the analysis, they consider both data from their own colony and from another colony the network has not seen during training. They validate the analysis scores of the network against expert human annotators, achieving a correlation of 80-90%. 

      Strengths: 

      (1) Robust Validation and Generalizability: The authors demonstrate a good performance of the AVN across various datasets, including individuals exhibiting deviant behavior. This extensive validation underscores the system's usefulness and broad applicability to zebra finch song analysis, establishing it as a potentially valuable tool for researchers in the field.

      (2) Comprehensive and Standardized Feature Analysis: AVN integrates a comprehensive set of interpretable features commonly used in the study of bird songs. By standardizing the feature extraction method, the AVN facilitates comparative research, allowing for consistent interpretation and comparison of vocal behavior across studies.

      (3) Automation and Ease of Use. By being fully automated, the method is straightforward to apply and should introduce barely an adoption threshold to other labs.

      (4) Human experts were recruited to perform extensive annotations (of vocal segments and of song similarity scores). These annotations released as public datasets are potentially very valuable. 

      Weaknesses: 

      (1) Poorly motivated tasks. The approach is poorly motivated and many assumptions come across as arbitrary. For example, the authors implicitly assume that the task of birdsong comparison is best achieved by a system that optimally discriminates between typical, deaf, and isolated songs. Similarly, the authors assume that song development is best tracked using a system that optimally estimates the age of a bird given its song. My issue is that these are fake tasks since clearly, researchers will know whether a bird is an isolated or a deaf bird, and they will also know the age of a bird, so no machine learning is needed to solve these tasks. Yet, the authors imagine that solving these placeholder tasks will somehow help with measuring important aspects of vocal behavior.  

      We appreciate this reviewer’s concerns and apologize for not providing sufficiently clear rationale for the inclusion of our phenotype classifier and age regression models in the original manuscript. These tasks are not intended to be taken as a final, ultimate culmination of the AVN pipeline. Rather, we consider the carefully engineered 55-interpretable feature set to be AVN’s final output, and these analyses serve merely as examples of how that feature set can be applied. That said, each of these models do have valid experimental use cases that we believe are important and would like to bring to the attention of the reviewer.

      For one, we showed how the LDA model that can discriminate between typical, deaf, and isolate birds’ songs not only allows us to evaluate which features are most important for discriminating between these groups, but also allows comparison of the FoxP1 knock-down (FP1 KD) birds to each of these phenotypes. Based on previous work (Garcia-Oscos et al. 2021), we hypothesized that FP1 KD in these birds specifically impaired tutor song memory formation while sparing a bird’s ability to refine their own vocalizations through auditory feedback. Thus, we would expect their songs to resemble those of isolate birds, who lack a tutor song memory, but not to resemble deaf birds who lack a tutor song memory and auditory feedback of their own vocalizations to guide learning. The LDA model allowed us to make this comparison quantitatively for the first time and confirm our hypothesis that FP1 KD birds’ songs are indeed most like isolates’. In the future, as more research groups publish their birds’ AVN feature sets, we hope to be able to make even more fine-grained comparisons between different groups of birds, either using LDA or other similar interpretable classifiers. 

      The age prediction model also has valid real-world use cases. For instance, one might imagine an experimental manipulation that is hypothesized to accelerate or slow song maturation in juvenile birds. This age prediction model could be applied to the AVN feature sets of birds having undergone such a manipulation to determine whether their predicted ages systematically lead or lag their true biological ages, and which song features are most responsible for this difference. We didn’t have access to data for any such birds for inclusion in this paper, but we hope that others in the future will be able to take inspiration from our methodology and use this or a similar age regression model with AVN features in their research. We have added a couple lines to the ‘Comparing Song Disruptions with AVN Features’ and ‘Tracking Song Development with AVN Features’ sections of the results to make this more clear. 

      Along similar lines, authors assume that a good measure of similarity is one that optimally performs repeated syllable detection (i.e. to discriminate same syllable pairs from different pairs). The authors need to explain why they think these placeholder tasks are good and why no better task can be defined that more closely captures what researchers want to measure. Note: the standard tasks for self-supervised learning are next word or masked word prediction, why are these not used here? 

      This reviewer appears to have misunderstood our similarity scoring embedding model and our rationale for using it. We will explain it in more depth here and have added a paragraph to the ‘Measuring Song Imitation’ section of the results explaining this rationale more briefly.

      First, nowhere are we training a model to discriminate between same and different syllable pairs. The triplet loss network is trained to embed syllables in an 8-dimensional space such that syllables with the same label are closer together than syllables with different labels. The loss function is related to the relative distance between embeddings of syllables with the same or different labels, not the classification of syllables as same or different. This approach was chosen because it has repeatedly been shown to be a useful data compression step (Schorff et al. 2015, Thakur et al. 2019) before further downstream tasks are applied on its output, particularly in contexts where there is little data per class (syllable label). For example, Schorff et al. 2015 trained a deep convolutional neural network with triplet loss to embed images of human faces from the same individual closer together than images of different individuals in a 128dimensional space. They then used this model to compute 128-dimensional representations of additional face images, not included in training, which were used for individual facial recognition (this is a same vs. different category classifier), and facial clustering, achieving better performance than the previous state of the art. The triplet loss function results in a model that can generate useful embeddings of previously unseen categories, like new individuals’ faces, or new zebra finches’ syllables, which can then be used in downstream analyses. This meaningful, lower dimensional space allows comparisons of distributions of syllables across birds, as in Brainard and Mets 2008, and Goffinet et al. 2021. 

      Next word and masked word prediction are indeed common self-supervised learning tasks for models working with text data, or other data with meaningful sequential organization. That is not the case for our zebra finch syllables, where every bird’s syllable sequence depends only on its tutor’s sequence, and there is no evidence for strong universal syllable sequencing rules (James et al. 2020). Rather, our embedding model is an example of a computer vision task, as it deals with sets of two-dimensional images (spectrograms), not sequences of categorical variables (like text). It is also not, strictly speaking, a selfsupervised learning task, as it does require syllable labels to generate the triplets. A common selfsupervised approach for dimensionality reduction in a computer vision task such as this one would be to train an autoencoder to compress images to a lower dimensional space, then faithfully reconstruct them from the compressed representation.  This has been done using a variational autoencoder trained on zebra finch syllables in Goffinet et al. 2021. In keeping with the suggestions from reviewers #1 and #2, we have included a comparison of our triplet loss model with the Goffinet et al. VAE approach in the revised manuscript. 

      (2) The machine learning methodology lacks rigor. The aims of the machine learning pipeline are extremely vague and keep changing like a moving target. Mainly, the deep networks are trained on some tasks but then authors evaluate their performance on different, disconnected tasks. For example, they train both the birdsong comparison method (L263+) and the song similarity method (L318+) on classification tasks. However, they evaluate the former method (LDA) on classification accuracy, but the latter (8-dim embeddings) using a contrast index. In machine learning, usually, a useful task is first defined, then the system is trained on it and then tested on a held-out dataset. If the sensitivity index is important, why does it not serve as a cost function for training?

      Again, this reviewer seems not to understand our similarity scoring methodology. Our similarity scoring model is not trained on a classification task, but rather on an embedding task. It learns to embed spectrograms of syllables in an 8-dimensional space such that syllables with the same label are closer together than syllables with different labels. We could report the loss values for this embedding task on our training and validation datasets, but these wouldn’t have any clear relevance to the downstream task of syllable distribution comparison where we are using the model’s embeddings. We report the contrast index as this has direct relevance to the actual application of the model and allows comparisons to other similarity scoring methods, something that the triplet loss values wouldn’t allow. 

      The triplet loss method was chosen because it has been shown to yield useful low-dimensional representations of data, even in cases where there is limited labeled training data (Thakur et al. 2019). While we have one of the largest manually annotated datasets of zebra finch songs, it is still quite small by industry deep learning standards, which is why we chose a method that would perform well given the size of our dataset. Training a model on a contrast index directly would be extremely computationally intensive and require many more pairs of birds with known relationships than we currently have access to. It could be an interesting approach to take in the future, but one that would be unlikely to perform well with a dataset size typical to songbird research. 

      Also, usually, in solid machine learning work, diverse methods are compared against each other to identify their relative strengths. The paper contains almost none of this, e.g. authors examined only one clustering method (HDBSCAN).  

      We did compare multiple methods for syllable segmentation (WhisperSeg, TweetyNet, and Amplitude thresholding) as this hadn’t been done previously. We chose not to perform extensive comparison of different clustering methods as Sainburg et al. 2020 already did so and we felt no need to reduplicate this effort. We encourage this reviewer to refer to Sainburg et al.’s excellent work for comparisons of multiple clustering methods applied to zebra finch song syllables.

      (3) Performance issues. The authors want to 'simplify large-scale behavioral analysis' but it seems they want to do that at a high cost. (Gu et al 2023) achieved syllable scores above 0.99 for adults, which is much larger than the average score of 0.88 achieved here (L121). Similarly, the syllable scores in (Cohen et al 2022) are above 94% (their error rates are below 6%, albeit in Bengalese finches, not zebra finches), which is also better than here. Why is the performance of AVN so low? The low scores of AVN argue in favor of some human labeling and training on each bird.  

      Firstly, the syllable error rate scores reported in Cohen et al. 2022 are calculated very differently than the F1 scores we report here and are based on a model trained with data from the same bird as was used in testing, unlike our more general segmentation approach where the model was tested on different birds than were used in training. Thus, the scores reported in Cohen et al. and the F1 scores that we report cannot be compared. 

      The discrepancy between the F1<sub>seg</sub> scores reported in Gu et al. 2023 and the segmentation F1 scores that we report are likely due to differences in the underlying datasets. Our UTSW recordings tend to have higher levels of both stationary and non-stationary background noise, which make segmentation more challenging. The recordings from Rockefeller were less contaminated by background noise, and they resulted in slightly higher F1 scores. That said, we believe that the primary factor accounting for this difference in scores with Gu et al. 2023 is the granularity of our ‘ground truth’ syllable segments. In our case, if there was never any ambiguity as to whether vocal elements should be segmented into two short syllables with a very short gap between them or merged into a single longer syllable, we chose to split them. WhisperSeg had a strong tendency to merge the vocal elements in ambiguous cases such as these. This results in a higher rate of false negative syllable onset detections, reflected in the low recall scores achieved by WhisperSeg (see Figure 2–figure supplement 1b), but still very high precision scores (Figure 2–figure supplement 1a). While WhisperSeg did frequently merge these syllables in a way that differed from our ground truth segmentation, it did so consistently, meaning it had little impact on downstream measures of syntax entropy (Figure 3c) or syllable duration entropy (Figure 3–figure supplement 2a). It is for that reason that, despite a lower F1 score, we still consider AVN’s automatically generated annotations to be sufficiently accurate for downstream analyses. 

      Should researchers require a higher degree of accuracy and precision with their annotations (for example, to detect very subtle changes in song before and after an acute manipulation) we suggest they turn toward one of the existing tools for supervised song annotation, such as TweetyNet.

      (4) Texas bias. It is true that comparability across datasets is enhanced when everyone uses the same code. However, the authors' proposal essentially is to replace the bias between labs with a bias towards birds in Texas. The comparison with Rockefeller birds is nice, but it amounts to merely N=1. If birds in Japanese or European labs have evolved different song repertoires, the AVN might not capture the associated song features in these labs well.  

      We appreciate the author’s concern about a bias toward birds from the UTSW colony. However, this paper shows that despite training (for the similarity scoring) and hyperparameter fitting (for the HDBSCAN clustering) on the UTSW birds, AVN performs as well if not better on birds from Rockefeller than from UTSW. To our knowledge, there are no publicly available datasets of annotated zebra finch songs from labs in Europe or in Asia but we would be happy to validate AVN on such datasets, should they become available. Furthermore, there is no evidence to suggest that there is dramatic drift in zebra finch vocal repertoire between continents which would necessitate such additional validation. While we didn’t have manual annotations for this dataset (which would allow validation of our segmentation and labeling methods), we did apply AVN to recordings shared with us by the Wada lab in Japan, where visual inspection of the resulting annotations suggested comparable accuracy to the UTSW and Rockefeller datasets. 

      (5) The paper lacks an analysis of the balance between labor requirement, generalizability, and optimal performance. For tasks such as segmentation and labeling, fine-tuning for each new dataset could potentially enhance the model's accuracy and performance without compromising comparability. E.g. How many hours does it take to annotate hundred song motifs? How much would the performance of AVN increase if the network were to be retrained on these? The paper should be written in more neutral terms, letting researchers reach their own conclusions about how much manual labor they want to put into their data.  

      With standardization and ease of use in mind, we designed AVN specifically to perform fully automated syllable annotation and downstream feature calculations. We believe that we have demonstrated in this manuscript that our fully automated approach is sufficiently reliable for downstream analyses across multiple zebra finch colonies. That said, if researchers require an even higher degree of annotation precision and accuracy, they can turn toward one of the existing methods for supervised song annotation, such as TweetyNet. Incorporating human annotations for each bird processed by AVN is likely to improve its performance, but this would require significant changes to AVN’s methodology, and is outside the scope of our current efforts.

      (6) Full automation may not be everyone's wish. For example, given the highly stereotyped zebra finch songs, it is conceivable that some syllables are consistently mis-segmented or misclassified. Researchers may want to be able to correct such errors, which essentially amounts to fine-tuning AVN. Conceivably, researchers may want to retrain a network like the AVN on their own birds, to obtain a more fine-grained discriminative method.  

      Other methods exist for supervised or human-in-the-loop annotation of zebra finch songs, such as TweetyNet and DAN (Alam et al. 2023). We invite researchers who require a higher degree of accuracy than AVN can provide to explore these alternative approaches for song annotation. Incorporating human feedback into AVN was never the goal of our pipeline, would require significant changes to AVN’s design and is outside the scope of this manuscript.

      (7) The analysis is restricted to song syllables and fails to include calls. No rationale is given for the omission of calls. Also, it is not clear how the analysis deals with repeated syllables in a motif, whether they are treated as two-syllable types or one.  

      It is true that we don’t currently have any dedicated features to describe calls. This could be a useful addition to AVN in the future. 

      What a human expert inspecting a spectrogram would typically call ‘repeated syllables’ in a bout are almost always assigned the same syllable label by the UMAP+HDBSCAN clustering. The syntax analysis module includes features examining the rate of syllable repetitions across syllable types, as mentioned in lines 222-226 of the revised manuscript. See https://avn.readthedocs.io/en/latest/syntax_analysis_demo.html#Syllable-Repetitions for further details.

      (8) It seems not all human annotations have been released and the instruction sets given to experts (how to segment syllables and score songs) are not disclosed. It may well be that the differences in performance between (Gu et al 2023) and (Cohen et al 2022) are due to differences in segmentation tasks, which is why these tasks given to experts need to be clearly spelled out. Also, the downloadable files contain merely labels but no identifier of the expert. The data should be released in such a way that lets other labs adopt their labeling method and cross-check their own labeling accuracy.  

      All human annotations used in this manuscript have indeed been released as part of the accompanying dataset. Syllable annotations are not provided for all pupils and tutors used to validate the similarity scoring, as annotations are not necessary for similarity comparisons. We have expanded our description of our annotation guidelines in the methods section of the revised manuscript. All the annotations were generated by one of two annotators. The second annotator always consulted with the first annotator in cases of ambiguous syllable segmentation or labeling, to ensure that they had consistent annotation styles. Unfortunately, we haven’t retained records about which birds were annotated by which of the two annotators, so we cannot share this information along with the dataset. The data is currently available in a format that should allow other research groups to use our annotations either to train their own annotation systems or check the performance of their existing systems on our annotations.  

      (9) The failure modes are not described. What segmentation errors did they encounter, and what syllable classification errors? It is important to describe the errors to be expected when using the method. 

      As we discussed in our response to this reviewer’s point (3), WhisperSeg has a tendency to merge syllables when the gap between them is very short, which explains its lower recall score compared to its precision on our dataset (Figure 2–figure supplement 1). In rare cases, WhisperSeg also fails to recognize syllables entirely, again impacting its precision score. TweetyNet hardly ever completely ignores syllables, but it does tend to occasionally merge syllables together or over-segment them. Whereas WhisperSeg does this very consistently for the same syllable types within the same bird, TweetyNet merges or splits syllables more inconsistently. This inconsistent merging and splitting has a larger effect on syllable labeling, as manifested in the lower clustering v-measure scores we obtain with TweetyNet compared to WhisperSeg segmentations. TweetyNet also has much lower precision than WhisperSeg, largely because TweetyNet often recognizes background noises (like wing flaps or hopping) as syllables whereas WhisperSeg hardly ever segments non-vocal sounds. 

      Many errors in syllable labeling stem from differences in syllable segmentation. For example, if two syllables with labels ‘a’ and ‘b’ in the manual annotation are sometimes segmented as two syllables, but sometimes merged into a single syllable, the clustering is likely to find 3 different syllable types; one corresponding to ‘a’, one corresponding to ‘b’ and one corresponding to ‘ab’ merged. Because of how we align syllables across segmentation schemes for the v-measure calculation, this will look like syllable ‘b’ always has a consistent cluster label (or is missing a label entirely), but syllable ‘a’ can carry two different cluster labels, depending on the segmentation. In certain cases, even in the absence of segmentation errors, a group of syllables bearing the same manual annotation label may be split into 2 or 3 clusters (it is extremely rare for a single manual annotation group to be split into more than 3 clusters). In these cases, it is difficult to conclusively say whether the clustering represents an error, or if it actually captured some meaningful systematic difference between syllables that was missed by the annotator. Finally, sometimes rare syllable types with their own distinct labels in the manual annotation are merged into a single cluster. Most labeling errors can be explained by this kind of merging or splitting of groups relative to the manual annotation, not to occasional mis-classifications of one manual label type as another.

      For examples of these types of errors, we encourage this reviewer and readers to refer to the example confusion matrices in figure 2f and Figure 2–figure supplement 3b&e. We also added two paragraphs to the end of the ‘Accurate, fully unsupervised syllable labeling’ section of the Results in the revised manuscript. 

      (10) Usage of Different Dimensionality Reduction Methods: The pipeline uses two different dimensionality reduction techniques for labeling and similarity comparison - both based on the understanding of the distribution of data in lower-dimensional spaces. However, the reasons for choosing different methods for different tasks are not articulated, nor is there a comparison of their efficacy.  

      We apologize for not making this distinction sufficiently clear in the manuscript and have added a paragraph to the ‘Measuring Song Imitation’ section of the Results explaining the rational for using an embedding model for similarity scoring. 

      We chose to use UMAP for syllable labeling because it is a common embedding methodology to precede hierarchical clustering and has been shown to result in reliable syllable labels for birdsong in the past (Sainburg et al. 2020). However, it is not appropriate for similarity scoring, because comparing EMD or MMD scores between birds requires that all the birds’ syllable distributions exist within the same shared embedding space. This can be achieved by using the same triplet loss-trained neural network model to embed syllables from all birds. This cannot be achieved with UMAP because all birds whose scores are being compared would need to be embedded in the same UMAP space, as distances between points cannot be compared across UMAPs. In practice, this would mean that every time a new tutor-pupil pair needs to be scored, their syllables would need to be added to a matrix with all previously compared birds’ syllables, a new UMAP would need to be computed, and new EMD or MMD scores between all bird pairs would need to be calculated using their new UMAP embeddings. This is very computationally expensive and quickly becomes unfeasible without dedicated high power computing infrastructure. It also means that similarity scores couldn’t be compared across papers without recomputing everything each time, whereas EMD and MMD scores obtained with triplet loss embeddings can be compared, provided they use the same trained model (which we provide as part of AVN) to embed their syllables in a common latent space. 

      (11) Reproducibility: are the measurements reproducible? Systems like UMAP always find a new embedding given some fixed input, so the output tends to fluctuate.

      There is indeed a stochastic element to UMAP embeddings which will result in different embeddings and therefore different syllable labels across repeated runs with the same input. We observed that v-measures scores were quite consistent within birds across repeated runs of the UMAP, and have added an additional supplementary figure to the revised manuscript showing this (Figure 2–figure supplement 4).

      Reviewer #1 (Recommendations For The Authors):

      (1) Benchmark their similarity score to the method used by Goffinet et al, 2021 from the Pearson group. Such a comparison would be really interesting and useful.  

      This has been added to the paper. 

      (2) Please clarify exactly what is new and what is applied from existing methods to help the reader see the novelty of the paper.  

      We have added more emphasis on the novel aspects of our pipeline to the paper’s introduction. 

      Minor:

      It's unclear if AVN is appropriate as the paper deals only with zebra finch song - the scope is more limited than advertised.

      We assume this is in reference to ‘Birdsong’ in the paper’s title and ‘Avian’ in Avian Vocalization Network. There is a brief discussion of how these methods are likely to perform on other commonly studied songbird species at the end of the discussion section.

      Reviewer #2 (Recommendations For The Authors):

      A few points for the authors to consider that might strengthen or inform the paper:

      (1) In the public review, I detailed some ways in which the SSL+EMD approach is unlikely to be appreciably distinct from the VAE+MMD approach -- in fact, one could mix and match here. It would strengthen the authors' claim if they showed via experiments that their method outperforms VAE+MMD, but in the absence of that, a discussion of the relation between the two is probably warranted.  

      This comparison has been added to the paper.

      (2) ll. 305-310: This loss of accuracy near the edge is expected on general Bayesian grounds. Any regression approach should learn to estimate the conditional mean of the age distribution given the data, so ages estimated from data will be pulled inward toward the location of most training data. This bias is somewhat mitigated in the Brudner paper by a more flexible model, but it's a general (and expected) feature of the approach.

      (3) While the online AVA documentation looks good, it might benefit from a page on design philosophy that lays out how the various modules fit together - something between the tutorials and the nitty-gritty API. That way, users would be able to get a sense of where they should look if they want to harness pieces of functionality beyond the tutorials.

      Thank you for this suggestion. We will add a page on AVN’s design philosophy to the online documentation. 

      (4) While the manuscript does compare AVN to packages like TweetyNet and AVA that share some functionality, it doesn't really mention what's been going on with the vocalpy ecosystem, where the maintainers have been doing a lot to standardize data processing, integrate tools, etc. I would suggest a few words about how AVN might integrate with these efforts.

      We thank the reviewer for this suggestion.

      (5) ll. 333-336: It would be helpful to provide a citation to some of the self-supervised learning literature this procedure is based on. Some citations are provided in methods, but the general approach is worth citing, in my opinion. 

      We have added a paragraph to the results section with more background on self-supervised learning for dimensionality reduction, particularly in the context of similarity scoring.

      (6) One software concern for medium-term maintenance: AVN docs say to use Python 3.8, and GitHub says the package is 3.9 compatible. I also saw in the toml file that 3.10 and above are not supported. It's worth noting that Python 3.9 reaches its end of life in October 2025, so some dependencies may have to be altered or changed for the package to be viable going forward.  

      Thank you for this comment. We will continue to maintain AVN and update its dependencies as needed.

      Minor points:

      (1) It might be good to note that WhisperSeg is a different install from AVN. May be hard for novice users, though there's a web interface that's available. 

      We’ve added a line to the methods section making this clear. 

      (2) Figure 6b: Some text in the y-axis labels is overlapping here. 

      This has been fixed. Thank you for bringing it to our attention. 

      (3) The name of the Python language is always capitalized.  

      We’ve fixed this capitalization error throughout the manuscript. Thank you.

      Reviewer #3 (Recommendations For The Authors):

      (1) I recommend that the authors improve the motivation of the chosen tasks and data or choose new tasks that more clearly speak to the optimizations they want to perform. 

      We have included more details about the motivation for our LDA classification analysis, age prediction model and embedding model for similarity scoring in the results of the revised manuscript, as discussed in more detail in the above responses to this reviewer. Thank you for these suggestions. 

      (2) They need to rigorously report the (classification) scores on the test datasets: these are the scores associated with the cost function used during training.  

      Based on this reviewer’s ‘Weaknesses: 3’ comment in the public reviews, we believe that they are referring to a classification score for the triplet loss model. As we explained in response to that comment, this is not a classification task, therefor there is no classification score to report. The loss function used to train the model was a triplet loss function. While we could report these values, they are not informative for how well this approach would perform in a similarity scoring context, as explained above. As such, we prefer to include contrast index and tutor contrast index scores to compare the models’ performance for similarity score, as these are directly relevant to the task and are established in the field for said task.

      (3) They need to explain the reasons for the poor performance (or report on the inconsistencies with previous work) and why they prefer a fully automated system rather than one that needs some fine-tuning on bird-specific data.

      We’ve addressed this comment in the public response to this reviewer’s weakness points 3, 5, and 6. 

      (4) They should consider applying their method to data from Japanese and European labs.  

      We’ve addressed this comment in the public response to this reviewer’s weakness point 4.

      (5) The need to document the failure modes and report all details about the human annotations.  

      We’ve added additional description of the failure modes for our segmentation and labeling approaches in the results section of the revised manuscript.

      Details: 

      The introduction is very vague, it fails to make a clear case of what the problem is and what the approach is. It reads a bit like an advertisement for machine learning: we are given a hammer and are looking for a nail.  

      We thank the reviewer for this viewpoint; however, we disagree and have decided to keep our Introduction largely unchanged. 

      L46 That interpretability is needed to maximize the benefits of machine learning is wrong, see self-driving cars and chat GPT.  

      This line states that ‘To truly maximize the benefits of machine learning and deep learning methods for behavior analysis, their power must be balanced with interpretability and generalizability’. We firmly believe that interpretability is critically important when using machine learning tools to gain a deeper scientific understanding of data, including animal behavior data in a neuroscience context. We believe that the introduction and discussion of this paper already provide strong evidence for this claim. 

      L64 What about zebra finches that repeat a syllable in the motif, how are repetitions dealt with by AVN?  

      This is already described in the results section in lines 222-226, and in the methods in the ‘Syntax Features: Repetition Bouts’ section.

      L107 Say a bit more here, what exactly has been annotated?  

      We’ve added a sentence in the introduction to clarify this. Line 113-115. 

      L112 Define spectrogram frames. Do these always fully or sometimes partially contain a vocalization? 

      Spectrogram frames are individual time bins used to compute the spectrogram using a short-term Fourier transform. As described in the ‘Methods; Labeling : UMAP Dimensionality Reduction” section, our spectrograms are computed using ‘The short term Fourier transform of the normalized audio for each syllable […] with a window length of 512 samples and a hop length of 128 samples’. Given that the song files have a standard sampling rate of 44.1kHz, this means each time bin represents 11.6ms of song data, with successive frames advancing in time by 2.9ms. These contain only a small fraction of a vocalization. 

      L122 The reported TweetyNet score of 0.824 is lower than the one reported in Figure 2a.  

      The center line in the box plot in Figure 2a represents the median of the distribution of TweetyNet vmeasure scores. Given that there are a couple outlying birds with very low scores, the mean (0.824 as reported in the text of the results section) is lower than the median. This is not an error.

      L155 Some of the differences in performance are very small, reporting of the P value might be necessary. 

      These methods are unlikely to statistically significantly differ in their validation scores. This doesn’t mean that we cannot use the mean/median values reported to justify favoring one method over another. This is why we’ve chosen not to report p-values here.

      L161 The authors have not really tested more than a single clustering method, failing to show a serious attempt to achieve good performance.  

      We’ve addressed this comment in the public response to this reviewer’s weakness point 2.

      L186 Did isolate birds produce stereotyped syllables that can be clustered? 

      Yes, they did. The validation for clustering of isolate bird songs can be found in Figure 2–figure supplement 4. 

      Fig. 3e: How were the multiple bouts aligned?

      This is described in lines 857-876 in the ‘Methods: Song Timing Features: Rhythm Spectrograms” section of the paper.

      L199 There is a space missing in front of (n=8).  

      Thank you for bringing this to our attention. It’s been corrected in the updated manuscript. 

      L268 Define classification accuracy.  

      We’ve added a sentence in lines 953-954 of the methods section defining classification accuracy. 

      L325 How many motifs need to be identified, why does this need to be done manually? There are semiautomated methods that can allow scaling, these should be  cited here. Also, the mention of bias here should be removed in favor of a more extensive discussion on the experimenter bias (traditionally vs Texas bias (in this paper).  

      All of the methods cited in this line have graphical user interfaces that require users to select a file containing song and manually highlight the start and end each motif to be compared. The exact number of motifs required varies depending on the specific context (e.g. more examples are needed to detect more subtle differences or changes in song similarity) but it is fairly standard for reviewers to score 30 – 100 pairs of motifs. 

      We’ve discussed the tradeoffs between full automation and supervised or human-in-the loop methods in response to this reviewer’s public comment ‘weakness #5 and 6’. Briefly, AVN’s aim is to standardize song analysis, to allow direct comparisons between song features and similarity scores across research groups. We believe, as explained in the paper, that this can be best achieve by having different research groups use the same deep learning models, which perform consistently well across those groups. Introducing semi-automated methods would defeat this benefit of AVN. 

      We’ve also addressed the question of ‘Texas bias’ in response to their reviewer’s public comment ‘Weakness #4’. 

      L340 How is EMD applied? Syllables are points in 8-dim space, but now suddenly authors talk about distributions without explaining how they got from points to distributions. Same in L925.  

      We apologize for the confusion here. The syllable points in the 8-d space are collectively an empirical distribution, not a probability distribution. We referred to them simply as ‘distributions’ to limit technical jargon in the results of the paper, but have changed this to more precise language in the revised manuscript.

      L351 Why do authors now use 'contrast index' to measure performance and no longer 'classification accuracy'?  

      We’ve addressed this comment in the public response to this reviewer’s weakness points 1 and 2.

      Figure 6 What is the confusion matrix, i.e. how well can the model identify pupil-pupil pairings from pupiltutor and from pupil-unrelated pairings? I guess that would amount to something like classification accuracy.  

      There is no model classifying comparisons as pupil-pupil vs. pupil-tutor etc. These comparisons exist only to show the behavior of the similarity scoring approach, which consists of a dissimilarity measure (MMD or EMD) applied to low dimensional representations of syllable generated by the triplet loss model or VAE. This was clarified further in our public response to this reviewer’s weakness points 1 and 2. 

      L487 What are 'song files', and what do they contain?   

      ‘Song files’ are .wav files containing recordings of zebra finch song. They typically contain a single song bout, but they can include multiple song bouts if they are produced close together, or incomplete song bouts if the introductory notes were very soft or the bouts were very long (>30s from the start of the file). Details of these recordings are provided in the ‘Methods: Data Acquisition: UTSW Dataset’ section of the manuscript.

      L497 Calls were only labelled for tweetynet but not for other tasks.  

      That is correct. The rationale for this is provided in the ‘Methods: Manual Song Annotation’ section of the manuscript. 

      L637 There is a contradiction (can something be assigned to the 'own manual annotation category' when the same sentence states that this is done 'without manual annotation'?) 

      We believe there is confusion here between automated annotation and validation. Any bird can be automatically annotated without the need for any existing manual annotations for that individual bird. However, manual labels are required to compare automatically generated annotations against for validation of the method.

      L970 Spectograms of what? (what is the beginning of a song bout, L972). 

      The beginning of a song bout is the first introductory note produced by a bird after a period without vocalizations. This is standard.

    1. Reviewer #1 (Public review):

      Summary:

      This paper investigates the control signals that drive event model updating during continuous experience. The authors apply predictions from previously published computational models to fMRI data acquired while participants watched naturalistic video stimuli. They first examine the time course of BOLD pattern changes around human-annotated event boundaries, revealing pattern changes preceding the boundary in anterior temporal and then parietal regions, followed by pattern stabilization across many regions. The authors then analyze time courses around boundaries generated by a model that updates event models based on prediction error and another that uses prediction uncertainty. These analyses reveal overlapping but partially distinct dynamics for each boundary type, suggesting that both signals may contribute to event segmentation processes in the brain.

      Strengths:

      (1) The question addressed by this paper is of high interest to researchers working on event cognition, perception, and memory. There has been considerable debate about what kinds of signals drive event boundaries, and this paper directly engages with that debate by comparing prediction error and prediction uncertainty as candidate control signals.

      (2) The authors use computational models that explain significant variance in human boundary judgments, and they report the variance explained clearly in the paper.

      (3) The authors' method of using computational models to generate predictions about when event model updating should occur is a valuable mechanistic alternative to methods like HMM or GSBS, which are data-driven.

      (4) The paper utilizes an analysis framework that characterizes how multivariate BOLD pattern dissimilarity evolves before and after boundaries. This approach offers an advance over previous work focused on just the boundary or post-boundary points.

      Weaknesses:

      (1) While the paper raises the possibility that both prediction error and uncertainty could serve as control signals, it does not offer a strong theoretical rationale for why the brain would benefit from multiple (empirically correlated) signals. What distinct advantages do these signals provide? This may be discussed in the authors' prior modeling work, but is left too implicit in this paper.

      (2) Boundaries derived from prediction error and uncertainty are correlated for the naturalistic stimuli. This raises some concerns about how well their distinct contributions to brain activity can be separated. The authors should consider whether they can leverage timepoints where the models make different predictions to make a stronger case for brain regions that are responsive to one vs the other.

      (3) The authors refer to a baseline measure of pattern dissimilarity, which their dissimilarity measure of interest is relative to, but it's not clear how this baseline is computed. Since the interpretation of increases or decreases in dissimilarity depends on this reference point, more clarity is needed.

      (4) The authors report an average event length of ~20 seconds, and they also look at +20 and -20 seconds around each event boundary. Thus, it's unclear how often pre- and post-boundary timepoints are part of adjacent events. This complicates the interpretations of the reported time courses.

      (5) The authors describe a sequence of neural pattern shifts during each type of boundary, but offer little setup of what pattern shifts we might expect or why. They also offer little discussion of what cognitive processes these shifts might reflect. The paper would benefit from a more thorough setup for the neural results and a discussion that comments on how the results inform our understanding of what these brain regions contribute to event models.

    2. Reviewer #3 (Public review):

      Summary:

      The aim of this study was to investigate the temporal progression of the neural response to event boundaries in relation to uncertainty and error. Specifically, the authors asked (1) how neural activity changes before and after event boundaries, (2) if uncertainty and error both contribute to explaining the occurrence of event boundaries, and (3) if uncertainty and error have unique contributions to explaining the temporal progression of neural activity.

      Strengths:

      One strength of this paper is that it builds on an already validated computational model. It relies on straightforward and interpretable analysis techniques to answer the main question, with a smart combination of pattern similarity metrics and FIR. This combination of methods may also be an inspiration to other researchers in the field working on similar questions. The paper is well written and easy to follow. The paper convincingly shows that (1) there is a temporal progression of neural activity change before and after an event boundary, and (2) event boundaries are predicted best by the combination of uncertainty and error signals.

      Weaknesses:

      Regarding question 3, I am less convinced by the results. They show that overlapping but somewhat distinct sets of brain regions relate to uncertainty and error boundaries over time. And that some regions show distinct patterns of temporal progressions in pattern change with both types of boundaries. However, most of the effects they observe in this analysis may still be driven by shared variance, as suggested by the results in Figure 6 and the high correlation between the two boundary time series. More specific comments are provided below.

      Impact:

      If these comments can be addressed sufficiently, I expect that this work will impact the field in its thinking on what drives event boundaries and spur interest in understanding the mechanisms behind the temporal progression of neural activity around these boundaries.

      Comments

      (1) The current analysis of the neural data does not convincingly show that uncertainty and prediction error both contribute to the neural responses. As both terms are modelled in separate FIR models, it may be that the responses we see for both are mostly driven by shared variance. Given that the correlation between the two is very high (r=0.49), this seems likely. The strong overlap in the neural responses elicited by both, as shown in Figure 6, also suggests that what we see may mainly be shared variance. To improve the interpretability of these effects, I think it is essential to know whether uncertainty and error explain similar or unique parts of the variance. The observation that they have distinct temporal profiles is suggestive of some dissociation, but not as convincing as adding them both to a single model.

      (2) The results for uncertainty and error show that uncertainty has strong effects before or at boundary onset, while error is related to more stabilization after boundary onset. This makes me wonder about the temporal contribution of each of these. Could it be the case that increases in uncertainty are early indicators of a boundary, and errors tend to occur later?

      (3) Given that there is a 24-second period during which the neural responses are shaped by event boundaries, it would be important to know more about the average distance between boundaries and the variability of this distance. This will help establish whether the FIR model can properly capture a return to baseline.

      (4) Given that there is an early onset and long-lasting response of the brain to these event boundaries, I wonder what causes this. Is it the case that uncertainty or errors already increase at 12 seconds before the boundaries occur? Or if there are other makers in the movie that the brain can use to foreshadow an event boundary? And if uncertainty or errors do increase already 12 seconds before an event boundary, do you see a similar neural response at moments with similar levels of error or uncertainty, which are not followed by a boundary? This would reveal whether the neural activity patterns are specific to event boundaries or whether these are general markers of error and uncertainty.

      (5) It is known that different brain regions have different delays of their BOLD response. Could these delays contribute to the propagation of the neural activity across different brain areas in this study?

      (6) In the FIR plots, timepoints -12, 0, and 12 are shown. These long intervals preclude an understanding of the full temporal progression of these effects.

    1. Reviewer #2 (Public review):

      Summary:

      This manuscript investigates the role of EV-D68 proteases 2A and 3C in nuclear pore complex (NPC) dysfunction and their contribution to motor neuron toxicity. The authors demonstrate that both proteases cleave only a limited number of nucleoporins, with 2A^pro showing the strongest impact by inhibiting nuclear import and export of proteins and disrupting NPC permeability without affecting RNA export. Importantly, treatment with the 2A^pro inhibitor telaprevir reduced neuronal cell death in a dose-dependent manner, achieving neuroprotection at concentrations below those required to inhibit viral replication. The study addresses a relevant mechanism underlying EV-D68-induced neuropathology and explores a potential therapeutic intervention.

      Strengths:

      (1) Provides significant mechanistic insight into how EV-D68 proteases alter NPC function and contribute to neuronal toxicity.

      (2) The use of recombinant 2A and 3C proteins allows clear dissection of the specific contribution of each protease.

      (3) Demonstrates a therapeutic effect of telaprevir, with neuroprotection independent of viral replication inhibition, adding translational value to the findings.

      (4) The topic is highly relevant given the association of EV-D68 with acute flaccid myelitis.

      Weaknesses:

      (1) Most experiments were performed with recombinant proteases, lacking validation in the context of viral infection, where both proteases act simultaneously.

      (2) The conclusion that RNA export is unaffected requires confirmation during actual infection.

      (3) The reduction of neurotoxicity by telaprevir does not fully demonstrate that the protective effect is solely mediated through NPC preservation; additional analyses of eIF4G cleavage, nucleoporin integrity, and stress granules are needed.

      (4) The study would be strengthened by including another 2A inhibitor (e.g., boceprevir) to confirm the specificity of telaprevir's protective effects.

    2. Reviewer #3 (Public review):

      Summary:

      The author showed expression of the viral proteases 2Apro and 3Cpro of EV-D68, which cleaved specific components of the nuclear pore complex (Nup98 and POM121 by 2Apro), and 2A but not 3C expression altered nuclear import and export. Similar nucleocytoplasmic transport deficits are observed in EV-D68-infected RD cells and iPSC-derived motor neurons (diMNs). 2A inhibitor telaprevir partially rescued the nucleocytoplasmic transport deficits and suppressed neuronal cell death after infection. While it's clear that 2A can cleave NPC proteins and affect nuclear transport, the link to neurotoxicity after EV-D68 infection is less convincing.

      This study opens up a very intriguing hypothesis: that EV-D68 2Apro could be directly responsible for motor neuron cell death, mediated by POM121 and possibly Nup98 cleavage, that ultimately results in paralysis known as acute flaccid myelitis. This hypothesis notably does run counter to other published data showing that human neuronal organoids derived from iPSCs can support productive EV-D68 infection for weeks without cell death and that EV-D68-infected mice can have paralysis prevented by depletion of CD8 T cells, still with EV-D68 infection of the spinal cord. However, even if 2Apro is not ultimately responsible for motor neurons dying in human infections, that does not exclude the possibility that cleavage of nups could still disrupt motor neuron function. Notably, most children with AFM have some amount of motor function return after their acute period of paralysis, but most still have some residual paralysis for years to life. It is possible that 2A pro could mediate the acute onset of weakness, while T cells killing neurons could determine the amount of long-term, residual paralysis.

      Strengths:

      The characterization of nuclear pore complex components that appear to be targets of both poliovirus and EV-D68 proteases is quite thorough and expansive, so this data set alone will be useful for reference to the field. And the process by which the authors narrowed their focus to EV-D68 2Apro reducing Nup98 and POM121 as consequential to both import and export of nuclear cargo but not RNA was technically impressive, thorough, and convincing. As will be detailed below, when the authors move from studying over-expressed proteases in transformed cell lines to studying actual virus infection in both transformed cell lines and iPSC-derived neurons, some of the data only indirectly support their conclusions; however, the quality of the experiments performed is still high. So even if the claim that 2Apro causes neurotoxicity is circumstantial, the data certainly are intriguing and certainly justify further study of the effects of EV-D68 2Apro on the NPC and how this impacts pathogenesis. This is a convincing start to an intriguing line of inquiry.

      Weaknesses:

      This study falls a bit shy of actually showing that 2Apro effects are causing motor neuron toxicity because the evidence of this is fairly indirect. At points, the authors do admit these limitations, but at other times, they claim to have shown the link directly. The following are reasons why these claims are only indirectly supported:

      (1) Cleavage of Nup98 and POM121 after EV-D68 infection in RD cells and diMNs is never demonstrated.

      (2) Telaprevir was able to rescue nucleocytoplasmic transport in RD cells at low concentrations (Figure 4A). It is not shown if this correlates with its antiviral effect in RD cells, or could this correlate with inhibition of 2A cleavage of Nup98 or POM121, which is never measured.

      (3) Building off of the prior point, the authors' claim that the neuroprotective effect of telaprevir is independent of its antiviral effect is not well-founded. Figure 4E (neuroprotection) was done with MOI 5, and Figure 4G (virus growth) was MOI 0.5. Telaprevir neuroprotection is not shown at MOI 0.5, nor is the neuroprotective effect correlated with inhibition of 2A cleavage of Nup98 or POM121.

      (4) The use of mixed virus isolates only in the diMNs is problematic because different EV-D68 isolates are known to have drastically different effects on pathogenesis in mice. Since all initial data were generated with the MO isolate, adding the additional MD isolate to the diMN experiments actually adds uncertainty to the conclusions. It is not clear if the authors infected different cultures with the different isolates and combined the data or infected all cultures with a mixture of the two isolates. If the former, then the data should be reported separately to see the effect of each individual strain, which would be interesting to EV-D68 virologists. If the latter, then there is no way to know from these data whether one of the two isolates had increased fitness over the other and exerted a dominant effect. If the MD isolate overtook the MO isolate, from which all other data in this manuscript are derived, then we have much less of an idea how much the data from the first three figures supports the final figure.

    3. Author response:

      We thank the reviewers for their detailed and thoughtful comments on the manuscript.  In general, the reviewers found the data supporting the role of Enterovirus D68 proteases in disrupting the composition of the nuclear pore complex, the 2A protease disrupting nucleocytoplasmic transport of protein cargoes, and the mechanistic dissection of this process to be convincing and potentially relevant to the pathogenesis of AFM.  Reviewers requested additional experiments evaluating our observation that RNA export was not similarly impaired, particularly in the context of viral infection rather than solely expression of recombinant proteases.  They also requested that cleavage of POM121 and Nup98 by 2A protease, which was demonstrated in 2A<sup>pro</sup> transfected cells and in biochemical assays, also be demonstrated in motor neurons infected by EV-D68.  Finally, reviewers noted that while suggestive, the evidence falls short of demonstrating that the toxicity of 2A<sup>pro</sup> is mediated through nuclear pore complex dysfunction.

      To address these critiques, we aim to do the following:

      (1) Determine the impact of live virus infection on RNA export by repeating the ethinyl uridine pulse-chase assay in the setting of live virus infection.  We will also provide representative images for these data and the previously reported data from transfection with GFP-2A<sup>pro</sup> and GFP-3C<sup>pro</sup>.

      (2) Evaluate cleavage of POM121 and Nup98 in EV-D68-infected diMNs and inhibition of cleavage by telaprevir by Western blot.

      (3) Present motor neuron survival data in figure 4 as separate graphs for each of the viral strains tested, rather than pooling the data.  To clarify reviewer #3’s concern, these were not mixed cultures.

      We agree that we have not demonstrated conclusively that the mechanism by which 2A<sup>pro</sup> is toxic to motor neurons is via NPC dysfunction.  Future work will determine the extent to which NPC dysfunction contributes to 2A<sup>pro</sup>-mediated motor neuron toxicity versus other potential targets of 2A<sup>pro</sup>.  We feel that the additional experiments required to achieve this will be extensive and are beyond the scope of the present manuscript, which represents a key first step in this line of inquiry.

      In addition to the above, there were several points of disagreement between reviewers.  We would like to respond to those as follows:

      Reviewer #1: “The hypothesis that infection of motoneurons is the cause of EVD68-induced neurological complications so far is supported by only one autopsy report.  Other data suggest that infection of other cell types, such as astrocytes, and/or inflammatory cell infiltration in the CNS, are likely to be responsible for the symptoms.”

      Reviewer #3: “This study opens up a very intriguing hypothesis: that EV-D68 2Apro could be directly responsible for motor neuron cell death, mediated by POM121 and possibly Nup98 cleavage, that ultimately results in paralysis known as acute flaccid myelitis. This hypothesis notably does run counter to other published data showing that human neuronal organoids derived from iPSCs can support productive EV-D68 infection for weeks without cell death and that EV-D68-infected mice can have paralysis prevented by depletion of CD8 T cells, still with EV-D68 infection of the spinal cord. However, even if 2Apro is not ultimately responsible for motor neurons dying in human infections, that does not exclude the possibility that cleavage of nups could still disrupt motor neuron function. Notably, most children with AFM have some amount of motor function return after their acute period of paralysis, but most still have some residual paralysis for years to life. It is possible that 2A pro could mediate the acute onset of weakness, while T cells killing neurons could determine the amount of long-term, residual paralysis.”

      The infection of motor neurons is strongly supported not only by the aforementioned autopsy data[1], but also by mouse model data demonstrating replication of EV-D68 within motor neurons in the anterior horn of the spinal cord.[2 ] There are also extensive reports of electromyography and nerve conduction studies from human AFM patients demonstrating that the site of pathology is the spinal motor neuron.[3-10]. By contrast, infection of astrocytes has been demonstrated only in primary murine astrocyte cultures in which no neurons were present.[11] .Therefore, while the available data suggest that EV-D68 infection of astrocytes is possible, in the in vivo context of human and mouse spinal cord, tropism to motor neurons appears to be preferential.  The relative contributions to toxicity of neuron-autonomous vs non-autonomous processes such as glial dysfunction and inflammatory cell infiltration remain to be elucidated, and are not mutually exclusive.

      Our working hypothesis is more in line with that of Reviewer #3.  Motor neuron dysfunction and motor neuron death may ultimately prove to have dissociable causes, each of which may be neuron-autonomous, non-neuron-autonomous, or a mixture thereof.  The infection of motor neurons is likely the initiating event, with multiple downstream consequences.  Much additional work will be required to resolve this controversy.

      Reviewer #1: “Demonstrates a therapeutic effect of telaprevir, with neuroprotection independent of viral replication inhibition, adding translational value to the findings.”

      Reviewer #3: “The authors' claim that the neuroprotective effect of telaprevir is independent of its antiviral effect is not well-founded. Figure 4E (neuroprotection) was done with MOI 5, and Figure 4G (virus growth) was MOI 0.5. Telaprevir neuroprotection is not shown at MOI 0.5, nor is the neuroprotective effect correlated with inhibition of 2A cleavage of Nup98 or POM121.”

      The selection of MOIs for these two experiments was limited by technical considerations.  If the viral growth curve were to be performed at MOI 5, it would be confounded by cell death.  Further, a low MOI is required in order to allow multiple rounds of infection, replication, and spread within the culture, and is therefore more sensitive for assaying the effect of telaprevir on viral replication.  On the other hand, at MOI 0.5 diMN death is very gradual, and in the neuroprotection assay we would have lacked the statistical power to determine whether a rescue of this small magnitude of toxicity is significant.  The EC<sub>50</sub> of telaprevir is not expected to vary significantly at different MOIs.

      References:

      (1) Vogt, M. R. et al. Enterovirus D68 in the Anterior Horn Cells of a Child with Acute Flaccid Myelitis. N Engl J Med 386, 2059-2060 (2022). https://doi.org/10.1056/NEJMc2118155

      (2) Hixon, A. M. et al. A mouse model of paralytic myelitis caused by enterovirus D68. PLoS Pathog 13, e1006199 (2017). https://doi.org/10.1371/journal.ppat.1006199

      (3) Andersen, E. W., Kornberg, A. J., Freeman, J. L., Leventer, R. J. & Ryan, M. M. Acute flaccid myelitis in childhood: a retrospective cohort study. Eur J Neurol 24, 1077-1083 (2017). https://doi.org/10.1111/ene.13345

      (4) Elrick, M. J. et al. Clinical Subpopulations in a Sample of North American Children Diagnosed With Acute Flaccid Myelitis, 2012-2016. JAMA Pediatr 173, 134-139 (2018). https://doi.org/10.1001/jamapediatrics.2018.4890

      (5) Hovden, I. A. & Pfeiffer, H. C. Electrodiagnostic findings in acute flaccid myelitis related to enterovirus D68. Muscle Nerve 52, 909-910 (2015). https://doi.org/10.1002/mus.24738

      (6) Knoester, M. et al. Twenty-Nine Cases of Enterovirus-D68 Associated Acute Flaccid Myelitis in Europe 2016; A Case Series and Epidemiologic Overview. Pediatr Infect Dis J 38, 16-21 (2018). https://doi.org/10.1097/INF.0000000000002188

      (7) Martin, J. A. et al. Outcomes of Colorado children with acute flaccid myelitis at 1 year. Neurology 89, 129-137 (2017). https://doi.org/10.1212/WNL.0000000000004081

      (8) Saltzman, E. B. et al. Nerve Transfers for Enterovirus D68-Associated Acute Flaccid Myelitis: A Case Series. Pediatr Neurol 88, 25-30 (2018). https://doi.org/10.1016/j.pediatrneurol.2018.07.018

      (9) Van Haren, K. et al. Acute Flaccid Myelitis of Unknown Etiology in California, 2012-2015. JAMA 314, 2663-2671 (2015). https://doi.org/10.1001/jama.2015.17275

      (10) Natera-de Benito, D. et al. Acute Flaccid Myelitis With Early, Severe Compound Muscle Action Potential Amplitude Reduction: A 3-Year Follow-up of a Child Patient. J Clin Neuromuscul Dis 20, 100-101 (2018). https://doi.org/10.1097/CND.0000000000000217

      (11) Rosenfeld, A. B., Warren, A. L. & Racaniello, V. R. Neurotropism of Enterovirus D68 Isolates Is Independent of Sialic Acid and Is Not a Recently Acquired Phenotype. Mbio (2019). https://doi.org/10.1128/mBio

    1. Author response:

      Reviewer #1 (Public review):

      For summary:

      Thank you for your insightful and rigorous review. We fully agree with your core concern: establishing a causal link between MORC2 phase separation (PS) and its gene regulatory function is not only a key need in the phase separation field but also essential to elevating the overall utility of our work. To resolve the current gap in causal evidence, we will design experiments that explicitly distinguish the role of phase-separated condensates from soluble MORC2 complexes: We will generate a phase-separation-deficient but dimerization-competent MORC2 mutant by mutating key hydrophobic residues in the IDRa region (critical for IDR-IBD multivalent interactions driving phase separation) without disrupting the CC3 domain’s dimerization interface. In addition, we plan to investigate whether introducing a KS sequence[1] at the C-terminus can effectively attenuate the phase separation propensity of MORC2. These mutants will allow us to decouple “phase separation capacity” from “protein dimerization” (a prerequisite for both soluble complex formation and condensates).

      For strengths:

      We appreciate the reviewer’s recognition of our characterization of MORC2 phase separation and its structural basis. Our understanding of the CW domain’s function remains preliminary. Although we observed that the CW domain can influence condensate size, the IDR, IBD, and CC3 domains constitute the core structural elements driving phase separation. Consequently, the CW domain was not a primary focus of the current study. Nonetheless, investigating its functional contributions represents an interesting avenue for future work.

      For weaknesses:

      (1) We appreciate the reviewer’s rigorous concern. Our RNA-seq data were generated from fully independent transfections performed in triplicate across different time points and cell culture batches, aiming to maximize sample independence. However, for sensitive sequencing experiments, we observed that variability in transfection efficiency and cell culture across batches can introduce experimental differences, resulting in variable regulation of differentially expressed genes across samples. During differential gene analysis, p-value filtering excluded an additional 40 overlapping genes. In total, 61 genes overlapped with those reported in reference 22[2] (ZNF91, ZNF721, ZNF66, ZNF493, ZNF462, ZNF221, ZNF121, VGLL3, TUFT1, TLE4, TGFB2, SYS1-DBNDD2, STXBP6, SPRY2, SAMD9, ROR1, PTGES, PLK2, PLCXD2, PEA15, PDE2A, OLR1, NYAP2, NTN4, NRXN3, NEXN, MYLK, MPP7, MDGA1, MAMDC2, LBH, KRT80, ITGB8, IGFBP3, IGF2BP2, ICAM1, HIVEP3, GRB14, GPRC5A, GLCE, GJB3, GADD45B, GADD45A, FOXE1, FOSL1, FGF2, ETV5, ERBB3, DNAJC22, DIRAS1, DBNDD2, CXCL16, CRB2, COL9A3, CLDN1, BDNF, ATP8A1, AMOTL2, AHNAK2, ADAMTS16, ACSF2). To further enhance reproducibility, we will perform additional sequencing experiments.

      (2).Disease-associated mutants of MORC2

      At the current stage, the results for disease-associated mutations are descriptive. While we observed that certain mutations clustered at the N-terminus can affect MORC2 condensate formation, ATPase activity, and DNA binding, we did not identify a mechanistic explanation for these correlations. Notably, the T424R mutation, previously reported to significantly enhance ATPase activity, also increased both intracellular condensate formation and in vitro DNA binding in our experiments. In contrast, other mutations did not show such consistent effects. Previous studies have established that MORC2’s ATP-binding and DNA-binding activities are independent[2]. Our results further suggest that MORC2’s phase separation behavior is also independent of both ATP and DNA binding, although existing evidence hints at potential cross-regulatory interactions among these three functions.

      We are fully committed to implementing these revisions with strict rigor and plan to complete them within 8–10 weeks. We will submit a comprehensive response letter alongside the revised manuscript, explicitly mapping how each of your concerns has been addressed, and ensuring that our conclusions about MORC2 PS’s functional role are supported by solid, reproducible data. We believe these revisions will transform our study from a strong “mechanism-focused” work to a comprehensive one that bridges PS mechanisms and biological function—aligning with the high standards of the phase separation field. Thank you again for your invaluable guidance in improving our work.

      Reviewer #2 (Public review):

      For summary:

      Thank you for your thorough and constructive review of our manuscript. We fully agree with the key concerns you raised and have developed a detailed revision plan to address each point comprehensively. We will perform additional control and validation experiments to directly link MORC2’s condensate-forming capacity with its gene silencing function. At the current stage, the results for disease-associated mutations are descriptive. While we observed that certain mutations clustered at the N-terminus can affect MORC2 condensate formation, ATPase activity, and DNA binding, we did not identify a mechanistic explanation for these correlations. Notably, the T424R mutation, previously reported to significantly enhance ATPase activity[3], also increased both intracellular condensate formation and in vitro DNA binding in our experiments. In contrast, other mutations did not show such consistent effects. Previous studies have established that MORC2’s ATP-binding and DNA-binding activities are independent[4]. Our results further suggest that MORC2’s phase separation behavior is also independent of both ATP and DNA binding, although existing evidence hints at potential cross-regulatory interactions among these three functions.

      For strengths:

      We thank the reviewer for their appreciation of the key findings presented in this manuscript.

      For weaknesses:

      We thank the reviewer for their careful assessment of MORC2’s DNA-binding properties and its relationship with ATPase and transcriptional activities. We would like to offer the following clarifications to address these concerns, which will also be incorporated into the Discussion section of the revised manuscript.

      (1) Recent work by Tan et al.[4] similarly identified multiple DNA-binding sites in MORC2, consistent with our findings, though there are discrepancies in the precise binding regions. In particular, they reported that isolated CC1 and CC2 domains do not bind 60 bp dsDNA, which contrasts with our observations. We attribute this difference to the types of DNA used in the assays. In our study, we employed 601 DNA, a defined nucleosome-positioning sequence, which differs substantially from randomly designed short dsDNA. For instance, prior work by Christopher H. Douse et al.[3] also confirmed that MORC2’s CC1 domain can bind 601 DNA.

      (2) In the study by Fendler et al.², DNA binding was reported to reduce MORC2’s ATPase activity—an observation that appears inconsistent with the results presented in our Fig. 5j. A critical distinction between the two studies lies in the experimental systems used: Fendler et al. employed a truncated MORC2 construct (residues 1–603) and 35 bp double-stranded DNA (dsDNA), whereas our experiments utilized full-length MORC2 and 601 bp DNA (a sequence with high nucleosome assembly potential). These differences—including the absence of potentially regulatory C-terminal regions in the truncated construct and the varying length/structural properties of the DNA substrates—introduce variables that substantially complicate direct comparative analysis of ATPase activity outcomes.

      Separately, Douse et al.³ demonstrated that the efficiency of HUSH complex-dependent epigenetic silencing decreases as MORC2’s ATP hydrolysis rate increases, implying an inverse relationship between ATPase activity and silencing function. Notably, our current work has not established a direct mechanistic link between MORC2 phase separation and its ATPase activity. Thus, we refrain from inferring that the effect of MORC2 phase separation on transcriptional repression is mediated through modulation of its ATPase function—this remains an important question to address in future studies.

      (3) Finally, we plan to perform additional experiments to rule out the potential effects of CC3 dimerization. We will generate a phase-separation-deficient but dimerization-competent MORC2 mutant by mutating key hydrophobic residues in the IDRa region (critical for IDR-IBD multivalent interactions driving phase separation) without disrupting the CC3 domain’s dimerization interface. In addition, we plan to investigate whether introducing a KS sequence[1] at the C-terminus can effectively attenuate the phase separation propensity of MORC2. These mutants will allow us to decouple “phase separation capacity” from “protein dimerization”.

      We are committed to implementing these revisions with strict rigor and plan to complete them within 8–10 weeks. We will submit a detailed response letter alongside the revised manuscript, explicitly mapping how each of your concerns has been addressed, and ensuring the Discussion section is robust, context-rich, and fully integrates our work with the existing literature. We believe these improvements will significantly enhance the reliability, contextual relevance, and impact of our study, and we sincerely thank you for guiding us to elevate its quality.

      Reviewer #3 (Public review):

      For summary:

      Thank you for your insightful review and constructive suggestions, which have been invaluable in refining our manuscript. We greatly appreciate your recognition of the study’s strengths, including its logical structure, integration of multi-disciplinary approaches (in vitro LLPS assays, cellular studies, NMR, and crystallography), and the establishment of a functional link between MORC2 phase separation, DNA binding, and transcriptional control. Your identification of areas needing stronger evidence has provided clear, actionable directions for improvement, and we are fully committed to addressing each point comprehensively.

      For Major comments:

      To strengthen the manuscript as per your recommendations:

      (1) For the characterization of IDR-IBD interactions in PS: We will perform systematic in vitro assays, including PS turbidity measurements and confocal imaging of MORC2 variants lacking IDR or IBD (ΔIDR, ΔIBD) and truncated constructs (IDR alone, IBD alone). These experiments will quantify how each domain individually or synergistically contributes to phase separation propensity (e.g., critical concentration, condensate size/distribution).

      (2) To assess DNA’s influence on PS: We will generate phase diagrams by testing a range of MORC2 concentrations (0.5–10 μM) or with 601 DNA (147bp) and concentrations (0–2 μM), using turbidity assays and microscopy to map phase boundaries. This will systematically clarify how DNA modulates MORC2 phase separation.

      We plan to complete these experiments within 3–4 weeks, with rigorous quantification and statistical analysis to support our conclusions. The revised manuscript will include a detailed response letter mapping each of your suggestions to specific data additions, ensuring enhanced robustness and conviction. We believe these revisions will significantly strengthen the study’s conclusions, and we sincerely thank you for guiding us to improve its quality.

      Reference:

      [1] Mensah, M. A., Niskanen, H., Magalhaes, A. P., Basu, S., Kircher, M., Sczakiel, H. L., Reiter, A. M. V., Elsner, J., Meinecke, P., Biskup, S., et al. (2023). Aberrant phase separation and nucleolar dysfunction in rare genetic diseases. Nature 614, 564-571. https://doi.org/10.1038/s41586-022-05682-1.

      [2] Fendler, N. L., Ly, J., Welp, L., Lu, D., Schulte, F., Urlaub, H., and Vos, S. M. (2024). Identification and characterization of a human MORC2 DNA binding region that is required for gene silencing. Nucleic Acids Res 53, gkae1273. https://doi.org/10.1093/nar/gkae1273.

      [3] Douse, C. H., Bloor, S., Liu, Y. C., Shamin, M., Tchasovnikarova, I. A., Timms, R. T., Lehner, P. J., and Modis, Y. (2018). Neuropathic MORC2 mutations perturb GHKL ATPase dimerization dynamics and epigenetic silencing by multiple structural mechanisms. Nat Commun 9, 651. https://doi.org/10.1038/s41467-018-03045-x.

      [4] Tan, W., Park, J., Venugopal, H., Lou, J. Q., Dias, P. S., Baldoni, P. L., Moon, K. W., Dite, T. A., Keenan, C. R., Gurzau, A. D., et al. (2025). MORC2 is a phosphorylation-dependent DNA compaction machine. Nat Commun 16, 5606. https://doi.org/10.1038/s41467-025-60751-z.

    2. Reviewer #3 (Public review):

      Summary:

      The manuscript by Zhang et al. demonstrates that MORC2 undergoes liquid-liquid phase separation (LLPS) to form nuclear condensates critical for transcriptional repression. Using a combination of in vitro LLPS assays, cellular studies, NMR spectroscopy, and crystallography, the authors show that a dimeric scaffold formed by CC3 drives phase separation, while multivalent interactions between an intrinsically disordered region (IDR) and a newly defined IDR-binding domain (IBD) further promote condensate formation. Notably, LLPS enhances MORC2 ATPase activity in a DNA-dependent manner and contributes to transcriptional regulation, establishing a functional link between phase separation, DNA binding, and transcriptional control. Overall, the manuscript is well-organized and logically structured, offering mechanistic insights into MORC2 function, and most conclusions are supported by the presented data. Nevertheless, some of the claims are not sufficiently supported by the current data and would benefit from additional evidence to strengthen the conclusions.

      The following suggestions may help strengthen the manuscript:

      Major comments:

      (1) The central model proposes that multivalent interactions between the IDR and IBD promote MORC2 LLPS. However, the characterization of these interactions is currently limited. It is recommended that the authors perform more systematic analyses to investigate the contribution of these interactions to LLPS, for example, by in vitro assays assessing how the IDR or IBD individually influence MORC2 phase separation.

      (2) The authors mention that DNA binding can promote MORC2 LLPS. It is recommended that they generate a phase diagram to systematically assess how DNA influences phase separation.

      (3) The authors use the N39A mutant as a negative control to study the effect of DNA binding on ATP hydrolysis. Given that N39A is defective in DNA binding, it could also be employed to directly test whether DNA binding influences MORC2 phase separation.

      (4) Many of the cellular and in vitro LLPS experiments employ EGFP fusions. The authors should evaluate whether the EGFP tag influences MORC2 phase separation behavior.

    3. Reviewer #1 (Public review):

      Summary:

      This work demonstrates that MORC2 undergoes phase separation (PS) in cells to form nuclear condensates, and the authors demonstrate convincingly the interactions responsible for this phase separation. Specifically, the authors make good use of crystallography and NMR to identify multiple protein:protein interactions and use EMSA to confirm protein:DNA interactions. These interactions work together to promote in vitro and in cell phase separation and boost ATPase activity by the catalytic domain of MORC2.

      However, the authors have very weak evidence supporting their potentially valuable claim that MORC2 PS is important for the appropriate gene regulatory role of MORC2 in cells. Exploring causal links between PS and function is an important need in the phase separation field, particularly as regards the role of condensates in gene regulation, and is a non-trivial matter. Any study with convincing data on this matter will be very important. For this reason, it is crucial to properly explore the alternative possibility that soluble complexes, existing in the same conditions as phase-separated condensates, are the functional species. It is also critical to keep in mind that, while a specific protein domain may be essential for PS, this does not mean its only important function pertains to PS.

      In this study, the authors do not sufficiently explore the role that soluble MORC2 complexes may play alongside MORC2 condensates. Neither do they include enough data to solidly show that domain deletion leads to phenotypes via a loss of phase separation per se, rather than the loss of phase separation being a microscopically visible result, not cause, of an underlying shift in protein function. For these reasons, the authors' conclusions regarding the functional role of MORC2 condensates are based on incomplete data. This also dampens the utility of this work as a whole, since the very nice work detailing the mechanism of MORC2 PS is not paired with strong data showing the importance of this observation.

      Strengths:

      Static light scattering and crystallography are nicely used to demonstrate the dimerization of MORC2FL and to discover the structure of the CC3 domain dimer, presumably responsible for the dimerization of MORC2FL (Figure 1).

      Extensive use of deletion mutants in multiple cell lines is used to identify regions of MORC2 that are important for forming condensates in the nucleus: the IBD, IDR, and CC3 domains are found to be essential for condensate formation, while the CW domain plays an unknown role in condensate morphology (Figure 3). The authors use NMR to further identify that the IBD domain seems to interact with the first third of the centrally located IDR, termed IDRa, but not with the latter two-thirds of the IDR domain (Figure 4). This leads them to propose that phase separation is the product of IDB:IDRa interaction, CC3 dimerization, and an unknown but important role for the CW domain.

      Based on the observation that removal of the NLS resulted in diffuse cytoplasmic localization, they hypothesized that DNA may play an important role in MORC2 PS. EMSA was used to demonstrate interaction between DNA and several MORC2 domains: CC1, CC2, IDR, and TCD-CC3-IBD. Further in vitro microscopy with purified MORC2 showed that DNA addition significantly reduces MORC2 saturation concentration (Figure 5).

      These assays convincingly demonstrate that MORC2 phase separates in cells, and identify the protein domains and interactions responsible for this phenomenon, with the notable caveat that the role of the CW domain here is left unexplored.

      Weaknesses:

      Although the authors demonstrated phase separation of MORC2FL, their evidence that this plays a functional role in the cell is incomplete.

      Firstly, looking at differentially upregulated genes under MORC2FL overexpression, the authors acknowledge that only 10% are shared with differentially regulated genes identified in other MORC2FL overexpression studies (Figure 6c,d). No explanation is given for why this overlap is so low, making it difficult to trust conclusions from this data set.

      Secondly, of the 21 genes shared in this study and in earlier studies, the authors note that the differential regulation is less pronounced when a phase-separation-deficient MORC2 mutant is overexpressed, rather than MORC2FL (Figure 6e). This is taken as evidence that phase separation is important for the proper function of MORC2. However, no consideration is made for the alternative possibility that the mutant, lacking the CC3 dimerization domain, may result in non-functional complexes involving MORC2, eliminating the need for a PS-centric conclusion. To take the overexpression data as solid evidence for a functional role of MORC2 PS, the authors would need to test the alternative, soluble complex hypothesis. Furthermore, there seems to be low replicate consistency for the MORC2 mutant condition (Figure S6a), with replicate 3 being markedly upregulated when compared to replicates 1 and 2.

      Thirdly, the authors close by examining the in-cell PS capabilities and ATPase activity of several disease-associated mutants of MORC2 ( Figure 7). However, the relevance of these mutants to the past 6 figures is unclear. None of these mutations is in regions identified as important for PS. Two of the mutations result in a higher percentage of the cell population being condensate-positive, but this is not seemingly connected to ATPase activity, as only one of these two mutants has increased ATPase activity. Figure 7 does not add any support to the main hypotheses in the paper, and nowhere in the paper do the authors investigate the protein regions where the mutations in Figure 7 are found.

    1. Reviewer #1 (Public review):

      Summary:

      The authors focused on medaka retinal organoids to investigate the mechanism underlying the eye cup morphogenesis. The authors succeeded to induce lens formation in fish retinal organoids using 3D suspension culture with minimal growth factor-containing media containing the Hepes. At day 1, Rx3:H2B-GFP+ cells appear in the surface region of organoids. At day 1.5, Prox1+cells appear in the interface area between the organoid surface and the core of central cell mass, which develops a spherical-shaped lens later. So, Prox1+ cells covers the surface of the internal lens cell core. At day 2, foxe3:GFP+ cells appear in the Prox1+ area, where early lens fiber marker, LFC, starts to be expressed. In addition, foxe3:GFP+ cells show EdU+ incorporation, indicating that foxe3:GFP+ cells have lens epithelial cell-characters. At day 4, cry:EGFP+ cells differentiate inside the spherical lens core, whose surface area consists of LFC+ and Prox1+ cells. Furthermore, at day 4, the lens core moves towards the surface of retinal organoids to form an eyecup like structure, although this morphogenesis "inside out" mechanism is different from in vivo cellular "outside -in" mechanism of eye cup formation. From these data, the authors conclude that optic cup formation, especially the positioning of the lens, is established in retinal organoids though the different mechanism of in vivo morphogenesis.

      Overall, manuscript presentation is nice. However, there are still obscure points to understand background mechanism. My comments are shown below.

      Major comments:

      (1) At the initial stage of retinal organoid morphogenesis, a spherical lens is centrally positioned inside the retinal organoids, by covering a central lens core by the outer cell sheet of retinal precursor cells. I wonder if the formation of this structure may be understood by differential cell adhesive activity or mechanical tension between lens core cells and retinal cell sheet, just like the previous study done by Heisenberg lab on the spatial patterning of endoderm, mesoderm and ectoderm (Nat. Cell Biol. 10, 429 - 436 (2008)). Lens core cells may be integrated inside retinal cell mass by cell sorting through the direct interaction between retinal cells and lens cells, or between lens cells and the culture media. After day 1, it is also possible to understand that lens core moves towards the surface of retinal organoids, if adhesive/tensile force states of lens core cells may be change by secretion of extracellular matrix. I wonder if the authors measure physical property, adhesive activity and solidness, of retinal precursor cells and lens core cells. If retinal organoids at day 1 are dissociated and cultured again, do they show the same patterning of internal lens core covering by the outer retinal cell sheet?

      (2) Optic cup is evaginated from the lateral wall of neuroepithelium of the diencephalon. In zebrafish, cell movement occurs from the pigment epithelium to the neural retina during eye morphogenesis in an FGF-dependent manner. How the medaka optic cup morphogenesis is coordinated? I also wonder if the authors conduct the tracking of cell migration during optic cup morphogenesis to reveal how cell migration and cell division are regulated in lens of the Medaka retinal organoids. It is also interesting to examine how retinal cell movement is coordinated during Medaka retinal organoids.

      (3) The authors showed that blockade of FGF signaling affects lens fiber differentiation in day 1-2, whereas lens formation seems to be intact in the presence of FGF receptor inhibitor in day 0-1. I suggest the authors to examine which tissue is a target of FGF signaling in retinal organoids, using markers such as pea3, which is a downstream target of ERK branch of FGF signaling. Since FGF signaling promotes cell proliferation, is the lens core size normal in SU5402-treated organoids from day 0 to day 1?

      (4) Fig. 3f and 3g indicate that there is some cell population located between foxe3:GFP+ cells and rx2:H2B-RFP+ cells. What kind of cell-type is occupied in the interface area between foxe3:GFP+ cells and rx2:H2B-RFP+ cells?

      (5) Fig. 5e indicates the depth of Rx3 expression at day 1. Is the depth the thickness of Rx3 expressing cell sheet, which covers the central lens core in the organoids? If so, I wonder if total cell number of Rx3 expressing cell sheet may be different in each seeded-cell number, because thickness is the same across each seeded-cell number, but the surface area size may be different depending on underneath the lens core size. Please clarify this point.

      (6) Noggin application inhibits lens formation at day 0-1. BMP signaling regulates formation of lens placode and olfactory placode at the early stage of development. It is interesting to examine whether Noggin-treated organoid expands olfactory placode area. Please check forebrain territory markers.

      Significance:

      Strength: This study is unique. The authors examined eye cup morphogenesis using fish retinal organoids. Eye cup normally consists of the lens, the neural retina, pigment epithelium and optic stalk. However, retinal organoids seem to be simple and consists of two cell types, lens and retina. Interestingly, a similar optic cup-like structure is achieved in both cases; however, underlying mechanism is different. It is interesting to investigate how eye morphogenesis is regulated in retinal organoids, under the unconstrained embryo-free environment.

      Limitation: Description is OK, but analysis is not much profound. It is necessary to apply a bit more molecular and cellular level analysis, such as tracking of cell movement and visualization of FGF signaling in organoid tissues.

      Advancement: The current study is descriptive. Need some conceptual advance, which impact cell biology field or medical science.

      Audience: The target audience of current study are still within ophthalmology and neuroscience community people, maybe translational/clinical rather than basic biology. To beyond specific fields, need to formulate a general principle for cell and developmental biology.

    2. Reviewer #2 (Public review):

      Summary:

      In this study from Stahl et al., the authors demonstrate that medaka pluripotent embryonic cells can self-organise into eye organoids containing both retina and lens tissues. While these organoids can self-organize into an eye structure that resembles the vertebrate eye, they are built from a fundamentally different morphogenetic process - an "inside-out" mechanism where the lens forms centrally and moves outward, rather than the normal "outside-in" embryonic process. This is a very interesting discovery, both for our understanding of developmental biology and the potential for tissue engineering applications. The study would benefit from some additional experiments and a few clarifications. The authors suggest that the lens cells are the ones that move from the central to a more superficial position. Is this an active movement of lens cells or just the passive consequence of the retina cells acquiring a cup shape? Are the retina cells migrating behind the lens or the lens cells pushing outwards? High-resolution imaging of organoid cup formation, tracking retina cells in combination with membrane labeling of all cells would help elucidate the morphogenetic processes occurring in the organoids. Membrane labeling would also be useful as Prox1 positive lens cells appear elongated in embryos while in the organoids, cell shapes seem less organised, less compact and not elongated (for example as shown in Fig 3f,g).

      The organoids could be a useful tool to address how cell fate is linked to cell shape acquisition. In the forming organoids, retinal tissue initially forms on the outside, while non-retinal tissue is located in the centre; this central tissue later expresses lens markers. Do the authors have any insights into why fate acquisition occurs in this pattern? Is there a difference in proliferation rates between the centrally located cells and the external ones? Could it be that highly proliferative cells give rise to neural retina (NR), while lower proliferating cells become lens?

      What happens in organoids that do not form lenses? Do these organoids still generate foxe3 positive cells that fail to develop into a proper lens structure? And in the absence of lens formation, does the retina still acquire a cup shape?

      The author suggest that lens formation occurs even in the absence of Matrigel. Is the process slower in these conditions? Are the resulting organoids smaller? While there are indeed some LFC expressing cells by day2, these cells are not very well organised and the pattern of expression seems dotty. Moreover, LFC staining seems to localise posterior to the LFC negative, lens-like structure (e.g. Fig.S1 3o'clock).

      How do these organoids develop beyond day 4? Do they maintain their structural integrity at later stages?

      The role of HEPES in promoting organoid formation is intriguing. Do the authors have any insights into why it is important in this context? Have the authors tried other culture conditions and does culture condition influence the morphogenetic pathways occurring within the organoids?

      Significance:

      This is a very interesting paper, and it will be important to determine whether this alternative morphogenetic process is specific to medaka or if similar developmental routes can be recapitulated in organoid cultures from other vertebrate species.

    1. Reviewer #3 (Public review):

      Summary:

      Microexons are highly conserved alternative splice variants, the individual functions of which have thus far remained mostly elusive. Inclusion of microexons in mature mRNAs increases during development, specifically in neural tissues, and is regulated by SRRM proteins. Investigation of individual microexon function is a vital avenue of research, since microexon inclusion is disrupted in diseases like autism. This study provides one of the first rigorous screens (using zebrafish larvae) of the functions of individual microexons in neurodevelopment and behavioural control. The authors precisely excise 21 microexons from the genome of zebrafish using CRISPR-Cas9 and assay the downstream impacts on neurite outgrowth, larvae motility and sociality. A small number of mild phenotypes were observed, which contrasts with the more dramatic phenotypes observed when microexon master regulators SRRM3/4 are disrupted. Importantly, this study attempts to address the reasons why mild/few phenotypes are observed and identifies transcriptomic changes in microexon mutants that suggest potential compensatory gene regulatory mechanisms.

      Strengths:

      (1) The manuscript is well written with excellent presentation of the data in the figures.

      (2) The experimental design is rigorous and explained in sufficient detail.

      (3) The identification of a potential microexon compensatory mechanism by transcriptional alterations represents a valued attempt to begin to explain complex genetic interactions.

      Overall this is a study with robust experimental design that addresses a gap in knowledge of the role of microexons in neurodevelopment.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):

      Summary:

      In this manuscript by Lopez-Blanch and colleagues, 21 microexons are selected for a deep analysis of their impacts on behavior, development, and gene expression. The authors begin with a systematic analysis of microexon inclusion and conservation in zebrafish and use these data to select 21 microexons for further study. The behavioral, transcriptomic, and morphological data presented are for the most part convincing. Furthermore, the discussion of the potential explanations for the subtle impacts of individual microexon deletions versus lossof-function in srrm3 and/or srrm4 is quite comprehensive and thoughtful. One major weakness: data presentation, methods, and jargon at times affect readability / might lead to overstated conclusions. However, overall this manuscript is well-written, easy to follow, and the results are of broad interest.

      We thank the Reviewer for their positive comments on our manuscript. In the revised version, we will try to improve readability, reduce jargon and avoid overstatements.  

      Strengths:

      (1) The study uses a wide variety of techniques to assess the impacts of microexon deletion, ranging from assays of protein function to regulation of behavior and development.

      (2) The authors provide comprehensive analyses of the molecular impact of their microexon deletions, including examining how host-gene and paralog expression is affected.

      Weaknesses:

      Major Points:

      (1) According to the methods, it seems that srrm3 social behavior is tested by pairing a 3mpf srrm3 mutant with a 30dpf srrm3 het. Is this correct? The methods seem to indicate that this decision was made to account for a slower growth rate of homozygous srrm3 mutant fish. However, the difference in age is potentially a major confound that could impact the way that srrm3 mutants interact with hets and the way that srrm3 mutants interact with one another (lower spread for the ratio of neighbour in front value, higher distance to neighbour value). This reviewer suggests testing het-het behavior at 3 months to provide age-matched comparisons for del-del, testing age-matched rather than size-matched het-del behavior, and also suggests mentioning this in the main text / within the figure itself so that readers are aware of the potential confound.

      Thank you for bringing up this point. For the tests shown in Figure 5, we indeed decided to match the pairs involving srrm3 mutant fish by fish size since we reasoned this would be more comparable to the other lines, both biologically and methodologically (in terms of video tracking, etc.). However, we are confident the results would be very similar if matched by age, since the differences in social interactions between the srrm3 homozygous mutants and their control siblings are very dramatic at any age. As an example, this can be appreciated, in line with the Reviewer's suggestion, in Videos S2 and S3, which show groups of five 5 mpf fish that are either srrm3 mutant or wild type. It can be observed that the behavior of 5 mpf WT fish (Video S3) is very similar to those of 1 mpf WT fish pairs, with very small interindividual distances, while the difference with repect to the srrm3 mutant group (Video S2) is dramatic. We nonetheless agree that this decision on the experimental design should be clearly stated in the main text and figure legend and we have done so in the revised version.

      (2) Referring to srrm3+/+; srrm4-/- controls for double mutant behavior as "WT for simplicity" is somewhat misleading. Why do the authors not refer to these as srrm4 single mutants?

      This comment applies to Figure 4 as well as the associated figure supplements. We reasoned that this made the understanding of plots easier, but the Reviewer is correct that it can be misleading. As a middle ground, we have now changed Figure 4 to follow the nomenclature of Figure 3D (WD, HD, DD), which is further explained in the legend, but kept the original format in the figure supplements for consistency with the (many) other plots in those figures.

      (3) It's not completely clear how "neurally regulated" microexons are defined / how they are different from "neural microexons"? Are these terms interchangeable?

      Yes, they are interchangeable. We have now double checked the wording to avoid confusion and for consistency.

      (4) Overexpression experiments driving srrm3 / srrm4 in HEK293 cells are not described in the methods.

      We apologized for this omission. We now briefly describe the data and asscoiated methods in more detail in the revised version; however, please note that the data was obtained from a previous publication (Torres-Mendez et al, 2019), where the detailed methodology is reported.

      (5) Suggest including more information on how neurite length was calculated. In representative images, it appears difficult to determine which neurites arise from which soma, as they cross extensively. How was this addressed in the quantification?

      We have added further details to the revised version. With regards to the specific question, we would like to mention that this has not been a very common issue for the time points used in the manuscript (10 hap and 24 hap). At those stages, it was nearly always evident how to track each individual neurite. Dubious cases were simply ignored and not measured, as we aimed for 100 neurites per well. Of course, such complex cases become much more common at later time points (48 and 72 hap), which were not used in this study.

      Reviewer #2 (Public review):

      Summary:

      This manuscript explores in zebrafish the impact of genetic manipulation of individual microexons and two regulators of microexon inclusion (Srrm3 and Srrm4). The authors compare molecular, anatomical, and behavioral phenotypes in larvae and juvenile fish. The authors test the hypothesis that phenotypes resulting from Srrm3 and 4 mutations might in part be attributable to individual microexon deletions in target genes.

      The authors uncover substantial alterations in in vitro neurite growth, locomotion, and social behavior in Srrm mutants but not any of the individual microexon deletion mutants. The individual mutations are accompanied by broader transcript level changes which may resemble compensatory changes. Ultimately, the authors conclude that the severe Srrm3/4 phenotypes result from additive and/or synergistic effects due to the de-regulation of multiple microexons.

      Strengths:

      The work is carefully planned, well-described, and beautifully displayed in clear, intuitive figures. The overall scope is extensive with a large number of individual mutant strains examined. The analysis bridges from molecular to anatomical and behavioral read-outs. Analysis appears rigorous and most conclusions are well-supported by the data.

      Overall, addressing the function of microexons in an in vivo system is an important and timely question.

      Weaknesses:

      The main weakness of the work is the interpretation of the social behavior phenotypes in the Srrm mutants. It is difficult to conclude that the mutations indeed impact social behavior rather than sensory processing and/or vision which precipitates apparent social alterations as a secondary consequence. Interpreting the phenotypes as "autism-like" is not supported by the data presented.

      The Reviewer is absolutely right. It was not our intention to imply that these social defects should be interpreted simply as autistic-like. It is indeed very likely that the main reason for the social alterations displayed by the srrm3 mutants is their impaired vision. We have now added this discussion point explicitly in the revised version. 

      Reviewer #3 (Public review):

      Summary:

      Microexons are highly conserved alternative splice variants, the individual functions of which have thus far remained mostly elusive. The inclusion of microexons in mature mRNAs increases during development, specifically in neural tissues, and is regulated by SRRM proteins. Investigation of individual microexon function is a vital avenue of research since microexon inclusion is disrupted in diseases like autism. This study provides one of the first rigorous screens (using zebrafish larvae) of the functions of individual microexons in neurodevelopment and behavioural control. The authors precisely excise 21 microexons from the genome of zebrafish using CRISPR-Cas9 and assay the downstream impacts on neurite outgrowth, larvae motility, and sociality. A small number of mild phenotypes were observed, which contrasts with the more dramatic phenotypes observed when microexon master regulators SRRM3/4 are disrupted. Importantly, this study attempts to address the reasons why mild/few phenotypes are observed and identify transcriptomic changes in microexon mutants that suggest potential compensatory gene regulatory mechanisms.

      Strengths:

      (1) The manuscript is well written with excellent presentation of the data in the figures.

      (2) The experimental design is rigorous and explained in sufficient detail.

      (3) The identification of a potential microexon compensatory mechanism by transcriptional alterations represents a valued attempt to begin to explain complex genetic interactions.

      (4) Overall this is a study with a robust experimental design that addresses a gap in knowledge of the role of microexons in neurodevelopment.

      Thank you very much for your positive comments to our manuscript.

      Reviewer #1 (Recommendations for the authors):

      Minor Suggestions

      (1) Axes are often scaled differently even between panels in the same figure. For example in Figure 5 - supplement 10, the srrm3_17 y axis scales from 0-20, while the neighboring panels scale from ~1-2.5. This somewhat underrepresents the finding that srrm3 mutants have much larger inter-individual distances. Similarly, in the panel above (src_1), the y-axis is scaled to include a single point around 17cm. As a result, it appears at first glance that the src_1 trials resulted in much lower inter-individual distance. Suggest scaling all of these the same to improve readability.

      While the Reviewer is certainly correct, after careful consideration we decided to have autoscaled axis to prioritize within-plot visualization (i.e. among genotypes within an experiment) than across plots (i.e. among experiments and lines).

      (2) Attention to italicizing gene names.

      Thanks.

      (3) In many points in the methods, we are instructed to "see below." Suggest directing the reader to a particular section heading.

      We found only one such instance, and we directed the reader to the specific section, as suggested.

      (4) In Methods, remove "in the corpus callosum." This is not an accurate descriptor for the site at which Mauthner axons cross.

      This is absolutely correct, apologies for this mistake.

      Clarify:

      (1) In the results section, "tissue-specific regulation was validated..." - suggest mentioning that this was performed in adult tissues / describe dissection in the methods.

      Added.

      (2) In the results section, the meaning of "no event ortholog" is not clear. Does this mean that a microexon does not have a human homolog? If so, suggest stating more clearly.

      Correct. We have added addition information.

      (3) In the results, the authors state that 78% of microexons are affected by srrm3/4 loss-offunction. Suggest stating the method used here (e.g. RNA-seq in mutants as compared to siblings)

      Added.

      (4) It is not clear what "siblings for the main founders means" for example in 3D. Is this effectively the analysis of microexon knockouts across multiple independent lines? Are the lines pooled for stats, for example in 3C?

      The main founder correspond to that listed as _1 and as default for experiments when only one found is used. We now explicitely state this.  

      For 3C, the lines are not pooled for stats; the stats correspond only to the main founder for each line. However, for each main founder line, multiple experiments are usually analyzed together and the stats are done taking their data structure into account (i.e. not simply pooling the values).

      (5) The purpose and a general description of NanoBRET assays should be included in the results.

      We added the main purpose of the NanoBRET assays (testing protein-protein interactions).

      (6) Specify that baseline behavior is analyzed in the light.

      Added.

      (7) In Figure 4A, adult fish are schematized being placed into a 96-well plate. Suggest using the larval diagram as in Figure 6 for accuracy.

      Done.

      (8) In Figure 4, plot titles could be made more accessible, especially in 4 F. Suggest removing extraneous information / italicizing gene names, etc. In G, suggest writing out Baseline, Dark, and Light to make it more accessible. Same in 4B.

      We have implemented some of the suggestions. In particular, italics were not used, since we are referring to the founder line, not the gene.

      (9) Figure 6 legend B - after (barplots), suggest inserting the word "and", to make clear that barplots indicate host gene *and* closely related paralogs are indicated by dots.

      Done.

      (10) In methods: "To better capture all microexons..." This sentence is difficult to understand. Suggested edit: "we excluded *from our calculation?* tissues with known or expected partial overlap... from comparison (for example, ...).

      Done.

      (11) In the methods, "which were defined with similar parameters but -min_rep 2." Suggest spelling this out, e.g. "with similar parameters, but requiring sufficient read coverage in at least n=2 samples per valid tissue group, whereas we only required one.".

      Done.

      (12) RNA was extracted for event and knockout validations. What does event mean here?

      Event refers to the validation of the exon regulatory pattern in WT tissues. We added this information.

      Provide definitions for abbreviations:

      (1) (Figure 6) Delta corrected VST Expression.

      Done.

      (2) "Mic-hosting genes" paralogs.

      Done.

      (3) In Figure 1F, "emic" is not defined.

      Done.

      Misspellings:

      All corrected.

      (1) Figure 6B (percentile is spelled percentil).

      (2) Figure 6B legend (bottom or top decile*).

      (3) Figure 6D - Schizophrenia* genes.

      (4) In Zebrafish husbandry and genotyping: suggest "srrm3 mutants grew more slowly.".

      (5) In results, "reduced body size at 90pdf" > 90dpf.

      Reviewer #2 (Recommendations for the authors):

      (1) Characterization of microexon mutants (Figure 2): The semi-quantitative PCR with flanking primers (Figure 2, supplement1) is well-suited to assess successful deletion of the exon and enables detection of potential mis-splicing around the alternative segment. However, it does not quantify the impact on total transcript levels. The authors should complement those experiments with qPCR measures of the transcript levels - otherwise, it is difficult to link mutant phenotypes to isoforms (as opposed to alterations in the level of gene expression). This point is somewhat addressed in Figure 6 by the RNA Seq analysis but it might help to add data specifically in Figure 2.

      As the Reviewer says, this point is explicitely addressed in Figure 6, where were show the change in the host gene's expression that follows the the removal of some microexons. We prefer to keep this in Figure 6, for consistency, as we believe this is not a direct (regulatory) consequence of the removal, but more likely a compensation effect.

      (2) Social behavior alterations in juvenile fish: The authors report "increased leadership" in Srrm3 mutant fish. However, these fish have impaired vision. Thus, "increased leadership" may simply reflect the fact that they do not perceive their conspecifics and, thus, do not follow them. The heterozygous conspecific will then mostly follow the Srrm3 mutant which appears as the mutant exhibiting an increase in leadership. Figure 5D suggests that Srrm3 del and het fish have the same ratio of "neighbor in front" which would be consistent with the hypothesis that the change in this metric is a consequence of a loss of following behavior due to a loss of vision. The authors should either adjust the discussion of this point or assess with additional experiments whether this is indeed a "social phenotype" or rather a secondary consequence of a loss of vision.

      The Reviewer is absolutely correct, and we have thus modified the short discussion directly related to these patterns.

      (3) The discussion centers on potential reasons why only mild phenotypes are observed in the single microexon mutants. One caveat of the phenotypic analysis provided in the manuscript is that it does not very deeply explore the phenotypic space of neuronal morphologies or circuit function. The behavioral and anatomical read-outs are rather coarse. There are no experiments exploring fine-structure of neuronal projections in vivo or synapse number, morphology, or function. Moreover, no attempts are made to explore which cell types normally express the microexons to potentially focus the loss-of-function analysis to these specific cell types. Of course, such analysis would substantially expand the scope of a study that already covers a large number of mutant alleles. However, the authors may want to add a discussion of these limitations in the manuscript.

      The Reviewer is correct. We aimed at covering this when referring to "(i) we may not be assessing the traits that these microexons are impacting, (ii) we may not have the sensitivity to robustly measure the magnitude of the changes caused by microexon removal". We have now added some of the specific points raised by the Reviewer as examples.

      (4) Note typos in Figure 6D: "schizoFrenia", "WNT signIalling"

      Done.

      Reviewer #3 (Recommendations for the authors):

      I only have a few minor suggestions for the authors.

      (1) It is interesting that a not insignificant number of microexon deletions (3/21) result in cryptic inclusions of intron fragments, and perhaps alludes to an as yet unreported molecular function of microexons in the regulation of host gene expression. Is it possible that microexon inclusion in these 3 genes could be important for expression? I think this requires some further discussion, as (if I'm not mistaken) microexons have thus far only been hypothesised to act as modulators of protein function, not as gene regulatory units.

      While we see that microexon removal can impact expression of the host gene (Figure 6), this is likely a compensatory mechanism (or so we suggest). We do not think these three cases are related to a putative physiological regulation, since the cryptic exons appear only in the deletion line. On the contrary, we think these are "regulatory artifacts" that originate in the nonWT mutated context. I.e. we removed the exon but some splicing signals remained in the intron, which are then recoginized by the spliceosome that incorrectly includes a different piece of the intron.

      (2) The flow of the text accompanying the molecular investigation of microexon function for evi5b and vav in Figure 3 could be improved. The text currently fades out with a speculative explanation for the lack of evi5b interaction phenotype. This final sentence could be moved to the discussion and replaced with a more general summary of the data.

      We have now swapped the order in which these results are described and leave out the discussion about evi5b's microexon function.

      (3) Is this a co-submission with Calhoun et al? If so, both papers should reference each other in the discussion and discuss the relative contributions of each.

      Done

      (4) "1 × 104 cells" in methods Nanobret paragraph should be superscript.

      Done

    1. Résumé de la vidéo [00:00:06][^1^][1] - [00:24:15][^2^][2]:

      Cette vidéo présente la 8ème Journée Départementale de la Parentalité à Agde en 2022, avec une conférence d'Isabelle Roskam. Elle aborde le burn-out parental et les défis de la parentalité au 21e siècle, en mettant l'accent sur les pressions sociétales et les attentes envers les parents.

      Points forts: + [00:00:06][^3^][3] Introduction d'Isabelle Roskam * Présentation de son parcours professionnel * Expérience en psychologie du développement et recherche sur le burn-out parental * Auteur d'ouvrages sur la parentalité + [00:01:47][^4^][4] La parentalité et les émotions positives * La perception culturelle de la parentalité associée au bonheur * Les défis et le stress liés à l'éducation des enfants * La difficulté d'exprimer les aspects négatifs de la parentalité + [00:07:05][^5^][5] Parentalité comme un travail exigeant * Comparaison de la parentalité à un emploi sans possibilité de démission * L'évolution des rôles de genre et les défis de la coparentalité * L'impact des valeurs individualistes sur la parentalité + [00:10:38][^6^][6] Changements sociétaux affectant la parentalité * L'influence de la contraception et le concept de l'enfant choisi * L'évolution du statut de l'enfant et les droits de l'enfant * Les responsabilités parentales décrites dans la Convention internationale des droits de l'enfant + [00:16:57][^7^][7] Développement des sciences psychologiques et éducation * Pression sur les parents à travers les médias et les professionnels * L'importance de l'engagement parental et les recommandations sur la bonne parentalité * La nouvelle pression historique sur les parents et leurs responsabilités + [00:19:02][^8^][8] Le glissement vers le burn-out parental * La différence entre la pression sociétale et le burn-out parental * Description du burn-out parental et ses symptômes * L'importance de l'investissement parental et le contraste avec le burn-out Résumé de la vidéo [00:24:17][^1^][1] - [00:44:58][^2^][2]:

      La conférence aborde le burn-out parental, ses symptômes, ses causes et ses conséquences sur les parents et les enfants. Elle souligne l'importance de l'équilibre entre les stresseurs et les ressources disponibles pour les parents, et propose des solutions pour prévenir et traiter le burn-out parental.

      Points forts: + [00:24:17][^3^][3] Symptômes du burn-out parental * Témoignage d'une mère épuisée par les demandes constantes de ses enfants * Différenciation entre burn-out parental et dépression + [00:26:39][^4^][4] Causes du stress parental * Impact du stress sur la santé physique des parents * Comparaison des niveaux de cortisol chez les parents en burn-out et d'autres groupes stressés + [00:30:00][^5^][5] Prévalence du burn-out parental * Statistiques montrant une prévalence élevée dans les pays occidentaux * Discussion sur l'importance de s'occuper du burn-out parental comme un problème de santé publique + [00:38:01][^6^][6] Conséquences et traitement * Effets néfastes sur la santé des parents et le bien-être des enfants * Approches de prévention et de traitement efficaces pour réduire le stress parental Résumé de la vidéo [00:45:00][^1^][1] - [01:06:42][^2^][2]:

      La conférence d'Isabelle Roskam aborde les défis de la parentalité moderne, contrastant avec les pratiques des années 80. Elle souligne la pression sur les parents pour répondre aux besoins académiques, émotionnels, nutritionnels et sociaux des enfants, tout en évitant la surstimulation et en favorisant une alimentation saine. Roskam discute de l'isolement croissant des parents dans une société individualiste et plaide pour un retour à la solidarité communautaire, rappelant le proverbe africain selon lequel il faut tout un village pour élever un enfant.

      Points forts: + [00:45:00][^3^][3] Contraste entre la parentalité en 1982 et 2019 * Pression pour répondre à tous les besoins des enfants * Différences dans les attentes et les pratiques éducatives * Humour pour souligner les changements sociétaux + [00:46:25][^4^][4] Parentalité solitaire dans la société moderne * Individualisme et réticence à demander de l'aide * Importance de partager les responsabilités parentales * Nécessité de soutien communautaire et informel + [00:50:11][^5^][5] Réflexion sur les sociétés collectivistes * Comparaison avec les modèles éducatifs où l'enfant est élevé par la communauté * Discussion sur l'adaptation des sociétés occidentales à ces modèles * Soutien formel et informel et leur impact sur la parentalité + [00:59:31][^6^][6] Équilibre personnel et parentalité * Gestion du stress parental et importance de maintenir une identité diversifiée * Rôle du travail et de la carrière dans la prévention de l'épuisement parental * Influence du nombre d'enfants et de la dynamique familiale sur le bien-être parental

    2. Le Burnout Parental : Synthèse de la Conférence d'Isabelle Roskam

      Résumé

      Ce document de synthèse analyse les points clés de la conférence d'Isabelle Roskam, Professeure de psychologie du développement, sur le phénomène du burnout parental.

      La parentalité au 21e siècle est soumise à des pressions sociétales intenses et inédites, transformant une expérience traditionnellement perçue comme joyeuse en une source potentielle de souffrance profonde.

      Le burnout parental est un syndrome clinique spécifique, distinct de la dépression et du burnout professionnel, caractérisé par un épuisement physique et émotionnel extrême, une distanciation affective avec ses enfants, et une perte de plaisir dans le rôle parental.

      Il est la conséquence d'un déséquilibre prolongé entre les stresseurs (personnels, familiaux, situationnels) et les ressources disponibles pour y faire face.

      Avec une prévalence touchant jusqu'à 8 % des parents dans des pays comme la France et la Belgique, ce trouble constitue un problème de santé publique majeur.

      Ses conséquences sont graves, incluant des problèmes de santé pour le parent, des idées suicidaires, ainsi que des actes de négligence et de violence envers les enfants.

      Les solutions proposées sont à la fois individuelles et collectives.

      Au niveau individuel, il s'agit de restaurer l'équilibre en réduisant les stresseurs et en augmentant les ressources, via l'écoute, la prévention et des thérapies ciblées.

      Au niveau collectif, une prise de conscience est nécessaire pour relâcher la pression vers une parentalité parfaite, mieux soutenir les parents et recréer un "village" solidaire pour briser l'isolement parental.

      1. Le Contexte Moderne : Pourquoi Être Parent Est Devenu si Exigeant

      Isabelle Roskam postule que la parentalité contemporaine est fondamentalement différente de celle des générations précédentes. Plusieurs changements sociétaux majeurs survenus dans la seconde moitié du 20e siècle ont intensifié la pression sur les parents.

      Évolution des rôles de genre : Le modèle traditionnel (mère au foyer, père pourvoyeur de ressources) a laissé place à une attente de double performance pour les femmes et à une redéfinition du rôle des pères.

      Cela a introduit de nouveaux défis, notamment celui de la coparentalité, qui exige un ajustement constant entre les parents sur les valeurs et méthodes éducatives.

      Montée de l'individualisme : Les sociétés occidentales valorisent l'épanouissement personnel, les désirs et les aspirations individuelles.

      Devenir parent crée une injonction contradictoire : il faut faire passer les besoins de l'enfant avant les siens.

      Cela génère un conflit interne permanent entre la culpabilité de prendre du temps pour soi et la frustration de se dédier entièrement à ses enfants.

      L'avènement de la contraception : Le concept de l'enfant choisi a transformé la parentalité en un projet de vie conscient.

      Cet engagement volontaire augmente la valeur attribuée à l'enfant et au rôle parental, mais induit aussi une pression sociale forte : "tu les as voulus, tu dois assumer", rendant difficile l'expression de la souffrance.

      Changement du statut de l'enfant : En un siècle, la société est passée d'une relative indifférence envers l'enfant (considéré parfois comme une force de travail) à une préoccupation intense pour l'optimisation de son développement (cognitif, émotionnel, social). Rien n'est trop beau ou trop cher pour l'enfant, qui est devenu une valeur centrale.

      La Convention internationale des droits de l'enfant (1989) : Ce texte a formalisé ce nouveau statut en définissant l'enfant comme un sujet de droits. Cela a bouleversé les dynamiques familiales :

      • ◦ L'enfant a désormais droit au chapitre, peut négocier et décider pour sa vie.

      • ◦ Les parents ont le devoir de lui offrir tout le nécessaire pour atteindre son plein potentiel.

      • ◦ L'État a le rôle d'aider les parents (crèches, allocations) mais aussi de surveiller les familles, mettant fin au "règne du pater familias" et instaurant un monitoring social pouvant aller jusqu'au retrait de l'enfant.

      Développement des sciences psychologiques : La diffusion massive de connaissances sur l'éducation, via la littérature de vulgarisation et les réseaux sociaux, a créé une pression immense pour devenir un "bon parent" et appliquer les principes de la parentalité positive, générant une anxiété de performance et une peur de l'erreur.

      2. Le Burnout Parental : Définition, Symptômes et Marqueurs Biologiques

      Le burnout parental est un syndrome spécifique qui ne doit pas être confondu avec la dépression ou un simple état de fatigue.

      Les Symptômes Clés

      1. Épuisement intense : Un épuisement physique et émotionnel qui se manifeste exclusivement dans la sphère parentale. Le parent n'a plus aucune énergie pour s'occuper de ses enfants, mais peut en conserver pour d'autres activités (travail, amis).

      2. Distanciation émotionnelle : Le parent fonctionne en "pilotage automatique". Il assure les tâches essentielles (conduire à l'école, nourrir) mais n'a plus les ressources pour se connecter émotionnellement à ses enfants.

      3. Perte de plaisir dans le rôle parental : Les interactions avec les enfants, autrefois sources de joie, deviennent une corvée.

      4. Contraste avec le "parent d'avant" : Le parent en burnout a conscience de ce changement radical. Il était souvent un parent très investi, voire perfectionniste, avant de s'effondrer.

      Témoignage marquant : "Ce mot 'maman' je ne le supporte plus. La première fois que votre bébé vous dit maman, c’est le plus beau jour de votre vie et aujourd’hui, ce n’est plus un mot que je suis heureuse d’entendre. Vraiment, c’est devenu un mot de torture."

      Distinction avec la Dépression et le Burnout Professionnel

      • • Contextualisation : Le burnout parental est spécifique à la sphère familiale. Une personne peut être en burnout parental et trouver refuge dans son travail, et inversement.

      • • Transversalité : La dépression est un trouble transversal qui affecte toutes les sphères de la vie. Une personne déprimée n'aura ni l'envie ni l'énergie pour ses enfants, son travail ou ses loisirs.

      La Preuve Biologique : Le Cortisol Des études scientifiques ont mesuré le taux de cortisol (l'hormone du stress) accumulé dans les cheveux des parents. Les résultats démontrent une souffrance physiologique réelle et mesurable.

      Groupe de personnes

      Niveau de stress (mesuré par le cortisol capillaire)

      Parents en burnout

      Très élevé

      Victimes de violences conjugales

      Élevé

      Patients souffrant de douleurs chroniques sévères

      Élevé

      Parents épanouis

      Modéré (plus élevé qu'un non-parent)

      Étudiant en période d'examens

      Modéré

      Cette hiérarchie montre que le stress chronique subi par les parents en burnout est physiologiquement supérieur à celui de populations connues pour leur détresse extrême.

      L'excès de cortisol est toxique et explique de nombreux problèmes de santé physique (migraines, troubles digestifs, douleurs) rapportés par ces parents.

      3. Le Mécanisme du Burnout : Le Modèle de la Balance

      Le burnout parental est le résultat d'un déséquilibre chronique entre les stresseurs et les ressources.

      Il survient lorsque les stresseurs sont trop nombreux ou trop intenses, pendant trop longtemps, sans ressources suffisantes pour les compenser.

      • Stresseurs Parentaux : Tout ce qui augmente la charge et la difficulté d'être parent.

      • Socio-démographiques : Nombre d'enfants, faibles revenus, logement exigu.
      • Situationnels : Avoir un enfant malade ou avec des difficultés particulières.
      • Personnels : Traits de perfectionnisme, histoire personnelle, mode de gestion éducative (ex: inconsistance qui multiplie les sollicitations).
      • Familiaux : Mauvaise coparentalité, conflits conjugaux, absence de routines familiales.

      Ressources Parentales : Tout ce qui aide à faire face aux stresseurs.

      • ◦ Soutien du conjoint, compétences parentales, temps pour soi, soutien social (famille, amis), satisfaction professionnelle, etc.

      Le burnout n'est pas nécessairement causé par un seul gros stresseur, mais souvent par une accumulation de petits stresseurs quotidiens qui font pencher la balance du mauvais côté.

      4. Prévalence et Conséquences Graves

      Le burnout parental doit être pris au sérieux pour deux raisons majeures : sa prévalence élevée et la gravité de ses conséquences.

      Prévalence : Un Problème de Santé Publique

      Une étude menée dans 42 pays révèle que les pays occidentaux sont les plus touchés.

      France et Belgique : Des taux de prévalence de 6 à 8 %.

      À l'échelle de la France : Cela représente environ 900 000 parents en souffrance.

      Conséquences

      Le burnout parental a des répercussions dévastatrices sur l'ensemble de l'écosystème familial.

      • Pour le parent :

      • ◦ Problèmes de santé physique exacerbés par le cortisol.

      • ◦ Idées suicidaires très fréquentes. Contrairement au burnout professionnel où l'on peut démissionner ou se mettre en arrêt maladie, il n'y a pas de porte de sortie à la parentalité. Le suicide est parfois perçu comme la seule issue.

      • Pour l'enfant :

      • Négligence : Le parent n'a plus l'énergie de s'occuper adéquatement de l'enfant (aide aux devoirs, surveillance).

      • Violence : La violence peut être verbale ("ma vie serait tellement plus simple si tu n'étais pas là") ou physique. Le parent, à bout, peut avoir des pulsions violentes qu'il peine à contrôler.

      • Pour le couple :

      • ◦ Flambée des conflits conjugaux.

      • ◦ Idées de divorce ou de séparation. La garde alternée peut être envisagée non pas à cause de la fin de l'amour, mais comme une stratégie de survie pour pouvoir "souffler une semaine sur deux".

      5. Pistes de Solution : Agir à l'Échelle Individuelle et Collective

      Solutions Individuelles

      • 1. Écouter et valider la souffrance : La première étape est de briser le tabou et de permettre au parent d'exprimer sa souffrance sans jugement.

      • 2. Prévention : Des programmes comme "Parents sur le fil" visent à aider les parents à relâcher la pression qu'ils s'imposent.

      • 3. Restaurer l'équilibre de la balance : Identifier les stresseurs pour les réduire et identifier/activer des ressources pour les augmenter.

      • 4. Traitement spécialisé : Pour les cas avancés, des thérapies de groupe ont prouvé leur efficacité, réduisant le taux de cortisol de 52 % en huit semaines et le ramenant à un niveau proche de celui des parents épanouis.

      • Solutions Collectives

      Le burnout parental étant en partie un phénomène de société, la réponse doit aussi être collective.

      • 1. Prendre garde aux pressions normatives : Il faut questionner les injonctions à la perfection véhiculées par les réseaux sociaux et certains professionnels.

      • 2. Adopter la bienveillance envers les parents : Les professionnels (pédiatres, enseignants) doivent considérer le bien-être du parent autant que celui de l'enfant.

      L'analogie de l'avion est parlante : "mettre son propre masque à oxygène avant d'aider son enfant". Un parent qui s'épuise ne peut plus prendre soin de son enfant.

      3. Repenser la parentalité positive : Ce concept doit être vu comme un "phare" qui donne une direction, et non comme un but inatteignable.

      Une parentalité "suffisamment bonne" est plus saine pour le parent et pour l'enfant, qui a besoin de se construire face à des adultes imparfaits.

      4. Combattre l'isolement parental : La parentalité est devenue une activité solitaire. Il est crucial de recréer du lien et de la solidarité.

      • Distinguer le soutien formel et informel : Les pays occidentaux offrent beaucoup de soutien formel (services de l'État, associations), mais ont perdu le soutien informel (famille élargie, voisinage). Or, ce dernier est essentiel pour le soutien émotionnel.

      • Retrouver l'esprit du village : Il faut réhabiliter l'idée qu'il est normal et nécessaire de partager les tâches et les responsabilités parentales au sein d'une communauté.

      Comme le dit le proverbe africain : "Pour élever un enfant, il faut tout un village".

    3. https://www.youtube.com/watch?v=r_cQsQ49uHk

      Résumé de la vidéo [00:00:06][^1^][1] - [00:24:15][^2^][2]:

      Cette vidéo présente la 8ème Journée Départementale de la Parentalité à Agde en 2022, avec une conférence d'Isabelle Roskam. Elle aborde le burn-out parental et les défis de la parentalité au 21e siècle, en mettant l'accent sur les pressions sociétales et les attentes envers les parents.

      Points forts: + [00:00:06][^3^][3] Introduction d'Isabelle Roskam * Présentation de son parcours professionnel * Expérience en psychologie du développement et recherche sur le burn-out parental * Auteur d'ouvrages sur la parentalité + [00:01:47][^4^][4] La parentalité et les émotions positives * La perception culturelle de la parentalité associée au bonheur * Les défis et le stress liés à l'éducation des enfants * La difficulté d'exprimer les aspects négatifs de la parentalité + [00:07:05][^5^][5] Parentalité comme un travail exigeant * Comparaison de la parentalité à un emploi sans possibilité de démission * L'évolution des rôles de genre et les défis de la coparentalité * L'impact des valeurs individualistes sur la parentalité + [00:10:38][^6^][6] Changements sociétaux affectant la parentalité * L'influence de la contraception et le concept de l'enfant choisi * L'évolution du statut de l'enfant et les droits de l'enfant * Les responsabilités parentales décrites dans la Convention internationale des droits de l'enfant + [00:16:57][^7^][7] Développement des sciences psychologiques et éducation * Pression sur les parents à travers les médias et les professionnels * L'importance de l'engagement parental et les recommandations sur la bonne parentalité * La nouvelle pression historique sur les parents et leurs responsabilités + [00:19:02][^8^][8] Le glissement vers le burn-out parental * La différence entre la pression sociétale et le burn-out parental * Description du burn-out parental et ses symptômes * L'importance de l'investissement parental et le contraste avec le burn-out Résumé de la vidéo [00:24:17][^1^][1] - [00:44:58][^2^][2]:

      La conférence aborde le burn-out parental, ses symptômes, ses causes et ses conséquences sur les parents et les enfants. Elle souligne l'importance de l'équilibre entre les stresseurs et les ressources disponibles pour les parents, et propose des solutions pour prévenir et traiter le burn-out parental.

      Points forts: + [00:24:17][^3^][3] Symptômes du burn-out parental * Témoignage d'une mère épuisée par les demandes constantes de ses enfants * Différenciation entre burn-out parental et dépression + [00:26:39][^4^][4] Causes du stress parental * Impact du stress sur la santé physique des parents * Comparaison des niveaux de cortisol chez les parents en burn-out et d'autres groupes stressés + [00:30:00][^5^][5] Prévalence du burn-out parental * Statistiques montrant une prévalence élevée dans les pays occidentaux * Discussion sur l'importance de s'occuper du burn-out parental comme un problème de santé publique + [00:38:01][^6^][6] Conséquences et traitement * Effets néfastes sur la santé des parents et le bien-être des enfants * Approches de prévention et de traitement efficaces pour réduire le stress parental Résumé de la vidéo [00:45:00][^1^][1] - [01:06:42][^2^][2]:

      La conférence d'Isabelle Roskam aborde les défis de la parentalité moderne, contrastant avec les pratiques des années 80. Elle souligne la pression sur les parents pour répondre aux besoins académiques, émotionnels, nutritionnels et sociaux des enfants, tout en évitant la surstimulation et en favorisant une alimentation saine. Roskam discute de l'isolement croissant des parents dans une société individualiste et plaide pour un retour à la solidarité communautaire, rappelant le proverbe africain selon lequel il faut tout un village pour élever un enfant.

      Points forts: + [00:45:00][^3^][3] Contraste entre la parentalité en 1982 et 2019 * Pression pour répondre à tous les besoins des enfants * Différences dans les attentes et les pratiques éducatives * Humour pour souligner les changements sociétaux + [00:46:25][^4^][4] Parentalité solitaire dans la société moderne * Individualisme et réticence à demander de l'aide * Importance de partager les responsabilités parentales * Nécessité de soutien communautaire et informel + [00:50:11][^5^][5] Réflexion sur les sociétés collectivistes * Comparaison avec les modèles éducatifs où l'enfant est élevé par la communauté * Discussion sur l'adaptation des sociétés occidentales à ces modèles * Soutien formel et informel et leur impact sur la parentalité + [00:59:31][^6^][6] Équilibre personnel et parentalité * Gestion du stress parental et importance de maintenir une identité diversifiée * Rôle du travail et de la carrière dans la prévention de l'épuisement parental * Influence du nombre d'enfants et de la dynamique familiale sur le bien-être parental

    1. t's easy to imagine some continuity from the lifestyle of hunters in regions of relative abundance like river valleys, to that of transhumant shepherds. A bit more difficult to connect this lifestyle with that of nomadic pastoralism.

      It's interesting to think about the life of a hunter and Shepard. I am wondering if they adapted to this lifestyle pretty smoothly.

    2. This supports the belief that written records were developed to facilitate the work of the temples that collected, stored, and dispersed the community's agricultural production.

      It's cool that they used writing for this purpose this long ago.

    1. Act with unforced actions in harmony with the natural cycles of the universe. Trying to force something to happen will likely backfire.

      The “unforced actions” mention here are formalized as wu-wei (literally “non-action” or “non-forcing”). It’s important not to confuse wu-wei with passivity, rather, it refers to action that flows naturally, without greed, ambition, or rigid control. For example, a farmer who plants in harmony with the seasons is practicing wu-wei, while a farmer who relies on harmful chemicals to force faster growth is acting against the Tao. <br /> Cite - Joppich, S. (2022, July 4). The 2-Word concept that makes everything you do feel effortless. Stephan Joppich. https://stephanjoppich.com/wu-wei/#:~:text=a%20Simpler%20Life-,The%20Philosophy%20of%20Wu%20Wei,less%20insecure%2C%20and%20less%20serious.

  3. minnstate.pressbooks.pub minnstate.pressbooks.pub
    1. Three great philosophies: 1. Greek (western) philosophy - Socrates, plato; rationality, logic, universal truth 2. Indian philosophy - Hindu, Buddha, Jain; dharma, rebirth, liberation, consciousness 3. Chinese philosophy - Confucianism, Daoism, Buddhism; harmony, relational identity, ethics, and living to the dao

      "Philia" meaning love or appreciation of (a deeper desire to understand) and "sofia" meaning wisdom, or meaning extracted from knowledge. Philosophy literally means a desire to be wise.

      Understanding a philosophy of a culture helps understand the essence of that culture itself, but not all aspects. In order to truly become cultured, you must embrace the traditions, lifestyles, and become knowledgeable in history and traditional values.

      1. superficial equivalence - forcing a comparison between two different ideas equally deserving of their own respect (ignoring deeper meanings behind connection)
      2. Cherry-picking - isolating sayings or passages to paint an assumption of the whole context 3 - exoticism - treating non-western philosophies as being strange or "other worldly" 4 - assuming universality of western categories - asking questions of one branch of philosophy onto traditions that may not frame the concept the same way 5 - oversimplification - reducing entire traditions into a single set of ideas, ignoring diversity 6 - deficit thinking - assuming a tradition is "lacking" because it doesn't focus on certain concepts that other traditions do. Comparative philosophy should be used to learn from differences, rather than erasing or measuring their value and cultural ties.

      Given my last answer, Chinese philosophy aims to live well in a relational world; how to live a prosperous life. Confucianism emphasizes ethical self-improvement through living virtuously (humanely, righteously, and ritualistically). Roles for individuals are clearly defined (or at least individuals seek to find their role in society) in Chinese philosophy, where western philosophy seeks to ask metaphysical questions (what is ultimate reality? What is truth? Happiness? What does it mean to be human?)

    1. Art. 68
      • ADI 4269
      • Órgão julgador: Tribunal Pleno
      • Relator(a): Min. EDSON FACHIN
      • Julgamento: 18/10/2017
      • Publicação: 01/02/2019

      AÇÃO DIRETA DE INCONSTITUCIONALIDADE. DIREITO CONSTITUCIONAL E ADMINISTRATIVO. REGULARIZAÇÃO FUNDIÁRIA DAS TERRAS DE DOMÍNIO DA UNIÃO NA AMAZÔNIA LEGAL. IMPUGNAÇÃO AOS ARTIGOS 4º, §2º, 13, 15, INCISO I, §§ 2º, 4º E 5º, DA LEI Nº 11.952/2009. PREJUÍZO PARCIAL DA AÇÃO. ALTERAÇÃO SUBSTANCIAL E REVOGAÇÃO DE DISPOSITIVOS PROMOVIDA POR LEI SUPERVENIENTE. ADEQUADA PROTEÇÃO ÀS TERRAS QUILOMBOLAS E DE OUTRAS COMUNIDADES TRADICIONAIS AMAZÔNICAS. INCONSTITUCIONALIDADE DA INTERPRETAÇÃO QUE CONCEDE ESSAS TERRAS A TERCEIROS. INTERPRETAÇÃO CONFORME À CONSTITUIÇÃO. ARTIGOS 216, INCISO II, DO TEXTO CONSTITUCIONAL E 68 DO ADCT. AUSÊNCIA DE VISTORIA PRÉVIA NA REGULARIZAÇÃO DE IMÓVEIS DE ATÉ QUATRO MÓDULOS FISCAIS. PROTEÇÃO DEFICIENTE AO MEIO AMBIENTE SE DESACOMPANHADA DE MEIOS EFICAZES PARA FISCALIZAÇÃO DOS REQUISITOS DE INGRESSO NO PROGRAMA TERRA LEGAL. INTERPRETAÇÃO CONFORME À CONSTITUIÇÃO. RESPEITO AO ARTIGO 225, CAPUT, DA CONSTITUIÇÃO. - 1. Há prejuízo parcial da ação direta de inconstitucionalidade quando lei superveniente promova alteração substancial ou revogue dispositivo impugnado em demanda de controle concentrado, conforme jurisprudência pacífica desta Corte. No caso, a superveniência da Lei nº 13.465, de 11 de julho de 2017, alterou a redação do artigo 15, inciso I e §2º, bem como revogou expressamente seus §§ 4º e 5º, circunstância que impede o conhecimento da ação, no ponto. - 2. O direito ao meio ambiente equilibrado foi assegurado pela Constituição da República, em seu artigo 225, bem como em diversos compromissos internacionais do Estado Brasileiro. A região amazônica, dada a diversidade biológica, cultural, etnográfica e geológica, mereceu tutela especial do constituinte, tornando-se imperiosa a observância do desenvolvimento sustentável na região, conjugando a proteção à natureza e a sobrevivência humana nas áreas objeto de regularização fundiária.

      • 3. Revela-se de importância ímpar a promoção de regularização fundiária nas terras ocupadas de domínio da União na Amazônia Legal, de modo a assegurar a inclusão social das comunidades que ali vivem, por meio da concessão de títulos de propriedade ou concessão de direito real de uso às áreas habitadas, redução da pobreza, acesso aos programas sociais de incentivo à produção sustentável, bem como melhorando as condições de fiscalização ambiental e responsabilização pelas lesões causadas à Floresta Amazônica.
      • 4. O artigo 4º, §2º da Lei nº 11.952/2009 vai de encontro à proteção adequada das terras dos remanescentes de comunidades quilombolas e das demais comunidades tradicionais amazônicas, ao permitir interpretação que possibilite a regularização dessas áreas em desfavor do modo de apropriação de território por esses grupos, sendo necessária interpretação conforme aos artigos 216, I da Constituição e 68 do ADCT, para assegurar a relação específica entre comunidade, identidade e terra que caracteriza os povos tradicionais.

      • 5. Exige interpretação conforme à Constituição a previsão do artigo 13 da Lei nº 11.952/2009, ao dispensar a vistoria prévia nos imóveis rurais de até quatro módulos fiscais, a fim de que essa medida de desburocratização do procedimento seja somada à utilização de todos os meios eficazes de fiscalização do meio ambiente, como forma de tutela à biodiversidade e inclusão social dos pequenos proprietários que exercem cultura efetiva na área.

      • 6. Ação Direta de Inconstitucionalidade conhecida parcialmente e, na parte conhecida, julgada parcialmente procedente.

      Legislação LEG-FED ADCT ANO-1988 ART-00068 ATO DAS DISPOSIÇÕES CONSTITUCIONAIS TRANSITÓRIAS

      Outras ocorrências Doutrina (1)


      • ADI 3239
      • Órgão julgador: Tribunal Pleno
      • Relator(a): Min. CEZAR PELUSO
      • Redator(a) do acórdão: Min. ROSA WEBER
      • Julgamento: 08/02/2018
      • Publicação: 01/02/2019

      AÇÃO DIRETA DE INCONSTITUCIONALIDADE. DECRETO Nº 4.887/2003. PROCEDIMENTO PARA IDENTIFICAÇÃO, RECONHECIMENTO, DELIMITAÇÃO, DEMARCAÇÃO E TITULAÇÃO DAS TERRAS OCUPADAS POR REMANESCENTES DAS COMUNIDADES DOS QUILOMBOS. ATO NORMATIVO AUTÔNOMO. ART. 68 DO ADCT. DIREITO FUNDAMENTAL. EFICÁCIA PLENA E IMEDIATA. INVASÃO DA ESFERA RESERVADA A LEI. ART. 84, IV E VI, "A", DA CF. INCONSTITUCIONALIDADE FORMAL. INOCORRÊNCIA. CRITÉRIO DE IDENTIFICAÇÃO. AUTOATRIBUIÇÃO. TERRAS OCUPADAS. DESAPROPRIAÇÃO. ART. 2º, CAPUT E §§ 1º, 2º E 3º, E ART. 13, CAPUT E § 2º, DO DECRETO Nº 4.887/2003. INCONSTITUCIONALIDADE MATERIAL. INOCORRÊNCIA. IMPROCEDÊNCIA DA AÇÃO. - 1. Ato normativo autônomo, a retirar diretamente da Constituição da República o seu fundamento de validade, o Decreto nº 4.887/2003 apresenta densidade normativa suficiente a credenciá-lo ao controle abstrato de constitucionalidade. - 2. Inocorrente a invocada ausência de cotejo analítico na petição inicial entre o ato normativo atacado e os preceitos da Constituição tidos como malferidos, uma vez expressamente indicados e esgrimidas as razões da insurgência. - 3. Não obsta a cognição da ação direta a falta de impugnação de ato jurídico revogado pela norma tida como inconstitucional, supostamente padecente do mesmo vício, que se teria por repristinada. Cabe à Corte, ao delimitar a eficácia da sua decisão, se o caso, excluir dos efeitos da decisão declaratória eventual efeito repristinatório quando constatada incompatibilidade com a ordem constitucional. - 4. O art. 68 do ADCT assegura o direito dos remanescentes das comunidades dos quilombos de ver reconhecida pelo Estado a propriedade sobre as terras que histórica e tradicionalmente ocupam – direito fundamental de grupo étnico-racial minoritário dotado de eficácia plena e aplicação imediata. Nele definidos o titular (remanescentes das comunidades dos quilombos), o objeto (terras por eles ocupadas), o conteúdo (direito de propriedade), a condição (ocupação tradicional), o sujeito passivo (Estado) e a obrigação específica (emissão de títulos), mostra-se apto o art. 68 do ADCT a produzir todos os seus efeitos, independentemente de integração legislativa. - 5. Disponíveis à atuação integradora tão-somente os aspectos do art. 68 do ADCT que dizem com a regulamentação do comportamento do Estado na implementação do comando constitucional, não se identifica, na edição do Decreto 4.887/2003 pelo Poder Executivo, mácula aos postulados da legalidade e da reserva de lei. Improcedência do pedido de declaração de inconstitucionalidade formal por ofensa ao art. 84, IV e VI, da Constituição da República. - 6. O compromisso do Constituinte com a construção de uma sociedade livre, justa e solidária e com a redução das desigualdades sociais (art. 3º, I e III, da CF) conduz, no tocante ao reconhecimento da propriedade das terras ocupadas pelos remanescentes das comunidades dos quilombos, à convergência das dimensões da luta pelo reconhecimento – expressa no fator de determinação da identidade distintiva de grupo étnico-cultural – e da demanda por justiça socioeconômica, de caráter redistributivo – compreendida no fator de medição e demarcação das terras. - 7. Incorporada ao direito interno brasileiro, a Convenção 169 da Organização Internacional do Trabalho – OIT sobre Povos Indígenas e Tribais, consagra a "consciência da própria identidade" como critério para determinar os grupos tradicionais aos quais aplicável, enunciando que Estado algum tem o direito de negar a identidade de um povo que se reconheça como tal. - 8. Constitucionalmente legítima, a adoção da autoatribuição como critério de determinação da identidade quilombola, além de consistir em método autorizado pela antropologia contemporânea, cumpre adequadamente a tarefa de trazer à luz os destinatários do art. 68 do ADCT, em absoluto se prestando a inventar novos destinatários ou ampliar indevidamente o universo daqueles a quem a norma é dirigida. O conceito vertido no art. 68 do ADCT não se aparta do fenômeno objetivo nele referido, a alcançar todas as comunidades historicamente vinculadas ao uso linguístico do vocábulo quilombo. Adequação do emprego do termo “quilombo” realizado pela Administração Pública às balizas linguísticas e hermenêuticas impostas pelo texto-norma do art. 68 do ADCT. Improcedência do pedido de declaração de inconstitucionalidade do art. 2°, § 1°, do Decreto 4.887/2003. - 9. Nos casos Moiwana v. Suriname (2005) e Saramaka v. Suriname (2007), a Corte Interamericana de Direitos Humanos reconheceu o direito de propriedade de comunidades formadas por descendentes de escravos fugitivos sobre as terras tradicionais com as quais mantêm relações territoriais, ressaltando o compromisso dos Estados partes (Pacto de San José da Costa Rica, art. 21) de adotar medidas para garantir o seu pleno exercício. - 10. O comando para que sejam levados em consideração, na medição e demarcação das terras, os critérios de territorialidade indicados pelos remanescentes das comunidades quilombolas, longe de submeter o procedimento demarcatório ao arbítrio dos próprios interessados, positiva o devido processo legal na garantia de que as comunidades tenham voz e sejam ouvidas. Improcedência do pedido de declaração de inconstitucionalidade do art. 2º, §§ 2º e 3º, do Decreto 4.887/2003. - 11. Diverso do que ocorre no tocante às terras tradicionalmente ocupadas pelos índios – art. 231, § 6º – a Constituição não reputa nulos ou extintos os títulos de terceiros eventualmente incidentes sobre as terras ocupadas por remanescentes das comunidades dos quilombos, de modo que a regularização do registro <u>exige o necessário o procedimento expropriatório</u>. A exegese sistemática dos arts. 5º, XXIV, 215 e 216 da Carta Política e art. 68 do ADCT impõe, quando incidente título de propriedade particular legítimo sobre as terras ocupadas por quilombolas, seja o processo de transferência da propriedade mediado por regular procedimento de desapropriação. Improcedência do pedido de declaração de inconstitucionalidade material do art. 13 do Decreto 4.887/2003. Ação direta de inconstitucionalidade julgada improcedente.


      • ADI 7008
      • Órgão julgador: Tribunal Pleno
      • Relator(a): Min. ROBERTO BARROSO
      • Julgamento: 22/05/2023
      • Publicação: 06/06/2023

      Direito constitucional e administrativo. Ação direta de inconstitucionalidade. Lei nº 16.260/2016, do Estado de São Paulo. Concessão de áreas estaduais para exploração de atividades de ecoturismo e extração comercial de madeira e subprodutos florestais. - 1. Ação direta de inconstitucionalidade contra a Lei nº 16.260/2016, do Estado de São Paulo, que autoriza a concessão à iniciativa privada de áreas estaduais para exploração de atividades de ecoturismo e extração comercial de madeira e subprodutos florestais. - 2. O ato normativo veicula autorização legislativa dada ao Poder Executivo estadual para a concessão da exploração de serviços ou do uso, total ou parcial, de áreas em próprios estaduais. Ato normativo de caráter genérico que não afasta a incidência de normas editadas pela União em matéria ambiental ou o dever de consulta prévia às comunidades indígenas e tradicionais eventualmente afetadas. Sendo evidente o sentido da norma, revela-se incabível a interpretação conforme à Constituição para essa finalidade. - 3. O art. 231 da Constituição consagrou o caráter originário do direito dos índios às terras por eles “tradicionalmente ocupadas”, reservando-lhes, com exclusividade, o usufruto das riquezas do solo, dos rios e dos lagos nelas existentes. Além disso, essas terras foram incluídas entre os bens da União (art. 20, XI, da CF/88). Trata-se, portanto, de território pertencente à União e de usufruto exclusivo dos povos indígenas, sendo inconstitucional a sua concessão pelo Estado à iniciativa privada. - 4. Também a proteção às terras ocupadas por comunidades tradicionais e de remanescentes quilombolas é essencial à preservação de sua identidade e seus “modos de criar, fazer e viver” (arts. 215 e 216 da Constituição; art. 68 do ADCT e Convenção nº 169 da OIT). É inconstitucional a concessão dessas áreas, pelo Estado, à iniciativa privada, para exploração florestal madeireira e do ecoturismo, independentemente do <u>status de regularização fundiária</u> e da <u>morosidade</u> do Estado em efetivar seu dever de demarcá-las e protegê-las. - 5. Pedido julgado parcialmente procedente, para conferir interpretação conforme a Constituição à Lei nº 16.260/2016, do Estado de São Paulo, de modo a afastar sua incidência relativamente às terras tradicionalmente ocupadas por comunidades indígenas, remanescentes quilombolas e demais comunidades tradicionais. - 6. Fixação da seguinte tese de julgamento: “1.* É constitucional norma estadual que, sem afastar a aplicação da legislação nacional em matéria ambiental (inclusive relatório de impacto ambiental) e o dever de consulta prévia às comunidades indígenas e tradicionais, quando diretamente atingidas por ocuparem zonas contíguas, autoriza a concessão à iniciativa privada da exploração de serviços ou do uso de bens imóveis do Estado; 2.* A concessão pelo Estado não pode incidir sobre áreas tradicionalmente ocupadas por povos indígenas, remanescentes quilombolas e demais comunidades tradicionais”.

      Tese - 1. É constitucional norma estadual que, sem afastar a aplicação da legislação nacional em matéria ambiental (inclusive relatório de impacto ambiental) e o dever de consulta prévia às comunidades indígenas e tradicionais, quando diretamente atingidas por ocuparem <u>zonas contíguas</u>, autoriza a concessão à iniciativa privada da exploração de serviços ou do uso de bens imóveis do Estado;

      • 2. A concessão pelo Estado não pode incidir sobre áreas tradicionalmente ocupadas por povos indígenas, remanescentes quilombolas e demais comunidades tradicionais.
    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility and clarity

      SUMMARY

      In this study, Fernandes and colleagues addressed the question of the role of micro-RNAs in regulating the coupling between organ growth and developmental timing. Using Drosophila, they identified the conserved micro-RNA miR-184 as a regulator of the developmental transition between juvenile larval stages and metamorphosis. This transition is under the control of the steroid hormone Ecdysone, and has been shown to be modulated in case of abnormal tissue growth to adjust the duration of larval growth in response to developmental perturbations. The relaxin-like hormone Dilp8 has been identified as a key secreted factor involved in this coupling. Here, the authors show that miR-184 is involved in the regulation of Dilp8 expression both in physiological conditions and upon growth perturbation. They propose that this function is carried out in imaginal tissues, where miR-184 levels are modulated by tissue stress. While several factors have already been involved in triggering sharp dilp8 induction at the transcriptional level, this study adds another level of complexity to the regulation of Dilp8 by proposing that its expression is fine-tunned post-transcriptionally through repression by miR-184.

      __MAJOR COMMENTS______

      Overall, the manuscript is well organized, and the logics of the experimental plan well presented. The results are clear, and I appreciate the quality of the pupariation curves. However, I believe that two main conclusions of the paper are not fully supported by the results presented in the figures: the direct regulation of dilp8 3'UTR by miR-184, and the specificity of this regulation in imaginal discs. Here I develop in more details these two aspects.

      Comment 1) The strategy of the 3'UTR sensor is not fully optimized. Indeed, in most experiments, qRT-PCR is used to assess dilp8 expression levels, although it reflects both transcriptional and post-transcriptional. Importantly, to show that post-transcriptional regulation is involved in the response to tissue damage, the levels of the 3'UTR sensor should be analyzed in discs expressing RAcs (showing at the same time that the response is cell-autonomous in the discs). The expected upregulation of the sensor should be prevented by simultaneous expression of miR-184. This approach would shed light on the relative contribution of transcriptional versus post-transcriptional regulation of dilp8 in response to growth perturbation.

      Response: We thank the reviewer for this comment. We agree that qRT-PCRs do not distinguish between transcriptional and post-transcriptional changes of dilp8 levels, in response to changes in miR-184 levels and tissue damage. In addition to the qRT-PCR data we have looked at dilp8-3’UTR-GFP reporter in response to overexpression of miR-184 in the wingdisc using patched-Gal4 driver, which show downregulation of the GFP reporter in the ptc domain (Fig 4C-D’). This suggests that dilp8 mRNA is a direct target of miR-184 by post-transcriptional regulation through its 3’UTR. Further, to confirm the specificity of the effect of miR-184 on dilp8-3’UTR, we generated a dilp8-3’UTR mutant in which the single target site for miR-184 was mutated. We show that the mutated dilp8-3’UTR reporter doesn’t show any regulation in response to miR-184 overexpression in the ptc domain of the wingdisc (Fig. 4E, E’, F, F’). This experiment confirms the specificity of the dilp8-3’UTR regulation by miR-184.

      As suggested by the reviewer we analysed dilp8-3’UTR-GFP reporter expression by overexpressing RicinA using ptcGAL4 driver in the wing imaginal disc (Fig. S6F-G’). We observed a slight but consistent increase in the dilp8-3’UTR-GFP reporter expression, indicating post-transcriptional regulation of dilp8 expression in response to tissue damage. However, the increase of reporter GFP levels observed in this experiment in response to tissue damage is mild (Fig. S6F-G’) than expected based on the qRT-PCR results (Fig S6A and B). We have added this new data to the manuscript (Fig. S6F-G’).

      We propose the following reasons to explain this result:

      a) both transcriptional and post-transcriptional regulation of dilp8 mRNA in response to developmental perturbations

      b) the data on 3’UTR reporter GFP is specifically from the ptc domain expression of RicinA, whereas for dilp8 transcript levels we have expressed RicinA in all larval imaginal tissues, or in the entire wing imaginal disc, which could be one of the reasons for the stronger effect seen on dilp8 mRNA levels

      c) we are not certain if the tubulin-promoter driven dilp8-3’UTR GFP reporter reflects post-transcriptional regulation of dilp8 by miR-184 efficiently in comparison to qRT-PCR. This is especially as the reporter-GFP-3’UTR will be expressed at very high levels due to the tubulin promoter, a majority of this reporter-GFP mRNA may not be relieved from degradation due to the moderate suppression of miR-184 in response to RicinA overexpression.

      Thus, our experiments suggest that dilp8 levels are regulated post-transcriptionally by miR-184 which contributes to pupariation delays in response to tissue damage. In support of this, we could rescue pupariation delays and dilp8 induction caused by RicinA expression using overexpression of miR-184 (Figs 5B, C). Thus, we confirm that the effect of post-transcriptional regulation by miR-184 during developmental perturbations also contributes to dilp8 induction and pupariation delays. Unfortunately, due to experimental limitations we could not perform simultaneous expression of RicinA and miR-184 to evaluate the rescue of dilp8-3’UTR-GFP sensor expression. The levels of dilp8-3’UTR sensor GFP is reduced efficiently by miR-184 overexpression (Fig 4D), which prevented us from attempting the rescue of the moderate increase of dilp8-3’UTR GFP levels in response to RicinA.

      Comment 2) In my opinion, the use of a 3'UTR sensor is not sufficient to conclude that the regulation by miR-184 is direct, as miR-184 could also regulate an intermediate factor that acts on dilp8 post-transcriptional regulation. To solve this issue, a common strategy is to generate a 3'UTR sensor with mutated binding sites that should abolish the regulation by miR-184. This mutated 3'UTR might also respond differently to tissue damage, which would strongly support the conclusions of the study.

      Response: We couldn’t agree more with the reviewer, this comment is addressed in the response to comment 1. We have confirmed the specificity of regulation of dilp8-3’UTR by miR-184 using target site mutated dilp8-3’UTR (new figures added to the manuscript Fig. 4E, E’, F, F’). We tested if the changes in dilp8 mRNA levels in response to tissue damage is post-transcriptional mediated by miR-184. We observe that there is a slight, but consistent increase of dilp8-3’UTR GFP reporter levels in the ptc domain of wingdisc in response to RicinA expression, suggesting a role for miR-184 mediated post-translational regulation of dilp8. However, we have not yet tested the mutated dilp8-3’UTR GFP reporter in response to tissue damage.

      Comment 3) Concerning the tissue-specific regulation of Dilp8 by miR-184, these results need to be strengthened. Indeed, this comes mostly from phenotypes observed with rn-GAL4. Although this is a classical tool for driving expression in imaginal discs, rn-GAL4 also drives strong expression in other tissues that could contribute to triggering a delay, such as the CNS and part of the gut (proventriculus). In our hands, some growth phenotypes in the wing obtained with rn-GAL4 could be fully reverted by blocking GAL4 in the CNS indicating that the phenotype was not wing-specific. Importantly, miR-184 seems to be highly expressed in the CNS according to FlyBase, reinforcing the possibility that it plays a role in this organ. Here I propose approaches to confirm that miR-184 mediated regulation of dilp8 and developmental timing indeed occur in the discs:

      - Another driver with less secondary expression sites could be used (pdmR11F02-GAL4), or rn-GAL4 could be combined with an elav-GAL80 to prevent expression in most neurons. - The authors could identify the source of Dilp8 upregulation in miR-184 mutants using tissue-specific qRT-PCR instead of whole larvae expression like in Fig 4A-B. - This tissue-specific upregulation could be functionally tested using a rescue experiment, in which the delay observed in miR-184 mutants could be rescued by disc-specific downregulation of Dilp8 (using pdm2-GAL4 for instance).

      Response: We are thankful to the reviewer, and agree that it is important to show that the effects that we see using rn-Gal4 are specific to imaginal discs, and not due to an effect in CNS. We tested this by expressing miR-184 sponge in the CNS. Though miR-184 is highly expressed in the larval CNS, downregulation of miR-184 specifically in the pan-neuronal background using elav-GAL4 led to no effects on pupariation timepoint. We have added this as supplementary data Figure S4. Therefore, we believe that the miR-184 downregulation phenotype in the rnGAL4 background can be mainly attributed to its role in the imaginal discs. In addition, as suggested by the reviewer we have also demonstrated that downregulation of miR-184 in the imaginal discs using rnGAL4 driver leads to an increase in dilp8 expression (Fig S5B). Thus confirming that dilp8 mRNA is enhanced in the imaginal discs by blocking miR-184.

      OPTIONAL: Because it is known that dilp8 is strongly regulated at the transcriptional level, the relative input from post-transcriptional upregulation is an important question arising from this study. Although it might be a more long-term approach, I believe that generating a Dilp8 mutant lacking its 3'UTR or, even better, with mutated miR-184 binding sites, would shed light on the role of this regulation for the response to growth perturbation and/or developmental stability (fluctuating asymmetry).

      Response: We thank the reviewer for the suggestion. This would have been an interesting experiment to carry out especially in the context of fluctuating asymmetry.

      MINOR COMMENTS

      1. __ I think that a number of results could be moved to SI as they are either controls, or reproduce published data without bringing novelty. For instance, results in Fig 5A-D are similar to data published by Sanchez et al, as stated in the text. Fig6A as well.__

      __Response: __We thank the reviewer for this suggestion, Fig. 5A-D, and F has been moved to Fig. S6A-E. We have also moved data from Fig. 6 to Fig. 5, as a result Fig 6 A-D has become Fig. 5 B-D.

      __ Fig 6D is quite mysterious, as it suggests that basal JNK activation regulates miR-184, which is different from a context of tissue damage. I think that this result could be removed. Alternatively, if the authors want to dig in that direction, more experiments should be provided, such as bskDN expression in an RAcs context and the effects on miR-184 levels and the 3'UTR sensor (since transcript levels are already published).__

      Response: We would like to clarify that our experiments suggest that endogenous JNK signalling negatively regulates miR-184, as blocking basal JNK signalling using bskDN increased the levels of miR-184 (changed to Fig 5D). Enhanced JNK signalling has been reported to be involved in tissue damage responses, and we propose that RicinA mediated increase in JNK signalling leads to the reduction of miR-184 (changed to Fig 5A, S6D-E). However, we are not strongly implying this as we did not co-express RicinA and bskDN to show that JNK signalling is responsible for the drop in miR-184 levels in response to tissue damage. We thank the reviewer for seeking this explanation, we have rewritten the results section to improve clarity.

      __ The references related to Dilp8 should be checked more in detail in the intro and discussion. About Dilp8 and developmental stability: remove the ref to Colombani et al 2012, instead put Boone et al 2016 and add Blanco-Obregon et al 2022 (in addition to Garelli et al 2012 who initially identified this phenotype. About Lgr3 as the receptor for Dilp8: add Colombani et al, Current Biology 2015, and cite here Vallejo et al 2015, Garelli et al 2015. Among the important transcriptional regulators of Dilp8, Xrp1 could be mentioned (Boulan et al 2019, Destefanis et al 2022) as it plays a complementary function to JNK depending on the type of tissue stress.__

      __Response: __We are really sorry for the glaring errors in citing appropriate references. We thank the reviewer for correcting this for us. We have made necessary changes to the text.

      Significance

      GENERAL ASSESSMENT This study provides convincing data showing that the conserved microRNA miR-184 plays a role in regulating developmental timing in Drosophila through modulating the levels of Dilp8, a key factor in the coupling between tissue growth and developmental transitions. The results are convincing, but the general conclusions of the paper need to be strengthened regarding the direct regulation of dilp8 by miR-184 and the tissue-specificity of this interaction.

      ADVANCE Dilp8 is a key factor that modulates growth and timing in response to developmental perturbations and contributes to developmental precision in physiological conditions. As such, its regulation has been studied by different groups in the last decade, leading to the identification of several inputs for its transcriptional regulation. Here, the authors uncover a post-transcriptional regulation by miR-184, adding another level of regulation of Dilp8 that contribute to ensuring proper regulation of developmental timing, and opening the possibility that miR-184 might play similar roles in other species.

      AUDIENCE This study is of interest for researchers in the field of basic science, with a focus on developmental timing, tissue damage and biological function of microRNAs.

      REVIEWER EXPERTISE Drosophila, growth control, developmental timing, Dilp8.

      Reviewer #2

      Evidence, reproducibility and clarity

      Drosophila has helped to characterize the mechanisms that coordinate tissue growth with developmental timing. The insulin/relaxin-like peptide Dilp8 has been identified as a key factor that communicates the abnormal growth status of larval imaginal discs to neuroendocrine neurons responsible for regulating the timing of metamorphosis. Dilp8, derived from imaginal discs, targets four Lgr3-positive neurons in the central nervous system, activating cyclic-AMP signaling in an Lgr3-dependent manner. This signaling pathway reduces the production of the molting hormone, ecdysone, delaying the onset of metamorphosis. Simultaneously, the growth rates of healthy imaginal tissues slow down, enabling the development of proportionate individuals.

      In this manuscript "miR-184 modulates dilp8 to control developmental timing during normal growth conditions and in response to developmental perturbations" by Dr. Varghese and colleagues, the authors identify a new post transcriptional regulator of Dilp8. The authors show that miR-184 plays a pivotal role in tissue damage responses by inducing dilp8 expression, which in turn delays pupariation to allow sufficient time for damage repair mechanisms to take effect.

      Major points:

      Comment 1) In most of the experiments for percentage of pupariation, the 50% pupariation in control is around 110 hours AED in figures 1, 2 and 3. In figures 5 and 6 using the UAS Ricin, the controls are more around 90 hours AED. Why this discrepancy?

      Response: We thank the reviewer for asking for this clarification. The former experiments for Figs 1-3 were carried out at 25oC while the latter experiments with a cold sensitive version of RicinA (UAS-RAcs), Figs 5 and 6 (now changed to Figs. 5 and S6 as suggested by reviewer #1) were carried out at 29oC (permissive temperature). This difference in temperature has led to alterations in pupariation timing. We apologise for not having mentioned this in the text, now we have made necessary corrections to the methods section clearly indicating this.

      Comment 2) What is the mechanism behind the expression of miR-184 in stress conditions? Is miR-184 also implicated in other conditions giving rise to a developmental delay (X-rays irradiation or animal bearing rasV12, scrib-/- tumors)?

      Response: We thank the reviewer for these questions.

      a) In response to developmental perturbations by RicinA, we believe that activation of JNK signalling controls miR-184 expression. We propose this as our experiments show that imaginal disc damage leads to enhancement of JNK signalling and increase in dilp8 mRNA levels (as reported earlier by Colombani et al 2012; Sánchez et al 2019), and a simultaneous reduction of miR-184 (Figs. S6A, D, E). We also have performed new experiments to show that in response to RicinA expression in the wingdisc there is moderate increase in the dilp8-3’UTR-GFP sensor expression (Figs. S6F-G’), indicating a post-transcriptional regulation of dilp8 expression in response to tissue stress. We also show that RicinA induced dilp8 expression and pupariation delay can be rescued by increasing miR-184 levels (Fig 5B and C), suggesting that the reduction of miR-184 in response to tissue damage contributes to the damage responses. In a separate experiment we show that blocking the endogenous JNK pathway by the expression of bskDN enhances miR-184 levels, suggesting that miR-184 is under the regulation of JNK signalling (Fig 5D). Hence, we speculate that during tissue stress, activation of JNK signalling leads to a reduction of miR-184 levels which contributes to regulating the levels of dilp8 post-transcriptionally and resulting in pupariation delays. The text has been modified to explain this better.

      b) In a previous paper by Shu et al., 2017 (https://doi.org/10.18632/oncotarget.22226) decreased expression of miR-184 was observed in a lglRNAi; RasV12 tumor background. Apart from this various studies have shown that dilp8 levels increase in response to tumour, radiation stress, apoptosis, and tissue damage (Yeom et al 2021, Ray et al 2019, Demay et al 2014, Katsuyama et al 2015, Colombani et al 2012, Garelli et al 2012). Whether the regulation of dilp8 by miR-184, occurs in these backgrounds is yet to be tested. We have now discussed this possibility in the manuscript.

      Comment 3) dilp8 mutant animals have also been shown to be more resistant to starvation or desiccation (https://doi.org/10.3389/fendo.2020.00461). Is miR-184 implicated in this answer?

      Response: We thank the reviewer for this question. In our earlier experiments miR-184 has been demonstrated to be regulated by nutrition in the larval stages and lack of miR-184 led to enhanced larval death in response to diet restriction (Fernandes et al., 2022). miR-184 was also demonstrated to play a role in the insulin producing cells (IPCs) in regulating lifespan (Fernandes & Varghese., 2022). In the current work, we propose miR-184 to act upstream of dilp8 in response to stress stimuli. Hence, it is possible that miR-184 might be involved in responses to starvation and desiccation stress in the adult female flies, by regulating dilp8 levels post-transcriptionally. However, it has not been tested yet if the miR-184 regulation of dilp8 plays a role in resistance to starvation or desiccation in adult females, as this was not within the scope of the current study. We have now added this reference in the discussion section.

      Comment 4) dilp8 expression has been also shown to be regulated by Xrp1 in response to ribosome stress (https://doi.org/10.1016/j.devcel.2019.03.016). This paper should be included in the manuscript. Is it possible that the expression levels of miR184 are regulated by Xrp1?

      Response: We thank the reviewer for the suggestion and have incorporated the reference into the paper. During ribosome stress in the larval imaginal discs the stress-response transcription factor Xrp1 acts through dilp8 in regulating systemic growth. We agree with the reviewer, it is possible that expression of miR-184 is regulated by Xrp1. Currently we have not explored this possibility. We have now added this to the discussion section.

      Minor points:

      1. __ Does the overexpression of miR184 induce an increased fluctuating asymmetry?__

      Response: We thank the reviewer for asking this question. The role of dilp8 in the fluctuation asymmetry is only observed in the dilp8 hypomorphic mutant background. To replicate this we would have to overexpress miR-184 in either the whole larvae or in the wing discs. Unfortunately overexpression of miR-184 in the wing discs (using rnGAL4) leads to pupal lethality while as overexpression of miR-184 in the whole larvae leads to embryonic lethality and therefore we were not be able to conclude from our experiments if miR-184 overexpression induces increased fluctuating asymmetry.

      2. There are 2 references Colombani et al. (2012 for Dilp8 and 2015 for Lgr3). Can you double check that they are used accordingly

      Response: We thank the reviewer for pointing these errors out and we have incorporated these changes into the paper.

      Significance

      Altogether, the paper present compiling lines of evidence supporting the proposed model. The experiments are well designed and are convincing. The papers is interesting and relevant for a broad audience.

      __Reviewer #3 __

      Evidence, reproducibility and clarity (Required):

      This is an interesting study demonstrating an interaction between miR-184 and the Drosophila insulin-like peptide 8 (dilp8) in the tissue damage response. The authors show that Dilp8 activity is negatively regulated by miR-184, apparently through direct interaction between miR-184 and the dilp8-3'UTR, which leads to lower dilp8 mRNA transcript levels, via an undetermined mechanism, supposedly its degradation? Furthermore, the authors show that during aberrant tissue growth, miR-184 levels are very slightly downregulated (see comment below), and based on other experiments, imply causation of this with the increased dilp8 mRNA levels that occur in these tissues, again via an unclear mechanism: upregulation or stabilization of dilp8 mRNA. The authors present evidence that the JNK pathway, which had been known to be critical for dilp8 mRNA upregulation upon tissue damage, does so via miR-184.

      Major Comments:

      __Comment 1: The data showing the direct regulation of dilp8-3'UTR by miR-184 are not very strong and would require more controls to strengthen the claim, as described below. __

      Response: We have performed new experiments to validate that dilp8-3’UTR is regulated by miR-184. Please see the detailed responses to comments 10-12 below.

      __Comment 2: The miR-184 effects are also very small (less than 2-fold reduction with tissue damage; or less than 2-fold induction with JNK-pathway inhibition via bskDN). These two points are the weakest part of the manuscript and model. __

      Response: We agree with the reviewers on this point. The reduction in miR-184 levels in response to RicinA expression is modest (25–30%), and the induction of miR-184 in response to bskDN expression is less than two-fold (Figs. 5A and D). In contrast, dilp8 transcript levels increase several-fold in response to RicinA expression (Fig. 5C, S6A and B). Since we measure dilp8 transcript levels by qPCR, we detect both transcriptional and post-transcriptional contributions to dilp8 regulation. In addition, we have performed a new experiment to check the post-transcriptional regulation of dilp8, in response to tissue damage. Though the change in the dilp8-3′UTR GFP reporter upon RicinA expression in the ptc domain of the wingdisc is mild (Figs. S6F-G’), this strongly suggests a post-transcriptional outcome of the reduction of miR-184 levels on dilp8. Hence, we propose that tissue damage induces strong transcriptional activation of dilp8, while the reduction of miR-184, despite its smaller magnitude, contributes to dilp8 upregulation via post-transcriptional regulation. In support of this, our experiments demonstrate direct regulation of the dilp8-3′UTR by miR-184 (Figs. 4C-F’), and show strong dilp8 mRNA upregulation in miR-184 deficient conditions (Fig. 4A and B), suggesting the role of miR-184 in maintaining dilp8 levels. We also show that RicinA induced effects on dilp8 and pupariation delay are reversed by co-expression of miR-184 (Fig. 5C). We do not claim that regulation by miR-184 is the sole mechanism for driving dilp8 induction during tissue damage, but suggest that miR-184-mediated post-transcriptional regulation acts in a complementary manner to transcriptional responses. Furthermore, we believe that the mild effect of JNK signaling on miR-184 (as shown by the bskDN experiment) is sufficient for the moderate reduction of miR-184 in response to tissue damage.

      Comment 3: ____Regarding the expression levels, it does not help that the authors show bar graphs with standard errors of the mean instead of the actual data points to allow reliable appreciation of the data dispersion.

      Response: We have modified our figures and have performed statistical analysis according to the suggestions of the reviewers, please see responses to comments 1-9, and 13-19.

      Comment 4: It is difficult to understand how minute changes in miR-184 levels can lead to over an order of magnitude differences (in some cases) in dilp8 mRNA levels considering that it is a stoichiometric relationship. Maybe ?miR-184-Dicer1? complexes are highly stable and re-used for multiple dilp8 transcripts - the authors could discuss how they understand this occurring in their manuscript.

      On the same line, discussion is also rather weak on what regards the mechanism of control of dilp8 mRNA levels by miR-184. Please discuss eg, the evidence for mRNA degradation induction by microRNAs with this UTR binding profile (imperfect UTR binding Fig S4) and-if appropriate-how other possible regulatory models (direct and indirect) could explain the findings.

      Response: We accept the reviewers comment that 25-30% reduction of miR-184 is low in comparison to the many fold increase in dilp8 levels. We believe that both post-transcriptional and transcriptional changes are responsible for the induction of dilp8 in response to tissue damage. However, our experiments suggest the role of post-transcriptional regulation by miR-184, as pupariation delay is rescued by miR-184 overexpression (also please see the response to the previous comment). We are not ruling out the possibility of transcriptional regulation of dilp8 mRNA, rather we are suggesting the possibility that both transcriptional and post-transcriptional means are responsible for changes in dilp8. Moreover, we have not performed absolute measurement of miR-184 in the imaginal discs (what we show is a comparison between control and RicinA expression), hence we do not have an exact estimate of how many miR-184 molecules are reduced and if they would be greatly equal or more in comparison to the dilp8 mRNA molecules that are upregulated, as again while measuring dilp8 mRNA we are not checking how many molecules of dilp8 exactly are increased. As the reviewer suggests, it is possible that miR-184-RISC could be stable to handle multiple dilp8 molecules one after the other, hence it is not a 1:1 relationship between miR-184:dilp8. We have included this in the manuscript. It is also known that imperfect 3’UTR binding as seen in most animal microRNAs leads to translational repression and mRNA deadenylation, which eventually results in mRNA degradation.

      Comment 5: ____We suggest the authors carefully revise their citations to cite appropriate work that supports the claims, and also to avoid missing the seminal studies that report the claims they cite.

      Response: We are really apologetic for the errors citing the key references. We are grateful to the reviewers for correcting this for us. We have made changes to the text to include and correct the references.

      We have the suggestions below which we hope will help the authors improve their manuscript. If the authors address these points raised above, we believe the manuscript should be a valuable contribution to the field, and help in the understanding of how tissues respond to growth aberrations and the regulation of transcript levels by microRNAs.

      Detailed Comments:

      Comment 1. Results 1st paragraph: please describe the screen in more detail. As written, one only discovers it was a miRNA loss-of-function screen when reading the legend of Table S1. Please show the original data of the screen - with dispersion if possible.

      Response: We thank the reviewers for these suggestions, we have now included the data from the screen with SEM, and p-values.

      Comment 2. Results 1st paragraph, Fourth line, "While several miRNAs caused delays in pupariation by 12 hours or more..". Please correct, as actually loss of miRNAs caused delays.

      Response: We thank the reviewer for pointing out this error, we have corrected the text accordingly.

      Comment 3. ____Results (Figure 1) - It says that data from three independent experiments are shown. However there is no dispersion in the data. Could the authors please explain this? Are the results of the three experiments summed and presented as one? or is this one of the three?

      Response: We thank the reviewers for these suggestions and have plotted data with the SEM values.

      Comment 4. It is reported in the legend of Figure S2 that LogRank test was performed to determine statistical significance. However, no statistical data is presented. Please show the results.

      __Response: __We thank the reviewers for these suggestions to improve the data presentation, we have incorporated the p-value as suggested.

      Comment 5. Fig2A and B. Please show the data points in the bar graphs (as in Figure. 2C), or choose another data representation. ____Please consider redoing statistical analysis with a simple t-test. ____It is not clear to me why ANOVA was used to compare two samples. Please state that data are normalized also to control (tub-GAL4>UAS-scramble). Please ____state____ the h post-hatching from which the RNA samples were collected (as in Fig 2C for 20HE quantification).

      __Response: __We thank the reviewers for these suggestions to improve the data presentation, we have incorporated all changes as suggested. Similar changes have been incorporated to the rest of the figures of the manuscript as well. Hours post-hatching information for each figure is now added to the figure legends. __ __

      Comment 6. Fig2C. Fig legend states the bar graphs are "absolute values". Please specify if the bar represents the average, median or something else.

      Response: We thank the reviewer for pointing this out, we have made the suggested changes.

      Comment 7. Throughout the manuscript: please use GAL4 in capital letters or at least standardize it throughout the ms. Currently there are GAL4s and Gal4s.. eg compare Fig 2 and 3 legends.

      Response: We thank the reviewer for pointing this out, we have incorporated all changes as recommended.

      Comment 8. FigS3A and B. Please revise as Fig2A and B above. and apply the same criteria in the respective figure legend.

      __Response: __We thank the reviewer for pointing this out, we have made the changes as recommended.

      Comment 9. Fig. 4 - please indicate on the figures what is whole larvae and what is wing imaginal discs. This will facilitate understanding of the figure.

      __Response: __We thank the reviewers for these suggestions and have included this information in all the figures.

      Comment 10. Fig 4 - Data - Authors do not show that rn-GAL4>miR-184-sponge causes up regulation of dilp8 mRNA levels, hence the model is weakened. Doing this experiment would significantly strengthen the study whatever the result is.

      Response: We thank the reviewer for pointing this out and we have included this in the manuscript (Fig S5B).

      Comment 11. The dilp8-3'UTR experiment is weak especially because its generation is not sufficiently well described in the manuscript. "The dilp8 3'UTR-GFP reporter line was created as described in (Vargheese & Cohen, 2007)" is not sufficient. Please describe the construct generation in sufficient detail so that the experiments can be reproduced by others.

      Response: We thank the reviewer for pointing this out and we have elaborated in the methods section on how we generated the dilp8 3'UTR-GFP reporter and dilp8 3'UTR mutant GFP reporter lines. The plasmid was originally created in Steve Cohen’s lab at EMBL, by modifying pCasper4 plasmid, by introducing a tubulin promoter, EGFP and a multiple cloning site, which allows one to clone 3’UTRs of target genes into this plasmid. Not1 and Xho1 sites were used to clone the dilp8-3’UTR and mut-3’UTR. We hope this explains our strategy sufficiently.

      Comment 12. Making assumptions, if the construct is as described in Vargheese & Cohen, 2007 and contains all of the dilp8 3'UTR - it should be a Tubulin-driven GFP gene with a dilp8-3'UTR "Tub-GFP-(dilp8 3'UTR)". In this case the authors need to rule out the alternative interpretation of the result in Fig. 4D by showing that the expression of miR-184 does not down regulate Tub-GFP expression itself. The best scenario would be to have a mutated dilp8 3'UTR for the miR-184 recognition site. This experiment would significantly strengthen the study and model.

      Response: We thank the reviewer for pointing this out. We agree with the reviewers that this experiment is needed to prove direct regulation of the dilp8-3’UTR by miR-184. We have mutated the sequences complementary to the seed region of miR-184 in the dilp8-3’UTR, and demonstrated that overexpression of miR-184 does not regulate the mutated tub-GFP-(dilp8 3'UTR) expression. This confirms that the dilp8 gene is a direct target of miR-184. This data is added to the manuscript as Figs 4E-F’.

      Comment 13. Figure 4C-D please separate dilp8 from 3'UTR with a space or hyphen.

      Response: We thank the reviewer for pointing this out and have separated dilp8 from 3’UTR with a hyphen.

      Comment 14. Figure 4E. Please name the dilp8 allele as MI00727 as it is not a KO, but rather a hypomorphic mutation (fully WT dilp8 transcripts are still generated, albeit at a much lower level).

      Response: We thank the reviewer for pointing this out and we have made the necessary changes.

      Comment ____15. Figure 6D: please add UAS to bskDN/+. All figures have rn-GAL4 alone or with UAS-GFP as control. This finding would be strengthened with this other control, especially because the size effect is small.____ This being said a general comment for all experiments is that hemi-controls are generally missing for all figures. eg, in Fig 3. One would typically include controls such as A. Phm>+ and +>miR.184; B. aug21>+ and +>miR.184; C. ptth>+ and +>miR.184; D. rn>+ and +>miR.184

      Response: We thank the reviewer for pointing this out. We have added UAS to bskDN, now Fig 5D and have also added the rnGAL4/+ control. We have also performed various hemi-control experiments as suggested by the reviewer to our best capabilities. We have added a separate graph with the hemicontrols in the as a Reviewer Response Figure 1.

      Comment 16. Figure 7: Are IPCs necessary for the model? If not, I suggest removing them and placing the Lgr3 neuron cell bodies much more anterior in this scheme. Their cell bodies are as anterior and rostral as it gets, approximately where the IPCs are depicted in this type of view of the CNS.

      Response: We thank the reviewer for pointing this out and have removed IPCs from the figure, this figure is now labelled as Fig. 6.

      Comment ____17. Table S1- It would be preferable to see the data of these experiments, but if the authors prefer to show this data in a table, please at least add the dispersion analyses (eg standard deviation.. OR median+-quartiles OR Confidence intervals..), N of animals analysed, and statistics against controls.

      Response: We thank the reviewer for pointing this out, we have added the number of larvae analysed, SEM values and statistics against the control condition.

      Comment ____18. In all figures with pupariation time: please also indicate significant findings in the graphs (with an asterisk, for instance) and adjust figure legends accordingly. This could facilitate understanding the data.

      __Response: __Thanks for the suggestion. We have incorporated this information into figure legends.

      Comment ____19. Please revise Figure legends for punctuation.

      __Response: __We have rectified all the errors in punctuation. We thank the reviewers for suggesting this.

      __Comment ____20. __

      a) Abstract:

      Line 10: What is the evidence to call Dilp8 a "paracrine" factor?

      Response: We thank the reviewer for pointing this out, we have changed the text to ‘secreted factor’.

      b) Introduction:

      4th paragraph, 3rd sentence " Dilp8... buffers developmental noise and delays pupariation..." Buffering of developmental noise was first shown in Garelli et al., Science 2012, so this publication should be cited. ____4th paragraph, 5th sentence: please include Jaszczak et al., Genetics 2016. This paper was published together with the 2015 papers, just a matter of timing that it got a 2016 date. Moreover, I do not think Katsuyama et al., 2015 is well cited to back up the statement in this sentence, hence I recommend removing that citation in this sentence.

      Response: We thank the reviewer for pointing this out and have made necessary changes.

      c) 6th paragraph: 5th line "targeting dilp8" : please specify if you mean the gene or the mRNA, or both. Same for line 7.

      Response: We thank the reviewer for pointing this out and have made necessary changes.

      d) Results Page 10, 1st paragraph, 1st sentence: the works cited are not the appropriate studies that demonstrated what is being stated. This was shown in Garelli et al., Science 2012 and Colombani et al., Science 2012. Results Page 10, 1st paragraph, line 11: Please also cite Colombani et al., Science 2012, who first showed that JNK is required for dilp8 regulation.

      Response: We thank the reviewer for pointing this out and are extremely apologetic for this oversight. We have made necessary changes to the manuscript.

      e) Discussion, 2nd paragraph, line 4: again, please indicate the rationale for using "paracrine" to describe Dilp8's activities. The current widely accepted model is that Dilp8 acts on interneurons in the brain ____(eg, reviewed in Juarez-Carreno et al., Cell Stress, 2018; Gontijo and Garelli, Mech Dev, 2018; Mirth and Shingleton, Front Cell Dev Biol, 2019; Texada et al., Genetics 2020; Boulan and Leopold, 2021).____ In order to reach the brain, Dilp8 has to be secreted from the discs and travel to the brain. This is as an endocrine mechanism as it gets for a small larva, considering that some discs can be on the opposite side of the larva (eg, genital discs). While this does not exclude that Dilp8 could also act paracrinally, the only evidence that I am aware of comes from other contexts such as during transdetermination (where Dilp8 has been proposed to work in an autocrine or paracrine fashion, via Drl in imaginal discs (Nemoto et al., Genes to Cells, 2023), however, this is not cited appropriately in this manuscript and is less related to the Lgr3-dependent pathway being studied here.

      Response: We totally agree with the reviewer and appreciate clarifying this for us. We have made necessary changes to the text.

      f) Discussion Page 13, 1st paragraph, This claim is supported by data presented in Garelli et al., Science 2012, not the other two papers. Garelli et al., 2015 shows that the Lgr3 receptor also participates in buffering developmental noise. Other studies have corroborated the Garelli et al., 2012 finding: eg, Colombani et al., Curr Biol 2015; Boone et al., Nat Commun 2016; Blanco-Obregon et al., Nat Commun 2022). Many other studies have shown that Dilp8 promotes developmental stability under tissue stress and challenges.

      Discussion Page 12, 3rd paragraph, 2nd sentence: "The Lgr3 neurons directly interact with ... PTTH ...and insulin-producing neurons" Please cite Colombani et al., 2015 and Vallejo et al., Science 2015. Vallejo et al., propose that circuit with insulin-producing neurons. In the 3rd sentence, only Jaszczak et al., 2016 is cited, whereas this claim/model comes from many studies, such as Halme et al., Curr Biol, 2010; Hackney et al., PLoS One 2012; Garelli et al. Science 2012; Colombani et al., Science, 2012; and the Lgr3 papers from 2015). Jaszczak et al., actually propose that Lgr3 is also required in the ring gland in addition to neurons.

      Discussion page 14 last paragraph,10 line, "In Aedes aegypti ....regulates ilp8 (Ling et al., 2017)". As far as I understand mosquitoes do not have a dilp8 orthologue (see for instance Gontijo and Gontijo, Mech Dev 2018; and Jan Veenstra's work). ilp nomenclature (numbering) does not follow that of Drosophila, so ilp8 is probably a typical Insulin/IGF-like peptide and is NOT an orthologue of Dilp8, a relaxin, so this citation needs to be removed or placed into the broader context of microRNA regulation of ilps.

      Response: We are really sorry for the numerous glaring errors in the references. We thank the reviewers for correcting this for us. We have made necessary changes to the text.

      Thank you for the opportunity to review your interesting work,

      Alisson Gontijo and Rebeca Zanini

      Reviewer #3 (Significance (Required)):

      If the authors address these points raised above, we believe the manuscript should be a valuable contribution to the field, and help in the understanding of how tissues respond to growth aberrations and the regulation of transcript levels by microRNAs.

      __Author’s concluding response: __

      We thank all the reviewers for the overall positive comments and suggestions that we believe have helped us to improve our manuscript. We have incorporated all the changes suggested, especially regarding errors in citing key references. We have performed most of the experimental suggestions. Also, we have modified the way in which graphs are presented, including statistical tests as suggested by the reviewers. Several controls have been performed to strengthen the manuscript further. We believe that this review process aided in significantly improving this manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This is an interesting study demonstrating an interaction between miR-184 and the Drosophila insulin-like peptide 8 (dilp8) in the tissue damage response. The authors show that Dilp8 activity is negatively regulated by miR-184, apparently through direct interaction between miR-184 and the dilp8-3'UTR, which leads to lower dilp8 mRNA transcript levels, via an undetermined mechanism, supposedly its degradation? Furthermore, the authors show that during aberrant tissue growth, miR-184 levels are very slightly downregulated (see comment below), and based on other experiments, imply causation of this with the increased dilp8 mRNA levels that occur in these tissues, again via an unclear mechanism: upregulation or stabilization of dilp8 mRNA. The authors present evidence that the JNK pathway, which had been known to be critical for dilp8 mRNA upregulation upon tissue damage, does so via miR-184. The data showing the direct regulation of dilp8-3'UTR by miR-184 are not very strong and would require more controls to strengthen the claim, as described below. The miR-184 effects are also very small (less than 2-fold reduction with tissue damage; or less than 2-fold induction with JNK-pathway inhibition via bsk-DN). These two points are the weakest part of the manuscript and model. Regarding the expression levels, it does not help that the authors show bar graphs with standard errors of the mean instead of the actual datapoints to allow reliable appreciation of the data dispersion. It is difficult to understand how minute changes in miR-184 levels can lead to over an order of magnitude differences (in some cases) in dilp8 mRNA levels considering that it is a stoichiometric relationship. Maybe ?miR-184-Dicer1? complexes are highly stable and re-used for multiple dilp8 transcripts - the authors could discuss how they understand this occurring in their manuscript. On the same line, discussion is also rather weak on what regards the mechanism of control of dilp8 mRNA levels by miR-184. Please discuss eg, the evidence for mRNA degradation induction by microRNAs with this UTR binding profile (imperfect UTR binding Fig S4) and-if appropriate-how other possible regulatory models (direct and indirect) could explain the findings. We suggest the authors carefully revise their citations to cite appropriate work that supports the claims, and also to avoid missing the seminal studies that report the claims they cite. We have the suggestions below which we hope will help the authors improve their manuscript. If the authors address these points raised above, we believe the manuscript should be a valuable contribution to the field, and help in the understanding of how tissues respond to growth aberrations and the regulation of transcript levels by microRNAs.

      Comments:

      Results 1st paragraph: please describe the screen in more detail. As written, one only discovers it was a miRNA loss-of-function screen when reading the legend of Table S1. Please show the original data of the screen - with dispersion if possible.

      Results 1st paragraph, Fourth line, "While several miRNAs caused delays in pupariation by 12 hours or more..". Please correct, as actually loss of miRNAs caused delays.

      Results (Figure 1) - It says that data from three independent experiments are shown. However there is no dispersion in the data. Could the authors please explain this? Are the results of the three experiments summed and presented as one? or is this one of the three?

      It is reported in the legend of Figure S2 that LogRank test was performed to determine statistical significance. However, no statistical data is presented. Please show the results.

      Fig2A and B. Please show the data points in the bar graphs (as in Figure. 2C), or choose another data representation. Please consider redoing statistical analysis with a simple t-test. It is not clear to me why ANOVA was used to compare two samples. Please state that data are normalized also to control (tub-GAL4>UAS-scramble). Please state the h post-hatching from which the RNA samples were collected (as in Fig 2C for 20HE quantification).

      Fig2C. Fig legend states the bar graphs are "absolute values". Please specify if the bar represents the average, median or something else.

      Throughout the manuscript: please use GAL4 in capital letters or at least standardize it throughout the ms. Currently there are GAL4s and Gal4s.. eg compare Fig 2 and 3 legends.

      FigS3A and B. Please revise as Fig2A and B above. and apply the same criteria in the respective figure legend.

      Fig. 4 - please indicate on the figures what is whole larvae and what is wing imaginal discs. This will facilitate understanding of the figure.

      Fig 4 - Data - Authors do not show that rn-GAL4>miR-184-sponge causes up regulation of dilp8 mRNA levels, hence the model is weakened. Doing this experiment would significantly strengthen the study whatever the result is.

      The dilp8-3'UTR experiment is weak especially because its generation is not sufficiently well described in the manuscript. "The dilp8 3'UTR-GFP reporter line was created as described in (Vargheese & Cohen, 2007)" is not sufficient. Please describe the construct generation in sufficient detail so that the experiments can be reproduced by others.

      Making assumptions, if the construct is as described in Vargheese & Cohen, 2007 and contains all of the dilp8 3'UTR - it should be a Tubulin-driven GFP gene with a dilp8-3'UTR "Tub-GFP-(dilp8 3'UTR)". In this case the authors need to rule out the alternative interpretation of the result in Fig. 4D by showing that the expression of miR-184 does not down regulate Tub-GFP expression itself. The best scenario would be to have a mutated dilp8 3'UTR for the miR-184 recognition site. This experiment would significantly strengthen the study and model.

      Figure 4C-D please separate dilp8 from 3'UTR with a space or hyphen.

      Figure 4E. Please name the dilp8 allele as MI00727 as it is not a KO, but rather a hypomorphic mutation (fully WT dilp8 transcripts are still generated, albeit at a much lower level).

      Figure 6D: please add UAS to bskDN/+. All figures have rn-GAL4 alone or with UAS-GFP as control. This finding would be strengthened with this other control, especially because the size effect is small. This being said a general comment for all experiments is that hemi-controls are generally missing for all figures. eg, in Fig 3. One would typically include controls such as A. Phm>+ and +>miR.184; B. aug21>+ and +>miR.184; C. ptth>+ and +>miR.184; D. rn>+ and +>miR.184

      Figure 7: Are IPCs necessary for the model? If not, I suggest removing them and placing the Lgr3 neuron cell bodies much more anterior in this scheme. Their cell bodies are as anterior and rostral as it gets, approximately where the IPCs are depicted in this type of view of the CNS.

      Table S1- It would be preferable to see the data of these experiments, but if the authors prefer to show this data in a table, please at least add the dispersion analyses (eg standard deviation.. OR median+-quartiles OR Confidence intervals..), N of animals analysed, and statistics against controls.

      In all figures with pupariation time: please also indicate significant findings in the graphs (with an asterisk, for instance) and adjust figure legends accordingly. This could facilitate understanding the data.

      Please revise Figure legends for punctuation.

      Abstract: Line 10: What is the evidence to call Dilp8 a "paracrine" factor?

      Introduction:

      4th paragraph, 3rd sentence " Dilp8... buffers developmental noise and delays pupariation..." Buffering of developmental noise was first shown in Garelli et al., Science 2012, so this publication should be cited.

      4th paragraph, 5th sentence: please include Jaszczak et al., Genetics 2016. This paper was published together with the 2015 papers, just a mater of timing that it got a 2016 date. Moreover, I do not think Katsuyama et al., 2015 is well cited to back up the statement in this sentence, hence I recommend removing that citation in this sentence.

      6th paragraph: 5th line "targeting dilp8" : please specify if you mean the gene or the mRNA, or both. Same for line 7.

      Results Page 10, 1st paragraph, 1st sentence: the works cited are not the appropriate studies that demonstrated what is being stated. This was shown in Garelli et al., Science 2012 and Colombani et al., Science 2012.

      Results Page 10, 1st pagragraph, line 11: Please also cite Colombani et al., Science 2012, who first showed that JNK is required for dilp8 regulation.

      Discussion, 2nd paragraph, line 4: again, please indicate the rationale for using "paracrine" to describe Dilp8's activities. The current widely accepted model is that Dilp8 acts on interneurons in the brain (eg, reviewed in Juarez-Carreno et al., Cell Stress, 2018; Gontijo and Garelli, Mech Dev, 2018; Mirth and Shingleton, Front Cell Dev Biol, 2019; Texada et al., Genetics 2020; Boulan and Leopold, 2021). In order to reach the brain, Dilp8 has to be secreted from the discs and travel to the brain. This is as an endocrine mechanism as it gets for a small larva, considering that some discs can be in the opposite side of the larva (eg, genital discs). While this does not exclude that Dilp8 could also act paracrinally, the only evidence that I am aware of comes from other contexts such as during transdetermination (where Dilp8 has been proposed to work in an autocrine or paracrine fashion, via Drl in imaginal discs (Nemoto et al., Genes to Cells, 2023), however, this is not cited appropriately in this manuscript and is less related to the Lgr3-dependent pathway being studied here.

      Discussion Page 13, 1st paragraph, This claim is supported by data presented in Garelli et al., Science 2012, not the other two papers. Garelli et al., 2015 shows that the Lgr3 receptor also participates in buffering developmental noise. Other studies have corroborated the Garelli et al., 2012 finding: eg, Colombani et al., Curr Biol 2015; Boone et al., Nat Commun 2016; Blanco-Obregon et al., Nat Commun 2022). Many other studies have shown that Dilp8 promotes developmental stability under tissue stress and challenges.

      Discussion Page 12, 3rd paragraph, 2nd sentence: "The Lgr3 neurons directly interact with ... PTTH ...and insulin-producing neurons" Please cite Colombani et al., 2015 and Vallejo et al., Science 2015. Vallejo et al., propose that circuit with insulin-producing neurons. In the 3rd sentence, only Jaszczak et al., 2016 is cited, whereas this claim/model comes from many studies, such as Halme et al., Curr Biol, 2010; Hackney et al., PLoS One 2012; Garelli et al. Science 2012; Colombani et al., Science, 2012; and the Lgr3 papers from 2015). Jaszczak et al., actually propose that Lgr3 is also required in the ring gland in addition to neurons.

      Discussion page 14 last paragraph,10 line, "In Aedes aegypti ....regulates ilp8 (Ling et al., 2017)". As far as I understand mosquitoes do not have a dilp8 orthologue (see for instance Gontijo and Gontijo, Mech Dev 2018; and Jan Veenstra's work). ilp nomenclature (numbering) does not follow that of Drosophila, so ilp8 is probably a typical Insulin/IGF-like peptide and is NOT an orthologue of Dilp8, a relaxin, so this citation needs to be removed or placed into the broader context of microRNA regulation of ilps.

      Thank you for the opportunity to review your interesting work, Alisson Gontijo and Rebeca Zanini

      Significance

      If the authors address these points raised above, we believe the manuscript should be a valuable contribution to the field, and help in the understanding of how tissues respond to growth aberrations and the regulation of transcript levels by microRNAs.

    1. Synthèse du webinaire : "Créer une communauté d'action : comment se lancer ?"

      Résumé

      Ce document de synthèse analyse les principaux enseignements du webinaire organisé par le Réseau National des Maisons des Associations (RNMA) et La Fonda, centré sur la création et l'animation de "communautés d'action".

      Fondée sur la méthodologie de la stratégie d'impact collectif, cette approche vise à structurer la coopération entre divers acteurs territoriaux pour répondre de manière concertée et transversale à des enjeux complexes.

      Les points clés sont les suivants :

      1. Une Méthodologie Structurante :

      La démarche s'appuie sur des phases claires (impulsion, structuration, pérennisation) et cinq conditions de réussite, dont la plus cruciale est l'existence d'une "fonction de soutien".

      Cette fonction, véritable colonne vertébrale de la coopération, est chargée d'animer, de coordonner et de faciliter le travail collectif.

      2. Deux Expérimentations Riches d'Enseignements :

      Pendant deux ans, des expérimentations ont été menées à Morlaix (portée par le Résam sur la transition écologique et alimentaire) et à Mulhouse (portée par le Carré des associations sur la jeunesse).

      Ces deux cas pratiques démontrent la capacité de la méthode à fédérer des acteurs hétérogènes (associations, citoyens, collectivités, institutions) et à produire des résultats concrets, allant de l'amélioration des parcours pour les jeunes à la création de projets sur la sécurité sociale de l'alimentation.

      3. Des Résultats Tangibles au-delà des Projets :

      L'impact majeur réside dans la transformation des modes de collaboration.

      Les expérimentations ont permis de renforcer l'interconnaissance, d'instaurer des relations plus horizontales, et d'ancrer la méthode au sein de politiques publiques locales (Contrat de Ville, Contrat Territorial Global de la CAF à Mulhouse).

      4. Le Défi Central du Financement : La pérennisation de ces dynamiques se heurte à un obstacle majeur : le financement de la fonction de soutien.

      Les financeurs publics privilégient traditionnellement les "actions concrètes" au détriment du temps d'animation et de coordination, pourtant essentiel à la réussite et à la durabilité des coopérations.

      5. Des Perspectives d'Essaimage :

      Le webinaire a confirmé un fort intérêt pour la méthode. Les participants ont exprimé des besoins clairs en matière d'outils, de formation et d'échanges entre pairs.

      Le RNMA et La Fonda envisagent la création d'une "communauté d'apprentissage" pour accompagner les territoires désireux de se lancer.

      En conclusion, la création de communautés d'action représente une voie prometteuse pour renforcer l'impact des initiatives locales.

      Sa réussite dépend cependant d'un changement de paradigme, notamment de la part des financeurs, pour reconnaître et soutenir le travail indispensable de mise en lien et d'animation territoriale.

      I. Le Cadre Méthodologique : La Stratégie d'Impact Collectif

      La démarche de création de communautés d'action s'ancre dans l'approche de la stratégie d'impact collectif, présentée comme un cadre permettant de structurer et de pérenniser la coopération sur un territoire.

      A. Définition et Promesse

      L'impact collectif est défini comme la capacité à mettre en cohérence des actions et à favoriser des dynamiques concertées pour apporter des réponses adaptées et transversales à des enjeux complexes et partagés.

      La promesse de cette approche est de générer des réponses de meilleure qualité, car elles sont construites en commun autour d'objectifs partagés.

      Le postulat de départ est que les enjeux territoriaux (du quartier à l'international) sont multifactoriels et appellent à la coopération.

      L'objectif n'est pas de multiplier les projets, mais de relier ce qui existe déjà pour aligner les actions vers un changement souhaité, défini collectivement.

      B. Les Étapes et Conditions de Réussite

      La mise en œuvre d'une stratégie d'impact collectif suit trois grandes phases :

      • 1. Impulsion : Définition et partage de l'enjeu.

      • 2. Structuration : Mise en commun et coordination des moyens et des ressources.

      • 3. Pérennisation : Ancrage de la dynamique dans la durée.

      Cinq conditions sont identifiées comme nécessaires à la réussite de la démarche :

      • • Une vision partagée des enjeux.

      • • Des actions qui se complètent mutuellement autour d'un plan d'action partagé.

      • • Une approche de l'évaluation pensée dès le début et conduite en commun.

      • • Une communication continue entre les acteurs.

      • • L'existence d'une "fonction de soutien" dédiée.

      C. La "Fonction de Soutien" : Clé de Voûte de la Coopération

      La "fonction de soutien" est l'acteur ou la structure en charge d'animer et de faire vivre la coopération.

      Elle est décrite comme la colonne vertébrale de la démarche.

      Missions Clés :

      • Coordonner les engagements des partenaires.
      • Faciliter le dialogue et l'interconnaissance.
      • Assurer la fluidité et la circulation des informations.
      • • Recueillir et analyser les données du terrain (notamment pour l'évaluation).
      • • Proposer des orientations stratégiques à construire collectivement.
      • Rechercher des financements pour la communauté d'action.

      Postures et Rôles :

      • • Écoute : Pour comprendre les besoins et favoriser l'inclusion.

      • • Coordinateur : Pour structurer les échanges et veiller au bon déroulement des actions.

      • • Animateur : Adopter une "posture haute" pour affirmer une position dans certaines décisions, ou une "posture basse" pour favoriser la prise d'initiative des membres.

      Cette fonction assure également un important travail "caché" entre les réunions : préparation des ordres du jour, rédaction des comptes-rendus et des documents stratégiques.

      II. Retours d'Expérience : Les Communautés d'Action de Morlaix et Mulhouse

      Deux territoires ont expérimenté cette méthode pendant deux ans, avec l'accompagnement du RNMA et de La Fonda.

      A. L'Expérimentation de Morlaix (Résam) : La Transition Alimentaire

      • Structure porteuse : Le Résam (Réseau d'échange et de service aux associations du Pays de Morlaix), une maison des associations associative à gouvernance partagée.

      • Genèse du Projet : Le projet est né d'une demande des associations locales autour des transitions écologiques et d'un besoin de l'équipe du Résam de renouveler ses pratiques d'animation territoriale. Le contexte local, marqué par une pénurie d'eau inédite, a renforcé la pertinence du sujet.

      • Déroulement et Acteurs : Sur une thématique large des "transitions écologiques", la communauté d'action a progressivement affiné son objet pour se concentrer sur l'autonomisation alimentaire du territoire respectueuse du vivant.

      La dynamique a rassemblé un groupe hétérogène de citoyens, associations, entrepreneurs, techniciens de collectivités et élus, animé via des journées complètes de travail tous les deux mois. Un comité de pilotage a été formé avec le Résam, le Pôle ESS local et l'Ulamir CPIE.

      • Difficultés Rencontrées :

      • ◦ La gestion d'un groupe très hétérogène, avec des attentes et des capacités d'agir différentes.

      ◦ Le décalage de temporalité avec les politiques publiques locales, notamment le Plan Alimentaire Territorial en cours d'élaboration.

      • Réussites et Résultats :

      • ◦ Une forte coordination et des habitudes de travail communes installées entre les trois structures de la fonction de soutien.

      • ◦ Une démarche perçue comme novatrice sur le territoire, réussissant à structurer un dialogue entre des acteurs variés.

      • ◦ Une forte dimension humaine, avec un plaisir partagé des participants à se retrouver.

      • ◦ La validation d'une charte commune après un travail collectif approfondi.

      • ◦ Le lancement d'une expérimentation sur la sécurité sociale de l'alimentation.

      • ◦ Une reconnaissance croissante de la "communauté d'action" sur le territoire.

      B. L'Expérimentation de Mulhouse (Carré des associations) : L'Accompagnement de la Jeunesse

      Structure porteuse : Le Carré des associations, une maison des associations municipale, intégrée à la Direction cohésion sociale et vie des quartiers de la Ville de Mulhouse.

      Genèse du Projet : Le constat d'un territoire très dynamique mais peinant à pérenniser ses expérimentations et à faire travailler ses acteurs ensemble.

      La ville de Mulhouse, très jeune et marquée par de fortes disparités sociales, a vu dans la méthode un moyen de créer du lien et d'améliorer l'accompagnement des initiatives.

      • Déroulement et Acteurs : Partie d'un enjeu général d'accompagnement de projet, la communauté d'action s'est rapidement focalisée sur la jeunesse.

      Elle a rassemblé des partenaires institutionnels variés : services de la Ville, État (politique de la ville), CAF, Alsace Active, Unicité, Mission Locale, etc.

      Un principe clé a été l'adoption d'une posture horizontale, où la Ville n'est pas un décideur mais un partenaire égal aux autres.

      • Difficultés Rencontrées :

      • ◦ Le départ de certains partenaires (État, France Active) par manque de temps ou d'alignement avec la thématique.

      • ◦ L'émergence récente de la question de la rémunération du temps de travail investi par les partenaires associatifs, posant un défi pour la pérennisation.

      • Réussites et Résultats :

      • ◦ L'amélioration de la connaissance mutuelle et la transformation des relations hiérarchiques en coopération horizontale.

      • ◦ La méthode a nourri les pratiques de chaque structure partenaire (projets sociaux des centres sociaux, par exemple).

      • ◦ Des passerelles concrètes ont été créées, fluidifiant les parcours des jeunes entre les différentes structures (ex: lien entre le budget participatif de la ville et les jeunes suivis par Unicité).

      • ◦ La méthode a été inscrite dans des cadres structurants comme le nouveau Contrat de Ville et le Contrat Territorial Global (CTG) de la CAF.

      • ◦ Un poste pour la fonction de soutien est envisagé dans le cadre du CTG, offrant une voie de pérennisation.

      III. Outils et Méthodes Clés

      Deux outils structurants utilisés lors des expérimentations ont été présentés.

      A. Le Scénario Idéal : Construire une Vision Partagée

      Cet outil de prospective a été utilisé en phase d'impulsion pour définir une vision et une problématique communes.

      • Objectif : Aider le groupe à se projeter dans un futur souhaitable pour identifier les enjeux prioritaires et se mettre d'accord sur un vocabulaire commun.

      • Processus :

      • 1. Identifier des tendances prospectives sur le territoire (ex: érosion de la biodiversité).
      • 2. Prioriser ces tendances en fonction de leur impact et de la capacité du groupe à agir.
      • 3. Formuler des scénarios idéaux à un horizon donné (ex: "En 2035, grâce à nos actions, le territoire est exemplaire...").
      • 4. Décrire ce qui se passe concrètement dans ce futur souhaitable.
      • 5. Formuler des problématiques sous forme de questions ("Comment faire pour...?").
      • 6. Prioriser une "question essentielle" qui devient la problématique centrale de la communauté d'action.

      B. La Cartographie des Acteurs : Rendre Visible et Relier l'Existant

      Cet outil, utilisé en phase de structuration, vise à capitaliser sur l'existant plutôt qu'à créer de nouveaux dispositifs.

      • Objectif : Rendre lisible comment les acteurs présents répondent déjà à l'enjeu partagé, identifier les complémentarités et les "trous dans la raquette".

      • Processus :

        1. Recenser les actions existantes menées par les membres de la communauté.
        1. Catégoriser ces actions en fonction des objectifs intermédiaires et globaux de la communauté (ex: à Mulhouse, les actions ont été classées selon qu'elles relevaient de l'insertion, de l'appui à l'engagement, de l'appui aux initiatives, etc.).
        1. Visualiser sur un schéma qui fait quoi et à quelle étape d'un parcours (ex: le parcours d'engagement d'un jeune).

      • Utilité : La cartographie devient une boussole pour les professionnels, facilitant l'orientation des publics entre les différentes structures et renforçant la coopération opérationnelle.

      IV. Enjeux et Perspectives pour l'Essaimage

      La fin du webinaire a ouvert une discussion sur la transférabilité de la démarche.

      A. Les Besoins des Acteurs de Terrain

      Les participants ont exprimé un vif intérêt et ont formulé plusieurs besoins pour se lancer :

      • Accès aux outils : Le site ressource en préparation par le RNMA est attendu.

      • Formation : Des temps dédiés pour s'approprier la méthode de manière plus approfondie.

      • Échange entre pairs et codéveloppement : Un espace pour partager les expériences, les difficultés et les réussites dans la durée.

      B. Le Défi du Financement de la "Fonction de Soutien" C'est l'enjeu principal qui a émergé. Les financements publics sont souvent fléchés vers des actions visibles et quantifiables, rendant difficile la valorisation du temps d'ingénierie, de coordination et d'animation.

      Or, sans ce temps dédié, les dynamiques coopératives peinent à se maintenir.

      Les intervenants ont qualifié la reconnaissance de ce travail de "combat politique" à mener, tant par les associations que par les collectivités, pour démontrer que l'animation territoriale est une condition essentielle de l'impact des politiques publiques.

      C. Vers une Communauté d'Apprentissage

      En réponse aux besoins exprimés, le RNMA et La Fonda proposent de lancer une communauté d'apprentissage.

      L'objectif serait de créer un espace d'échange de pratiques et de soutien pour les structures souhaitant impulser des communautés d'action sur leur territoire, que ce soit dans le cadre de Guide'Asso, de politiques alimentaires ou d'autres thématiques.

      Des rencontres futures, notamment avec des acteurs de la philanthropie comme la Fondation de France, sont envisagées pour faire avancer la réflexion sur le financement de ces transformations systémiques.

    1. La Démarche d'Observation Locale de la Vie Associative (OLVA)

      Résumé

      La démarche d'Observation Locale de la Vie Associative (OLVA), pilotée par le Réseau National des Maisons des Associations (RNMA), est une initiative structurée visant à combler le manque de données sur le tissu associatif à l'échelle territoriale.

      Face à une statistique publique jugée insuffisante et mal adaptée aux réalités associatives, l'OLVA propose une méthodologie rigoureuse pour objectiver les réalités, mesurer les évolutions et permettre des comparaisons entre territoires.

      La finalité est de "connaître pour agir" : transformer la connaissance produite en actions concrètes, en politiques publiques éclairées et en un accompagnement plus pertinent pour les associations.

      La démarche repose sur deux piliers :

      • une enquête quantitative (l'observation) via un questionnaire standardisé d'une cinquantaine de questions, et

      • la création d'une dynamique partenariale (l'observatoire) associant acteurs associatifs et publics.

      Ce processus se décline en trois phases : la préparation (définition des enjeux locaux, formation), la diffusion de l'enquête (mobilisation des partenaires), et l'analyse des données suivie d'actions post-enquête.

      Le RNMA fournit un accompagnement complet incluant des formations, des outils (questionnaire, plateforme en ligne), l'analyse statistique et la mise en réseau des quelque 50 observatoires actifs.

      Les livrables finaux incluent une infographie synthétique, une analyse des associations employeuses et un rapport d'enquête détaillé, conçus pour aider à la décision, renforcer la reconnaissance du secteur et mettre en débat les enjeux de la vie associative.

      1. Contexte et Origine de la Démarche

      1.1. Le Réseau National des Maisons des Associations (RNMA)

      Le RNMA est une tête de réseau regroupant un peu plus de 100 structures membres réparties sur le territoire français.

      Sa spécificité réside dans l'hybridité de ses membres, qui incluent des Maisons des Associations sous statut associatif et des services "Vie Associative" de collectivités publiques.

      Cette diversité est considérée comme une richesse pour le réseau.

      Les missions principales du RNMA sont :

      • Faire réseau : Créer des synergies entre les membres via des rencontres nationales et des projets communs. • Coordonner des projets : L'OLVA est un exemple de projet né d'un enjeu commun identifié : le besoin de connaissance de la vie associative locale. D'autres chantiers portent sur l'engagement ou le métier d'accompagnateur. • Être un relais : Porter les enjeux et problématiques des membres et des associations qu'ils accompagnent auprès des partenaires publics et privés.

      1.2. La Genèse de l'OLVA

      La démarche OLVA est née d'un constat fondamental : la méconnaissance de la vie associative à l'échelle d'un territoire. La statistique publique est jugée "assez faible" pour décrire ce secteur, étant principalement conçue pour le monde économique.

      Pour répondre à ce besoin, le RNMA s'est inspiré dès 2006 des travaux de Viviane Chardononog, chercheuse au CNRS qui produit régulièrement des "paysages associatifs français".

      La méthodologie d'observation nationale a été adaptée et déclinée à l'échelle locale de manière progressive, par des tests puis la création d'outils.

      Depuis 2021, une évolution majeure a été la mise en place d'un système de lancement d'observations synchronisées, créant des "vagues" d'observatoires. Cette approche facilite l'accompagnement et génère une dynamique d'échange et de partage entre les territoires participants.

      2. Les Objectifs Stratégiques de l'OLVA

      La démarche poursuit un objectif principal : "Connaître pour agir". La connaissance n'est pas une fin en soi, mais un levier pour transformer les réalités et répondre aux besoins identifiés.

      2.1. Objectifs Liés à l'Observation

      1. Objectiver les réalités : Dépasser les "ressentis" sur la structuration du tissu associatif en produisant des chiffres et des données factuelles.

      2. Mesurer les évolutions : Après une première enquête qui établit une "photo statique", les observations suivantes permettent de comparer les "millésimes" et de suivre les dynamiques sur le long terme.

      3. Se comparer : La méthodologie commune appliquée sur une cinquantaine de territoires permet de situer les réalités locales par rapport à d'autres territoires et aux tendances nationales décrites par Viviane Chardononog.

      2.2. Objectifs Liés à l'Action

      • Aider à la décision : Fournir des données pour orienter les politiques publiques des collectivités et les stratégies d'accompagnement des structures d'appui associatives.

      • Communiquer sur le secteur : Lutter contre les préjugés sur le monde associatif en partageant des réalités objectivées.

      • Favoriser la reconnaissance : Mettre en lumière la transversalité et la globalité de la vie associative, souvent cloisonnée par secteurs (sport, culture, etc.) au sein des collectivités. L'enquête permet d'aborder les enjeux de manière transversale.

      • Créer du commun : Les temps de restitution de l'enquête sont des occasions pour des associations de secteurs différents de se rencontrer, de partager des difficultés communes et de créer une "espèce de commun de la vie associative".

      • Mettre en débat : Utiliser les résultats de l'enquête pour identifier les difficultés, se poser les bonnes questions et rassembler les acteurs autour de la table.

      3. Méthodologie et Déroulement

      La démarche OLVA s'articule autour de deux axes complémentaires : l'observation (l'enquête elle-même) et l'observatoire (la dynamique partenariale).

      3.1. L'Enquête (L'Observation) Il s'agit d'une enquête principalement quantitative, avec la possibilité d'y associer un volet qualitatif (par exemple, via un partenariat universitaire).

      Le Questionnaire : * • Format : Environ 50 questions. * • Temps de remplissage : Environ 20 minutes.

      • Structure en 4 parties :

        1. Profil de l'association : Secteur d'activité, âge, périmètre d'action.
        1. Membres de l'association : Profil des dirigeants, bénévoles, volontaires.
        1. Moyens de fonctionnement : Ressources, régime de financement, locaux.
        1. Besoins et perspectives.

      Ce questionnaire "tronc commun" peut être complété par des questions spécifiques définies en fonction des enjeux du territoire.

      3.2. La Dynamique Partenariale (L'Observatoire)

      C'est un aspect essentiel qui consiste à associer d'autres acteurs du territoire concernés par l'enquête (fédérations associatives, acteurs publics, etc.). Cette collaboration vise à : • Garantir une large diffusion du questionnaire au-delà des contacts habituels. • Permettre une action post-enquête en impliquant dès le départ ceux qui pourront répondre aux besoins exprimés. • Favoriser l'interconnexion des acteurs du territoire.

      3.3. Les Trois Phases de la Démarche

      Phase

      Description des Actions

      1. Préparation de l'enquête

        • Définition des objectifs et enjeux spécifiques au territoire.<br>
        • Formulation des questions complémentaires au tronc commun.<br>
        • Mobilisation des données existantes (notamment INSEE pour les associations employeuses).<br>
        • Organisation de formations par le RNMA.

      2. Diffusion et recueil

        • Mobilisation de l'observatoire et des partenaires pour une diffusion large (en ligne, questionnaire papier).<br>
        • Association possible avec des événements forts (ex: Forum des associations).<br>
        • Animation de la collecte : phoning, relances ciblées par secteur pour assurer un échantillon cohérent.<br>
        • Accompagnement du RNMA pour le suivi de la qualité de l'échantillon.

      3. Analyse et post-enquête

        • Analyse statistique des données réalisée par le RNMA (vérification de cohérence, croisements de variables).<br>
        • Partage des premiers résultats avec le porteur de projet pour identifier des axes d'approfondissement.<br>
        • Organisation du travail post-enquête pour répondre aux besoins identifiés et dialoguer avec les pouvoirs publics.

      4. Moyens et Accompagnement du RNMA

      4.1. Moyens Humains Requis

      Le RNMA identifie trois fonctions clés à mobiliser pour mener à bien une démarche OLVA :

      • 1. Pilotage : Gestion de projet (planification, suivi, moyens).

      • 2. Animation : Mobilisation des partenaires, diffusion de l'enquête, communication. Cette fonction peut être assurée en interne ou via des stagiaires ou services civiques.

      • 3. Analyse : Assurée par le RNMA, avec une possibilité de transfert de compétences ("formation-action") vers la structure porteuse pour de futures observations.

      4.2. Appuis Fournis par le RNMA

      • Accompagnement et Formations :

      • ◦ Structurer une démarche d'observation (2 jours) : en amont du projet.
      • ◦ Analyse de données (1 jour) : en fin d'enquête, pour maîtriser les résultats.
      • ◦ Mener une enquête éclair : formation pour des enquêtes plus simples et rapides sur des sujets spécifiques.

      • Outillage : Fourniture du questionnaire tronc commun, mise en ligne, plateforme de collecte de données et outils issus de l'expérience des autres territoires.

      • Analyse de données : Exploitation des données existantes (créations d'associations, associations employeuses) et analyse complète des données collectées.

      • Mise en réseau et valorisation : ◦ Organisation de journées d'échange de pratiques deux fois par an pour les 50 observatoires. ◦ Valorisation des travaux locaux via des colloques, articles, etc.

      4.3. Livrables

      Trois types de documents sont produits à l'issue de l'analyse :

        1. Infographie "Les chiffres clés de la vie associative" : Un document synthétique et visuel de 4 pages, facile à diffuser.
        1. Document sur les associations employeuses : Une analyse courte basée sur les dernières données INSEE Flores.
        1. Rapport d'enquête complet : Un document d'environ 40 pages traitant toutes les questions avec graphiques, commentaires et détail de la méthodologie.

      5. Partenariats et Enjeux Territoriaux (Synthèse des échanges)

      5.1. Articulation avec Guid'Asso

      • • Le RNMA est membre du comité de pilotage national de Guid'Asso et accompagne son déploiement sur les territoires.
      • • Les questions posées par les observatoires (structuration de la vie associative, besoins, articulation des appuis) sont jugées "très connexes" à la structuration de cette politique publique.
      • • L'articulation se fait de manière variable selon les territoires, mais il est noté un développement d'observatoires régionaux (ex: Hauts-de-France) qui permettent une approche différente.

      5.2. Relations avec les Collectivités et la Charte des Engagements Réciproques

      • • Le RNMA et ses membres connaissent et travaillent sur la déclinaison de la Charte des engagements réciproques entre l'État, les associations et les collectivités.
      • • Il est souligné que l'enjeu réside moins dans le document lui-même que dans le processus de co-construction.
      • • La mise en place d'une telle charte peut être une des conséquences concrètes d'une démarche d'observation, en réponse aux défis identifiés.

      5.3. Échelle d'Action et Compétences * • L'observation se fait souvent à l'échelle d'une commune ou d'une communauté de communes. * • Une difficulté est identifiée à l'échelle des communautés de communes : la "vie associative" reste souvent une compétence communale, ce qui peut limiter la capacité des intercommunalités à porter de tels projets. * • Cependant, lorsque le portage est associatif (par une Maison des Associations), le bassin de vie ou la communauté de communes est plus régulièrement l'échelle privilégiée.

      6. Considérations Pratiques

      • • Coût de la démarche : Le coût varie en fonction de la taille du territoire, mais se situe généralement entre 8 000 et 12 000 €. Ce montant est ajustable en fonction du niveau d'analyse et de transfert de compétences souhaité.
      • • Format des formations : Elles sont privilégiées en présentiel pour faciliter les échanges et la connaissance mutuelle.
    1. Synthèse du webinaire : Face à la marchandisation des associations

      Résumé

      Ce document de synthèse analyse les conclusions du webinaire intitulé "Face à la marchandisation des associations", deuxième épisode du cycle "Écrire collectivement le scénario de renforcement du monde associatif".

      Il met en lumière une transformation profonde du secteur associatif français, caractérisée par un glissement d'un modèle de partenariat avec la puissance publique vers une logique de marché concurrentiel.

      Les principaux moteurs de cette "marchandisation" sont doubles : * l'adoption du New Public Management depuis les années 1980, qui impose une culture du résultat et de l'évaluation quantitative, et * les règles de concurrence du marché unique européen, qui considèrent par défaut les associations comme des entreprises.

      Cette dynamique se matérialise par le recul de la subvention de fonctionnement au profit de la commande publique (appels à projet, marchés publics), transformant les associations en prestataires de services et les privant de leur capacité d'initiative et d'interpellation politique.

      Ce phénomène est exacerbé par l'essor de l'entrepreneuriat social, qui brouille les frontières entre lucrativité et non-lucrativité, et par la financiarisation du secteur via des outils comme l'investissement et les contrats à impact social.

      Le cas du secteur de la petite enfance illustre concrètement comment des acteurs lucratifs pénètrent des champs historiquement associatifs, utilisant des montages complexes ("associations écran") pour capter des fonds publics.

      Face à ce constat, plusieurs pistes de "démarchandisation" sont proposées.

      Elles incluent la nécessité de redonner un pouvoir politique à la subvention, d'explorer des modèles de financement alternatifs inspirés de la Sécurité Sociale de l'Alimentation ou du modèle belge de l'éducation permanente, et de "reconscientiser" les élus et techniciens des collectivités à la spécificité du fait associatif.

      La reconquête sémantique, notamment sur la notion d'évaluation (utilité sociale contre mesure d'impact), et le renforcement des réseaux et du plaidoyer collectif apparaissent comme des stratégies cruciales pour défendre un modèle associatif fondé sur l'intérêt général, la citoyenneté et la démocratie.

      1. Introduction et Contexte du Webinaire

      Le webinaire s'inscrit dans un cycle de réflexion organisé en partenariat par le Réseau National des Maisons des Associations (Rnma) et le Collectif des associations citoyennes (CAC).

      Il vise à explorer collectivement les scénarios de renforcement du monde associatif face aux dynamiques d'affaiblissement actuelles.

      Cet épisode se concentre spécifiquement sur le phénomène de marchandisation, ses constats, ses réalités de terrain et les pistes pour y faire face.

      Intervenants principaux :

      • Marianne Langlais : Coordinatrice de l'Observatoire citoyen de la marchandisation des associations (CAC).

      • Frédéric Bilde : Président de l'ACEPP Sud-Ouest (Association des Collectifs Enfants Parents Professionnels).

      • Thomas Lowers : Administrateur du Rnma et Directeur de la Maison des Associations de Roubaix.

      Le webinaire s'est articulé autour de deux temps forts : une analyse du processus de marchandisation et une exploration des pistes de "démarchandisation".

      2. Analyse Approfondie du Phénomène de Marchandisation

      A. Un Cadre Théorique : Les Quatre Vecteurs d'Affaiblissement

      Marianne Langlais a introduit l'analyse en présentant quatre vecteurs interdépendants qui affaiblissent le monde associatif, identifiés par le CAC :

      1. La Marchandisation : La transformation des modes de financement, avec le passage de la subvention à la commande publique et l'augmentation des ressources marchandes.

      2. La Managérialisation : L'adoption de techniques de management issues de l'entreprise lucrative, perçues comme le seul modèle légitime, au détriment des formes d'organisation collective propres aux associations.

      3. L'Instrumentalisation : Les associations sont de plus en plus vues comme de simples prestataires de services chargés de mettre en œuvre des politiques publiques pensées sans elles, perdant leur rôle d'initiative et de création de droits.

      4. La Mise à Mal des Libertés Associatives :

      Conséquence des trois autres vecteurs, le financement public est de plus en plus perçu comme devant s'accompagner d'une "soumission à la politique gouvernementale", rendant illégitime toute parole politique critique de la part d'une association vue comme prestataire.

      B. Origines et Mécanismes de la Marchandisation

      La marchandisation est le résultat de deux tendances de fond :

      • Le New Public Management (années 1980) : Cette approche a introduit dans les services publics, et par ricochet dans les associations, une "culture du résultat, de la performance, de l'évaluation quantitative et du management par objectif", en important les outils du secteur privé lucratif.

      • Les Règles du Marché Unique Européen : Elles imposent le marché comme règle générale. Dans ce cadre, les associations sont considérées comme des entreprises et les subventions comme des aides d'État susceptibles de fausser la concurrence. Bien que des exemptions existent, elles restent l'exception.

      Ce cadre a engendré un mécanisme central : le recul de la subvention au profit de la commande publique. Cette mutation modifie radicalement le rapport aux pouvoirs publics :

      Mode de Financement

      Caractéristiques Clés

      Subvention

        • L'association est à l'origine de l'initiative. <br>
        • Pas de contrepartie directe attendue par le financeur. <br>
        • Logique de partenariat, politique ascendante (partant des besoins).

      Commande Publique

        • Achat d'un service par la puissance publique. <br>
        • L'association doit s'inscrire dans un cadre prédéfini. <br>
        • Logique de prestation, politique descendante (l'État commande).

      C. L'Entrepreneuriat Social et la Financiarisation

      Le tournant néolibéral est également accompagné et nourri par des concepts qui brouillent les frontières entre lucrativité et non-lucrativité. • L'Entrepreneuriat Social : Terme importé du monde anglo-saxon dans les années 1980, il promeut l'utilisation de compétences entrepreneuriales et de techniques de management du privé pour résoudre des problèmes sociaux.

      Marianne Langlais souligne que ce concept, sans statut juridique propre, "invisibilise les associations" et "affaiblit la portée politique que peuvent représenter les associations".

      Elle cite Jean-Marc Borello (Groupe SOS) pour qui le modèle associatif, "arquebouté sur le principe non lucratif apparaît à présent inadapté".

      • La Financiarisation : Ce flou favorise l'arrivée de l'investissement à impact social, qui promet de "faire du bien et du profit en même temps". * ◦ Le Contrat à Impact Social (CIS) est un exemple frappant. Ce mécanisme financier engage un investisseur privé, une association et la puissance publique.

      L'investisseur finance un programme associatif, et la puissance publique le rembourse avec intérêts en fonction de l'atteinte d'indicateurs de performance prédéfinis. * ◦ Conséquence : Les actions associatives sont transformées en produits financiers, et la mesure d'impact social, souvent monétarisée (coûts évités), devient le nouveau mode d'évaluation, supplantant la notion d'utilité sociale.

      D. Illustration Sectorielle : Le Cas de la Petite Enfance Frédéric Bilde a illustré concrètement ce processus dans le secteur de la petite enfance.

      • Historique : Le secteur s'est initialement développé sur un modèle associatif et citoyen, avec des crèches parentales (ACEPP) fondées sur une démarche ascendante, partant des besoins locaux. • Le tournant : Les directives européennes (Bolkestein, 2006) ont ouvert le secteur à la concurrence et aux acteurs privés lucratifs. L'État français a choisi de maintenir les services à la petite enfance dans ce champ concurrentiel.

      • Conséquences directes :

      • Nouveau langage : Les notions de "rentabilité", "taux de remplissage", "optimisation" et "coût de revient" priment sur le "bien-être de l'enfant" et le "soutien aux parents".

      • Pervertissement du modèle associatif : Des groupes privés lucratifs créent des "associations écran" pour rassurer les municipalités et remporter des délégations de service public (DSP).

      Ces structures sont ensuite vidées de leur substance, les bénéfices étant "aspirés" vers le groupe via des centrales d'achat, des frais de gestion ou des loyers versés à des sociétés civiles immobilières (SCI) appartenant au même groupe.

      • Usage de fonds publics : Ces montages bénéficient à la fois des financements de la Caisse d'Allocations Familiales (CAF) et d'une défiscalisation massive via le Crédit Impôt Famille (CIF), créant une "redondance" entre fonds publics et fonds privés défiscalisés.

      3. Pistes de "Démarchandisation" et Stratégies de Renforcement

      Face à ce diagnostic, les intervenants ont proposé plusieurs axes d'action pour inverser la tendance.

      A. Redonner un Pouvoir Politique à la Subvention

      La piste principale est de réaffirmer la légitimité de la subvention de fonctionnement comme mode de financement principal des associations.

      Il est jugé indispensable qu'une partie de l'impôt finance les associations qui contribuent à l'intérêt général. Toutefois, cette subvention doit être repensée pour la "sortir du discrétionnaire politique" afin de garantir la fonction d'interpellation des associations.

      B. Explorer des Modèles de Financement Alternatifs

      Plusieurs modèles inspirants sont explorés :

      1. La Sécurité Sociale de l'Alimentation : Ce projet propose un accès universel à une alimentation de qualité via une allocation et un conventionnement de producteurs, le tout géré par des caisses locales démocratiques. Transposé aux associations, ce modèle pourrait permettre de repenser la redistribution de l'argent public en dehors des logiques de marché.

      2. Le Modèle Belge de l'Éducation Permanente : Un décret en Wallonie garantit un financement pérenne et à long terme pour les associations de ce secteur.

      Crucialement, l'article 1 de ce décret "garantit la fonction d'interpellation des associations". Ce modèle promeut également l'auto-évaluation, une alternative à l'évaluation par la mesure d'impact.

      C. Renforcer le Dialogue et la Coconstruction Une stratégie clé consiste à "reconscientiser" la puissance publique, notamment les élus locaux.

      • Changement de regard : Thomas Lowers observe que beaucoup d'élus locaux récents, issus d'un monde où la logique de marché est une "évidence", n'envisagent plus d'alternative.

      Il est donc crucial de démontrer la valeur et le sérieux du monde associatif, qui n'est pas "un coût à gérer" mais un partenaire.

      • Formation : L'utilisation de dispositifs comme le Certificat de Formation à la Gestion Associative (CFGA) pour former également les élus est une piste pour créer une culture commune.

      • Défendre l'interpellation : Léa Gallois (Institut Alinsky) a insisté sur la nécessité de faire reconnaître la fonction d'interpellation comme "riche pour la démocratie locale" et de créer des "fonds d'interpellation" pour la soutenir matériellement.

      D. S'Organiser Collectivement et Mener la "Bataille des Mots"

      • Force du collectif : Frédéric Bilde a souligné que "seul, on n'y arrivera pas". La mobilisation au sein de fédérations et de réseaux est essentielle pour peser dans les décisions politiques.

      • Bataille sémantique : La discussion a mis en exergue l'importance de déconstruire le langage de la marchandisation. L'évaluation est un terrain de lutte central.

      Il s'agit de défendre une évaluation de l'utilité sociale (qui montre la spécificité associative) contre la mesure de l'impact social (qui réduit l'action à une valeur monétaire et est le "cheval de Troie de la financiarisation").

      4. Points Clés de la Discussion

      • Le Mécénat : Il n'est pas vu comme une solution miracle. Il est souvent ponctuel, orienté vers des "pépites" médiatiques et peut lui-même s'inscrire dans une logique d'investissement ("venture philanthropy") qui s'éloigne du don.

      • Règles Européennes : Le rapport du CESE sur le financement des associations est cité comme une ressource clé pour argumenter en faveur d'une sortie des associations du cadre concurrentiel du marché unique.

      • Statuts et Agréments : La discussion a montré comment les exigences administratives et la standardisation des projets (via les agréments) peuvent dénaturer des initiatives de terrain innovantes, transformant une logique ascendante en une logique descendante.

    1. Synthèse du Soutien de l'État à la Vie Associative : Du National au Local

      Résumé

      Cette note de synthèse analyse les formes et les logiques du soutien de l'État à la vie associative en France, en se basant sur une recherche menée par Mathilde Rtinassi et Emmanuel Porte.

      L'étude révèle une action publique éclatée et peu coordonnée, dépourvue d'une politique unifiée.

      Le soutien financier, bien que stable en volume global de subventions (environ 8,5 milliards d'euros), est perçu comme étant en baisse en raison de sa répartition sur un nombre croissant d'associations, ce qui diminue le montant moyen par structure.

      La recherche identifie quatre grands objectifs poursuivis par l'État : la consolidation des structures, l'articulation du secteur, l'observation du monde associatif, et la reconnaissance de sa légitimité. Ces objectifs sont inégalement poursuivis selon les ministères, menant à une typologie de quatre formes de soutien distinctes :

      • 1. Soutien partiel et institué : Une relation de compagnonnage de longue date, mais ne couvrant qu'une partie des objectifs.

      • 2. Soutien multiforme : Le modèle le plus complet, couvrant les quatre objectifs, porté par des acteurs comme la CNAF ou la DJEPVA.

      • 3. Soutien par la reconnaissance : Limité aux procédures réglementaires (agréments, labels), avec une relation distante.

      • 4. Soutien par relation intéressée : Le modèle le plus répandu, où l'association est principalement un instrument pour le déploiement des politiques publiques, souvent via des appels d'offres.

      Un constat majeur est la corrélation directe entre la qualité du dialogue entre les acteurs publics et les associations et la richesse des formes de soutien.

      Enfin, la production de connaissance sur le secteur associatif reste le "parent pauvre" de l'action publique nationale, et les "têtes de réseau" jouent un rôle opérationnel indispensable que l'État ne peut assumer seul.

      1. Contexte et Méthodologie de la Recherche

      La recherche intitulée "Les soutiens national à la vie associative : enquête exploratoire sur une action publique éclatée" a été initiée mi-2019 pour répondre à deux constats principaux :

      • • L'illisibilité du soutien à la vie associative : Des rapports antérieurs (Inspection Générale, 2009 et 2016) soulignaient la complexité et la confusion des dispositifs, labels et acronymes (CRIB, PAVA, PIVA), rendant l'écosystème difficile à naviguer pour les associations. Une citation d'un rapport illustre ce point : "il existe des crib (...) qui sont des Pavas (...), des cribes non Pava, des Pavas non crib..."

      • • Une connaissance parcellaire des soutiens ministériels : Au-delà du rôle historique du ministère de la Jeunesse, la manière dont les autres ministères soutiennent le secteur associatif restait mal connue, alors que chacun interagit avec lui (ex: fiscalité pour le ministère de l'Économie, gestion du greffe pour l'Intérieur). Méthodologie

      L'étude repose sur une approche purement qualitative :

      • • 42 entretiens semi-directifs ont été réalisés avec :

      • ◦ Des agents de 15 ministères et organismes d'État.

      • ◦ 19 têtes de réseau associatives.

      • ◦ 4 acteurs de l'accompagnement privé.
      • ◦ 3 personnalités qualifiées.
      • ◦ 1 responsable politique.

      • • Une analyse documentaire approfondie (doctrines de financement, appels à projets, rapports d'activité) a été menée pour aller au-delà du discours officiel.

      Le terrain de recherche a été complexe, marqué par des difficultés d'accès aux ministères et une interruption de 14 mois due à la crise du Covid-19.

      2. Analyse du Soutien Financier : Nuances et Réalités

      Contrairement à l'idée reçue d'une baisse généralisée des financements, l'analyse des données (notamment le "Jaune" budgétaire associatif) apporte des nuances importantes :

      • Stabilité des subventions : Le montant global des subventions versées par l'État aux associations est relativement stable, s'élevant à environ 8,5 milliards d'euros.

      • Augmentation du volume global des moyens : Le volume financier total attribué aux associations augmente, mais une part croissante de ces moyens n'est pas versée sous forme de subventions (ex: marchés publics).

      • Étalement sur le tissu associatif : L'État finance un nombre croissant d'associations.

      Par conséquent, même avec un budget global stable, le montant moyen et médian par association est en baisse.

      Ce phénomène d'étalement est le principal facteur expliquant la perception d'une diminution des financements et de la part des subventions.

      3. Les Quatre Grands Objectifs du Soutien de l'État

      L'analyse des entretiens et des documents a permis d'identifier quatre objectifs principaux que l'État poursuit, de manière plus ou moins explicite, à travers son soutien au monde associatif.

      Objectif Description Exemples d'Actions Consolidation

      Vise à pérenniser la structure associative en la rendant robuste sur le long terme pour qu'elle puisse répondre aux besoins sociaux et aux politiques publiques. * - Soutien financier (subventions, etc.)<br>- Accompagnement à la professionnalisation<br>- Renforcement du modèle socio-économique et de la gouvernance<br>- Accueil, information et orientation<br>-

      Formation des membres et salariés

      Articulation / Maillage Concerne la circulation de l'information, le partage de bonnes pratiques et la facilitation des coopérations entre associations, et avec d'autres acteurs (publics, privés). * - Mise en réseau des acteurs<br> * - Organisation d'échanges de pratiques entre pairs<br> * - Facilitation de l'essaimage d'expérimentations

      Observation / Objectivation

      Recouvre la production de connaissances sur le secteur associatif pour éclairer l'action publique. Cet objectif est souvent le moins prioritaire.

        • Financement d'études et de recherches (réalisées par des cabinets, des chercheurs, des têtes de réseau)<br>
        • Création d'espaces de réflexion collective (ex: le COJ pour la jeunesse)

      Reconnaissance / Légitimation Englobe toutes les procédures de reconnaissance officielle des structures, de leurs projets ou de leurs activités, leur conférant une légitimité à agir au nom de l'intérêt général.

        • Délivrance d'agréments<br>
        • Reconnaissance d'utilité publique (RUP)<br>
        • Attribution de labels et de prix

      Ces objectifs sont souvent interconnectés. Par exemple, la reconnaissance (via un agrément) facilite l'accès aux financements, contribuant ainsi à la consolidation de l'association.

      4. Typologie des Formes de Soutien National

      En croisant les objectifs poursuivis, la nature du dialogue et le type de relation entre l'État et les associations, la recherche a établi une typologie de quatre modèles de soutien. Type de Soutien Description et Relation Objectifs Couverts

      Exemples et Caractéristiques

      1. Soutien partiel et institué Relation de "compagnonnage" : Historique, structurante et de longue durée.

      La politique publique est fortement adossée aux têtes de réseau, qui deviennent des partenaires incontournables.

      Consolidation (financière) et Articulation/Maillage.

      • Ministère de la Culture (pratiques amateurs) : Les fédérations sont des "coquilles vides" potentielles sans le soutien de l'État, et l'État ne peut agir sans elles.<br>

      • Délégation à la Sécurité Routière avec l'association Prévention Routière.<br>

      • Tendance à créer un "monopole" d'interlocuteurs.

      2. Soutien multiforme

      Le "champion" du soutien : Le modèle le plus complet et diversifié.

      La relation est basée sur la co-construction et un dialogue riche.

      Il existe une culture "militante" en faveur du monde associatif au sein de ces administrations.

      Les 4 objectifs sont couverts, y compris l'Observation.

      • CNAF, DJEPVA, ANCT (ex-CGET).<br>
      • Utilise une palette d'outils : subventions, CPO, appels à projets (rarement des appels d'offres).<br>
      • Production d'outils, de kits, de formations.<br>
      • Agence Française de Développement (AFD) avec le dispositif "Initiative OSC" qui reconnaît le droit à l'initiative des associations.

      3. Soutien par la reconnaissance

      Relation procédurale et distante : Le soutien est quasi-exclusivement articulé autour d'une procédure réglementaire (agrément, label).

      Principalement la Reconnaissance/Légitimation.

      • Direction Générale de la Santé pour l'agrément des associations représentant les usagers du système de santé.<br>
      • L'administration connaît mal le secteur et a peu de dialogue avec lui.<br>
      • L'objectif est de s'assurer que les associations remplissent les conditions réglementaires, pas de les accompagner dans leur développement.

      4. Soutien par relation intéressée

      Relation instrumentale : Le soutien est secondaire par rapport à l'objectif principal du ministère, qui est le déploiement de sa politique publique. Les associations sont vues comme des prestataires.

      Principalement la Consolidation (uniquement pour qu'elles puissent "tenir" et mettre en œuvre la politique).

        • Le modèle le plus fréquent (près de la moitié des directions rencontrées).<br>
        • Recours massif aux marchés publics (appels d'offres) et appels à projets.<br>
        • Mise en concurrence des associations avec des entreprises lucratives ou des établissements scolaires (ex: Ministère de la Défense).<br>
        • Les dispositifs généraux d'accompagnement (DLA, FDVA) sont méconnus et peu mobilisés par ces directions.

      5. Le Rôle Crucial des Têtes de Réseau

      La recherche souligne que le soutien de l'État ne pourrait exister sans le rôle opérationnel des têtes de réseau, qui agissent comme le bras armé d'un "État stratège" mais souvent démuni de capacités d'action directe.

      Leurs contributions principales sont :

      • Représentation et Plaidoyer : Faire remonter les besoins des territoires et défendre les intérêts du secteur.

      • Identification des problématiques : Aider à l'émergence de nouvelles questions sociales ou d'innovations.

      • Mutualisation d'expertise : Produire de la connaissance utile pour leurs membres et non-membres.

      Les têtes de réseau sont traversées par un débat sur leur approche territoriale :

      • Logique d'équité : Viser un soutien équitable pour toutes les structures membres, souvent via une centralité budgétaire.

      • Logique d'adaptation : Soutenir les structures de manière "ad hoc" en fonction des spécificités territoriales, ce qui implique une gouvernance moins pyramidale mais pose des défis de connaissance des contextes locaux et de risque de concurrence interne.

      6. Perspectives Locales et Divergences

      L'intervention de Luciana complète la perspective nationale en soulignant l'importance du contexte territorial, souvent absent des discours ministériels (à l'exception de l'ANCT et de la DJEPVA).

      • Le territoire comme construction sociale : Au-delà de l'espace administratif, le territoire est façonné par les relations entre les acteurs. Les politiques nationales sont appropriées différemment selon les spécificités locales.

      • Lisibilité des politiques publiques locales : La multiplication des dispositifs et le traitement en silo par secteur d'activité créent un manque de transversalité.

      La présence d'un élu référent, d'un service dédié ou d'un observatoire local de la vie associative (OLVA) peut renforcer le dialogue et la cohérence de l'action publique locale.

      • Rôle des réseaux locaux : Comme au niveau national, les réseaux locaux sont perçus par leurs membres comme des représentants de leurs intérêts, influençant l'appropriation des politiques publiques sur le territoire.

      7. Conclusions et Points Soulevés en Discussion

      Constats Clés

      1. Absence d'une politique unifiée : Il n'existe pas de politique de soutien à la vie associative intégrée au niveau de l'État. La coordination est faible, y compris au sein d'un même ministère.

      2. La qualité du dialogue est déterminante : Plus le dialogue est riche et orienté vers la co-construction, plus le soutien est diversifié et complet.

      3. L'Observation, parent pauvre de l'action publique :

      La production de connaissance sur le secteur associatif est souvent considérée comme secondaire ou un "impensé" au niveau national, alors qu'elle est un levier puissant au niveau local.

      Discussion avec les Participants

      Financements et clientélisme : Les participants ont évoqué un ressenti de clientélisme.

      La recherche met plutôt en évidence un "effet de monopole", où les ministères préfèrent dialoguer avec un interlocuteur unique ou principal.

      La charge administrative : Il a été souligné que les associations passent un temps considérable à remplir des dossiers pour des dispositifs multiples et chronophages, au détriment de l'action de terrain.

      Cela renvoie à la nécessité de simplifier les procédures et de privilégier des financements pluriannuels (CPO) plutôt que des appels à projets annuels.

    1. Document de Synthèse : Webinaire "Décrypter la recherche - Épisode 1"

      Synthèse

      Ce document de synthèse résume les points clés du webinaire "Décrypter la recherche - Épisode 1", organisé par le Réseau National des Maisons des Associations (RNMA).

      La discussion centrale a exploré la dichotomie conceptuelle de l'association, vue soit comme une "entreprise sociale", soit comme une "action collective", dans le contexte des débats actuels sur une potentielle révision de la loi française sur l'Économie Sociale et Solidaire (ESS) de 2014.

      La présentation principale, assurée par Luciana Riero, doctorante au sein du RNMA, a exposé les résultats d'une cartographie scientifique ("science mapping") de la recherche internationale.

      Cette analyse révèle la structuration du champ académique autour de deux traditions dominantes :

      1. L'approche de l'entrepreneuriat social, prédominante dans le monde anglo-saxon, qui valorise l'activité entrepreneuriale et une vision plus individualiste, centrée sur le profil de l'entrepreneur social.

      2. L'approche du secteur à but non lucratif ("non-profit"), plus proche de la tradition européenne et française, qui analyse les associations dans leur relation avec l'action publique et en tant qu'expression de l'action collective.

      Le choix entre ces deux prismes d'analyse n'est pas neutre et a des conséquences directes sur la perception des associations, notamment en matière de gouvernance démocratique, d'économisation du social et de la dimension politique de l'ESS.

      Le débat sur la révision de la loi ESS française cristallise ces tensions, opposant une vision d'une ESS palliative et isomorphique aux modèles d'entreprises capitalistes, à une vision d'une ESS alternative et émancipatrice.

      Enfin, la position du RNMA est de défendre la spécificité de l'association comme action collective et de plaider pour que toute évolution législative renforce de manière opérationnelle la coopération et la co-construction sur les territoires.

      1. Introduction au Webinaire et à la Démarche de Recherche

      Contexte et Objectifs

      Le webinaire constitue le premier épisode d'un cycle intitulé "Décrypter la recherche", visant à créer un dialogue semestriel (tous les 4 à 6 mois) entre le monde de la recherche et les acteurs associatifs.

      L'objectif n'est pas de prendre position pour ou contre la révision de la loi ESS, mais d'utiliser les apports de la recherche pour fournir des "clés de lecture" sur les conséquences des évolutions en cours pour les associations et les territoires.

      Le RNMA et la Recherche

      Le RNMA entretient des liens historiques avec le monde de la recherche, notamment depuis les années 2000 avec des collaborations sur les observatoires locaux de la vie associative (avec des chercheurs comme Vivian Tchernonog ou Lionel Prouto).

      Cette démarche vise à rendre la recherche opérationnelle pour aider les associations à mieux comprendre leur fonctionnement et à faire évoluer leurs pratiques.

      La Thèse de Luciana Riero

      Luciana Riero est doctorante au sein du RNMA via un dispositif CIFRE. Sa thèse s'intitule "La qualification et interprétation des relations entre les associations et le territoire".

      Son objectif est d'identifier, de qualifier et de mesurer ces relations, ainsi que de démontrer les liens de causalité entre les caractéristiques socio-économiques des territoires et les caractéristiques organisationnelles des associations.

      2. Cartographie de la Recherche Internationale sur les Associations et le Territoire

      Méthodologie du "Science Mapping"

      La présentation s'appuie sur une cartographie scientifique, une méthode d'analyse bibliométrique quantitative qui permet de visualiser l'état des connaissances sur un sujet. La démarche est inductive, sans hypothèses a priori.

      • Corpus : 2 857 articles scientifiques issus de la base de données Web of Science.

      • Mots-clés de recherche : Croisement de termes liés aux associations ("nonprofit organization", "social enterprise", "voluntary sector") et de termes liés au territoire ("spatial", "urban", "local", "development").

      • Constat initial : Une augmentation des publications sur ce thème est observée depuis 2011. Les couples de mots-clés les plus fréquents sont "social enterprise + development" et "nonprofit organization + development", suggérant que la notion de développement est plus prégnante dans la littérature internationale que celle de territoire.

      Résultats Clés de la Cartographie L'analyse révèle une structuration de la recherche mondiale autour de deux grands pôles intellectuels :

      Cluster Thématique

      Description

      Pôle 1 : L'Entreprise Sociale Ce courant est centré sur l'entreprise sociale et l'activité entrepreneuriale. Il analyse l'émergence de ce concept, souvent dans une perspective internationale comparative (Europe vs. États-Unis), et son ancrage dans la théorie des organisations hybrides.

      Pôle 2 : Le Secteur à But Non Lucratif Ce courant est centré sur les organisations à but non lucratif, la philanthropie et le bénévolat. L'analyse porte principalement sur les relations entre ces organisations et l'État (l'action publique), questionnant les phénomènes d'institutionnalisation, de banalisation ou de marchandisation du secteur.

      Opposition Conceptuelle : Approches Américaine et Européenne de l'Entreprise Sociale

      • Aux États-Unis, deux écoles de pensée coexistent :

      1. L'école de la recette marchande : Considère comme entreprise sociale toute organisation, quel que soit son statut, qui déploie une activité économique marchande au profit d'une finalité sociale.

      2. L'école de l'innovation sociale : Met l'accent sur la figure de l'entrepreneur social (son dynamisme, sa créativité, son leadership) comme facteur déterminant. * ◦ Conclusion : Une approche à dominante

      • En Europe, les travaux du réseau EMES (Emergence of Social Enterprise) ont fondé le concept sur un idéal-type reposant sur trois dimensions clés : 1. Un projet économique : Activité continue de production de biens ou services. 2. Une mission sociale : Objectif explicite de service à la communauté et distribution limitée des profits. 3. Une gouvernance participative : Association des différentes parties prenantes. * ◦ Conclusion : Une approche à dominante

      3. Conséquences et Enjeux des Approches Théoriques

      Le choix d'analyser les associations par le prisme de l'entreprise sociale ou de l'action collective a des implications profondes.

      • Gouvernance Démocratique : L'économie sociale repose sur le principe "une personne, une voix" et la double qualité des membres (bénéficiaires et sociétaires).

      L'approche par l'entreprise sociale, notamment dans ses formes comme le "social business", rend ces frontières plus floues, le critère de démocratie interne n'étant pas toujours explicite.

      La gouvernance participative constitue une "dimension cruciale des ruptures possibles" entre les deux modèles.

      • Économisation du Social : L'approche par l'entreprise sociale peut accentuer la lecture d'un "déplacement d'une production publique vers une production privée".

      • Lien avec l'Action Publique : La recherche sur le secteur non lucratif met en débat la relation avec la puissance publique, oscillant entre des perspectives d'isomorphisme (tendance des associations à adopter les modes d'organisation des entreprises capitalistes) et de co-construction des politiques publiques.

      Positionnement de la Thèse : Face à ces constats, la recherche de Luciana Riero s'inscrira dans la continuité des approches francophones en termes d'action collective et d'auto-organisation.

      Elle renforcera une vision du territoire comme une construction sociale, en interaction avec l'action collective, se démarquant ainsi d'une vision internationale qui le perçoit souvent comme une simple donnée administrative (urbain/rural).

      4. Mise en Perspective : La Loi ESS et ses Débats

      Avertissement : Les points suivants sont basés sur les travaux d'acteurs comme ESS France, le RTES ou des chercheurs comme Timothée Duverger, et non sur l'expertise directe de la présentatrice.

      La Loi de 2014 : Un Double Projet

      L'article 1 de la loi de 2014 définit l'ESS non seulement comme un "mode d'entreprendre", mais aussi comme un "mode de développement économique", ce qui le distingue d'un simple projet d'entreprise pour en faire un projet politique de société.

      Ce développement est précisé comme étant local et durable (via les PTCE, article 9).

      Points Clés du Débat sur la Révision

      À l'approche des 10 ans de la loi, plusieurs enjeux sont débattus :

      • Agrément ESUS (Entreprise Solidaire d'Utilité Sociale) : Des acteurs proposent de le réviser pour imposer un meilleur partage de la valeur, renforcer le contrôle et ajouter une obligation de reporting d'utilité sociale et environnementale.

      • Moyens financiers : Les fonds créés sont jugés souvent "faibles et mêlés à une politique de soutien de l'économie d'impact aux contours un peu flous".

      • Rapprochement avec la RSE : La loi PACTE de 2019 a créé les "sociétés à mission", invitant les entreprises conventionnelles à se doter d'une "raison d'être". Ce mouvement, auquel des structures de l'ESS ont participé, questionne les frontières et les spécificités de l'ESS.

      • Lien ESS et Territoire : Un axe de développement serait d'assumer pleinement le rôle des régions comme "chefs de file" de l'ESS et de reconnaître celui des départements comme premiers financeurs.

      Enjeux Théoriques du Débat Français

      Le débat académique français gravite autour d'une tension fondamentale :

      • Une ESS alternative et émancipatrice vs. une ESS palliative et d'isomorphisme.

      • La montée en puissance des thématiques de l'entreprise sociale, de l'entrepreneuriat social, voire du "social business", tend à masquer l'hétérogénéité de l'ESS et à réduire son projet politique à une simple finalité sociale.

      Conclusion : L'approche adoptée pour analyser les associations (entreprise sociale ou action collective) influence directement la dimension politique du débat sur l'avenir de l'ESS.

      5. Contributions des Participants et Conclusion

      Précisions Techniques et conceptuelles

      • Distinction entre activité économique et marchande (Colin Blard, avocat) : Une spécificité française cruciale est que toute activité économique n'est pas marchande.

      La "plus-value sociale ajoutée" (liée à la notion d'utilité sociale) permet à une association de développer une activité économique tout en conservant son statut non lucratif et sa non-sujétion aux impôts commerciaux. Cette notion d'utilité sociale est intrinsèquement liée au territoire, notamment via l'analyse de la concurrence.

      • Évolution historique du secteur : Un participant avec 20 ans d'expérience dans l'ESS a rappelé que la raréfaction des financements publics depuis les années 1980 a poussé les associations vers une hybridation de leurs ressources et une professionnalisation, avec l'émergence de compétences issues du monde de l'entreprise.

      Ce mouvement a renforcé leur autonomie économique mais a aussi instauré une logique de mise en concurrence via les appels à projets.

      Positionnement et Plaidoyer du RNMA

      En conclusion, Thomas (RNMA) a exposé la position du réseau :

      • La loi de 2014 a été positive pour la reconnaissance de l'ESS, mais ses effets transformateurs sur les territoires sont restés limités.

      • Le RNMA défend la spécificité de l'association comme "action collective" au sein de l'ESS.

      • Plaidoyer : Si une révision de la loi a lieu, le RNMA plaidera pour qu'elle intègre une traduction opérationnelle forte des principes de coopération et de co-construction multi-acteurs.

      L'enjeu est de reconnaître et d'outiller l'apport des associations aux transitions des territoires et de la société.

    1. La Faculté de Punir : Analyse des Transformations du Châtiment des Enfants

      Résumé

      Ce document de synthèse analyse l'évolution de la "faculté de punir" appliquée aux enfants, en s'appuyant sur les travaux de Didier Fassin.

      Il met en lumière un paradoxe central : alors que les sociétés occidentales sont devenues de plus en plus punitives envers les adultes depuis la seconde moitié du XXe siècle, elles se sont montrées de plus en plus protectrices à l'égard des enfants.

      Cette protection a évolué à travers une série de redéfinitions successives des mauvais traitements, passant d'un cadre légal et familial (la "correction modérée") à des cadres médical (le "syndrome de l'enfant battu"), de santé publique (la "maltraitance infantile") et enfin psychologique et sexuel (les "abus sexuels").

      Cette tendance globale à la protection a conduit à une prohibition croissante des châtiments corporels, bien que de manière inégale à travers le monde, avec des exceptions notables comme les États-Unis.

      Cependant, l'analyse révèle que cette protection est elle-même vulnérable.

      L'État, en revendiquant le monopole de la faculté de punir, restreint le droit de correction dans la sphère privée tout en se réservant le droit de châtier les mineurs dans des cadres spécifiques.

      Cette vulnérabilité se manifeste de deux manières principales :

        1. Dans la justice pénale des mineurs, où le principe de protection établi par l'ordonnance de 1945 en France est progressivement érodé par un durcissement législatif visant à rapprocher le traitement des mineurs de celui des adultes.
        1. Dans le traitement administratif des mineurs étrangers, où l'enjeu devient de leur refuser le statut de minorité pour les soustraire à la protection et les exposer à la répression.

      En définitive, bien que la punition des enfants ait reculé dans la sphère privée, la protection qui leur est accordée par la loi reste fragile et soumise aux pressions politiques qui associent délinquance et immigration, remettant ainsi en cause le statut d'exception du mineur.

      1. La Nature et la Légitimité de la Punition

      Punir, dans son acception la plus générale, consiste à infliger une forme de souffrance (physique ou psychique) à une personne supposée avoir violé une loi, une morale ou une norme.

      Cette action est réputée légitime socialement car elle vise à corriger une infraction et à rétablir l'ordre. La légitimité du châtiment peut revêtir plusieurs dimensions :

      Légale : Lorsqu'elle est prononcée par une autorité judiciaire au terme d'un procès.

      Ce système formel est lui-même historiquement, culturellement et politiquement déterminé.

      Sociale : En dehors du cadre légal, dans des "mondes sociaux" qui définissent leurs propres règles (ex: institution scolaire, milieux mafieux).

      L'analyse se concentre principalement sur une évolution paradoxale : alors que l'appareil punitif de l'État est devenu plus sévère ("punitif"), la sphère familiale l'est devenue de moins en moins.

      La légitimité du châtiment s'est renforcée dans l'espace public tout en se restreignant dans l'espace privé, notamment concernant les enfants.

      2. La Généalogie du Châtiment de l'Enfant : De la Puissance Paternelle à la Protection

      2.1. De la "Patria Potestas" à la "Correction Modérée" L'histoire du châtiment des enfants est marquée par une longue évolution depuis le pouvoir quasi absolu du père dans l'Empire romain.

      La Patria Potestas : Ce pouvoir du père sur ses enfants, allant jusqu'au droit de vie et de mort, était quasi absolu dans le cadre légal romain.

      Il était cependant limité en pratique par des facteurs sociaux (mariage tardif, espérance de vie réduite) et une évolution progressive du droit romain lui-même.

      L'Évolution vers la Protection : Au fil des époques (médiévale, classique, Lumières), une obligation de protection de l'enfant s'est associée à la puissance paternelle, qui a également été étendue à la mère.

      La "Correction Modérée" : Au XIXe siècle, comme le note Philippe Antoine Merlin en 1813, le "droit de vie et de mort fut réduit à une simple correction, à un châtiment modéré".

      Toutefois, les critères de cette modération n'étant pas définis, la législation protégeait peu l'enfant contre la violence parentale.

      2.2. L'Intervention de l'État : Les Lois de la Fin du XIXe Siècle

      La fin du XIXe siècle en France marque un tournant avec l'intervention de l'État dans la sphère familiale pour protéger les enfants.

      Loi du 24 juillet 1889 ("sur la protection des enfants maltraités et abandonnés") :

      Elle prévoit la déchéance de la puissance paternelle pour les parents condamnés pour des crimes ou délits sur leurs enfants, ou dont le comportement (ivrognerie, mauvais traitements) compromet la santé, la sécurité ou la moralité des enfants.

      Loi du 19 avril 1898 ("sur la répression des violences...") :

      Elle précise les infractions (coups, privation d'aliments) et alourdit les peines si les auteurs sont les parents ou une personne ayant autorité sur l'enfant, pouvant aller jusqu'aux travaux forcés à perpétuité.

      Selon l'historien Georges Vigarello, ces lois s'inscrivent dans une triple transformation de la société :

        1. Un recul général de la violence.
        1. Une sensibilité nouvelle à la souffrance et à l'image de l'enfant.
        1. La contestation de l'autorité absolue du père, couplée à une volonté de moraliser les classes pauvres.

      Ces lois participaient ainsi à une double logique : la disciplinarisation des pauvres et la moralisation des enfants perçus comme de potentiels futurs délinquants.

      2.3. La Reconfiguration des Mauvais Traitements : Une Triple Redéfinition

      Au XXe siècle, les mauvais traitements ont connu trois redéfinitions successives, les inscrivant dans de nouveaux champs de savoir et d'action.

      2.3.1. Le Cadre Médical : Le "Syndrome de l'Enfant Battu"

      La médecine a joué un rôle crucial dans l'identification des violences familiales.

      Ambroise Tardieu (1860) : Ce médecin légiste français fut le premier à décrire le tableau clinique de sévices et mauvais traitements (hématomes multiples, séquelles de fractures), mais son travail eut peu d'écho à l'époque.

      Frederic Silverman (1953) : Ce radiologue pédiatrique américain identifie l'association de fractures multiples comme un signe de traumatismes.

      En 1962, avec des collègues, il nomme ce tableau le Battered child syndrome ("syndrome de l'enfant battu"), ce qui initiera des lois imposant le signalement des mauvais traitements.

      Aujourd'hui : Le terme de "traumatisme non accidentel" (non-accidental trauma) est préféré pour inclure d'autres formes comme le "syndrome du bébé secoué".

      2.3.2. Le Cadre de la Santé Publique : La "Maltraitance Infantile"

      Le passage à la santé publique a changé l'échelle d'analyse : du cas individuel à la population, du diagnostic à la prévention.

      Nouveau concept : Le terme "maltraitance infantile" (child maltreatment) apparaît en France dans le dernier quart du XXe siècle, porté par la pédiatrie sociale.

      Changement d'échelle : Les médias se focalisent sur les cas extrêmes (infanticides), invisibilisant la "banalité des mauvais traitements".

      Les données épidémiologiques révèlent une prévalence bien plus élevée que ce que les hospitalisations suggèrent.

      Indicateur de Maltraitance Infantile

      Données (États-Unis)

      Données (France)

      Enfants référés pour suspicion (avant 5 ans)

      13,9 % (enquête en Californie)

      Proportion d'enfants victimes (violence/négligence) 19,2 % (enquête nationale 2010) - Taux basé sur les hospitalisations (formes graves) - 0,11 % (fortement sous-estimé) Taux de violence physique (enquêtes pop.) - Entre 4 % et 16 % (travaux internationaux) Taux de violence psychologique (enquêtes pop.) - Entre 4 % et 10 % (travaux internationaux)

      2.3.3. La Reconnaissance Tardive : Les Abus Sexuels

      Les abus sexuels sur enfants ont été occultés ou niés pendant la majeure partie du XXe siècle par différentes sphères :

      La médecine : A longtemps interprété les infections génitales chez les enfants comme des conséquences de la promiscuité ou du manque d'hygiène, désexualisant les pratiques.

      Le politique : Des parlementaires masculins se sont opposés aux réformes féministes.

      Le judiciaire : Le traitement dubitatif et inquisiteur des plaintes était dissuasif pour les victimes.

      La psychanalyse : A pu être instrumentalisée pour conforter le "négationnisme ambiant" en réduisant les accusations à des fantasmes.

      Ce n'est qu'au début du XXIe siècle que le sujet entre dans l'espace public, révélant une ampleur considérable.

      Données sur les Abus Sexuels

      Méta-analyse (Europe/Amérique du Nord)

      Rapport CIIVISE 2023 (France)

      Garçons/Hommes

      2,6% (sans contact) / 4% (avec contact)

      1,5 million d'hommes victimes (6,4%)

      Filles/Femmes

      6,7% (sans contact) / 12,7% (avec contact)

      3,9 millions de femmes victimes (14,5%)

      Total

      Plus d'un adulte sur 10

      Contexte

      8% des cas avant 5 ans

      Inceste dans plus de 8 cas sur 10

      Âge de début

      8,5 ans en moyenne

      3. La Prohibition Mondiale des Châtiments Corporels : Une Progression Inégale

      Parallèlement à la reconfiguration des violences en mauvais traitements, un mouvement mondial d'interdiction des châtiments corporels a émergé.

      3.1. L'Interdiction dans la Sphère Familiale

      Pionnière : La Suède a été le premier pays au monde à inscrire cette interdiction dans sa législation en 1979.

      Progression : En 2000, 11 pays avaient suivi. En 2022, on en comptait 66.

      La France : Condamnée par le Conseil de l'Europe en 2015, la France est devenue le 56e pays à adopter une telle législation le 2 juillet 2019 avec la loi "relative à l'interdiction des violences éducatives ordinaires".

      Pays Réfractaires : En 2022, 133 pays ne l'avaient pas fait, parmi lesquels les États-Unis, le Canada, l'Australie, la Russie, l'Inde et le Royaume-Uni.

      3.2. L'Exceptionnalisme Américain : La Persistance des Punitions à l'École

      L'interdiction des châtiments corporels en milieu scolaire est plus répandue : 136 pays l'avaient adoptée en 2022. L'exception la plus marquante est celle des États-Unis.

      Légalité : La pratique reste autorisée dans les établissements privés (sauf 4 États) et dans les établissements publics de 17 États, principalement dans le Sud.

      Validation par la Cour Suprême : L'affaire Ingraham v. Wright (1977) a validé cette pratique.

      La Cour a jugé que le 8e amendement de la Constitution (interdisant les "punitions cruelles et inhabituelles") ne s'appliquait pas aux élèves car il avait été conçu pour les criminels.

      La Dimension Raciale Invisibilisée : L'analyse souligne que cette affaire présente une dimension raciale cruciale mais ignorée :

        1. La victime était un élève noir.
        1. La scène se déroule dans un État du Sud marqué par l'héritage des lois de ségrégation Jim Crow.
        1. Le mode de sanction (humiliation physique) rappelle celui utilisé pour punir les esclaves.

      "malgré un corpus considérable de recherche en sciences sociales établissant à la fois l'inefficacité des châtiments corporels et les dommages sociaux très graves qui peuvent en résulter, le système judiciaire s'obstine à récuser toute mise en cause de la constitutionnalité de cette forme de punition." - Dina PoKempner Sacks

      3.3. Au-delà du Châtiment Corporel : Les Nouvelles Formes de Discipline Scolaire

      La suppression des punitions physiques ne doit pas occulter la persistance d'autres formes de sanctions, qui affectent de manière disproportionnée les élèves des catégories défavorisées et des minorités ethnoraciales.

      Sanctions classiques : Heures de retenue, exclusions définitives.

      Sanctions moins visibles : Humiliations, stigmatisations.

      Nouveaux motifs : En France, le principe de laïcité tel que défini par la loi du 15 mars 2004 a créé de nouveaux motifs de sanction.

      Pour l'année 2022-2023, 3 881 signalements ont été transmis, dont la moitié pour des tenues comme des jupes ou robes longues.

      4. La Vulnérabilité de la Protection : Le Mineur face à l'État Punitif

      4.1. Le Double Principe : L'Enfant comme Objet de Protection et Sujet de Droit

      La protection accrue des mineurs repose sur une double argumentation, en apparence contradictoire mais qui se renforce mutuellement :

      1. L'enfant comme objet de protection : En raison de son "manque de maturité physique et intellectuelle", il a besoin d'une protection spéciale (Déclaration des Droits de l'Enfant, 1959).

      2. L'enfant comme sujet de droit : Il possède des droits fondamentaux au même titre qu'un adulte, en vertu de sa "dignité et de la valeur de la personne humaine" (Charte des Nations-Unies, 1945).

      Ce double principe fonde l'abolition des châtiments corporels.

      Cependant, il révèle surtout que l'État devient le maître du jeu, revendiquant le monopole de la faculté de punir et s'autorisant lui-même à infliger des châtiments aux enfants dans des cas précis (ex: tribunaux militaires israéliens jugeant des enfants palestiniens dès 12 ans).

      4.2. La Justice Pénale des Mineurs en France : Du Souci de Protection au Durcissement Sécuritaire

      L'histoire de la justice des mineurs en France illustre une tension permanente entre protection et punition.

      L'Ordonnance du 2 février 1945 : Marque un "âge presque révolu" où la protection primait sur la punition.

      Le Durcissement depuis les années 1990 : Sous l'effet du populisme pénal et de l'instrumentalisation de faits divers, la tendance s'est inversée.

      • Érosion des principes de 1945 : Le législateur a progressivement restreint la présomption de non-discernement (avant 13 ans) et l'excuse de minorité, multiplié les lieux d'enfermement et rapproché la justice des mineurs de celle des adultes.

      • ◦ Exemples de mesures : Création de la rétention judiciaire pour les moins de 13 ans (1994), création des centres éducatifs fermés (2002), abaissement temporaire de l'âge de responsabilité pénale à 10 ans.

      La Réponse des Magistrats : Les données statistiques sur la période 2000-2019 montrent un tableau contrasté.

      • ◦ La délinquance juvénile est stable, voire en diminution.

      • ◦ Le taux de réponse pénale augmente fortement (de 78% à 93%).

      • ◦ Les magistrats privilégient les alternatives aux poursuites.

      • ◦ Lorsqu'ils condamnent à la prison, les peines sont plus longues (quantum moyen passant de 5,5 à 9 mois) et les détentions provisoires également (de 3,4 à 7,1 mois).

      • ◦ Les mesures éducatives sont en recul d'un quart, tandis que les sanctions éducatives sont multipliées par six.

      En somme, les magistrats semblent vouloir moins condamner, mais le faire plus lourdement, tout en privilégiant des sanctions à vocation éducative plutôt que des mesures de pur accompagnement.

      4.3. Les Mineurs Non Accompagnés (MNA) : Entre Protection et Répression

      Le cas des mineurs étrangers non accompagnés (MNA) illustre une autre forme de mise en cause de la protection. Ici, la tension n'est pas entre protection et punition, mais entre protection et répression.

      Cadre Juridique : En principe, les MNA ne peuvent se voir opposer leur absence de titre de séjour et doivent être pris en charge par l'Aide Sociale à l'Enfance (ASE), relevant de la protection.

      L'évaluation de leur minorité par les conseils départementaux est donc cruciale.

      Réalité de Terrain (Enquête dans les Hautes-Alpes) :

      L'enquête montre que le protocole légal n'est souvent pas respecté.

      ◦ Refoulements illégaux à la frontière.

      ◦ Politique de "déminorisation" par les services départementaux, soumise à la pression politique sur les coûts. Dans le département étudié, le taux de reconnaissance est passé de 46% en 2017 à 4% en 2019.

      ◦ Logique de suspicion systématique durant les entretiens d'évaluation, où tout élément du récit peut être retourné contre le jeune pour contester sa minorité.

      • Conséquences : Le rejet de leur minorité condamne ces jeunes à la précarité, l'errance et l'exposition à de multiples violences, voire les pousse vers des activités illicites pour survivre.

      5. Conclusion : La Vulnérabilité de l'Exception Mineure

      L'analyse des transformations du châtiment des enfants révèle une dynamique complexe.

      Si la société a évolué vers une plus grande protection des mineurs dans la sphère privée, cette protection est loin d'être absolue et reste éminemment vulnérable.

      Deux logiques distinctes mais convergentes sont à l'œuvre :

        1. Pour les jeunes délinquants : L'enjeu est de réduire la protection accordée par la minorité pénale (en levant l'excuse de minorité, en avançant la capacité de discernement) pour pouvoir appliquer la punition.
        1. Pour les jeunes étrangers : L'enjeu est de refuser la protection en rejetant la déclaration de minorité pour pouvoir appliquer la répression (obligation de quitter le territoire, placement en centre de rétention).

      Dans les deux cas, l'exception dont les mineurs peuvent théoriquement se prévaloir est remise en cause par des politiques exigeant plus de sévérité.

      Le rapprochement opéré dans le débat public entre criminalité et immigration ne fait que renforcer cette tendance, menaçant de faire converger le traitement des mineurs délinquants et des mineurs étrangers vers un même horizon répressif et punitif.

    2. 40 ans après la Suède 4 ans après avoir été condamné par le Conseil de l'Europe pour ne pas avoir prévu je cite d'interdiction suffisamment clair contraignante et précise des châtiments corporels la France qui qui maintenait 00:26:37 jusqu'à l'ORS dans ces textes un droit de correction a finalement été le 56e pays dans le monde à adopté une législation respectant la charte européenne des droits 00:26:48 sociaux ironiquement appelé antifessé comme pour la tournée en dérision la loi relative à je cite l'interdiction des violences éducatives ordinaires a été 00:27:01 voté au Sénat le 2 juillet 2019 en fait les parlementaires s'étent déjà prononcé sur ce principe dans le cadre de la loi dite égalité citoyenneté en 2016 mais le Conseil constitutionnel en avait censuré 00:27:15 l'article car il s'agissait d'un cavalier législatif c'est-à-dire introduit dans le dans un texte de loi sans rapport avec sa substance
    3. dans le département des Alpes en 2017 1243 jeunes ce disant mineurs ont été évalués et 00:55:03 575 soit 46 % ont effectivement été reconnus tel mais 2 ans plus tard alors qu'il n'y avait pourtant plus que 621 dossiers examinés soit la moitié de 00:55:16 2017 seul 26 soit 4 % se sont vu attribuer le le statut protecteur cette politique drastique de de déminorisation instaurée par le Conseil départemental condamnait presque 00:55:30 systématiquement tous ces garçons à une vie de précarité d'érrance d'exposition à des risques bien documentés de violence d'addiction et de prostitution
    4. Résumé de la vidéo [00:00:00][^1^][1] - [00:25:06][^2^][2]:

      Cette vidéo présente une conférence de Didier Fassin sur la faculté de punir, explorant les aspects légaux, sociaux et historiques de la punition. Fassin discute de la légitimité de la punition dans différents contextes, y compris la justice formelle, l'éducation et la famille.

      Points forts: + [00:00:29][^3^][3] Définition de la punition * Infliger une souffrance pour violation de la loi, morale ou norme * Peut être physique ou psychique, légale ou informelle + [00:04:00][^4^][4] Évolution de la punition des enfants * Tendance vers moins de punition dans la famille malgré un État plus punitif * La légitimité du châtiment parental diminue + [00:07:12][^5^][5] Protection légale des enfants en France * Lois de 1889 et 1898 pour protéger les enfants maltraités et abandonnés * Introduction de la déchéance de la puissance paternelle + [00:12:00][^6^][6] Redéfinitions des mauvais traitements * Inclusion dans la chaîne pénale et reconnaissance médicale et publique * Évolution vers la prévention et la sensibilisation aux abus sexuels Résumé de la vidéo [00:25:08][^1^][1] - [00:49:11][^2^][2]:

      Cette partie de la vidéo aborde l'évolution de la législation sur les châtiments corporels, en particulier dans le contexte familial et scolaire. Elle met en lumière les changements dans les lois et les attitudes envers la discipline physique des enfants, en soulignant les différences entre les pays et les évolutions au fil du temps.

      Points forts: + [00:25:08][^3^][3] L'histoire des châtiments corporels * Suppression progressive des châtiments corporels dans de nombreux pays * La France a adopté une législation interdisant les violences éducatives ordinaires en 2019 * Évolution des attitudes et des lois concernant la discipline physique des enfants + [00:28:00][^4^][4] La situation aux États-Unis et au Royaume-Uni * Les États-Unis maintiennent le droit de corriger physiquement les enfants * Le Royaume-Uni a été condamné pour ne pas avoir respecté les droits de l'homme en matière de châtiments corporels * Discussion sur la dimension raciale de la violence scolaire aux États-Unis + [00:33:04][^5^][5] L'exceptionnalisme américain dans la discipline scolaire * Les punitions physiques restent autorisées dans certains États américains * La Cour suprême des États-Unis a validé ces pratiques en 1977 * Analyse de la persistance de ces pratiques malgré leur inefficacité et leurs dommages sociaux + [00:39:01][^6^][6] Le monopole de l'État sur la faculté de punir * L'État revendique le monopole de l'usage légitime de la faculté de punir * Discussion sur la protection des enfants et la responsabilité pénale des mineurs * Évolution de la justice pénale des mineurs vers plus de sévérité depuis les années 1990 Résumé de la vidéo [00:49:13][^1^][1] - [01:01:00][^2^][2]:

      Cette partie de la vidéo aborde la justice pénale des mineurs en France, en mettant l'accent sur la tension entre protection et punition. Didier Fassin discute de l'augmentation des réponses pénales dans un contexte de délinquance juvénile stable ou en baisse, et de la pression politique et policière sur les magistrats. Il souligne également la situation particulière des mineurs non accompagnés (MNA) et les défis liés à leur évaluation et protection.

      Points forts: + [00:49:13][^3^][3] Justice pénale des mineurs * Augmentation des réponses pénales malgré la stabilité de la délinquance * Préférence pour les alternatives aux poursuites plutôt que l'emprisonnement * Pressions politiques et policières influençant les décisions judiciaires + [00:50:53][^4^][4] Mineurs non accompagnés (MNA) * Difficultés d'évaluation de l'âge et de la minorité * Protection offerte par la convention internationale des droits de l'enfant * Enjeux politiques et administratifs liés à la prise en charge des MNA + [00:52:57][^5^][5] Enquête sur le traitement des MNA * Protocole souvent non suivi pour l'évaluation des MNA * Pression politique sur les personnels évaluant la minorité * Conséquences graves du rejet de la minorité pour les jeunes + [00:59:47][^6^][6] Protection des mineurs et politiques de sévérité * Vulnérabilité de la protection accordée aux mineurs * Tension entre protection et répression pour les MNA * Impact des politiques sur la vie des jeunes rejetés

    1. Résumé de la vidéo [00:00:01][^1^][1] - [00:23:31][^2^][2]:

      Cette vidéo présente le jeu Hexagone utilisé dans un lycée pour discuter de la laïcité.

      L'activité ludique encourage les élèves à débattre et à réfléchir de manière critique sur le sujet.

      Le professeur guide les élèves à travers des discussions constructives, en soulignant l'importance de la laïcité comme cadre permettant la liberté de conscience et de culte.

      Points forts: + [00:00:41][^3^][3] Introduction du jeu Hexagone * Utilisé pour engager les élèves dans le débat sur la laïcité * Encourage l'autonomie et l'esprit critique * Les élèves travaillent en équipe pour répondre à des questions + [00:06:00][^4^][4] La laïcité dans l'éducation * Un concept mal compris par les jeunes * Le débat permet d'explorer différentes perspectives * Importance de discuter pour construire une compréhension commune + [00:11:00][^5^][5] Liberté de culte et espaces neutres * Discussion sur la possibilité d'avoir des salles de prière dans les écoles * La laïcité doit préserver la liberté de culte tout en maintenant la neutralité * Les élèves apprennent à naviguer entre les tensions et les ambiguïtés de la laïcité + [00:14:00][^6^][6] La laïcité constructive vs restrictive * La laïcité ne signifie pas cacher sa religion * Il est essentiel de discuter et de clarifier les termes flous * La laïcité devrait être un cadre pour la liberté plutôt que pour la contrainte Résumé de la vidéo [00:23:33][^1^][1] - [00:31:46][^2^][2]:

      Cette partie de la vidéo discute de l'utilisation du jeu "L'Hexagone" dans les lycées pour aborder le sujet de la laïcité. L'intervenant explique comment il intègre cette activité ludique dans ses cours pour faciliter l'apprentissage des élèves sur des sujets complexes, en utilisant des hexagones pour organiser et relier des concepts.

      Points forts: + [00:23:33][^3^][3] Utilisation en classe * Peut être utilisée comme activité d'introduction ou de conclusion * Aide à réactiver les préacquis et à visualiser les enjeux * Encourage la discussion et la négociation entre élèves + [00:25:08][^4^][4] Préparation du matériel * Les élèves découpent et collent les hexagones eux-mêmes * Activité manuelle qui stimule la réflexion et le débat * Nécessite une gestion du temps et une organisation en classe + [00:29:47][^5^][5] Encouragement de l'innovation pédagogique * Pas de prérequis spécifiques pour proposer cette activité * Importance de la formation des enseignants à de nouvelles méthodes * Partage et mutualisation des ressources entre collègues

    1. L'École et les Valeurs de la République : Synthèse de la Journée d'Études

      Résumé

      Ce document de synthèse analyse les thèmes, arguments et données clés présentés lors de la journée d'études intitulée "L'École, un territoire vivant au cœur des valeurs de la République".

      Organisée par l'INSPÉ de l'Académie de Lille, cette journée s'est déroulée dans un contexte marqué par l'attentat d'Arras, conférant une acuité particulière aux débats.

      Les interventions soulignent unanimement la mission première de l'École, inscrite dans le Code de l'éducation, de faire partager les valeurs de la République.

      Cette mission s'ancre dans un héritage historique profond, allant des Lumières aux lois Jules Ferry, et vise à former des citoyens émancipés par la raison et le savoir.

      Une analyse sémantique et juridique révèle que la notion de "valeurs de la République" est d'usage récent, tant dans le discours public que dans les textes de loi, avec une augmentation significative depuis les années 1980.

      Ces valeurs ne sont pas figées ; elles évoluent et s'enrichissent, comme en témoigne l'intégration de la lutte contre les discriminations.

      Le droit ne leur donne pas de définition constitutionnelle, et leur mention prédomine dans le Code de l'éducation et le droit des étrangers.

      Sur le plan pédagogique, un consensus émerge sur la nécessité de dépasser une "pédagogie de la prescription" pour atteindre une "pédagogie de la conviction".

      Cette "approche citoyenne" refuse l'inculcation et promeut la pensée critique, l'expérimentation des valeurs au quotidien et la coopération.

      L'objectif est de permettre aux élèves non seulement de connaître les valeurs, mais de les "éprouver" et d'en ressentir le bénéfice, transformant l'école en un "écosystème de valeurs".

      Enfin, les discussions mettent en lumière les défis contemporains : le poids croissant qui pèse sur l'institution scolaire, le communautarisme, le relativisme et la nécessité de ne pas nier le réel tout en présentant les valeurs comme un idéal à conquérir.

      L'écart entre la valeur et le réel est présenté non comme un échec, mais comme l'espace même de l'engagement citoyen.

      1. La Mission Fondamentale de l'École dans un Contexte de Crise

      Les propos introductifs des différents intervenants ont unanimement rappelé le rôle central et fondateur de l'École dans la transmission des valeurs républicaines, une mission rendue encore plus cruciale par le contexte contemporain.

      1.1 Un Fondement Juridique et Historique

      La mission de l'École est clairement définie par l'article L111-1 du Code de l'éducation, cité à plusieurs reprises, qui stipule que "la nation fixe comme mission première à l’école de faire partager aux élèves les valeurs de la République".

      Cette mission n'est pas un simple "supplément d'âme" mais une obligation professionnelle qui constitue l'armature du projet républicain.

      Les intervenants ont inscrit cette mission dans une profondeur historique :

      Les Lumières et la Révolution : Alain Frugère a évoqué l'esprit des Lumières (Molière), le projet d'instruction publique de Condorcet (1792) qui établit la primauté des savoirs issus de la recherche sur les opinions et les croyances, et le "pari de la raison émancipatrice".

      Le 19ème siècle : Madame Looher a rappelé le projet des républicains de la Troisième République (Gambetta, Ferdinand Buisson) de stabiliser le régime grâce à l'éducation, aboutissant aux lois Jules Ferry de 1881-82 qui instaurent un enseignement fondé sur la gratuité, l'obligation et la laïcité.

      1.2 Le Poids du Contexte Actuel

      La journée d'études, bien que planifiée de longue date, a été profondément marquée par l'assassinat de Dominique Bernard à Arras.

      Cet événement a donné une "coloration tout à fait particulière" aux réflexions, comme l'a souligné Sébastien Jaibovski.

      Ce contexte met en lumière plusieurs tensions :

      Le Poids sur l'Institution : Sébastien Jaibovski a soulevé la question du "poids qui aujourd'hui est très important, peut-être trop important" que la société fait peser sur l'École et ses enseignants.

      La Conquête Permanente : Il a également insisté sur le fait que "les valeurs ne sont jamais acquises mais elles sont toujours à être conquises et à conquérir".

      Les Défis Sociétaux : Alain Frugère a mentionné "le repli sur soi, le communautarisme, l'intolérance voire la haine" comme des défis quotidiens, tandis que Mathieu Clouet a listé les inégalités sociales, les effets de l'économie médiatique et le relativisme.

      2. Analyse de la Notion de "Valeurs de la République"

      L'intervention d'Ismaël Ferrat, professeur des universités, a offert une analyse lexicale et juridique détaillée, démontrant que la notion de "valeurs de la République" est à la fois complexe, évolutive et d'émergence récente.

      2.1 Une Apparition Récente dans le Discours Public et Juridique

      Contrairement à une idée reçue, l'usage du syntagme "valeurs de la République" est un phénomène récent.

      Dans les publications : Une analyse des corpus de textes numérisés (Google Books) et des archives du journal Le Monde montre une quasi-absence du terme jusqu'aux années 1980, suivie d'une "explosion" de son usage à partir de 1989.

      • Dans le droit : L'occurrence du terme dans les codes juridiques français est très faible au début des années 2000 et connaît une forte poussée à partir de 2016.

      Cette augmentation est principalement due à deux codes :

        1. Le Code de l'éducation.
        1. Le Code de l'entrée et du séjour des étrangers et du droit d'asile.

      2.2 Une Définition Juridique Absente et Évolutive

      L'analyse juridique révèle un paradoxe : bien que la notion soit de plus en plus utilisée, elle reste juridiquement insaisissable.

      Absence de définition constitutionnelle : Aucun texte constitutionnel ne définit précisément ce que sont les valeurs de la République. Le Conseil constitutionnel n'a produit aucune étude sur le sujet.

      L'avis du Conseil d'État : Saisi lors du projet de loi "séparatisme", le Conseil d'État a jugé la notion de "valeurs" trop large pour être un principe de droit généralisable, lui préférant celle de "principes républicains".

      Des valeurs évolutives : La liste des valeurs n'est pas figée. La lutte contre les discriminations, par exemple, est une valeur aujourd'hui considérée comme une évidence, alors que le premier article du Code pénal sur ce sujet ne date que de 1994.

      3. L'Approche Pédagogique : De la Prescription à la Conviction

      Mathieu Clouet, représentant l'équipe académique Valeurs de la République, a développé le concept d'une "approche citoyenne des valeurs à l'école", qui se distingue par son refus de l'inculcation au profit d'une adhésion réfléchie.

      3.1 Refuser l'Inculcation, Viser la Conviction

      L'objectif n'est pas seulement de faire connaître les valeurs, mais de les "faire partager".

      Pédagogie de la conviction : "Nous ne pouvons pas nous contenter d'une pédagogie de la prescription, il nous faut trouver la voix d'une pédagogie de la conviction."

      Appel à la raison : Cette approche repose sur l'éducation à la liberté, fait appel à la pensée critique et apprend aux élèves à interroger les valeurs elles-mêmes.

      Les trois dimensions de la valeur : Elle doit prendre en compte les dimensions

      • intellectuelle (contenus),

      • psycho-affective (ressenti) et

      • conative (action).

      3.2 L'École comme "Écosystème de Valeurs"

      Pour que les valeurs aient du prix aux yeux des élèves, ils doivent les "éprouver", c'est-à-dire en ressentir le bénéfice et en tester la réalité.

      Le rôle des savoirs : La transmission des connaissances participe à l'éducation aux valeurs. Citant Catherine Kintzler, Mathieu Clouet parle de la "puissance libératrice des enseignements" : maîtriser un savoir est une expérience concrète de la liberté.

      L'expérience vécue : L'éducation aux valeurs passe aussi par la coopération, la prise de responsabilité et les pratiques participatives. L'école doit être un lieu où les valeurs sont incarnées au quotidien pour éviter les écarts entre le discours et la réalité.

      Inverser la focale : Il est suggéré de replacer les faits négatifs (discriminations, racisme) dans la perspective plus large de la lutte pour l'égalité.

      L'exemple de l'affaire Dreyfus est utilisé pour montrer que la France de l'époque n'était pas seulement celle de l'antisémitisme, mais aussi le seul pays d'Europe où des intellectuels se sont levés pour défendre un Juif.

      4. Étude de Cas : l'Enseignement de la Laïcité

      Ismaël Ferrat a illustré les enjeux de la transmission des valeurs à travers l'exemple de la laïcité, en analysant son traitement dans les programmes scolaires.

      Période

      Occurrence du mot "Laïcité" dans les programmes (élémentaire/collège)

      Contexte et Enjeux

      Années 1970-1980

      Quasiment absente La laïcité est considérée comme une évidence, une "non-notion" sur le plan pédagogique.

      Années 1990-2000

      Forte augmentation

      L'émergence est liée à la nécessité d'expliquer les règles, notamment suite à l'affaire du voile de Creil (1989) et la circulaire Bayrou (1994) sur les signes religieux ostensibles.

      Depuis 2013 (Loi Peillon) Présence stabilisée à un niveau élevé

      Un élève scolarisé aujourd'hui rencontre la notion environ 13 fois entre le primaire et le collège.

      L'enjeu pédagogique est double :

        1. Expliquer le principe : Donner les clés de compréhension d'une valeur fondamentale.
        1. "Déconflictualiser" : Éviter que le principe soit perçu par certains élèves, notamment de culture musulmane, comme étant dirigé "contre l'islam".

      Les résultats sont probants : une étude du Knesco montre que 90 % des élèves de 3e et 80 % des lycéens en terminale ont déjà abordé la laïcité en cours et maîtrisent globalement bien la notion. Cela démontre l'efficacité du travail mené en classe.

      5. Conclusion : La Valeur comme Engagement et "Refus du Réel" La journée d'études se conclut sur une vision exigeante mais volontariste de la mission de l'École.

      La transmission des valeurs de la République n'est pas l'imposition d'un dogme, mais une invitation à participer à un projet collectif de "perpétuelle réinvention démocratique".

      Comme l'a formulé Mathieu Clouet, il faut se souvenir qu'"une valeur ça n'est pas seulement un reflet du réel, une valeur c'est aussi un refus du réel".

      L'écart entre l'idéal prôné par la valeur (l'égalité, la fraternité) et les imperfections de la société n'est pas un signe d'échec.

      Au contraire, "c'est précisément dans cet écart que nous pouvons trouver les moyens d'apporter aux élèves que nous encadrons la volonté d'agir et de s'engager dans la République française".

      L'approche citoyenne des valeurs est donc, en définitive, une preuve de l'engagement citoyen de l'ensemble de la communauté éducative.

    1. Consider, for example, Black Americans, whose darker skin is often not recognized by hand soap and water dispensers in public spaces. This is not a natural limitation of technology—it is a consequence of designers choosing a sensor technology that must necessarily be calibrated for particular skin tones, and then calibrating it for white skin. Design justice argues, then, that some designs, when they cannot be universal, should simply not be made. And if they can be universal, then they should be made in ways that 1) center power inequalities, 2) center the voices of all directly impacted by the design outcomes, 3) prioritize impact on communities over designers’ intents, 4) view designers as facilitators rather than designers, 5) ensure designs are sustainable and community led, and 6) build upon and amplify the solutions that communities have already found.

      I never thought about the fact that there are specific designs that have the power to exclude people, but given the broad scope of design, this makes sense and I agree with the sentiment of design justice. One example I can think of when it comes to design justice is video games having fewer playable characters of darker skin tones. Although games have slowly been moving towards representing women, many do not have the option to change skin tones. This is many times an afterthought. This widened my perspective on the idea that companies should hire designers of different groups to be inclusive of all.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      1. General Statements

      We thank the reviewer for their positive comments regarding the research article titled "The Ketogenic Diet Metabolite 1 β-Hydroxybutyrate Promotes Mitochondrial Elongation via Deacetylation and Improves Autism-like Behaviour in Zebrafish" by Uddin GM and colleagues. We appreciate your input, and we will address these comments as indicated below with specific responses to each point raised by reviewers.

      The main changes in the updated manuscript are as follows:

      We have revised the introduction to now incorporate additional background information on mitochondria, NAD, and mitochondrial dynamics and function. This addition aims to provide readers with a broader understanding of the mitochondrial context in relation to our study.

      Furthermore, we recognize that previous studies have explored mitochondrial function in the context of the ketogenic diet. While our specific investigation centered on mitochondrial morphology, we acknowledge the importance of comprehensively investigating mitochondrial function. To this end, we have added new data showing how BHB impacts mitochondrial oxidative phosphorylation in HeLa cells (Sup Fig 2), and how both BHB and NMN impact oxygen consumption/glycolysis in zebrafish (Fig 7).

      We have also added new behaviour analysis of the zebrafish (Fig 6), and have re-framed the discussion around neurodevelopment generally, rather than ASD specifically.

      Finally, we have now included a section in our manuscript that discusses the limitations of our study. These limitations can be further investigated to explore and characterize the full mechanistic potential behind the effects of the ketogenic diet and/or NMN on mitochondrial dynamics.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      *Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Uddin GM and colleagues presented a research article entitled 'The Ketogenic Diet Metabolite 1 β-Hydroxybutyrate Promotes Mitochondrial Elongation via Deacetylation and Improves Autism-like Behaviour in Zebrafish'. Roles of ketogenic diet (KD) and NAD+ precursors in health promotion and longevity, as well as on the alleviation of a broad range of diseases are evident. However, their roles in autism are not well done, which is the novelty of the current study. Addressing below questions will improve the quality of the paper.

      Major concerns 1. In the introduction section, a broad overview of the roles of ketogenic diet (KD) in neurodegenerative disease (and ageing, if possible) should be provided. E.g., the authors should summarize exciting progress on the use of KD to treat Alzheimer's disease in animal models (PMID: 23276384). *

      Response: Thank you for your valuable suggestion. While it is true that the KD appears to be beneficial in neurodegenerative (and other disease) models, our focus in this paper is looking at neurodevelopment, rather than all potential benefits of the KD. Nonetheless, we have addressed this comment by incorporating a brief overview of the roles of the KD in neurodegenerative diseases, including Alzheimer's disease (AD), in the introduction section of the manuscript. Specifically, we have summarized the exciting progress made in utilizing KD to treat AD in animal models, as highlighted in the suggested study. This addition helps to provide a better overview of the potential therapeutic effects of KD in neurodegenerative diseases and strengthens the introduction section of the manuscript.

      • Roles of high fat diet to treat diseases could be extended to rare premature ageing diseases. In such scenario, high fat and NAD+ boosting shared some joint mechanisms (PMID: 25440059 ). *

      Response: This information and the reference are now added to the discussion.

      *In the introduction, a more detailed introduction of NAD+ and its roles in mitochondrial homeostasis (especially mitophagy and the mitochondrial fusion-fission balance) should be included (PMID: 24813611; PMID: 30742114; PMID: 31577933). *

      Response: Although our paper focused primarily on mitochondrial fission and fusion, we have incorporated a new paragraph in the introduction to provide a more detailed introduction detailing NAD+ and its roles in mitochondrial homeostasis, specifically highlighting mitophagy. We have included the suggested references.

      • In regarding to the statement of KD increases NAD+, was it due to increased generation (to check protein levels and activities of different NAD+ synthetic enzymes, such as iNAMPT, NMNAT1-3, and NRK) and/or reduced consumption (in addition to reduced glycolysis, does KD inhibit the activities of CD38 and PARPs? In this paper, Sirtuins' activities is (are increased)). Detailed exploration of the activities of these proteins will unveil a clear molecular mechanisms on how KD affects/regulates NAD+. *

      Response: Thank you for the comment. We agree that exploring the detailed mechanism of how the ketogenic diet (KD) affects NAD+ is an interesting question that will have important implications once answered. However, fully elucidating the mechanism of action would require a more comprehensive investigation, which is beyond the scope of this current project. We have now added this as a future direction in the manuscript.

      *Fig. 1: in the NAD+ field, the normal used NR/NMN concentrations are normally high like to use 500 µM to 2-5 mM (as the NAD+ levels in cells are high). In addition to use 50 µM, the authors are strongly to have a dose-dependent study (50 µM, 500µM, 1, 2, 5 mM), and see changes of mitochondrial funciton and parameters. In this condition, NAD+ levels should be also checked. *

      Response: We have added new supplemental data showing the initial dose response of the effects of BHB and NMN on mitochondrial morphology, which led us to choosing the relevant doses for the remainder of the paper. Our objective was not to investigate the broad impacts of different NMN concentrations on mitochondrial function and parameters, or NAD+ levels. As such, we have only focused on doses where we see effects on mitochondrial morphology.

      *Fig. 2: a comprehensive characterization of mitochondrial fusion-fission should be performed. In addition to the protein evaluated, changes on other key fusion-fission proteins, like Bax, Bak, Mfn-1, Mfn-2, etc should be performed (PMID: 17035996; PMID: 24813611). *

      Response: We agree that looking at other key proteins involved in mediating mitochondrial fission and fusion could provide additional insight. Indeed, given the changes in global acetylation that we see, it is expected that some other proteins may also be regulated in this way. However, there are at least a dozen proteins involved in mediating mitochondrial fusion and fission, not to mention many more proteins that regulate these proteins. Unfortunately, it is not feasible to analyze all the proteins involved in mitochondrial fusion-fission. Moreover, looking only at protein levels, doesn't necessarily inform about the activity of any protein. Instead, we concentrated in this paper on investigating known links between protein acetylation and mitochondrial dynamics, particularly focusing on the proteins that have known links to acetylation (i.e., DRP1, OPA1, MFNs). We have added a note in the discussion acknowledging that other means of regulation could also be occurring in parallel.

      *Figs. 1-5 were focused on mitochondrial morphology, whether KD and NMN changed mitochondrial funciton should be explored, such as to use seahorse to check ECR and OCR. *

      Response: Although our question was focused on morphology, we agree that mitochondrial function is important. We have added new data showing that BHB increases basal oxygen consumption in HeLa cells (Sup Fig 2), as well as new data showing that BHB and NMN influence oxygen consumption and glycolysis in our zebrafish model (Fig 7)

      • Fig. 6: NR/NMN used in animal studies (via gavage or in drinking water in mice, and on plate for worms and flies) are normally high (e.g., in drinking water for mice could be 4-12 mM; for worms and flies are normally 1-5 mM); for zebrafish, while they are swimming in water, this reviewer concerned whether it was true that 50 µM of NMN was sufficient to show the benefit presented.*

      Response: Our data show that these doses are indeed sufficient. We did look at some higher doses for NMN, but these were toxic, leading to poor survival and were not studied further.

      *Minor concerns 1. Line 26: For 'a growing list of neurological disorders, including autism spectrum disorder (ASD)', please add AD in. *

      Response: Line 26 is part of the abstract, which we feel should be focused more on the main message of the paper, which does not involve AD. As addressed above, we have added AD as an example in the introduction.

      *Line 57: For 'with side effects such as gastrointestinal disturbances, nausea/vomiting, diarrhea, constipation, and hypertriglyceridemia being reported', rate of frequency shall be provided if any. *

      Response: We have modified the statement to indicate the relative percent of patients suffering the various side effects.

      *Reviewer #1 (Significance (Required)):

      The novelty of the current study was to investigate effects of KD and NAD+ on autism. This investigation was not performed before and thus is the novelty.

      Weakness, effects of KD and NAD+/NMN on mitochondrial function were not well-investigated and should be done. Introduction was not well done, many key information in the fields were not provided which may mislead the readers an over-evaluation of the novelty of the current study.*

      Response: As outlined above, we have edited the introduction to include additional information requested by the reviewer. Moreover, our focus in this manuscript was to look at the mechanisms underlying changes in mitochondrial morphology, not mitochondrial function per se, though this is clearly important and related. Nonetheless, as discussed above, we have also added new data showing how BHB impacts mitochondrial function.

      *My expertise lies in NAD+, mitochondria, and brain health.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      The study examined the effect of beta-hydroxybutyrate and nicotinamide nucleotide on mitochondrial morphology and the molecular pathways which mitigate this effect as well as the effect of these treatments on behavior in zebrafish. The study is well done and well written. The only thing I think that could be improved are the bar in the graph some the significant comparisons. It is sometimes difficult to see which groups are being compared.*

      Response: We're happy to adjust how the data is displayed in the relevant bar graphs, but it is not clear exactly what changes the reviewer would like. To some degree this will depend on the specific guideline of the final journal where we hope the manuscript will be published. As such, we have not made changes at this point.

      ***Referees cross-commenting**

      The other reviewers do have some fair comments. Multiple doses would be helpful and showing bioenergetic data would complement the morphological measurements. Additionally, behavioral assays showing changes in social behavior in the Zebrafish would provide a stronger link to ASD. *

      Response: As discussed above, we have added new information on doses and mitochondrial bioenergetics. With respect to behaviour, we have added thigmotaxis data and reworked the discussion around behaviour and neurodevelopment so that it is less specific to ASD.

      *Reviewer #2 (Significance (Required)):

      As beta-hydroxybutyrate is an important substrate for the ketogenic diet, this study helps explain the potential mechanisms in which the ketogenic diet may enhance mitochondrial function.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      In this paper, Uddin and colleagues have investigated components of the ketogenic diet to understand changes in both mitochondrial morphology and protein expression, and zebrafish locomotor behaviour. They investigate whether beta-hydroxybutyrate (BHB) or nicotinamide nucleotide (NMN) application can later human mitochondria in HeLA cell lines, and also recue a locomotion defect in shank3b+/- zebrafish larvae that have previously been proposed as a model for autism. This study is strengthened by showing data from two species; however the link between the HeLA cell line data and larval zebrafish is not strong. The study would be improved by assessing zebrafish mitochondrial changes after drug application, and testing more than one concentration of BH and NMN in the behavioural assay. This is an interesting study, and it is nicely written and presented. I have made some comments to strengthen the study below.

      Major comments My expertise is in modelling some aspects of autism in zebrafish. To this end I have focussed on the zebrafish part of this manuscript more fully. I have several comments related to the zebrafish experiments. 1. The changes in mitochondrial morphology, peroxisome number and mitochondrial protein levels were measured in HeLA cells and not comparable data is shown for zebrafish. The same experiments should be repeated using larval zebrafish or a zebrafish cell line. *

      Response: We chose to use HeLa cells for the mechanistic studies due to practical reasons. Cell lines offer a controlled and well-established system for investigating cellular processes and molecular mechanisms. Measuring these parameters in tissues is significantly more challenging and requires different reagents (e.g., antibodies) and methodology (electron microscopy) that are not feasible in the current study.

      On the other hand, zebrafish larvae were employed for the behavior studies, which cannot be conducted using cell lines. By utilizing zebrafish, we were able to examine the effects of beta-hydroxybutyrate (BHB) and nicotinamide nucleotide (NMN) on locomotor behavior, providing valuable insights into potential therapeutic implications for autism.

      While we acknowledge the limitations of not directly measuring mitochondrial morphology, peroxisome number, and mitochondrial protein levels in zebrafish, we believe that our study provides significant contributions to understanding the effects of BHB and NMN in zebrafish behavior. Future studies could certainly consider incorporating zebrafish-specific experiments to complement the findings in HeLa cells.

      • How did you choose the concentration of BHB and NMN to use in behavioural experiments? And the timing of application - I don't really understand why you waited 3 days after drug application to measure locomotion. *

      Response: These doses chosen initially as they were similar the doses that induced mitochondrial elongation in HeLa cells and were tolerated by the fish larvae. As we saw promising effects at these initial doses, we decided to explore them in more detail. While we agree that it would be worth comparing the effects of additional doses, as well as looking at their effects at other timepoints, such work would be a major endeavour and is beyond the scope of our initial investigations, which we feel are worth reporting in their current state.

      With respect to the treatment paradigm, fish larvae were treated 10-48 hours post fertilization, as this is a critical neurogenic developmental timepoint that is often used for exposure studies. Fish do not fully hatch until 3-4 days post fertilization, and display only minimal movement before 5 days, which is why we waited until 5 days to look at movement.

      • Do the shank3b+/- larvae show any morphological deficits? Their decrease in locomotion is striking. Is the morphology also rescued by drug application? Can you tie this to the mitochondrial changes that you observed in HeLA cells?*

      Response: We do not observe any gross changes in fish morphology that might explain a decrease in locomotion. Unfortunately, it is not feasible to look at mitochondrial morphology in the fish at this time. However, based on previous published work showing that the ketogenic diet promotes mitochondrial elongation in mouse brains (PMID:32380723), we would expect mitochondrial morphology also to be changed in the fish. Nonetheless, as we have not examined this directly in fish, we are not making this specific claim in this manuscript.

      • In figure 6A you use time spent swimming as a readout of distance. This doesn't really make sense, because without also showing speed of swimming it is not possible to know whether time and distance correlate in the same way across genotypes. This figure could be improved by showing more detail - speed of swimming, time spent immobile etc. This can easily be extracted from the films that you have already made using the ViewPoint software. *

      Response: As requested, we have reanalyzed the zebrafish movement data for a more refined analysis. In the revised version (Fig 6), we include analysis of both speed and distance travelled within a defined time. Importantly, these findings still support differences between WT and shank3b+/- fish that are restored by BHB and NMN to varying degrees.

      • Showing a change in locomotion is not enough to claim that a model is autism-like. At a minimum I think that you need to show changes in social behaviour - likely using older fish (more than three weeks) that interact with each other. Changes in locomotion can be caused by so many factors, many of which are not indicative of autism. It is important that as a field we do not simply claim that locomotion can be used as a proxy for more complex disease phenotypes. This recent review may help you with this point:* https://www.frontiersin.org/articles/10.3389/fnmol.2020.575575/full.

      Response: The reviewer makes an important point that the movement behaviour phenotypes that we see do not necessarily represent classic ASD phenotypes (i.e., repetitive behaviour, reduced sociability, and reduced communication). To begin to address this issue, we analyzed thigmotaxis, which can be a measure of anxiety. Notably, we also see differences that are reversed by BHB and NMN. However, we cannot model all ASD behaviours in a fish model, and we are not set up to look at social behaviour, especially in the young fish that we were studying. As such, even though Shank3 is a recognized ASD gene, and the shank3b+/- model we are studying is a validated ASD model (PMID: 29619162), we have re-phrased the manuscript in the context of neurodevelopment generally, rather than with respect to ASD specifically. As such, we ascribe the movement and thigmotaxis phenotypes as neurodevelopmental phenotypes that are improved by BHB and NMN.

      *For the statistics, as far as I can tell, all of the data should be analysed by ANOVA or the non-parametric equivalent followed by a post-hoc test. Please check this and add information about normality in. *

      Response: As requested, we have clarified our statistical methodology throughout the manuscript.

      For the mechanistic data, we used t-tests for direct comparisons between two groups (e.g., vehicle vs. treatment). While multiple conditions such as vehicles, NMN, BHB, or etomoxir were tested, statistical comparisons were only conducted comparisons between the vehicle and each treatment group individually. As we are not also making comparisons between treatments this is not a multiple comparison, and ANOVA is not applicable in this context. We have clarified this rationale in the manuscript to avoid any confusion.

      For the zebrafish study, where multiple factors were involved (e.g., treatments across different time points or conditions), we performed a two-way ANOVA followed by Tukey's post-hoc test to identify specific group differences. This approach was appropriate for analyzing these datasets and ensures robust conclusion.

      With respect to normality testing, all datasets were assessed for normality using the Shapiro-Wilk test, and no violations of normality were observed. The updated text now includes these details.

      *Minor comments

      1. Make sure that you refer to the fish line as shank3b+/- throughout - see abstract.*

      This has bee corrected.

      • Please add a space between all numbers and units (e.g. 5 Mm). *

      This has bee corrected.

      • There is a spelling error on line 340 page 16: finings instead of findings. *

      This has bee corrected.

      • In figure 1, if each dot represents a different sample, then there appear to be many fewer samples analysed in 1D compared to 1B. Can you comment upon this please*

      __Response: __A total of 80-150 cells were counted per condition, and the analyses were performed on 3 independent replicates with 2 independent technical replicates for each treatment condition. The quantification of mean mitochondrial branch length in Figure 1B was measured using Image-J and the MiNA plugin. The measurements were taken from three independent replicates using a standard region of interest (ROI) and randomly selected areas from each image.

      In Figure 1D, NAD+ levels were measured 24 hours after treatment of vehicle, βHB, NMN, or Eto+βHB in HeLa cells (n=3-6/group). Each sample lysate represents an independent experimental dish from which coverslips were collected for image analysis.

      The difference in sample numbers between Figure 1B and 1D arises because image analysis involves individual cells fixed and stained on coverslips, whereas the NAD assay requires the whole lysate from the entire cell culture dish. Therefore, the higher cell count in Figure 1B represents the number of cells analyzed on coverslips, while Figure 1D represents NAD levels from the lysate normalized to the protein concentration.

      *Reviewer #3 (Significance (Required)):

      I think that this will be interesting to autism researchers and it could lead to more investigation of the ketogenic diet. Some more work is needed, likely in other model organisms, before this research can be translated to human patients. *

      __Response: __We agree that the findings of our study could be of interest to autism researchers and have implications for further investigation of the ketogenic diet (KD). It is important to note that further work, including studies in other model organisms, would be beneficial before translating this research to human patients.

      Our study aimed to provide mechanistic insights into the effects of the KD on mitochondrial morphology and behavior. We recognize that the translation of research findings to human patients requires rigorous investigation, including preclinical and clinical studies. Our study contributes to the understanding of the underlying mechanisms involved in the KD's effects, laying the groundwork for future research and potential therapeutic avenues.

      We appreciate your perspective and emphasize that our intention is to provide valuable insights into the mechanisms underlying the KD's effects rather than suggesting immediate translation to human patients. Further investigation and validation in diverse models and clinical settings will be necessary before considering clinical applications.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      In this paper, Uddin and colleagues have investigated components of the ketogenic diet to understand changes in both mitochondrial morphology and protein expression, and zebrafish locomotor behaviour. They investigate whether beta-hydroxybutyrate (BHB) or nicotinamide nucleotide (NMN) application can later human mitochondria in HeLA cell lines, and also recue a locomotion defect in shank3b+/- zebrafish larvae that have previously been proposed as a model for autism. This study is strengthened by showing data from two species; however the link between the HeLA cell line data and larval zebrafish is not strong. The study would be improved by assessing zebrafish mitochondrial changes after drug application, and testing more than one concentration of BH and NMN in the behavioural assay.

      This is an interesting study, and it is nicely written and presented. I have made some comments to strengthen the study below.

      Major comments

      My expertise is in modelling some aspects of autism in zebrafish. To this end I have focussed on the zebrafish part of this manuscript more fully. I have several comments related to the zebrafish experiments.

      1. The changes in mitochondrial morphology, peroxisome number and mitochondrial protein levels were measured in HeLA cells and not comparable data is shown for zebrafish. The same experiments should be repeated using larval zebrafish or a zebrafish cell line.
      2. How did you choose the concentration of BHB and NMN to use in behavioural experiments? And the timing of application - I don't really understand why you waited 3 days after drug application to measure locomotion.
      3. Do the shank3b+/- larvae show any morphological deficits? Their decrease in locomotion is striking. Is the morphology also rescued by drug application? Can you tie this to the mitochondrial changes that you observed in HeLA cells?
      4. In figure 6A you use time spent swimming as a readout of distance. This doesn't really make sense, because without also showing speed of swimming it is not possible to know whether time and distance correlate in the same way across genotypes. This figure could be improved by showing more detail - speed of swimming, time spent immobile etc. This can easily be extracted from the films that you have already made using the ViewPoint software.
      5. Showing a change in locomotion is not enough to claim that a model is autism-like. At a minimum I think that you need to show changes in social behaviour - likely using older fish (more than three weeks) that interact with each other. Changes in locomotion can be caused by so many factors, many of which are not indicative of autism. It is important that as a field we do not simply claim that locomotion can be used as a proxy for more complex disease phenotypes. This recent review may help you with this point: https://www.frontiersin.org/articles/10.3389/fnmol.2020.575575/full.
      6. For the statistics, as far as I can tell, all of the data should be analysed by ANOVA or the non-parametric equivalent followed by a post-hoc test. Please check this and add information about normality in.

      Minor comments

      1. Make sure that you refer to the fish line as shank3b+/- throughout - see abstract.
      2. Please add a space between all numbers and units (e.g. 5 Mm).
      3. There is a spelling error on line 340 page 16: finings instead of findings.
      4. In figure 1, if each dot represents a different sample, then there appear to be many fewer samples analysed in 1D compared to 1B. Can you comment upon this please?

      Significance

      I think that this will be interesting to autism researchers and it could lead to more investigation of the ketogenic diet. Some more work is needed, likely in other model organisms, before this research can be translated to human patients.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Uddin GM and colleagues presented a research article entitled 'The Ketogenic Diet Metabolite 1 β-Hydroxybutyrate Promotes Mitochondrial Elongation via Deacetylation and Improves Autism-like Behaviour in Zebrafish'. Roles of ketogenic diet (KD) and NAD+ precursors in health promotion and longevity, as well as on the alleviation of a broad range of diseases are evident. However, their roles in autism are not well done, which is the novelty of the current study. Addressing below questions will improve the quality of the paper.

      Major concerns

      1. In the introduction section, a broad overview of the roles of ketogenic diet (KD) in neurodegenerative disease (and ageing, if possible) should be provided. E.g., the authors should summarize exciting progress on the use of KD to treat Alzheimer's disease in animal models (PMID: 23276384).
      2. Roles of high fat diet to treat diseases could be extended to rare premature ageing diseases. In such scenario, high fat and NAD+ boosting shared some joint mechanisms (PMID: 25440059 ).
      3. In the introduction, a more detailed introduction of NAD+ and its roles in mitochondrial homeostasis (especially mitophagy and the mitochondrial fusion-fission balance) should be included (PMID: 24813611; PMID: 30742114; PMID: 31577933).
      4. In regarding to the statement of KD increases NAD+, was it due to increased generation (to check protein levels and activities of different NAD+ synthetic enzymes, such as iNAMPT, NMNAT1-3, and NRK) and/or reduced consumption (in addition to reduced glycolysis, does KD inhibit the activities of CD38 and PARPs? In this paper, Sirtuins' activities is (are increased)). Detailed exploration of the activities of these proteins will unveil a clear molecular mechanisms on how KD affects/regulates NAD+.
      5. Fig. 1: in the NAD+ field, the normal used NR/NMN concentrations are normally high like to use 500 µM to 2-5 mM (as the NAD+ levels in cells are high). In addition to use 50 µM, the authors are strongly to have a dose-dependent study (50 µM, 500µM, 1, 2, 5 mM), and see changes of mitochondrial funciton and parameters. In this condition, NAD+ levels should be also checked.
      6. Fig. 2: a comprehensive characterization of mitochondrial fusion-fission should be performed. In addition to the protein evaluated, changes on other key fusion-fission proteins, like Bax, Bak, Mfn-1, Mfn-2, etc should be performed (PMID: 17035996; PMID: 24813611).
      7. Figs. 1-5 were focused on mitochondrial morphology, whether KD and NMN changed mitochondrial funciton should be explored, such as to use seahorse to check ECR and OCR.
      8. Fig. 6: NR/NMN used in animal studies (via gavage or in drinking water in mice, and on plate for worms and flies) are normally high (e.g., in drinking water for mice could be 4-12 mM; for worms and flies are normally 1-5 mM); for zebra fish, while they are swimming in water, this reviewer concerned whether it was true that 50 µM of NMN was sufficient to show the benefit presented.

      Minor concerns

      1. Line 26: For 'a growing list of neurological disorders, including autism spectrum disorder (ASD)', please add AD in.
      2. Line 57: For 'with side effects such as gastrointestinal disturbances, nausea/vomiting, diarrhea, constipation, and hypertriglyceridemia being reported', rate of frequency shall be provided if any.

      Significance

      The novelty of the current study was to investigate effects of KD and NAD+ on autism. This investigation was not performed before and thus is the novelty.

      Weakness, effects of KD and NAD+/NMN on mitochondrial function were not well-investigated and should be done. Introduction was not well done, many key information in the fields were not provided which may mislead the readers an over-evaluation of the novelty of the current study.

      My expertise lies in NAD+, mitochondria, and brain health.

    1. =0.04). This is contrary to what one wouldhave expected, and we regard it as a chance finding. Wenote that our meta-regression was based on a subjectiveranking of the possibility of a physiological effect ofplacebo, and that both the subgroup analysis and themeta-regression are observational in nature. However,our findings are similar to that of a randomised trialreporting no difference in analgesic effect betweenthree types of placebo acupuncture: acupunctureconsidered specific for another disease, needle inser-tion at non-acupuncture points, and non-penetrativesimulated acupuncture.18We found a tendency for an increase in the use ofanalgesic drugs in the no acupuncture groups com-pared with the placebo and acupuncture groups, whichwould tend to underestimate the effect of placeboacupuncture. We found no tendency for any differencein use of concomitant treatment between the placebogroups and the acupuncture groups.Our sensitivity analyses of the authors’ primaryoutcomes found slightly larger effects of acupunctureand placebo, as well as more heterogeneous results.However, the trials had very dissimilar primaryoutcomes (such as days with headache and number ofanalgesic doses) and primary outcomes in clinical trialsare often changed retrospectively.14 We excluded onetrial as a clear outlier, but the proportion of excludedpatients was small and had little effect on our effectestimates.Other studiesOur finding of limited, at best, analgesic effects ofacupuncture corresponds with the seven Cochranereviews on acupuncture for various types of pain,which all concluded that no clear evidence existed of ananalgesic effect of acupuncture.19-25 Most stressed themethodological shortcomings of the included trials.Our finding of a moderate difference betweenplacebo acupuncture and no acupuncture (standar-dised mean difference −0.42) agrees fairly well with ourprevious review of the effect of placebo in general.9 10Although we previously found an overall difference instandardised mean difference of −0.25 for pain, we alsosaw a tendency for larger effects when the placebointervention was procedural—for example, a shamacupuncture needle (standardised mean difference−0.33)—and not merely a placebo tablet (standardisedmean difference −0.20).Meaning of our reviewInterpreting a standardised mean difference clinicallymay be challenging. On the basis of the mean standarddeviation from the trials that had used visual analoguescales, the effect of acupuncture (standardised meandifference −0.17, −0.26 to −0.07) corresponds to areduction of 4 (2 to 6) mm on a 100 mm scale. The effectof placebo acupuncture (standardised mean difference−0.42 (−0.60 to −0.23), corresponds to a reduction of 10(6 to 15) mm.Attempts at defining a clinically minimal painimprovement have reached quite different conclusionsand have often reported percentage improvement andnot an absolute effect size as we have.26 27 However, aconsensus report characterised a 10 mm reduction on a100 mm visual analogue scale as representing a“minimal” change or “little change.”27 Thus, theapparent analgesic effect of acupuncture seems to bebelow a clinically relevant pain improvement.Our pooled effect of placebo acupuncture (standar-dised mean difference −0.42) is based on trials witheffects that vary much more than is expected by chance.Some of the large trials report an effect of placeboacupuncture that is of clear clinical relevance—forexample, standardised mean difference −0.95,w2 corre-sponding to 24 mm on a 100 mm visual analogue scale—whereas others find effects that seem to be of limitedRESEARCHpage 6 of 8 BMJ | ONLINE FIRST | bmj.comProtected by copyright, including for uses related to text and data mining, AI training, and similar technologies. .by gueston 26 September 2025https://www.bmj.com/Downloaded from27 January 2009.10.1136/bmj.a3115 onBMJ: first published as

      Results were consistent even in the best-designed trials and when experienced acupuncturists chose points individually, which adds confidence to the findings

    2. Type of placebo acupunctureWe ranked the various placebo acupuncture inter-ventions on a 1-5 scale, where 1 represents a placebotreatment that was most likely to produce physiologicaleffects. We ranked needling at acupuncture pointswithout electrical stimulation but indicator lights on as1w3 w6 ; needling at non-acupuncture points with elec-trical stimulation as 2 w5; superficial needling at non-acupuncture points (20-50 mm) avoiding Qi andmanual stimulation as 3w1 w2 w7 w8 w10-w12; non-penetrat-ing needle as 4w9; and laser turned off, held over thesymptomatic points without using any mechanicalpressure as 5.w4A meta-regression of the 12 trials found nostatistically significant relation between the type ofplacebo intervention and the effect of acupuncture(P=0.60). Supplementary subgroup analyses found astatistically significant difference in effect of acupunc-ture between the two trials using non-penetrative
      1. Needling real acupuncture points without electrical stimulation but with indicator lights.

      2: Needling non-acupuncture points with electrical stimulation.

      3: Superficial needling at non-acupuncture points, avoiding Qi or manual stimulation.

      4: Non-penetrating needle.

      5: Inactive laser held over points without pressure.

    1. Reviewer #1 (Public review):

      Summary:

      Planar cell polarity core proteins Frizzled (Fz)/Dishevelled (Dvl) and Van Gogh-like (Vangl)/Prickle (Pk) are localized on opposite sides of the cell and engage in reciprocal repression to modulate cellular polarity within the plane of static epithelium. In this interesting manuscript, the authors explore how the anterior core proteins (Vangl/Pk) inhibit the posterior core protein (Dvl). The authors propose that Pk assists Vangl2 in sequestering both Dvl2 and Ror2, while Ror2 is essential for Dvl to transition from Vangl to Fz in response to non-canonical Wnt signaling. Nevertheless, there are several major and minor points that affect the strength of the author's proposed model (and are listed below).

      Strengths:

      The strengths of the manuscript are found in the very interesting and new concept along with supportive data for a model of how non-canonical Wnt induces Dvl to transition from Vangl to Fz with an opposing role for PK and Vangl2 to suppress Dvl during convergent extension movements. Ror is key player required for the transition and antagonizes Vangl.

      Weaknesses:

      The weaknesses are in the clarity and resolution of the data that forms the basis of the model. In addition to general whole embryo morphology that is used as evidence for CE defects, two forms of data are presented, co-expression and IP, as well as a strong reliance on IF of exogenously expressed proteins. Thus, it is critical that both forms of evidence be very strong and clear, and this is where there are deficiencies; 1) For vast majority of experiments general morphology and LWR was used as evidence of effects on convergent extension movements rather than keller explants or actual cell movements in the embryo. 2) the microscopy would benefit from super resolution microscopy since in many cases the differences in protein localization are not very pronounced. 3) the IP and Western analysis data often shows very subtle differences, and some cases not apparent.

      Major points.

      (1) Assessment of CE movement

      The authors conducted an analysis of the subcellular localization of PCP core proteins, including Vangl2, Pk, Fz, and Dvl, within animal cap explants (ectodermal explants). The authors primarily used the length-to-width ratio (LWR) to evaluate CE movement as a basis for their model. However, LWR can be influenced by multiple factors and is not sufficient to directly and clearly represent CE defects. While the author showed that Prickle knockdown suppresses animal cap elongation mediated by Activin treatment, they did not test their model using standard assays such as animal cap elongation or dorsal marginal zone (DMZ) Keller explants. Furthermore, although various imaging analyses were performed in Wnt11-overexpressing animal caps and DMZ explants, the Wnt11-overexpressing animal caps did not undergo CE movement. Given that this study focuses on the molecular mechanisms of Vangl2 and Ror2 regulation of Dvl2 during CE, the model should be validated in more appropriate tissues, such as DMZ explants.

      (2) Overexpression conditions

      Another concern is that most analyses were performed with overexpression conditions. PCP core proteins (Vangl2, Pk, Dvl, and Fz receptors) are known to display polarized subcellular localization in both the neural epithelium and DMZ explants (Ref: PCP and Septins govern the polarized organization of the actin cytoskeleton during convergent extension, Current Biology, 2024). However, in this study, overexpressed PCP core proteins failed to show polarized localization. Previous studies, such as those from the Wallingford lab, typically used 10-30 pg of RNA for PCP core proteins, whereas this study injected 100-500 pg, which is likely excessive and may have created artificial conditions that confound the imaging results.

      (3) Subtle and insufficient effects

      Several of the reported results show quite modest changes in imaging and immunoprecipitation analyses, which are not sufficient to strongly support the proposed molecular model. For example, most Dvl2 remained localized with Fz7 even under Vangl2 and Pk overexpression (Fig. 4). Similarly, Wnt11 overexpression only slightly reduced the association between Vangl2 and Dvl2 (Sup. Fig. 8), and the Ror2-related experiments also produced only subtle effects (Fig. 8, Sup. Fig. 15).

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review)

      The weaknesses are in the clarity and resolution of the data that forms the basis of the model. In addition to whole embryo morphology that is used as evidence for convergent extension (CE) defects, two forms of data are presented, co-expression and IP, as well as a strong reliance on IF of exogenously expressed proteins. Thus, it is critical that both forms of evidence be very strong and clear, and this is where there are deficiencies; 1) For vast majority of experiments general morphology and LWR was used as evidence of effects on convergent extension movements rather than Keller explants or actual cell movements in the embryo. 2) The study would benefit from high or super resolution microscopy, since in many cases the differences in protein localization are not very pronounced. 3) The IP and Western analysis data often show subtle differences, and not apparent in some cases. 4) It is not clear how many biological repeats were performed or how and whether statistical analyses were performed. 

      (1) To more objectively assess the convergent extension phenotypes, we developed a Fiji macro to automatically quantify the LWR in various injected Xenopus embryos, as detailed in the Methods section. We acknowledge that a limitation in the current manuscript is how to link our mechanistic model at the molecular level with the actual cellular behavior during convergent extension, and we plan to perform cell biological studies in the future to elucidate the link;

      (2) We have repeated some of the imaging experiments in DMZ explants using a Zeiss LSM 900 confocal equipped with Airyscan2 detector that can increase the resolution to ~100 nm. The new data are in Suppl. Fig. 4, 9, 11, 16;

      (3) We have repeated all IP and western blots at least three times and provided quantification and statistical analyses;

      (4) We have added the information on biological repeats and statistical analyses in all figures and figure legends.

      Reviewer #2 (Public Review):

      The protein localization experiments in animal cap assays are for the most part convincing, but with the caveat that the authors assume that the proteins are acting within the same cell. As Fzd and Vangl2 are thought to localize to opposite cell ends in many contexts, can the authors be sure that the effects they observe are not due to trans interactions? 

      In our previous publication, we provided evidence that Vangl is necessary and sufficient to recruit Dvl to the plasma membrane within the same cell (Figure 3 in 10.1093/hmg/ddx095). In a more recent publication ( 10.1038/s41467-025-57658-0 ), we further elucidated a mechanism through which Dvl oligomerization switches its binding from Vangl to Fz, and determined that Dvl binding to Vangl and Fz are differentially mediated by its PDZ and DEP domain, respectively. In the current manuscript, we also performed co-IP experiment under various conditions to demonstrate binding between Dvl and Vangl. We feel that these evidences together provide a strong argument for our model where Vangl2 acts within the same cell to sequester Dvl from Fz.

      In regards to the Dvl patches induced by Wnt11 (Fig. 3 and Suppl. Fig. 9), we performed separate injection of EGFP- and mSc-tagged Dvl into adjacent blastomeres, and demonstrated that the Wnt11-induced patches arise from symmetrical accumulation of Dvl at contact of two neighboring cells (Suppl. Fig. 9a-c’). This scenario is different from epithelial PCP where Fz/Dvl and Vangl/Pk are asymmetrically accumulated at the contact between two adjacent cells.

      The authors propose a model whereby Vangl2 acts as an adaptor between Dvl and Ror, to first prevent ectopic activation of signaling, and then to relay Dvl to Fzd upon Wnt stimulation. This is based on the observation that Ror2 can be co-IPed with Vangl2 but not Dvl; and secondly that the distribution of Ror2 in membrane patches after Wnt11 stimulation is broader than that of Fzd7/Dvl, while Vangl2 localizes to the edges of these patches. The data for both these points is not wholly convincing. The co-IP of Ror2 and Vangl2 is very weak, and the input of Dvl into the same experiment is very low, so any direct interaction could have been missed. Secondly, the broader distribution of Ror2 in membrane patches is very subtle, and further analysis would be needed to firm up this conclusion. 

      (1) We repeated the co-IP experiment with Myc-tagged Vangl or Dvl. Using the same anti-Myc antibody and experimental condition (including the expression level of Vangl, Dvl and Ror2), we still found that Ror2 could be pulled down by Vangl but not Dvl (Suppl. Fig. 15b). Whereas this data confirms our previous conclusion, we acknowledge that a negative data does not fully exclude the possibility for direct biding between Ror and Dvl.

      (2) We re-analyzed the signal intensity of Dvl and Ror in Wnt11-induced patches. By quantifying the intensity ratio between Ror and Dvl along the patches, we found an increase over two folds at the border of the patches (Fig. 7j, bottom panel). We interpret this data to suggest that Ror is accumulated to a higher level than Dvl at the patch borders.     

      A final caveat to these experiments is that in the animal cap assays, loss of function and gain of function both cause convergence and extension defects, so any genetic interactions need to be treated with caution i.e. two injected factors enhancing a phenotype does not imply they act in the same direction in a pathway, in particular as there are both cis/trans and positive/negative feedbacks between the PCP proteins. 

      We agree with the reviewer that a difficulty in studying PCP/ non-canonical signaling is that both loss and gain of function of any its components can cause convergence and extension defects. Genetic interactions, especially synergistic interactions, should be interpreted with caution. But we do want to point out that, in a number of case, we were also able to demonstrate epistasis. For instance, we found that Dvl2 over-expression induced CE defects can be rescued by Pk over-expression (Fig. 1e and f), whereas Vangl/ Pk co-injection induced severe CE defects can be reciprocally rescued by Dvl2 over-expression (Fig. 1g). Likewise, we showed that Fz2/ Dvl2 co-injection induced CE defects can be rescued by wild-type Vangl2 but not Vangl2 RH mutant (Suppl. Fig. 6b), and Ror2 can rescue Vangl2 overexpression induced CE defect (Suppl. Fig. 14). Collectively, these functional interaction data consistently demonstrate an antagonism between Dvl/ Fz/ Ror2 and Vangl2/ Pk, which is correlated with our imaging and biochemical studies.

      As you can see from the reviews, the referees generally agree that your paper is a potentially valuable contribution to the field. Your observations are important because of the novel model based on the inhibitory feedback regulation between planar cell polarity (PCP) protein complexes. However, the reviewers also stated that the model is only partly supported by data because of insufficient clarity and missing controls in several experiments supporting the proposed model. The paper would be significantly improved if your conclusions are backed up by additional experimentation. Specifically, the referees wanted to see the reproducibility of the results shown in Figures 3, 4, 8, S3, S7, S12. 

      We hope that you are able to revise the paper along the lines suggested by the referees to increase the impact of your study on the current understanding of PCP signaling mechanisms. 

      We thank the reviewers for careful reading of our manuscript and for their constructive critiques and suggestions. We have repeated the animal cap studies in original Figures 3, 4, 8 and S3 with DMZ explants, and the new data are in Supplementary Fig. 9, 11, 16 and 4, respectively. We also repeated the biochemical studies in original Figure S 7and 12, and the new data are in Supplementary Fig. 8 and 15.

      Reviewer #1 (Recommendations For The Authors):

      Major points:(1) The author conducted an analysis of the subcellular localization of PCP core proteins, including Vangl2, Pk, Fz, and Dvl, within animal cap explants (ectodermal explants). To validate the model proposing that 'non-canonical Wnt induces Dvl to transition from Vangl to Fz, while PK inhibits this transition, and they function synergistically with Vangl to suppress Dvl during Convergent Extension (CE),' it is crucial to assess the subcellular localization of PCP core proteins in dorsal marginal zone (DMZ) cells, which are known to undergo CE. Notably, the overexpression of Wnt11 alone, as employed by the author, does not induce animal cap elongation. Therefore, the use of animal cap explants may not be sufficient to substantiate the model during Convergent Extension (CE). Indeed, previous knowledge indicates that Vangl2 and Pk localize to the anterior region in DMZ explants. However, the results presented in this manuscript appear to differ from this established understanding. Consequently, to provide more robust support for the proposed model, it is advisable to replicate the key experiments (Figures 3, 4, 8, and Figure S3) using DMZ explants. 

      We repeated the experiments in Figure 3, 4, 8 and Figure S3 with DMZ explant and the new data are in new Supplementary Fig. 9, 11, 16 and 4, respectively.In regards to “previous knowledge indicates that Vangl2 and Pk localize to the anterior region in DMZ explants”, we are aware Vangl/ Pk localization to the anterior cell cortex in neural epithelium from the studies by the Sokol and Wallingford labs, but are not aware of similar reports in DMZ explants. When we examined the localization of small amount of injected EGFP-mPk2 (0.1 ng mRNA) in DMZ explants, we saw a somewhat uniform distribution on the plasma membrane (Suppl. Fig. 4). In addition, in a related recent publication, we examined endogenous XVangl2 protein localization in activin induced animal cap explants that do undergo CE. What we observed was that whereas low level injected Dvl2 and Fz form clusters on the plasma member, endogenous XVangl2 remains uniformly distributed on the plasma membrane (Suppl. Fig. 3S-Z in 10.1038/s41467-025-57658-0 ). These observations may suggest potential differences of PCP protein localization during neural vs. mesodermal convergence and extension.

      (2) The author suggests that 'Vangl2 and Pk together synergistically disrupt Fz7-Dvl2 patches.' As shown in Figure 4 (panels J' to I'), it is evident that the co-expression of Pk and Vangl2 increases Fz7 endocytosis. Nevertheless, a significant amount of Fz7 still co-localizes with Dvl2. To strengthen the author's hypothesis, additional clear assay is required such as Fluorescence resonance energy transfer (FRET) assay. 

      We appreciate this valuable advice. Since none of the tagged Fz/ Dvl/ Vangl proteins we had were suitable for FRET, we made proteins tagged with mClover and mRuby2, which were reported as optimized FRET pairs. But in our hands mRuby2 seems to require very long time (~2 days) to mature and become detectable at room temperature, and is not suitable for our Xenopus experiments. We are in the process of establishing a luciferase based NanoBiT system to detect Fz-Dvl and Dvl-Vangl interactions in live cells and cell lysates, and will use it in future studies to investigate their interaction dynamics.

      For the current manuscript, we reason that a substantial reduction of Fz7-Dvl2 clusters with Vangl2/ Pk co-injection would still support our idea that Vangl2 and Pk act synergistically to sequester Dvl from Fz to prevent their clustering in response to non-canonical Wnt ligands.

      (3) The IP data is less clear and evident. A couple of examples are: a) Fig 2g where the authors report that the Vangl2 R177H variant reduced Vangl2 interaction with Pk and recruitment of Pk to the plasma membrane, but it appears that the variant interacts slightly better than WT Vangl2 with Pk. In Fig. S7a, the authors state that Pk overexpression can indeed significantly reduce Wnt11-induced dissociation of EGFP-Vangl2 and Flag-Dvl2 in the DMZ. However, there is a minimal impact when compared to the Wnt11 absent control. Based on the results presented in Fig S12a the authors indicate that Wnt11 reduces the association between Vangl2 and Dvl2, which can be discerned, but loss of Ror2 does not change this in any obvious way - but the authors indicate it does. In S12b, the authors have suggested that Ror and Dvl do not form a direct binding interaction. However, the interpretation of Figure S12b is not entirely convincing due to several issues. Notably, the expression levels of each protein appear inconsistent, the bands are not sufficiently clear, and there is the detection of three different tag proteins on a single blot. To strengthen the validity of these findings, it is advisable to repeat this experiment with improved quality. 

      We repeated all the co-IP and western blot analyses pointed out by the reviewer, and performed quantification and statistical analyses.

      Fig 2g had a mistake in the labeling and is replaced with new Figure 2g;

      Fig. S7a is replaced by new data in Supplementary Figure 8a and b;

      Fig. S12a and 12b are replaced by new data in Supplementary Figure 15a, a’ and b, respectively. In 15a and a’, we noticed a consistent decrease of Dvl2-Vangl2 co-IP in Xror2 morphant. The reason for this is not yet clear and will need further study in the future.

      Minor points: (1) In all the whole embryo injection assays examining morphology, no Western analysis is performed to show roughly equivalent and appropriate levels of the various proteins are being expressed. Differences will affect the data. 

      Although we did not do western analyses to examine the protein levels in various functional interaction assays, we did examine how co-expression of Vangl2, mPk2 or Dvl2 may impact each other’s protein levels in Supplementary Fig. 2, which did not reveal any significant change when co-injected in different combination.

      (2) The author's prior publication (Bimodal regulation of Dishevelled function by Vangl2 during morphogenesis, Hum Mol Genet. 2017) presented clear evidence of Vangl2 overexpression inducing Dvl2 membrane localization. However, Figure S4 in the current manuscript did not provide clear evidence of membrane localization. To strengthen the hypothesis that Vangl2-RH mutant also induces Dvl2 membrane localization, further comprehensive imaging analysis is needed. 

      We re-analyzed the imaging data and replaced old Figure S4 with a new Supplementary Fig. 5.

      (3) In Supplementary Figure 9, the authors propose that the overexpression of Vangl2/Pk induces Fz7 endocytosis, as indicated by its co-localization with FM4-64. However, it raises a question: how does the Fz7-GFP protein internalize into the cells without endocytosis, as seen in Figures S9a-c'? To enhance readers' understanding, a discussion addressing this point should be included. 

      We think that this might be a technical issue. As detailed in the Method section, we only incubated the embryos transiently with FM4-64 for 30 minutes, and the embryos were subsequently washed and dissected in 0.1X MMR without the dye. Therefore, only the Fz7-GFP protein endocytosed during the 30 minute-incubation would be labeled by FM-64, whereas that endocytosed before or after the incubation would not. Alternatively, the very few Fz7-GFP puncta occasionally observed in the absence of Vangl2/Pk overexpression could be vesicles trafficking to the plasma membrane.

      (4) Statistical analyses are absent for several results, including those in Figure 2f, Figure S4d, and Figure S7b. 

      We repeated these experiments and included statistical analyses. The new data are in Figure 2f, Supplementary Fig. 5d and Supplementary Fig. 8b.

      (5) This manuscript lacks any results regarding Ck1. Therefore, it is advisable to consider removing the discussion or mention of CK1. 

      We agree, and tune down the discussion on CK1 and removed CK1 from our model in Fig. 9.

      Reviewer #2 (Recommendations For The Authors):

      (1) In all the convergence and extension assays, the authors should report n numbers (i.e. number of animals), what statistical test is used, and what the error bars show. Ideally dot-plots would be used instead of bar charts as they give a better insight into the data distribution. It might be useful to give a section on the statistical analyses used in the M&M, including e.g. any power calculations carried out, as now required by many journals. 

      We have follow the advice to use dot-plots for all the quantification analyses in the manuscript. We include in the figure legends the statistical test used and what the error bars show. The number of embryos analyzed were included in each panel in the figures. We also provided more details in the Methods section on how the LWR quantification was carried out.

      (2) I think Figure 2g is wrongly labelled? FLAG bands are in all three lanes in the western blot, but not labelled as such in the schematic. 

      We corrected the schematic labeling in Figure 2g, and thank the reviewer for catching this mistake.

      (3) In Figure S7, the authors show that co-IP of Dvl and Vangl2 is reduced by Wnt11 and the effects of Wnt are blocked by Pk. Does Pk have any effect in the absence of Wnt? 

      We examined the effect of Pk over-expression on Dvl2-Vangl2 co-IP as advised, and did not see a significant impact in the absence of Wnt11 co-injection. The data is included in the new Supplementary Figure 8a. We interpret the data to suggest that “at least under the condition of our co-IP experiment, Pk may not directly impact the steady-state binding between Vangl and Dvl”.

      (4) In Figure 3, the authors show (as published previously) that Wnt11 induces patches of Dvl at the plasma membrane. It would be useful to see Dvl in the absence of Wnt and Vangl2/Dvl in the absence of Wnt. 

      Dvl is widely known as a cytoplasmic protein and its localization has been published by many labs over the past 20-30 years. In our recent publication (10.1038/s41467-025-57658-0 ), we also re-examined Dvl localization when injected at various dosages. So we did not feel it was necessary to show its localization in the absence of Wnt11 again, but included a reference to our prior publication. In regards to Vangl/Dvl distribution in the absence of Wnt11, the readers can see Suppl. Fig. 5b as an example, in addition to our previous publications referenced in the manuscript.

      (5) In the review figures, the difference in Fz7-GFP patch formation in d' and e' (vs e.g. a') is not very clear. Could the images be improved or (better) quantified in some way? 

      We assume that “review figures” refer to Figure 3 or 4? If so, we felt that Fz7-GFP patch formation was clear in Fig. 3d’, e’ or Fig. 4d’, e’. Nevertheless, we repeated these experiments in DMZ explants as advised by Reviewer 1, and additional examples of Fz7-EGFP patch formation can be seen in the new Suppl. Fig. 9d-f’ and Suppl. Fig. 11d-f’.

      (6) In Figure 6d, I'm concerned that the loss of flag-Dvl2 might occur via dephosphorylation in the IP reaction. Also the M&M don't include methodological details about buffers and whether phosphatase inhibitors were used. A compelling control would be anti-FLAG pulldown showing retention of phosphorylation. Also Figure 6f shows a reduced ratio of fast-to-slow migrating bands of Dvl with Vangl2/Pk - unless I have misunderstood, is this ratio the wrong way round? 

      We added co-IP buffer and protease inhibitor information in Methods.

      We agree that the concern about dephosphorylation during IP reaction is valid, and that direct pull down of Dvl to show the phosphorylated form is a compelling control. We therefore note that in Suppl. Fig. 8a and 15b, direct pull down of Flag-Dvl or Myc-Dvl (with anti-Flag or anti-Myc) did show the slower migrating, phosphorylated form. Additional examples in which Vangl only co-IP the faster migrating unphosphorylated Dvl include Suppl. Fig. 15a, and in a related paper we published recently (Fig. 3R and R’ in 10.1038/s41467-025-57658-0 ).

      Finally, we did wrongly label Figure 6f in the last submission, and the ratio should have been “slow/fast”. We have made the correction, and appreaicte the reviewer for the meticulousness in perusing our manuscript.

      (7) In Figure 7, what does Ror2 look like in the absence of Wnt11? 

      We included new Figure 7a-c to show that without Wnt11 co-injection, Ror2 is uniformly distributed on the plasma membrane.

      (8) Also in Figure 7, Ror2 patches are said to be slightly wider than Dvl2 patches "reminiscent of Vangl2" - I wouldn't describe them as being similar. Vangl2 shows a distinct dip in the center of the Dvl patches, Ror2 does not show a dip, and is only (at best) in a slightly wider patch, and I would want to see further examples to be convinced that the localization domain is reproducibly wider. The merge of many samples in 7d may actually be making the distribution harder to see and if the Xror2 and Dvl2 intensities were normalized I'm not sure how different the curves would appear. (i.e. the Xror2 curve looks like a flattened version of the Dvl2 curve). 

      We have added an additional panel in the new Figure 7j to compare the intensity ratio of Ror/ Dvl2 along the patches, and this analysis reveals an over two folds increase of the ratio at the border region. This quantification may make a more convincing argument that at the patch border region, Dvl is diminished whereas Ror2 accumulate with Vangl2. 

      (9) In Figure S12a, the authors suggest Wnt11 induced dissociation of Dvl from Vangl2 (by co-IP), and this is reduced after Ror2 MO. This would be more convincing with replicates and quantitation. 

      We have repeated this experiment with Vangl2 pull down and added quantification. The data is in the new Suppl. Fig. 15a.

      (10) In Figure S12b, the authors suggest Ror2 can co-IP Vangl2 but not Dvl. This is not very convincing, as the Dvl input band is very weak, and the Vangl2 co-IP band is very weak. 

      We repeated the co-IP experiment with Myc-tagged Vangl or Dvl. Using the same anti-Myc antibody and experimental condition (including the expression level of Vangl, Dvl and Ror2), we still found that Ror2 could be pulled down by Vangl but not Dvl (Suppl. Fig. 15b).

      (11) "Prickle" spelled "Prickel" in the abstract (and abbreviated to "PK" not "Pk" at one place in the abstract and several places in text) 

      We have corrected these typos.

      (12) Quite a lot of interesting observations are in supplemental figures. Normally it might be expected that extra data supporting a conclusion would be in supplemental, but here some of the supplemental data feels like it is more than simply additional evidence. For instance supplemental Figures 2 and 3 feel more than just supplemental (and Supplemental Figure 3 if merged with Figure 2 would make it easier for the reader). Moreover, for example, the description of the results in Figure 2 is punctuated by references to supplemental Figures 4 and 5 that contain key data to support the conclusions, which means the reader has to flick backwards and forwards from place to place in the manuscript to follow the argument. It is of course up to the authors, but in some cases putting supplemental data back into the main figures (for which there is no size or number limit) would increase clarity. 

      These are excellent points; in the resubmitted manuscript we have a total of 24 data figures, and we used 8 as main figures since we felt that they provide the most relevant and conclusive evidence to our model. We will consult the copy editors at eLife on how to arrange the rest as main vs. supporting figures when requesting publication as version of record.

    1. Note de synthèse : Réunion des parents d'élèves des filières technologiques (Première et Terminale)

      À : L'ensemble du personnel enseignant et administratif du Lycée Louis Vincent De : Olivier Palaise, Proviseur Date : 29 septembre 2025 Objet : Alignement stratégique suite à la réunion des parents des filières technologiques

      1. Introduction : Alignement sur notre Vision et nos Exigences Communes

      L'objectif de cette note est de garantir que l'ensemble du personnel, enseignant comme administratif, partage une compréhension unifiée et cohérente des messages clés que nous avons communiqués aux parents d'élèves des classes de Première et Terminale technologiques (STI2D et STL).

      Cet alignement est fondamental : notre crédibilité et l'efficacité de notre accompagnement reposent sur notre capacité à porter une parole collective forte et univoque.

      La réussite de nos élèves dépend directement de cette cohérence entre nos exigences, nos discours et nos actions quotidiennes.

      2. Les Filières Technologiques : Fer de Lance de l'Établissement et Cadre d'Exigence

      J'ai tenu à réaffirmer auprès des parents ma vision stratégique : nos filières technologiques sont le "fer de lance de l'établissement".

      Cette valorisation n'est pas un vain mot ; elle justifie le niveau d'exigence particulièrement élevé que nous appliquons.

      Nous offrons des parcours d'exception qui se méritent par un engagement sans faille. Les règles fondamentales de la vie au lycée, rappelées avec fermeté, constituent le socle de cette ambition.

      Voici les points sur lesquels nous devons maintenir une vigilance collective et absolue :

      • Assiduité et ponctualité : Le respect des horaires est non négociable, y compris lors des interclasses. Nous avons expliqué aux parents que la justification rigoureuse des absences et la ponctualité sont les conditions premières de la réussite.

      L'exemple de l'élève habitant Montini, à qui il a été clairement conseillé de "prendre le bus d'avant", illustre notre refus de la complaisance face aux retards évitables.

      C'est une préparation directe aux exigences du monde supérieur et professionnel.

      • Comportement et tenue : L'anecdote de l'élève reçu dans mon bureau en "claquet de chaussette" a servi à illustrer un point crucial : l'image de soi.

      Dans des filières qui préparent à des carrières d'ingénieur, d'architecte ou de technicien supérieur, une tenue et une posture professionnelles sont indispensables. Le premier regard compte, et nous devons les y former.

      • Usage du téléphone portable : La règle est simple et doit être appliquée par tous : interdiction totale dans les bâtiments, sauf autorisation explicite d'un membre du personnel pour un usage pédagogique.

      Pour garantir le respect de cette règle, la sanction dissuasive de "4 heures de col le samedi" a été rappelée.

      C'est un outil à notre disposition pour maintenir un environnement de travail serein et concentré.

      Cette rigueur n'est pas une fin en soi. Elle est le cadre qui permet à nos élèves de saisir les opportunités exceptionnelles offertes par nos filières et de construire les parcours d'avenir ambitieux que nous leur promettons.

      3. Le Parcours vers la Réussite : Calendrier, Enjeux et Stratégie

      Nous avons insisté sur le fait que les années de Première et de Terminale constituent une période charnière, un véritable sprint où deux objectifs doivent être menés de front : l'obtention du baccalauréat avec la meilleure mention possible et la préparation stratégique du dossier d'orientation post-bac sur Parcoursup.

      3.1. Les Enjeux dès la Classe de Première

      Il a été expliqué aux parents que les élèves de Première doivent "courir deux lièvres à la fois" :

      1. Préparation au Baccalauréat : Au-delà des épreuves anticipées de français, nous avons souligné la nouveauté de l'épreuve de mathématiques qui s'ajoute au calendrier.

      2. Anticipation de l'Orientation : La réflexion sur le projet post-bac n'est plus une option.

      Elle doit commencer dès maintenant, car les trois bulletins de Première pèseront d'un poids considérable dans l'examen de leur dossier Parcoursup l'année suivante.

      3.2. La Double Échéance de la Classe de Terminale

      Pour les élèves de Terminale, l'enjeu est immédiat, car ils sont "de plein fouet dans les deux parties".

      Le calendrier Parcoursup est extrêmement serré, et les vœux doivent être formulés et validés dès mi-janvier.

      L'importance stratégique du Grand Oral a été particulièrement mise en lumière.

      L'exemple de l'élève dont la note est passée de 5/20 à 16/20 après une préparation encadrée de quelques heures est la preuve tangible que le travail et la méthode priment sur le "talent" seul.

      Le coefficient (14 pour les Terminales de cette année, qui passera à 12 pour la promotion suivante) en fait une épreuve décisive qu'il est impératif de préparer avec le plus grand sérieux.

      3.3. Le Dossier Scolaire : Le Facteur Humain dans la Sélection Parcoursup

      Nous avons démystifié Parcoursup : ce n'est pas un algorithme froid.

      Derrière l'interface, des commissions composées d'enseignants examinent les dossiers. Certains éléments, que nous devons tous avoir à l'esprit, entraînent un rejet quasi-systématique :

      1. Absences régulières : La simple mention "élève régulièrement absent" sur un bulletin place immédiatement le dossier sur une "pile d'à côté".

      2. Comportement problématique : De même, une appréciation telle que "comportement inadmissible" disqualifie un candidat avant même l'examen de ses notes.

      3. Appréciations des professeurs : L'arbitrage est clair.

      Entre un élève à 12 de moyenne "au talent" et un autre à 12 dont l'appréciation souligne les efforts ("fait vraiment des efforts"), le second sera toujours préféré.

      Notre rôle, en tant qu'enseignants et membres de l'équipe pédagogique, est donc crucial.

      Les appréciations que nous rédigeons ne sont pas de simples commentaires ; ce sont des facteurs décisifs qui peuvent ouvrir ou fermer les portes de l'avenir de nos élèves. La précision et la justesse de nos mots sont une responsabilité majeure.

      4. L'Excellence des Filières Technologiques : Pédagogie et Débouchés

      Les filières STI2D et STL ont été présentées comme des voies d'excellence, caractérisées par une pédagogie concrète, une forte proportion de travaux pratiques et des perspectives de poursuites d'études ambitieuses, y compris vers les plus grandes écoles.

      4.1. Baccalauréat STI2D : Le concret au service de la polyvalence

      La philosophie de la filière STI2D est de donner du sens aux sciences par le concret, de permettre aux élèves de "comprendre pourquoi ils font des sciences". Ses points forts résident dans :

      • Un volume conséquent de 12 heures d'enseignement scientifique dès la Première.

      • Une approche pédagogique basée sur l'expérimentation et la simulation.

      • Un profil qui reste "généraliste", assurant un "large éventail sur le choix des formations postbac".

      • La nouveauté d'une épreuve pratique au baccalauréat, qui valorise enfin leur cœur de formation.

      4.2. Baccalauréat STL : La pratique en laboratoire comme tremplin

      La spécificité du bac STL repose sur une forte coloration physique-chimie, avec une partie expérimentale "extrêmement importante".

      Les élèves passent plus d'un tiers de leur temps en travaux pratiques, en groupes à effectif réduit. Les débouchés sont réels et attractifs :

      • Les BTS internes de haut niveau (SIRA, Métiers de la chimie).
      • Surtout, la possibilité d'intégrer notre classe préparatoire TSI pour viser des écoles d'ingénieur, une voie royale qui concrétise notre promesse d'excellence.

      Le succès de ces filières, validé par des taux de réussite remarquables (plus de 96 % en STI2D et 93 % en STL), est le fruit direct de cette approche pédagogique exigeante.

      5. Calendrier des Temps Forts de l'Orientation

      Pour accompagner nos élèves dans la construction de leur projet, plusieurs rendez-vous majeurs jalonneront l'année. Il est essentiel que nous les encouragions tous à y participer activement.

      Événement Date(s) Clé(s) Public Cible

      Un Jour à l'Université (Ujalu) Inscriptions dès le 29 sept. Élèves de Terminale

      Forum "Aux réaction" 20, 21 et 22 novembre Première et Terminale (transport non organisé par le lycée cette année)

      Forum des Formations (au lycée) 6 février 2026 Première et Terminale Journée Portes Ouvertes (au lycée) 12 février Première et Terminale

      Ces événements sont des opportunités précieuses que les élèves doivent saisir pour affiner leurs choix.

      6. Conclusion : Notre Responsabilité Collective

      Je compte sur chacun de vous pour porter et incarner ce message de rigueur, d'ambition et d'accompagnement.

      La réussite de nos élèves des filières technologiques est une fierté pour notre établissement, mais elle est avant tout le résultat d'un effort collectif.

      Notre vigilance constante, notre communication cohérente et notre exigence bienveillante sont les clés qui leur permettront de réaliser leur plein potentiel et de transformer leurs ambitions en succès.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewers for their thoughtful comments and overall very supportive feedback.

      Reviewer #1 writes: "The study is very thorough and the experiments contain the appropriate controls. (...) The findings of the study can have relevance for human conditions involving disrupted mitochondrial dynamics, caused for example by mutations in mitofusins." Reviewer #2 writes: "The dataset is rich and the time-resolved approach strong." Reviewer #3 writes: "I admire the philosophy of the research, acknowledging an attempt to control for the many possible confounding influences. (...) This is a powerful and thoughtful study that provides a collection of new mechanistic insights into the link between physical and genetic properties of mitochondria in yeast."

      We address all points below. We have not yet updated our text and figures since we expect substantial additions from new experiments. But we have included Figure R1 with some additional analyses of existing data at the bottom of the manuscript.

      Reviewer1

      1.1 Statistical comparisons are missing throughout the manuscript (with the exception of Fig. 2c). Appropriate statistical tests, along with p-values, should be used and reported where different gorups are compared, for example (but not limited to) Fig. 3d and most panels of Fig. 4.

      We initially decided not to add too many extra labels to the already very busy plots, given that the magnitude of change mostly speaks for itself. However, we will try to find meaningful statistical tests together with a sensible graphical representation for all of the figures. For one example see Figure R1A.

      1.2. I do not agree with the use of Atp6 protein as a direct read-out of mtDNA content. While Atp6 protein levels will decrease with decreasing mtDNA content, the inverse is not necessarily true: decreased Atp6 protein levels do not necessarily indicate decreased mtDNA levels, because they could alternatively or additionally be caused by decreased transcription and/or translation. Therefore, please do not equate Atp6 protein levels to mtDNA levels, and instead rephrase the text referencing the Atp6 experiments in the Results and Discussion sections to measure "mtDNA expression" or "mt-encoded protein" or similar. For example, on p. 14 line 431 should read "mtDNA expression" rather than "decreased synthesis of mtDNA", and line 440 on the same page "mean mtDNA levels" should be "mtDNA expression" or similar.

      All three reviewers agree that using Atp6-NG as a direct proxy for mtDNA requires more validation, or at least rephrasing of the text. We agree that this is the most important point to address. We had previously tried using the mtDNA LacO array (Osman et al. 2015) to directly assess the amount of nucleoids per cell. However, the altered mitochondrial morphology of the Fzo1 depleted cells combined with the LacI-GFP which is still in mitochondria even when mtDNA is gone, increases the noise level to a point that we cannot interpret the signal. However, as this manuscript was in the submission process, the Schmoller lab (co-authors #2 and #7) adapted the HI-NESS system to label mtDNA in live yeast cells(Deng et al. 2025). This system promises much better signal to noise and we expect we can address all concerns regarding the actual count of nucleoids per cell. Should this unexpectedly fail for technical reasons, we will try to calibrate the Atp6-levels with DAPI staining at defined time points and will rephrase the text as the reviewer suggests.

      1.3. In Fig. 3, the authors use the fluorescence intensity of a mitochondrially-targeted mCardinal as a read-out of mitochondrial mass. Please provide evidence that this is not affected by MMP, either with relevant references or by control experiments (e.g. comparing it to N-acridine orange or other MMP-independent dyes or methods).

      Whether or not the import of any mitochondrial protein is dependent on the MMP depends largely on the signal sequence. The preSu9-signaling sequence was previously characterized as largely independent of the MMP compared to other presequences (Martin, Mahlke, and Pfanner 1991), which is why Vowinckel (Vowinckel et al. 2015) and others (Di Bartolomeo et al. 2020; Perić et al. 2016; Ebert et al. 2025) have previously used this as a neutral reference to the strongly MMP-dependent pre-Cox4 signal to estimate MMP. As one control in our own data, we consider that the population-averaged mitochondrial fluorescent signal Figure S3C stays constant in the first few hours, in agreement with the total averaged mitochondrial proteome (Fig R1E). As additional controls, we plan to compare the signal to an MMP independent dye as the reviewer suggests.

      1.4. In Fig. 2e-f, the authors use a promoter reporter with Neongreen to answer whether the reduced levels of the nuclear-encoded mitochondrial proteins Mrps5 and Qcr7 are due to decreased expression or to protein degradation, and find no evidence of degradation of the Neongreen reporter protein. However, subcellular localization might affect the availability of the protein to proteases. Although not absolutely required, it would be relevant to know if the Neongreen fusion protein is found in the same subcellular compartment as Mrps5 and Qcr7 at 0h and 9h after Fzo1 depletion.

      Here, it seems we need to explain the set-up and interpretation of the data better. The key point we are trying to make with the promoter-Neongreen construct is that the regulation is not mainly at the level of transcription. We are showing that the reduction in the levels of the actual protein (orange bars) is not (mainly) explained by a reduction in expression, since the promoter is similarly active at 0 and at 9 hours (grey bars). If expression from the promoter were strongly reduced, the Neongreen would be diluted with growth and would also decrease, but this is not the case. The fluorophore itself is just floating around in the cytosol and is not subject to the same post-translational regulation as Mrps5 and Qcr7, so there is no reason to expect degradation.

      1.5. Fzo1 depletion leads to a very rapid drop in MMP during the first hour of depletion. In the Discussion, can the authors speculate on the possible mechanism of this rapid MMP drop that occurs well before mtDNA or mt-encoded proteins are decreased in level?

      This is indeed an interesting point. We think there are likely three reasons causing this initial drop: Firstly, due to the fragmentation the mixing of mitochondrial content is disturbed and smaller fragments may have suboptimal stoichiometry of components (see also (Khan et al. 2024) who look at this in detail including the Fzo1 deletion); secondly, already fairly early, some mitochondrial fragments may not contain any mtDNA and therefore will be unable to synthesize ETC proteins; thirdly, altered morphological features like changes in the surface-to-volume ratios may play a role. Sadly, mechanistically following up on this is not possible with the tools in our hands and therefore outside of the scope of this manuscript. But we are happy to include these speculations in our discussion.

      1.6. In Fig. 2a, the mtDNA copy number of Fzo1-depleted cells is ca 1.3-fold of the control cells at the 0h timepoint. Why might this be? Is it an impact of one of the inducers? If so, we might be looking at the combination of two different processes when measuring copy number: one that is an induction caused by the inducer(s), and the other a consequence of Fzo1 depletion itself.

      We believe that this 30% increase is within the noise of the experiment rather than an effect of the induction. Since we normalize to t=0 uninduced, the first black data point does not have error bars, emphasizing this difference. None of the protein data suggests that there is an increase in mtDNA encoded proteins (see e.g. 2B, or Atp6 fluorescence data). In the planned HI-NESS experiment, we will see in our single cell data whether there is an actual increase in mtDNA upon TIR induction. Additionally, we will run a qPCR to carefully determine mtDNA levels of untreated wild-type cells, tetracycline treated wild-type cells and tetracycline induced TIR expressing cells to exclude effects of tetracycline as well as the expression of TIR on mtDNA.

      Minor comments:

      1.7. p. 3, line 71: "ten thousands of dividing cells.." should be "tens of thousands of dividing cells".

      Thank you, will correct.

      1.8.-p.4, line 116: please be even more clear with what the "depleted" cells and controls are treated with: are depleted cells treated with both inducers, and controls with neither?

      We will make this more clear. Depleted cells are treated with both inducers, the control cells are not. However, in Figure 1A and in S1 we do controls to show that inducing TIR per se or adding aTC per se does not change growth rate or mitochondrial morphology. We will make this more clear.

      1.9. -p.5, lines 147-148: the authors write "the rate with which the abundance of Cox2 and Var1 proteins decreases was similar to the rate of mtDNA loss" though the actual rate is not shown. Please calculate and show rates for these processes side by side to make comparison possible, or alternatively rephrase the statement.

      Indeed this was not phrased well. We will call it dynamics rather than rates.

      1.10. -Fig. 2d: changing the y-axis numbering to match those in panels a and b would facilitate comparisons.

      Makes sense, we will change this.

      1.11. Fig. 2e: it is recommended to label the western blot panels to indicate what protein is being imaged in each (Neongree,, Mrps5, Qcr7).

      We will adapt the labelling to make it more clear.

      1.12. -p.9, line 262: I suggest referencing Fig. 4e at the end of the first sentence for clarity.

      We will modify the sentence as suggested.

      1.13. -In the sections related to Fig. 3a and Fig. 5a as well as the connected supplemental data, the authors discuss both the median and the mean of mitochondrial mass and Atp6 protein, respectively. For purposes of clarity, I suggest decreasing the focus on the mean (that is provided only in the supplemental data) and focusing the text mainly on the median. The two show differing trends and it is very good that both are shown, but the clarity of the text can be improved by focusing more on the median where possible.

      We will check the phrasing and simplify.

      1.14. -p. 14, line 435: the statement that mt mass is maintained over the first 9h of depletion is only true for the mean mt mass, not for the median. Please make this clear or rephrase.

      We will check phrasing, make it more clear and also point out the extended proteomics data (see Fig R1), which corresponds to the mean of the populations

      1.15.-p.14, line 452: "mitofusions" should be "mitofusins".

      Thanks for catching this.

      Reviewer 2:

      2.1. While inducible TIR is used to reduce background, the manuscript should rigorously exclude auxin/TIR off-targets (growth, mitochondrial phenotypes, gene expression). Please include full matched controls: (plus minus)auxin, (plus minus)TIR, epitope tag alone, and a degron control on an unrelated mitochondrial membrane protein.

      We agree that rigorous controls are crucial for the interpretation of the results. However, we think we have already included most of the controls the reviewer is asking for, but we might have not pointed this out clearly enough. For example, in Fig 1A, we could make it more clear by adding more labels in which samples we added aTC, which is only described in the figure legend.

      Here is a list of all the controls:

      • Each depletion experiment is always matched with an experiment of the same strain without induction. So the genetic background as well as effects such as light exposure, time spent in the microfluidics systems, etc are controlled for.
      • Figure S1D shows that the growth rate is wildtype like in a strain containing either the AID tag or the TIR protein AND upon addition of both chemicals. It also shows that the final genetic background (AID-tag and TIR) also grows like wildtype if the inducers are not added. This conclusively shows that neither the tags/constructs nor the chemicals per se affect growth rate
      • In Figure S1C we show the mitochondrial morphology of the same controls. We will make sure to label them more consistently to match panel D, and include an actual wildtype and a FLAG-AID-Fzo1 strain without TIR treated with both aTC and 5-Ph-IAA as direct comparison
      • In figure 1A we compare the Fzo1 protein levels of a strain with and without TIR. We show that in absence of TIR, adding either aTC or Auxin does not change Fzo1 levels and that the levels are comparable in the strain that is able to deplete Fzo1 directly before addition of 5-Ph-IAA (after 2 h of induction of TIR through addition of tetracycline)
      • Additionally, in Figure S2C we show that two hours after adding aTC, the entire proteome does not change significantly apart from a strong induction of TIR. We can also make this more clear in the figure legend.
      • Additionally, we will run a qPCR to carefully determine mtDNA levels of untreated wild-type cells, tetracycline treated wild-type cells and tetracycline induced TIR expressing cells to exclude effects of tetracycline as well as the expression of TIR on mtDNA. (also in response to 1.6.) In summary, we think we have controlled sufficiently for all confounding parameters and most importantly showed that addition of either aTC or Auxin as well as the FLAG-AID tag per se does not disturb mitochondria or cell growth. We do not see what a degron control on an unrelated protein will tell us. Depending on the nature of the protein, it may or may not have a phenotype that may or may not be related to morphology changes etc.

      2.2. The Mitoloc preSu9 vs Cox4 import ratio is only a proxy of mitochondrial membrane potential (ΔΨm) and itself depends on mitochondrial mass, protein expression, matrix ATP, and import saturation. The authors need to calibrate ΔΨm with orthogonal dyes (TMRE/TMRM) and pharmacologic titrations (FCCP/antimycin/oligomycin) to generate a response curve; show that Mitoloc tracks dye-based ΔΨm across the relevant range and corrects for mass/photobleaching. Report single-cell ΔΨm vs mass residuals.

      We completely agree that the MitoLoc system is only a rough proxy for the actual membrane potential. That is why we make no quantitative claims on the absolute value or absolute difference between groups of cells. We also make very clear in Fig 3B what we are actually measuring and can emphasize again in the text that this is only a proxy. We agree that it is a good idea to compare MitoLoc values to TMRE staining as the reviewer suggests, we will do these experiments in depleted and control cells at different timepoints. Please note though that also dye staining has its caveats, especially in dynamic live cell experiments. TMRM for example is not compatible with the acidic pH 5 medium that is typically used for yeast and subjecting cells to washing steps and higher pH may change both morphology of mitochondria and the MMP, especially in cells that are already “stressed”. We prefer not to complete elaborate pharmacological titration experiments because firstly, this was extensively done in the original MitoLoc paper by the Ralser lab ((Vowinckel et al. 2015), cited 120 times); secondly, the value of the MMP is not the most critical claim of the manuscript. See also 3.12. Please note that in Figure S4D we had already plotted MMP vs mitochondrial concentration.

      2.3. To use Atp6-mNeon as a proxy for mtDNA is an assumption. Interpreting Atp6 intensity as "functional mtDNA" could be confounded by translation, turnover, or assembly. Please (i) report mtDNA copy number time courses (you have qPCR), nucleoid counts (DAPI/PicoGreen or TFAM/Abf2 tagging), and (ii) assess translation (e.g., 35S-labeling or puromycin proxies) and turnover (proteasome/AAA protease inhibition, mitophagy mutants -some data are alluded to- plus mRNA levels for mtDNA-encoded genes). This will support the "reduced synthesis" versus "increased degradation" conclusion.

      We agree with all three reviewers that Atp6 is only a proxy for mtDNA (Jakubke et al. 2021; Roussou et al. 2024) and the correlation should be checked more carefully. We will use the very recently established Hi-NESS system to follow nucleoids/ mtDNA during depletion experiments. See detailed reply to 1.2.

      (ii) in Figure 2C we inhibit mitochondrial translation and show that in this case control and depleted cells have the same level of Cox2, at least suggesting that degradation is not the key mechanism controlling the levels of mtDNA encoded proteins. We cannot do proteasome inhibitor assays since the nature of the AID-TIR systems requires an active proteasome. In figure S5C we show that the Atp6 depletion is similar in an atg32 deletion. This does not completely exclude a contribution of mitophagy to the observed phenotype, but does confirm that mitophagy is not the primary reason for cells becoming petite.

      2.4. The promoter-NeonGreen reporters argue against transcriptional down-regulation of nuclear OXPHOS. Please add mRNA (RT-qPCR/RNA-seq) for representative genes and a pulse-chase or degradation-pathway dependency (e.g., proteasome/mitophagy/autophagy mutants) to firmly assign active degradation. The authors need to normalize proteomics to mitochondrial mass (e.g., citrate synthase/porin) to separate organelle abundance from protein turnover.

      While we are happy to perform qPCR experiments for selected genes, a full RNA-seq experiment seems outside the scope of this study. As explained above, a proteasome inhibitor experiment is not possible in this set-up. Bulk mitophagy/autophagy seems unlikely to be the cause of the decrease of the nuclear-encoded OXPHOS proteins, since most other mitochondrial proteins do not decrease on average on population level in the first hours. This data is now plotted as additional figure (see below) and will be included in the supplementary of the revised manuscript (Fig R1E).

      2.5. Using preSu9-mCardinal intensity as "mitochondrial concentration" is sensitive to expression, import competence, and morphology/segmentation. The authors should provide validation that this metric tracks 3D volume across fragmentation states (e.g., correlation with mito-GFP volumetrics; detergent-free CS activity; TOMM20/Por1 immunoblot per cell).

      We agree that this is an important point and the co-authors discussed this point quite intensively. In figure S3A and B we show (using confocal data) that there is a very strong correlation between the total fluorescence signal and the 3D volume reconstruction. However, the slope of the correlation is different between tubular and fragmented mitochondria (compare panels A and B) and see figure legend. Since we are dealing with diffraction-limited objects it is likely that the 3D reconstruction is sensitive to morphology, especially if mitochondria are “clumping”. We therefore think that the total fluorescence signal is actually a better estimate of mitochondrial mass per cell than the 3D volume reconstruction (especially for our data obtained with a conventional epifluorescence microscope). The mean of the total mitochondrial fluorescence also better matches the population average mitochondrial proteome (Fig R1E). To consolidate this assumption, we will additionally compare our data to a strain with Tom70-Neongreen and to MMP independent dyes.

      Notably, since the morphology is similarly altered in mothers and buds this is of minor impact for our main point – the unequal distribution between mother and buds.

      2.6. The unequal mother-daughter distribution is compelling, but causality remains inferred. Test whether modulating inheritance machinery (actin cables/Myo2, Num1, Mmr1) or altering fission (Dnm1 inhibition) modifies segregation defects and rescues mtDNA/Atp6 decline. Complementation with Fzo1 re-expression at defined times would help order the phenotype cascade.

      We agree that rescue experiments would be very useful. We have some preliminary data for tether experiments, for example with Num1. The general problem is that the fragmented mitochondria clump together. We have not found a method to restore an equal distribution between mother and daughter cells. We will try to optimize the assay, but are not overly confident it will work. Mmr1 deletion aggravates the Fzo1 phenotype, likely also because the distribution becomes even more heterogeneous, but we have not rigorously analyzed this.

      We like the idea of the Fzo1 re-expression and will run such experiments. This will be especially powerful in combination with the new HI-NESS mtDNA reporter. We may be able to track exactly when cells reach the point-of-no return and become petite. This will also help connecting our mathematical model more directly to the data.

      2.7. The model is useful but should include parameter sensitivity (segregation variance, synthesis slopes, initial nucleoid number) and prospective validation (e.g., predict rescue upon partial restoration of synthesis or inheritance, then test experimentally).

      We will refine our model to include the to-be-measured nucleoids/mtDNA values. We will include a parameter sensitivity analysis with the updated model.

      Reviewer 3:

      3.1. About the use of Atp6 as a good proxy for mtDNA content. This is assumed from l285 onwards, based on a previous publication. As the link is fairly central to part of the paper's arguments, and the system in this study is being perturbed in several different ways, a stronger argument or demonstration that this link remains intact (and unchanged, as it is used in comparisons) would seem important.

      We agree, see 1.2.

      3.2. About confounding variables and processes. The study does an admirable job of being transparent and attempting to control for the many different influences involved in the physical-genetic link. But some remain less clearly unpacked, including some I think could be quite important. For example, there is a lot of focus on mito concentration -- but given the phenotypes are changing the sizes of cells, do concentration changes come from volume changes, mito changes, or both? In "ruling out" mitophagy -- a potentially important (and intuitive) influence, the argument is not presented as directly as it could be and it's not completely clear that it can in fact be ruled out in this way. There are a couple of other instances which I've put in the smaller points below.

      Thank you for acknowledging our efforts to show transparent and well-controlled experiments! We address each of the specific points below.

      3.3. full genus name when it first appears

      We will add the full name.

      3.4. I may be wrong here, but I thought the petite phenotype more classically arises from mtDNA deletion mutations, not loss? The way this is phrased implies that mtDNA loss is [always] the cause. Whether I'm wrong on that point or not, the petite phenotype should be described and referenced.

      We can expand the text and cite additional relevant papers. The term “petite” refers to any strain that is respiratory incompetent and leads to small colonies (not necessarily small cells!) (Seel et al. 2023). This can be mutations or gene loss (fragments) on the mtDNA (these are called cytoplasmic petite), or chemically induced loss of mtDNA (e.g. EtBr), or mutations of nuclear genes required for respiration (these are termed nuclear petite; some nuclear petites show loss of mtDNA in addition to the mutation in the nuclear genome) (Contamine and Picard 2000).

      3.5. para starting l59 -- should mention for context that mitochondria in (healthy, wildtype) yeast are generally much more fused than in other organisms

      ok.

      3.6. Fig 1C -- very odd choice of y-axis range! either start at zero or ensure that the data fill as much vertical space of the plot as possible

      True, this was probably some formatting relic. We will adapt the axis to fill the full space. Most of our axes start at 0, but that doesn’t make so much sense here, since we consider the solidity in the control as “baseline”.

      3.7. "wild-type like more tubular mitochondria" reads rather awkwardly. "more tubular mitochondria (as in the wild-type)"?

      Thank you, sounds better.

      3.8. l106 -- imaging artefacts? are mitos fragmenting because of photo stress? -- this is mentioned in l577-8 in the Methods, but the data from the growth rate and MMP comparison isn't given -- an SI figure would be helpful here. It would be reassuring to know that mito morphology wasn't changing in response to phototoxicity too.

      In the methods we just briefly point out that we have done all our “due diligence” controls to check that we do not generate phototoxicity, something that we highlight in the cited review. We do not explicitly have a figure for this, but figure S1A shows that the solidity of the mitochondrial network in control cells stays the same over 9 hours, even though these cells are exposed to the same cultivation and imaging regime as the depleted cells. We will also add a picture of control cells after 9 h. In S1B we show that control cells containing TIR but no AID tag treated with both chemicals imaged over 9 hours also show the same solidity (~mitochondrial morphology) as untreated control. Also, the doubling times of cells grown in our imaging system (Fig R1B) are very similar to the shake flask (Fig R1A). All in all, we are very confident that our imaging settings did not impact our reported phenotypes.

      3.9. para l146 -- so this suggests mtDNA-encoded proteins have a very rapid turnover, O(hours) -- is this known/reasonable?

      Reference (Christiano et al. 2014) suggests that respiratory chain proteins are shorter lived than the average yeast protein. However, based on Figure 2C we think the dynamics mostly speak for a dilution by growth.

      3.10. section l189 -- it's hard to reason fully about these statistics of mitochondrial concentration given that the petite phenotype is fundamentally affecting overall cell volume. can we have details on the cell size distribution in parallel with these results? to put it another way -- how does mitochondrial *amount* per cell change?

      This is a good point. We report mostly on mitochondrial “concentrations” because we think this is what the cell actually cares about (mitochondrial activity in relationship to cytosolic activity). But we will include additional graphs on mitochondrial amount as well as size distributions (Fig R1C, related to Fig 4F). We can already point out that the size distribution of the population does not change much in the first hours. The “petite” phenotype refers to small colonies on growth medium with limited supply of a fermentable carbon source, not to smaller size of single cells.

      3.11. l199 the mean in Fig S3C certainly does change -- it increases, clearly relative both to control and to its initial value. rather than sweeping this under the carpet we should look in more detail to understand it (a consequence of the increased skew of the distribution)?

      This relates somewhat to the previous point. The increase in average concentration is not due to an increased amount in the population, but due to the fact that it is the small buds that get a very high amount of the mitochondria which “exaggerates” the asymmetric/heterogenous distribution. This will be clarified by the figures we mention in the point above.

      3.12. para line 206 -- this doesn't make it clear whether your MMP signal is integrated over all mitochondria in the cell, or normalised by mitochondrial content? this matters quite a lot for the interpretation if the distributions of mitochondrial content are changing. reading on, this is even more important for para line 222. Reading further on, there is an equation on l612 that gives a definition, but it doesn't really clarify (apologies if I'm misunderstanding).

      For each cell, we basically calculate the relative mitochondrial enrichment of the MMP sensitive vs the MMP insensitive pre-sequence.

      So, MMP= (total intensity of mitochondrial pre-Cox4 Neongreen/ total intensity of mitochondrial pre-Su9 Cardinal) / (total cytosolic pre-Cox4 Neongreen/ total cytosolic pre-Su9 Cardinal).

      We calculate this value for each cell, but we do not have the optical resolution to calculate it for individual mitochondrial fragments.

      Both constructs are driven by the same strong promoter, so transcription of the fluorophore should never limit the uptake. Also, in Figure 3D we compare control and depleted cells with similar total mitochondrial concentration, so the difference must be due to a different import of the two fluorophores, see also Fig S4D. The calculated “MMP” value is of course only a crude proxy for the actual membrane potential in millivolts and we do not want to make any claims on absolute values or quantitative differences. But essentially what we are interested in is “mitochondrial health/activity” and we think the system is good at reporting this. See also 2.2.

      3.13. l230 -- a point of personal interest -- low mito concentrations are connected to low "function" (MMP) and give extended division times -- this is interestingly exactly the model needed to reproduce observations in HeLa cells (https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002416). That model went on to predict several aspects of downstream cellular behaviour -- it would be very interesting to see how compatible that picture (parameterised using HeLa observations) is with yeast!

      Thank you for pointing out your interesting paper, which we will include in our discussion. Another recent preprint about fission yeast (Chacko et al. 2025) also fits into this picture. Since you were kind enough to disclose your identity, we would be happy to discuss this further with you in person if we can maybe follow-up on this.

      3.14. l239 "less mitochondria" -- a bit tricky but I'd say "fewer mitochondria" or "less mitochondrial content"

      Thanks, we will think about how to best rephrase this, probably less mitochondrial content.

      3.15. Section l234 So here (and in Fig 4) the focus is on overall distributions of mitochondrial concentration in different cells (mother-to-be, mother, bud; gen 1, gen >1). But we've just seen that one effect of fzo1 is to broader the distribution of mitochondrial concentration across cells. Can't we look in more depth at the implications of this heterogeneity? For example in Fig 4F (which is cool) we look at the distribution of all fzo1 mothers-to-be, mothers, and buds. But this loses information about the provenance. For example, do mothers-to-be with extremely low mito concentrations just push everything to the bud, while mothers-to-be with high mito concentrations distribute things more evenly? It would seem very easy and very interesting to somehow subset the distribution of mothers-to-be by concentration and see how different subsets behave

      This is a good point. When analyzing the data, we pretty much plotted everything against everything and then chose the graphs that we think will best guide the reader through the story-line. We can make additional supplementary plots where we show the starting concentrations/amounts of the mother in relationship to the resulting split ratio at the end of the cycle (Fig R1D).

      3.16. l285 -- experimental design -- do we know that Atp6 will continue to be a good proxy for functional mtDNA in the face of the perturbations provided by Fzo1 depletion? Especially if there is impact on the expression of mitoribosomes, the relationship between mtDNA and Atp6 may look rather different in the mutant?

      This is actually our top-priority experiment now. We will use the HI-NESS system and possibly DAPI staining to make a more direct link to mtDNA/ nucleoid numbers, see 1.2.

      3.17. l290 -- ruled out mitophagy. This message could be much clearer. Comparing Fig S5C and Fig 3A side-by-side is a needlessly difficult task -- put Fig 3A into Fig S5. Then we see that when mitophagy is compromised, the distribution of mitochondrial concentration has a lower median and much lower upper quartile than in the mitophagy-equipped Fzo1 mutant? What is going on here? For a paper motivated by disentangling coupled mechanisms, this should be made clearer!

      Thanks for pointing this out. We can of course easily include the control in the corresponding figure. Compromising mitophagy is likely to generally affect mitochondrial health and turnover a little bit, independent of what is going on with Fzo1. The second evidence that speaks against large-scale mitophagy is the proteomics data: On population level the dynamics of the respiratory chain proteins are very different from those of other (nuclear encoded) mitochondrial proteins. We will add additional supplementary figures to make this more clear, see Fig R1E. Most mitochondrial proteins in the proteomics experiment stay constant in the first few hours, consistent with the imaging data showing that the mean mitochondrial content of the population does not change initially. This again highlights that it is the unequal distribution which is the problem and not massive degradation of mitochondria.

      3.18. With the Atp6 signal, how do we know that fluorescence from different cells is comparable? Buds will be smaller than mother cells for example, potentially leading to less occlusion of the fluorescent signal by other content in the cytoplasm

      This is of course a general problem that anyone faces doing quantitative fluorescence microscopy. From the technical side, we have done the best we could by taking a reasonable amount of z-slices and by choosing fluorophores that are in a range with little cellular background fluorescence (e.g. Neongreen is much better than GFP). From a practical standpoint, we are always comparing to the control, which is subject to the same technical limitations as the depleted cells and the cell sizes are very similar. So, even if we are systematically overestimating the Atp6 concentration in the bud by a few %, the difference to the control would still be qualitatively true. We therefore do not think that any of our conclusions are affected by this.

      3.19. l343 -- maintenance of mtDNA -- here the point about l285 (is the Atp6-mtDNA relationship the same in the Fzo1 mutant) is particularly important, as we're directly tying findings about the protein product to implications about the mtDNA

      We will carefully address this, see above.

      3.20. l367 -- on a first read this description of the model feels like lots of choices have been made without being fully justified. Why a log-normal distribution (when the fit to the data looks rather flawed); why the choice of 5 groups for nucleoid number (why not 3? or 8?); the process used for parameter fitting is very unclear (after reading the methods I think some of these values are read directly from the data, but the shapes of the distributions remain unexplained). l705 -- presumably the ratio was drawn from a log-normal distribution and then the corresponding nucleoid numbers were rounded to integers? the ratio itself wasn't rounded? (also l367) How were the log-normal distributions fitted to experiments (Figs. S7A,B)? Just by eye?

      We will update our model based on measured nucleoid counts and then explain more stringently the choices we make/ parameters we select.

      3.21. l711 by random selection -- just at random? ("selection" could be confusing) Overall, it feels like the model may be too complicated for what it needs to show. Either (a) the model should show qualitatively that unequal inheritance and reduced production leads to rapid loss -- which a much simpler model, probably just involving a couple of lines of algebra, could show. Or (b) the model should quantitatively reproduce the particular numerical observations from the experiments -- it's not totally clear that it does this (do the cell-cycle-based decay timescales in Fig 7 correspond to the hour-based decay timescales in other plots, for example). At the moment the model is at a (b) level of detail but it's only clear that it's reporting the (a) level of results.

      If the HI-NESS and Fzo1 re-addition experiments work as explained above, all parameters will have direct experimental data, and we should get much closer to (a).

      3.22. A lot of the discussion repeats the results; depending on editorial preferences some of this text could probably be pared back to focus on the literature connections and context.

      We will think about streamlining the discussion once some of the additional material alluded to above has been added.

      3.23. Data availability -- it looks like much of the data required to reproduce the results is not going to be made available. Images and proteomic data are promised, but the data associated with mitochondrial concentration and other features are not mentioned. For FAIR purposes all the data (including statistics from analysis of the images) should be published.

      We maybe didn’t phrase this clearly. All data will be made available. Where technically feasible, this will be directly accessible in a repository, otherwise by request to the corresponding author.

      On our OMERO server, we have deposited many TB of raw images as well as all the intermediate steps such as segmentation masks, and the csv files with all the extracted data for each cell (including background corrections etc). Additionally, we can include csvs with the data grouped in a way that we used to generate all the box blots etc. As of now, the OMERO data is unfortunately only available by requesting a personal guest login from our bioinformatics facility, but we were promised that with the next technical update there will be a public link available. The proteomics data and the model are already fully accessible. The raw western blot images with corresponding ponceau staining will be included with the final publication either as additional supplementary material or in whatever format matches the journal requirements.

      3.24 l660 -- can an overview of the EM protocol be given, to avoid having to buy the Mayer 2024 article?

      The cited paper is open access. But we can also include more details in our method section.

      References:

      Chacko, L. A., H. Nakaoka, R. Morris, W. Marshall, and V. Ananthanarayanan. 2025. 'Mitochondrial function regulates cell growth kinetics to actively maintain mitochondrial homeostasis', bioRxiv.

      Christiano, R., N. Nagaraj, F. Frohlich, and T. C. Walther. 2014. 'Global proteome turnover analyses of the Yeasts S. cerevisiae and S. pombe', Cell Rep, 9: 1959-65.

      Contamine, V., and M. Picard. 2000. 'Maintenance and integrity of the mitochondrial genome: a plethora of nuclear genes in the budding yeast', Microbiol Mol Biol Rev, 64: 281-315.

      Deng, Jingti, Lucy Swift, Mashiat Zaman, Fatemeh Shahhosseini, Abhishek Sharma, Daniela Bureik, Francesco Padovani, Alissa Benedikt, Amit Jaiswal, Craig Brideau, Savraj Grewal, Kurt M. Schmoller, Pina Colarusso, and Timothy E. Shutt. 2025. 'A novel genetic fluorescent reporter to visualize mitochondrial nucleoids', bioRxiv: 2023.10.23.563667.

      Di Bartolomeo, F., C. Malina, K. Campbell, M. Mormino, J. Fuchs, E. Vorontsov, C. M. Gustafsson, and J. Nielsen. 2020. 'Absolute yeast mitochondrial proteome quantification reveals trade-off between biosynthesis and energy generation during diauxic shift', Proc Natl Acad Sci U S A, 117: 7524-35.

      Ebert, A. C., N. L. Hepowit, T. A. Martinez, H. Vollmer, H. L. Singkhek, K. D. Frazier, S. A. Kantejeva, M. R. Patel, and J. A. MacGurn. 2025. 'Sphingolipid metabolism drives mitochondria remodeling during aging and oxidative stress', bioRxiv.

      Jakubke, C., R. Roussou, A. Maiser, C. Schug, F. Thoma, R. Bunk, D. Horl, H. Leonhardt, P. Walter, T. Klecker, and C. Osman. 2021. 'Cristae-dependent quality control of the mitochondrial genome', Sci Adv, 7: eabi8886.

      Khan, Abdul Haseeb, Xuefang Gu, Rutvik J. Patel, Prabha Chuphal, Matheus P. Viana, Aidan I. Brown, Brian M. Zid, and Tatsuhisa Tsuboi. 2024. 'Mitochondrial protein heterogeneity stems from the stochastic nature of co-translational protein targeting in cell senescence', Nature Communications, 15: 8274.

      Martin, J., K. Mahlke, and N. Pfanner. 1991. 'Role of an energized inner membrane in mitochondrial protein import. Delta psi drives the movement of presequences', J Biol Chem, 266: 18051-7.

      Osman, C., T. R. Noriega, V. Okreglak, J. C. Fung, and P. Walter. 2015. 'Integrity of the yeast mitochondrial genome, but not its distribution and inheritance, relies on mitochondrial fission and fusion', Proc Natl Acad Sci U S A, 112: E947-56.

      Perić, Matea, Peter Bou Dib, Sven Dennerlein, Marina Musa, Marina Rudan, Anita Lovrić, Andrea Nikolić, Ana Šarić, Sandra Sobočanec, Željka Mačak, Nuno Raimundo, and Anita Kriško. 2016. 'Crosstalk between cellular compartments protects against proteotoxicity and extends lifespan', Scientific Reports, 6: 28751.

      Roussou, Rodaria, Dirk Metzler, Francesco Padovani, Felix Thoma, Rebecca Schwarz, Boris Shraiman, Kurt M. Schmoller, and Christof Osman. 2024. 'Real-time assessment of mitochondrial DNA heteroplasmy dynamics at the single-cell level', The EMBO Journal, 43: 5340-59-59.

      Seel, A., F. Padovani, M. Mayer, A. Finster, D. Bureik, F. Thoma, C. Osman, T. Klecker, and K. M. Schmoller. 2023. 'Regulation with cell size ensures mitochondrial DNA homeostasis during cell growth', Nat Struct Mol Biol, 30: 1549-60.

      Vowinckel, J., J. Hartl, R. Butler, and M. Ralser. 2015. 'MitoLoc: A method for the simultaneous quantification of mitochondrial network morphology and membrane potential in single cells', Mitochondrion, 24: 77-86.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      This article addresses the connection between perturbed mitochondrial structure and genetics in yeast. When mitochondrial fusion is compromised, what is the chain of causality -- the mechanism -- that leads to mtDNA populations becoming depleted? This is a fascinating question, linking physical cell biology to population genetics. I admire the philosophy of the research, acknowledging and attempt to control for the many possible confounding influences. The manuscript describes the context and the research tightly and digestibly; the figures illustrate the results in a clear and natural way.

      For transparency, I am Iain Johnston and I am happy for this review to be treated as public domain. To my eyes my most important shortcoming as a review is my relative lack of familiarity with the yeast fzo1 mutant; while I am familiar with analysis of yeast mito morphology and mtDNA segregation, a reviewer familiar with the nuances of this strain and its culture would be a useful complement.

      I have a few more general points and a collection of smaller points below that I believe might help make the story more robust.

      General points

      1. About the use of Atp6 as a good proxy for mtDNA content. This is assumed from l285 onwards, based on a previous publication. As the link is fairly central to part of the paper's arguments, and the system in this study is being perturbed in several different ways, a stronger argument or demonstration that this link remains intact (and unchanged, as it is used in comparisons) would seem important.
      2. About confounding variables and processes. The study does an admirable job of being transparent and attempting to control for the many different influences involved in the physical-genetic link. But some remain less clearly unpacked, including some I think could be quite important. For example, there is a lot of focus on mito concentration -- but given the phenotypes are changing the sizes of cells, do concentration changes come from volume changes, mito changes, or both? In "ruling out" mitophagy -- a potentially important (and intuitive) influence, the argument is not presented as directly as it could be and it's not completely clear that it can in fact be ruled out in this way. There are a couple of other instances which I've put in the smaller points below.

      Smaller points

      l47 full genus name when it first appears

      l58 I may be wrong here, but I thought the petite phenotype more classically arises from mtDNA deletion mutations, not loss? The way this is phrased implies that mtDNA loss is [always] the cause. Whether I'm wrong on that point or not, the petite phenotype should be described and referenced.

      para starting l59 -- should mention for context that mitochondria in (healthy, wildtype) yeast are generally much more fused than in other organisms

      Fig 1C -- very odd choice of y-axis range! either start at zero or ensure that the data fill as much vertical space of the plot as possible

      l105 "wild-type like more tubular mitochondria" reads rather awkwardly. "more tubular mitochondria (as in the wild-type)"?

      l106 -- imaging artefacts? are mitos fragmenting because of photo stress? -- this is mentioned in l577-8 in the Methods, but the data from the growth rate and MMP comparison isn't given -- an SI figure would be helpful here. It would be reassuring to know that mito morphology wasn't changing in response to phototoxicity too.

      para l146 -- so this suggests mtDNA-encoded proteins have a very rapid turnover, O(hours) -- is this known/reasonable?

      section l189 -- it's hard to reason fully about these statistics of mitochondrial concentration given that the petite phenotype is fundamentally affecting overall cell volume. can we have details on the cell size distribution in parallel with these results? to put it another way -- how does mitochondrial amount per cell change?

      l199 the mean in Fig S3C certainly does change -- it increases, clearly relative both to control and to its initial value. rather than sweeping this under the carpet we should look in more detail to understand it (a consequence of the increased skew of the distribution)?

      para line 206 -- this doesn't make it clear whether your MMP signal is integrated over all mitochondria in the cell, or normalised by mitochondrial content? this matters quite a lot for the intepretation if the distributions of mitochondrial content are changing. reading on, this is even more important for para line 222. Reading further on, there is an equation on l612 that gives a definition, but it doesn't really clarify (apologies if I'm misunderstanding).

      l230 -- a point of personal interest -- low mito concentrations are connected to low "function" (MMP) and give extended division times -- this is interestingly exactly the model needed to reproduce observations in HeLa cells (https://journals.plos.org/ploscompbiol/article?id=10.1371/journal.pcbi.1002416). That model went on to predict several aspects of downstream cellular behaviour -- it would be very interesting to see how compatible that picture (parameterised using HeLa observations) is with yeast!

      l239 "less mitochondria" -- a bit tricky but I'd say "fewer mitochondria" or "less mitochondrial content"

      Section l234 So here (and in Fig 4) the focus is on overall distributions of mitochondrial concentration in different cells (mother-to-be, mother, bud; gen 1, gen >1). But we've just seen that one effect of fzo1 is to broader the distribution of mitochondrial concentration across cells. Can't we look in more depth at the implications of this heterogeneity? For example in Fig 4F (which is cool) we look at the distribution of all fzo1 mothers-to-be, mothers, and buds. But this loses information about the provenance. For example, do mothers-to-be with extremely low mito concentrations just push everything to the bud, while mothers-to-be with high mito concentrations distribute things more evenly? It would seem very easy and very interesting to somehow subset the distribution of mothers-to-be by concentration and see how different subsets behave

      l285 -- experimental design -- do we know that Atp6 will continue to be a good proxy for functional mtDNA in the face of the perturbations provided by Fzo1 depletion? Especially if there is impact on the expression of mitoribosomes, the relationship between mtDNA and Atp6 may look rather different in the mutant?

      l290 -- ruled out mitophagy. This message could be much clearer. Comparing Fig S5C and Fig 3A side-by-side is a needlessly difficult task -- put Fig 3A into Fig S5. Then we see that when mitophagy is compromised, the distribution of mitochondrial concentration has a lower median and much lower upper quartile than in the mitophagy-equipped Fzo1 mutant? What is going on here? For a paper motivated by disentagling coupled mechanisms, this should be made clearer!

      With the Atp6 signal, how do we know that fluorescence from different cells is comparable? Buds will be smaller than mother cells for example, potentially leading to less occlusion of the fluorescent signal by other content in the cytoplasm

      l336 -- similar to the Jajoo et al. mechanism in fission yeast -- but are you talking about feedback control of the mtDNA or the protein (or mRNA) product?

      l343 -- maintenance of mtDNA -- here the point about l285 (is the Atp6-mtDNA relationship the same in the Fzo1 mutant) is particularly important, as we're directly tying findings about the protein product to implications about the mtDNA

      l367 -- on a first read this description of the model feels like lots of choices have been made without being fully justified. Why a log-normal distribution (when the fit to the data looks rather flawed); why the choice of 5 groups for nucleoid number (why not 3? or 8?); the process used for parameter fitting is very unclear (after reading the methods I think some of these values are read directly from the data, but the shapes of the distributions remain unexplained). l705 -- presumably the ratio was drawn from a log-normal distribution and then the corresponding nucleoid numbers were rounded to integers? the ratio itself wasn't rounded? (also l367) How were the log-normal distributions fitted to experiments (Figs. S7A,B)? Just by eye? l711 by random selection -- just at random? ("selection" could be confusing) Overall, it feels like the model may be too complicated for what it needs to show. Either (a) the model should show qualitatively that unequal inheritance and reduced production leads to rapid loss -- which a much simpler model, probably just involving a couple of lines of algebra, could show. Or (b) the model should quantitatively reproduce the particular numerical observations from the experiments -- it's not totally clear that it does this (do the cell-cycle-based decay timescales in Fig 7 correspond to the hour-based decay timescales in other plots, for example). At the moment the model is at a (b) level of detail but it's only clear that it's reporting the (a) level of results.

      A lot of the discussion repeats the results; depending on editorial preferences some of this text could probably be pared back to focus on the literature connections and context.

      Data availability -- it looks like much of the data required to reproduce the results is not going to be made available. Images and proteomic data are promised, but the data associated with mitochondrial concentration and other features are not mentioned. For FAIR purposes all the data (including statistics from analysis of the images) should be published.

      l660 -- can an overview of the EM protocol be given, to avoid having to buy the Mayer 2024 article?

      Significance

      This is a powerful and thoughtful study that provides a collection of new mechanistic insights into the link between physical and genetic properties of mitochondria in yeast. Cell biologists, geneticists, and the mitochondrial field will find this of potentially deep interest. Because of the mode and dynamics of inheritance in budding yeast, findings here may not be directly transferrable to other eukaryotes, but these insights are still of interest for researchers outside of yeast for their insight into how this well-studied system manages its mitochondrial populations.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      Dengler and colleagues use an AID-based acute depletion of Fzo1 in budding yeast, coupling microfluidics live imaging, single-cell quantification (>30k cells), proteomics, an mtDNA-encoded Atp6 reporter, and simple modeling to argue that fusion loss causes (i) rapid fragmentation and ΔΨm decline, (ii) progressive mtDNA/RC depletion, and (iii) unequal mother-daughter mitochondrial inheritance; together with a failure of compensatory synthesis, these changes drive petite formation. The time-resolved design is valuable, but several readouts are indirect, and some claims (particularly those regarding membrane potential, synthesis "failure," and causality) appear over-interpreted without additional controls.

      Major points

      1. While inducible TIR is used to reduce background, the manuscript should rigorously exclude auxin/TIR off-targets (growth, mitochondrial phenotypes, gene expression). Please include full matched controls: {plus minus}auxin, {plus minus}TIR, epitope tag alone, and a degron control on an unrelated mitochondrial membrane protein.
      2. The Mitoloc preSu9 vs Cox4 import ratio is only a proxy of mitochondrial membrane potential (ΔΨm) and itself depends on mitochondrial mass, protein expression, matrix ATP, and import saturation. The authors need to calibrate ΔΨm with orthogonal dyes (TMRE/TMRM) and pharmacologic titrations (FCCP/antimycin/oligomycin) to generate a response curve; show that Mitoloc tracks dye-based ΔΨm across the relevant range and corrects for mass/photobleaching. Report single-cell ΔΨm vs mass residuals.
      3. To use Atp6-mNeon as a proxy for mtDNA is an assumption. Interpreting Atp6 intensity as "functional mtDNA" could be confounded by translation, turnover, or assembly. Please (i) report mtDNA copy number time courses (you have qPCR), nucleoid counts (DAPI/PicoGreen or TFAM/Abf2 tagging), and (ii) assess translation (e.g., 35S-labeling or puromycin proxies) and turnover (proteasome/AAA protease inhibition, mitophagy mutants -some data are alluded to- plus mRNA levels for mtDNA-encoded genes). This will support the "reduced synthesis" versus "increased degradation" conclusion.
      4. The promoter-NeonGreen reporters argue against transcriptional down-regulation of nuclear OXPHOS. Please add mRNA (RT-qPCR/RNA-seq) for representative genes and a pulse-chase or degradation-pathway dependency (e.g., proteasome/mitophagy/autophagy mutants) to firmly assign active degradation. The authors need to normalize proteomics to mitochondrial mass (e.g., citrate synthase/porin) to separate organelle abundance from protein turnover.
      5. Using preSu9-mCardinal intensity as "mitochondrial concentration" is sensitive to expression, import competence, and morphology/segmentation. The authors should provide validation that this metric tracks 3D volume across fragmentation states (e.g., correlation with mito-GFP volumetrics; detergent-free CS activity; TOMM20/Por1 immunoblot per cell).
      6. The unequal mother-daughter distribution is compelling, but causality remains inferred. Test whether modulating inheritance machinery (actin cables/Myo2, Num1, Mmr1) or altering fission (Dnm1 inhibition) modifies segregation defects and rescues mtDNA/Atp6 decline. Complementation with Fzo1 re-expression at defined times would help order the phenotype cascade.
      7. The model is useful but should include parameter sensitivity (segregation variance, synthesis slopes, initial nucleoid number) and prospective validation (e.g., predict rescue upon partial restoration of synthesis or inheritance, then test experimentally).

      Significance

      The dataset is rich and the time-resolved approach strong, but key conclusions rely on indirect proxies and need orthogonal validation and at least one causal rescue experiment to avoid over-interpretation.

    4. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      This manuscript by Dengler et al examines the mechanisms underlying the mtDNA depletion observed in cells where mitochondrial fusion is disrupted by depletion of the fusion factor Fzo1. In Saccharomyces cerevisiae, the authors deplete Fzo1 and use live-cell imaging of thousands of cells to follow the effects and their dynamic following Fzo1 depletion. They find that Fzo1-depleted cells show very rapid mitochondrial fragmentation (within 1h of Fzo1 depletion), and also an immediate drop in mitochondrial membrane potential (MMP). MtDNA is lost by 15h, and along with it the expression of mitochondrially-encoded proteins. Nuclear-encoded mitochondrial proteins are also decreased though somewhat later, and the authors find that this is largely due to their degradation (probably a consequence of lack of mitochondrial import into low-MMP cells). Most importantly, the study identifies two separate mechanisms that together contribute to the loss of mt-encoded proteins in Fzo1-depleted cells: unequal distribution of mitochondria during cell division and the reduction of a fusion-dependent compensatory synthesis of mt-encoded proteins. Unexpectedly, Fzo1-depleted cells end up passing an increased (rather than decreased) amount of mitochondria and mitochondria-encoded proteins to their daughters. Over several generations, and combined with the loss of the compensatory synthesis of more mt-encoded proteins, this leads to the progressive loss of mtDNA and mtDNA-encoded proteins in the population.

      The study is very thorough and the experiments contain the appropriate controls. The conclusions are convincing and largely supported by the experimental data that has been appropriately replicated. The data presentation is generally clear although the text could benefit from some streamlining.

      However, addressing the following major comments is required:

      1. Statistical comparisons are missing throughout the manuscript (with the exception of Fig. 2c). Appropriate statistical tests, along with p-values, should be used and reported where different gorups are compared, for example (but not limited to) Fig. 3d and most panels of Fig. 4.
      2. I do not agree with the use of Atp6 protein as a direct read-out of mtDNA content. While Atp6 protein levels will decrease with decreasing mtDNA content, the inverse is not necessarily true: decreased Atp6 protein levels do not necessarily indicate decreased mtDNA levels, because they could alternatively or additionally be caused by decreased transcription and/or translation. Therefore, please do not equate Atp6 protein levels to mtDNA levels, and instead rephrase the text referencing the Atp6 experiments in the Results and Discussion sections to measure "mtDNA expression" or "mt-encoded protein" or similar. For example, on p. 14 line 431 should read "mtDNA expression" rather than "decreased synthesis of mtDNA", and line 440 on the same page "mean mtDNA levels" should be "mtDNA expression" or similar.
      3. In Fig. 3, the authors use the fluorescence intensity of a mitochondrially-targeted mCardinal as a read-out of mitochondrial mass. Please provide evidence that this is not affected by MMP, either with relevant references or by control experiments (e.g. comparing it to N-acridine orange or other MMP-independent dyes or methods).
      4. In Fig. 2e-f, the authors use a promoter reporter with Neongreen to answer whether the reduced levels of the nuclear-encoded mitochondrial proteins Mrps5 and Qcr7 are due to decreased expression or to protein degradation, and find no evidence of degradation of the Neongreen reporter protein. However, subcellular localization might affect the availability of the protein to proteases. Although not absolutely required, it would be relevant to know if the Neongreen fusion protein is found in the same subcellular compartment as Mrps5 and Qcr7 at 0h and 9h after Fzo1 depletion.
      5. Fzo1 depletion leads to a very rapid drop in MMP during the first hour of depletion. In the Discussion, can the authors speculate on the possible mechanism of this rapid MMP drop that occurs well before mtDNA or mt-encoded proteins are decreased in level?
      6. In Fig. 2a, the mtDNA copy number of Fzo1-depleted cells is ca 1.3-fold of the control cells at the 0h timepoint. Why might this be? Is it an impact of one of the inducers? If so, we might be looking at the combination of two different processes when measuring copy number: one that is an induction caused by the inducer(s), and the other a consequence of Fzo1 depletion itself.

      Minor comments:

      • p. 3, line 71: "ten thousands of dividing cells.." should be "tens of thousands of dividing cells".
      • p.4, line 116: please be even more clear with what the "depleted" cells and controls are treated with: are depleted cells treated with both inducers, and controls with neither?
      • p.5, lines 147-148: the authors write "the rate with which the abundance of Cox2 and Var1 proteins decreases was similar to the rate of mtDNA loss" though the actual rate is not shown. Please calculate and show rates for these processes side by side to make comparison possible, or alternatively rephrase the statement.
      • Fig. 2d: changing the y-axis numbering to match those in panels a and b would facilitate comparisons.
      • Fig. 2e: it is recommended to label the western blot panels to indicate what protein is being imaged in each (Neongree,, Mrps5, Qcr7).
      • p.9, line 262: I suggest referencing Fig. 4e at the end of the first sentence for clarity.
      • In the sections related to Fig. 3a and Fig. 5a as well as the connected supplemental data, the authors discuss both the median and the mean of mitochondrial mass and Atp6 protein, respectively. For purposes of clarity, I suggest decreasing the focus on the mean (that is provided only in the supplemental data) and focusing the text mainly on the median. The two show differing trends and it is very good that both are shown, but the clarity of the text can be improved by focusing more on the median where possible.
      • p. 14, line 435: the statement that mt mass is maintained over the first 9h of depletion is only true for the mean mt mass, not for the median. Please make this clear or rephrase.
      • p.14, line 452: "mitofusions" should be "mitofusins".

      Referees cross-commenting

      I think that the reviews of the other two reviewers are both insightful and constructive. Especially the rescue experiment suggested by Reviewer 2 could provide strong support for the interpretations of the study. Note that all three reviewers ask for validation of the use of Atp6p as a read-out of mtDNA function, and that all agree the data is powerful and the study of value to the field.

      Significance

      The fact that disruption of mt fusion leads to mtDNA loss has been known for some time, but the mechanism behind this phenomenon has remained unknown to date. This thorough and precise study by Dengler et al uses state-of-the-art single-cell analysis to dissect the mechanisms underlying the mtDNA loss following the disruption of mt fusion, and convincingly reveal that it is caused by two different mechanisms: i) the inequal inheritance of mitochondria between mother and bud, and ii) the loss of a compensatory mechanism that normally maintains homeostatic mt protein levels. In the process, the authors shed light on the dynamics of the events following Fzo1 depletion, revealing dramatically fast mt fragmentation and a loss of MMP, which in turn can be expected to act as a stress signal and influence a number of cellular processes.

      The findings of the study can have relevance for human conditions involving disrupted mitochondrial dynamics, caused for example by mutations in mitofusins. The study will be of interest to researchers in mitochondrial biology ranging from dynamics and mtDNA maintenance to mitochondrial medicine.

      The field of expertise of this reviewer: mtDNA maintenance. I am not able to properly evaluate the modelling in Fig. 7.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1(Public Review):

      The correlation between rebound excitation and song structure (e.g., harmonic stack duration) may depend on outliers, such as birds with harmonic stacks >150ms.

      If in wild zebra finch, or even if in domesticated zebra finch including our birds and the birds from the other labs that we evaluated, the distribution of durations of longest harmonic stacks has a long tail, it is not apparent that birds with long duration harmonic stacks are properly considered as outliers. Examining the distribution of motif durations (a less derived statistic) in 33 birds (Fig. 2C) does not support the idea that birds with longer duration songs are outliers. Thus, we view the reviewer question as addressing whether there are different mechanisms operating in birds with long harmonic stacks than for other birds. Unfortunately, the numbers of long-duration harmonic stack birds are too small to give confidence in any statistical analysis of that group. Thus, we limited our re-analysis to the data excluding birds with harmonic stacks >150ms (which is arbitrary), examining how these birds influence our conclusions. We conclude that the influence of the excluded birds on the overall result is modest. The updated results are presented in Supplemental Figure 6, and the Results section has been revised to state:

      “We found that while some of the p values increased above 0.05 (p = 0.058 for rebound area vs. longest harmonic stack and p = 0.082 for sag ratio and longest harmonic stack), it remained significant for firing frequency and longest stack (Pearson’s R, p = 0.0017) and for sag ratio and motif duration (p = 0.024). However, when sag ratio was compared against the duration of the motif excluding the longest harmonic stack, there was no relationship (p = 0.85).”

      There is a disconnect between the physiological measurements and the HH model presented.

      We acknowledge that addressing this limitation would involve additional experimental and modeling assumptions. Rather than overextending our interpretations, we have clarified the limitations of the current study in the Discussion:

      “While this HH model provides a plausible framework for linking intrinsic properties to sequence propagation, it does not fully account for the observed relationship between IPs and song structure. A principal limitation constraining the current model is the absence of information for the same neurons combining characterization of both IPs and network activity during singing (or song playback), when HVC<sub>X</sub> express activity related to song features. Addressing this gap would requires additional and challenging experiments and is beyond the scope of this study.”

      Although disynaptic inhibition between HVC<sub>X</sub> neurons and between HVC<sub>RA</sub> and HVC<sub>X</sub> neurons is well established, I am not aware of any data indicating direct synaptic connections between HVC<sub>X</sub> neurons.

      This is an important theoretical point about the reliance of the intervaldetecting network model on HVC<sub>X</sub> neurons and about how the model would change if many of the HVC<sub>X</sub> were swapped for HVC<sub>RA</sub> neurons. Connections between HVC<sub>RA</sub> neurons to HVC<sub>X</sub> neurons are established, whereas there is relative paucity of evidence for HVC<sub>X</sub> to HVC<sub>X</sub> connectivity. This is based on work from Prather and Mooney, 2005 (among others) which performed paired sharp electrode recordings to characterized connections in HVC. This work found very few HVC<sub>X</sub> - HVC<sub>X</sub> connections. However, if connected HVC<sub>X</sub> neurons are physically more distant from each other than are connected HVC<sub>RA</sub> – HVC<sub>X</sub> neurons, they would more likely be missed in blind paired recordings. Using different approaches, recent results from the Roberts lab (Trusel et al.,eLife,  2025) supports the existence of robust HVC<sub>X</sub>  - HVC<sub>X</sub>  connections.

      Reviewer #2(Public Review):

      The interpretation of p-values is rigid, and near-significant results (e.g., p = 0.06) are dismissed without discussion.

      We revised the text to reflect a more nuanced and consistent interpretation of p-values and updated the reporting to include exact values. For example, the Results section now states:

      "Nonetheless, the longest syllable duration was not significantly correlated with the average sag ratio for each bird (Pearson’s R: R<sup>2</sup> = 0.12, p = 0.065, Supplemental Fig. 2, top left panel), though it is trending toward significance (see Discussion)”

      The conclusion that harmonic stacks influence intrinsic properties lacks necessary controls.

      We have attempted to further clarify that harmonic stacks were used as a representative feature of temporal song structure rather than a unique determinant of intrinsic properties. The Discussion now states:

      “Although harmonic stacks provide a useful test case for studying temporal integration, our findings suggest that IPs are broadly linked to song duration and structure, rather than specific syllable types. This is also consistent with prior results that found all HVC<sub>X</sub> ion currents that were modeled were influenced by song learning[31].”

      The relationship between rebound area and experimentally tutored birds was not fully explored.

      We expanded the analysis to include rebound area in instrumentally tutored birds, which has now been incorporated into Figure 4C. These additional analyses also robustly support our hypotheses. The Results section has been updated to state:

      “We then evaluated the IPs of HVC<sub>X</sub> in the birds from the two groups. HVC<sub>X</sub> neurons from birds who sang unmodified songs (N = 5 birds, 31 neurons), which had shorter harmonic stacks and shorter overall duration, had lower sag ratios (Mann-Whitney: p = 0.025), firing frequency (Mann-Whitney, p = 0.0051) and rebound area (Mann-Whitney: p = 0.0003)”

      Reviewer #3 (Public Review):

      Limited data supports the claim that intrinsic properties influence temporal integration windows.

      While we agree that further data could strengthen this claim, we show that this can happen in principle (Figure 5) but believe that the appropriate experiment to test this requires further experiments in-vivo. We emphasize in the Discussion:

      “Our findings suggest that post-inhibitory rebound excitation in HVC<sub>X</sub> could expand temporal integration. Ultimately, experiments combining in vitro with in vivo recordings can directly quantify this effect. We hope our results motivate such experiments.”

      Technical Corrections

      (1) Fixed typographical errors (e.g., Line 177: corrected "r2 = 4" to "r2 = 0.4").

      (2) Revised figure legends for clarity (e.g., Figure 4E now includes tutoring design details).

      (3) Updated methods to specify how motifs were defined and measured.

      Revised Figures

      Figure 4: Updated to include analysis of rebound area in instrumentally tutored birds, reflecting the relationship between experimental tutoring and intrinsic properties.

      Supplemental Figure 6: Correlation analysis excluding outliers

    1. Author response:

      The following is the authors’ response to the previous reviews

      Reviewer #1 (Public review):

      This report addresses a compelling topic. However, I have significant concerns, which necessitate a reassessment of the report's overall value.

      Anatomical Specificity and Stimulation Site:

      While the authors clarify that the ventral MGB (MGv) was the intended stimulation target, the electrode track (Fig. 1A) and viral spread (Fig. 2E) suggest possible involvement of the dorsal MGB (MGd) and broader area. Given that MGv-AI and MGd-AC pathways have distinct-and sometimes opposing-effects on plasticity, the reported LTP values (with unusually small standard deviations) raise concerns about the specificity of the findings. Additional anatomical verification would help resolve this issue.

      We thank the reviewer for highlighting the importance of anatomical specificity in MGv targeting. In the revised manuscript, we have taken several steps to address these issues:

      (1) Higher-magnification histology has been added to Figure 1A, clearly identifying the electrode tip localized within the MGv.

      (2) Figure 2E has been replaced with a new image showing viral expression largely confined to MGB, with minimal spread to surrounding structures.

      (3) In the Discussion, we explicitly acknowledge that although targeting was guided by stereotaxic coordinates and histological confirmation, some viral spread throughout the MGB occurred. We also discuss the possibility that both MGv-A1 and MGd-AC pathways may contribute to the recorded responses, which could influence the observed plasticity, as previously suggested by the reviewer.

      These additions and acknowledgments are now incorporated to ensure the reader can interpret the data with full consideration of anatomical targeting limitations.

      Results section:

      “Higher-magnification histology confirmed accurate MGv targeting (Figure 1A, lower-middle panel)’”

      Discussion section:

      “Although our experiment targeting the MGv was guided by stereotaxic coordinates and verified post hoc, we acknowledge potential contributions from non-lemniscal medial geniculate nucleus dorsal (MGd) projections. Anatomical and physiological evidence indicates that MGv-AC projections provide rapid, frequency‑specific, tonotopically organized excitation, whereas MGd pathways target higher‑order auditory cortex with broader tuning, less precise tonotopy, longer response latencies, and greater context‑dependence, features that can differentially shape cortical sensory integration and plasticity (Lee and Sherman, 2010; Smith et al., 2012; Ohga et al., 2018; Lee, 2015; Hu, 2003). While the co-recruitment of lemniscal and non-lemniscal inputs may enhance the generality of our CCK-dependent mechanism, the differing response characteristics of these pathways suggest subtle differences in their relative engagement in the observed plasticity. Future pathway-specific manipulations will help clarify their respective contributions”

      Lee, C.C., and Sherman, S.M. (2010). Topography and physiology of ascending streams in the auditory tectothalamic pathway. Proceedings of the National Academy of Sciences 107, 372-377. doi:10.1073/pnas.0907873107.

      Smith, P.H., Uhlrich, D.J., Manning, K.A., and Banks, M.I. (2012). Thalamocortical projections to rat auditory cortex from the ventral and dorsal divisions of the medial geniculate nucleus. Journal of Comparative Neurology 520, 34-51.

      Ohga, S., Tsukano, H., Horie, M., Terashima, H., Nishio, N., Kubota, Y., Takahashi, K., Hishida, R., Takebayashi, H., and Shibuki, K. (2018). Direct Relay Pathways from Lemniscal Auditory Thalamus to Secondary Auditory Field in Mice. Cerebral Cortex 28, 4424-4439. 10.1093/cercor/bhy234.

      Lee, C.C. (2015). Exploring functions for the non-lemniscal auditory thalamus. Frontiers in Neural Circuits 9, 69.

      Hu, B. (2003). Functional organization of lemniscal and nonlemniscal auditory thalamus. Experimental Brain Research 153, 543-549. 10.1007/s00221-003-1611-5.

      Figure legend section:

      “Post-hoc histology at higher magnification (lower-middle) shows the electrode tip confined within the MGv. White lines delineate the MGv/MGd border based on cytoarchitectonic landmarks.”

      Statistical Rigor and Data Variability:

      The remarkably low standard deviations in LTP measurements are unexpected based on established variability in thalamocortical plasticity. The authors' response confirms these values are accurate, but further justification, such as methodological controls or replication-would bolster confidence in these results. Additionally, the comparison of in vivo vs. in vitro LTP variability requires more substantive support.

      We appreciate the reviewer's concern regarding the unusually small variability. We would like to clarify that the error bars in our figures represent Standard Error of the Mean (SEM) rather than Standard Deviations (SD). As SEM is derived from the SD while incorporating sample size, it is inherently smaller than SD, which may have led to the impression of unrealistically low variability. This has now been explicitly clarified in the figure legends and Methods.

      To illustrate the raw variability, we have added Supplementary Figure S1E showing unaveraged fEPSP slopes compare to SEM, corresponding to Figure S1C. This addition ensures transparency and allows readers to directly assess the quality and consistency of our recordings.

      Regarding the comparison between in vivo and in vitro LTP variability:

      We agree that clarifying the basis of our in vivo vs. in vitro variability comparison is important. For example, in Chen et al., 2019, using identical LTP induction protocols (Fig. J), the SED of in vitro slice measurements (Fig. K) was substantially larger than that of in vivo recordings (Fig. L).

      This difference likely reflects:

      (1) In vitro: neighboring data points within a single experiment are highly correlated; variability across experiments is large due to heterogeneous sensitivity to LTP induction (10–200% increasement).

      (2) In vivo: lower correlation between neighboring data points, but each is averaged from 12 recordings over 2 min, reducing cross-trial variability; sensitivity to LTP induction is less variable across experiments (5–60% changes).

      We hope that these clarifications and additional data address the reviewer’s concerns regarding statistical rigor and data variability.

      Methods section:

      “The slopes of the evoked fEPSPs were calculated and normalized using a customized MATLAB script, and the group data were plotted as mean ± Standard Error of the Mean (SEM).”

      “All data are presented as mean ± SEM. Error bars and shaded areas represent SEM. Here, n represents the number of stimulation-recording sites or and N represents the number of animals in each experiment. At each time point, fEPSPs were averaged across 12 consecutive trials (2 min) to reduce within-experiment fluctuation. Normalized time courses were then used for repeated-measures analyses.”

      Figure legend section:

      “Data are mean ± SEM; error bars indicate SEM.”

      “(E) Unaveraged fEPSP slopes are shown for each time point, with individual data points corresponding to all sites included in Fig. 1C; mean ± SEM overlays are shown in black. Note that all individual data points are displayed in this figure, whereas in Figure S1C, only the averaged values are shown.”

      Viral Targeting and Specificity:

      The manuscript does not clearly address whether cortical neurons were inadvertently infected by AAV9. Given the potential for off-target effects, explicit confirmation (e.g., microphotograph of stimulation site) would strengthen the study's conclusions.

      We appreciate the request for quantitative confirmation of off-target cortical infection. We clarify that our histological verification was conducted by systematic sampling rather than exhaustive quantification. Under the same sampling procedure, we did not detect tdTomato-positive cortical somata after AAV9‑Syn‑ChrimsonR‑tdTomato injections into the MGB, whereas we observed rare EYFP-positive cortical somata after AAV9‑EF1a‑DIO‑ChETA‑EYFP (median < 1 cell per 0.4 × 0.4 mm² section, Supplementary Figure S1E). Although these observations do not constitute a formal statistical estimate, they were consistent across sampled sections and are in line with the low-level trans-synaptic transfer reported for AAV9. We have discussed their potential implications for data interpretation in the Discussion.

      We hope these clarifications and the newly presented histological evidence address the reviewer’s concerns and further strengthen the rigor of our study.

      Discussion section:

      “Another potential limitation of our study is the trans-synaptic transfer property of AAV9 (Figure S1F). To mitigate this risk, we carefully control the injection volume, rate, and viral expression time, while also verifying expression post-hoc. Systematic sampling histological analysis detected no tdTomato-positive cortical somata in the ACx (Figure 2E lower panel), whereas rare EYFP-positive cortical somata were observed after AAV9-EF1a-DIO-ChETA-EYFP injections (median < 1 cell in 0.4 × 0.4 mm2 section, Figure S1F, corresponds to Figure 2A upper-middle panel). These construct‑dependent observations align with occasional low‑level trans‑synaptic transfer reported for AAV9 (Zingg et al., 2017) and indicate that off‑target cortical infection was negligible for ChrimsonR and exceedingly rare for ChETA under our experimental conditions.”

      Zingg, B., Chou, X.L., Zhang, Z.G., Mesik, L., Liang, F., Tao, H.W., and Zhang, L.I. (2017). AAV-Mediated Anterograde Transsynaptic Tagging: Mapping Corticocollicular Input-Defined Neural Pathways for Defense Behaviors. Neuron 93, 33-47. 10.1016/j.neuron.2016.11.045.

      Figure legend:

      “Representative histological images demonstrating low-level transsynaptic spread following AAV9-EF1a-DIO-ChETA-EYFP injection into the MGv. Rare EYFP-positive cortical neurons were observed (median < 1 cell per 0.4 × 0.4 mm² section). Scale bar: 100 µm.”

      Integration of Prior Literature:

      The discussion of existing work is adequate but could be more comprehensive. A deeper engagement with contrasting findings would provide better context for the study's contributions.

      We appreciate the reviewer’s suggestion to engage more deeply with contrasting findings. In the revised Introduction and Discussion, we have:

      (1) Refocused the historical context toward adult auditory thalamocortical plasticity and explicitly contrasted it with visual and somatosensory cortices, while adult ACx exhibits weaker and more gated NMDAR dependence.

      (2) Positioned CCK–CCKBR signaling as a permissive/gating mechanism that can complement or partially compensate for postsynaptic NMDAR signaling, potentially reconciling variability across cortical areas and life stages.

      (3) Clarified the potential differential contributions of lemniscal (MGv) and non‑lemniscal (MGd) streams to plasticity expression and variability, acknowledging pathway-specific response properties.

      These additions are now integrated in the Introduction (paragraphs 2–3) and Discussion (sections “CCK Dependence of Thalamocortical Neuroplasticity in the ACx” and “Developmental and Age‑Dependent CCK‑Mediated Plasticity”), providing a more comprehensive and balanced context for our findings.

      Introduction section:

      “However, converging evidence shows that thalamocortical inputs retain a capacity for experience-dependent modification in adulthood. Sensory enrichment or deprivation can gate or reinstate thalamocortical plasticity. In the adult ACx, pairing sounds with neuromodulatory drive can reshape cortical representations. In vivo high-frequency stimulation (HFS) of dorsal lateral geniculate nucleus (LGN) or medial geniculate body (MGB) induces LTP in sensory cortices and has been linked to perceptual learning beyond the critical period. Notably, auditory thalamocortical plasticity appears less dependent on NMDA receptors compared to other cortical regions. The mechanisms underlying thalamocortical plasticity in the mature brain remain poorly understood.

      Cholecystokinin (CCK) and its receptor CCK-B receptor (CCKBR) are well positioned to influence thalamocortical transmission: Cck mRNA is abundant in MGB neurons and CCKBR is enriched in layer IV of ACx, the principal thalamorecipient layer.”

      Discussion section:

      “These findings suggest a potential involvement of CCK in thalamocortical plasticity. Our data extend this framework by identifying CCK–CCKBR signaling as a permissive modulator of adult thalamocortical LTP.”

      “We propose that CCKBR activation may trigger intracellular calcium release and AMPAR recruitment in parallel to, or partially compensating for,independently of postsynaptic NMDAR signaling, while the complementarity of CCKBR and NMDARs may contribute to robust thalamocortical plasticity. This complementary arrangement may reconcile differences across developmental stages and cortical areas, and highlights neuropeptidergic signaling as a lever to re-enable adult thalamocortical plasticity.

      Notably, exogenous CCK alone failed to induce LTP in the absence of accompanying stimulation (Figure S2A and S2B), emphasizing that CCK function as a modulator rather than a direct initiator of LTP. Activation of the thalamocortical pathway is also essential for LTP induction. Although our experiment targeting the MGv was guided by stereotaxic coordinates and verified post hoc, we acknowledge potential contributions from non-lemniscal medial geniculate nucleus dorsal (MGd) projections. Anatomical and physiological evidence indicates that MGv-AC projections provide rapid, frequency‑specific, tonotopically organized excitation, whereas MGd pathways target higher‑order auditory cortex with broader tuning, less precise tonotopy, longer response latencies, and greater context‑dependence, features that can differentially shape cortical sensory integration and plasticity. While the co-recruitment of lemniscal and non-lemniscal inputs may enhance the generality of our CCK-dependent mechanism, the differing response characteristics of these pathways suggest subtle differences in their relative engagement in the observed plasticity. Future pathway-specific manipulations will help clarify their respective contributions. Another potential limitation of our study is the trans-synaptic transfer property of AAV9 (Figure S1F). To mitigate this, we carefully controlled the injection volume, rate, and viral expression time, and conducted post-hoc histological analyses to minimize off-target effects, thereby reducing the likelihood of trans-synaptic transfer confounding the interpretation of our findings.”

      Therapeutic Implications:

      The authors' discussion of therapeutic potential is now appropriately cautious and well-reasoned.

      Conclusion:

      While the study presents intriguing findings, the concerns outlined above must be addressed to fully establish the validity and impact of the results. I appreciate the authors' efforts thus far and hope they can provide additional data or clarification to resolve these issues. With these revisions, the manuscript could make a valuable contribution to the field.

      Reviewer #2 (Public review):

      Summary:

      This work used multiple approaches to show that CCK is critical for long-term potentiation (LTP) in the auditory thalamocortical pathway. They also showed that the CCK mediation of LTP is age-dependent and supports frequency discrimination. This work is important because is opens up a new avenue of investigation of the roles of neuropeptides in sensory plasticity.

      Strengths:

      The main strength is the multiple approaches used to comprehensively examine the role of CCK in auditory thalamocortical LTP. Thus, the authors do provide a compelling set of data that CCK mediates thalamocortical LTP in an age-dependent manner.

      Weaknesses:

      There are some details that should be addressed, primarily regarding potential baseline differences in comparison groups. The behavioral assessment is relatively limited, but may be fleshed out in future work.

      We appreciate the reviewer’s suggestion regarding potential baseline differences. In our study, all groups underwent harmonized procedures, including identical exposure, timing, and acquisition parameters. Group allocation and data collection were performed under standardized conditions. For electrophysiology, baseline fEPSP measures and stimulation intensities were calibrated per site using consistent input-output procedures, with analyses based on normalized slopes relative to each site’s own baseline. For behavior, animals from the same litter served as both experimental and control groups, matched for handling conditions; startle/PPI data were acquired using identical hardware and timing settings. While no additional post hoc re-processing was performed, we have clarified these controls in the Methods to enhance transparency.

      We agree that the behavioral assessment is intentionally focused and does not encompass broader auditory perceptual functions (e.g., temporal processing). We now explicitly state this limitation and propose future studies to examine temporal acuity and cell-type-specific manipulations. These experiments will clarify how CCK-dependent thalamocortical plasticity generalizes to other perceptual domains.

      Reviewer #3 (Public review):

      Summary:

      Cholecystokinin (CCK) is highly expressed in auditory thalamocortical (MGB) neurons and CCK has been found to shape cortical plasticity dynamics. In order to understand how CCK shapes synaptic plasticity in the auditory thalamocortical pathway, they assessed the role of CCK signaling across multiple mechanisms of LTP induction with the auditory thalamocortical (MGB - layer IV Auditory Cortex) circuit in mice. In these physiology experiments that leverage multiple mechanisms of LTP induction and a rigorous manipulation of CCK and CCK-dependent signaling, they establish an essential role of auditory thalamocortical LTP on the co-release of CCK from auditory thalamic neurons. By carefully assessing the development of this plasticity over time and CCK expression, they go on to identify a window of time that CCK is produced throughout early and middle adulthood in auditory thalamocortical neurons to establish a window for plasticity from 3 weeks to 1.5 years in mice, with limited LTP occurring outside of this window. The authors go on to show that CCK signaling and its effect on LTP in the auditory cortex is also capable of modifying frequency discrimination accuracy in an auditory PPI task. In evaluating the impact of CCK on modulating PPI task performance, it also seems that in mice <1.5 years old CCK-dependent effects on cortical plasticity is almost saturated. While exogenous CCK can modestly improve discrimination of only very similar tones, exogenous focal delivery of CCK in older mice can significantly improve learning in a PPI task to bring their discrimination ability in line with those from young adult mice.

      Strengths:

      (1) The clarity of the results, along with the rigor multi-angled approach, provide significant support for the claim that CCK is essential for auditory thalamocortical synaptic LTP. This approach uses a combination of electrical, acoustic, and optogenetic pathway stimulation alongside conditional expression approaches, germline knockout, viral RNA downregulation and pharmacological blockade. Through the combination of these experimental configures the authors demonstrate that high-frequency stimulation-induced LTP is reliant on co-release of CCK from glutamatergic MGB terminals projecting to the auditory cortex.

      (2) The careful analysis of the CCK, CCKB receptor, and LTP expression is also a strength that puts the finding into the context of mechanistic causes and potential therapies for age-dependent sensory/auditory processing changes. Similarly, not only do these data identify a fundamental biological mechanism, but they also provide support for the idea that exogenous asynchronous stimulation of the CCKBR is capable of restoring an age-dependent loss in plasticity.

      (3) Although experiments to simultaneously relate LTP and behavioral change or identify a causal relationship between LTP and frequency discrimination are not made, there is still convincing evidence that CCK signaling in the auditory cortex (known to determine synaptic LTP) is important for auditory processing/frequency discrimination. These experiments are key for establishing the relevance of this mechanism.

      Weaknesses:

      (1) Given the magnitude of the evoked responses, one expects that pyramidal neurons in layer IV are primarily those that undergo CCK-dependent plasticity, but the degree to which PV-interneurons and pyramidal neurons participate in this process differently is unclear.

      We agree with the reviewer that the relative contributions of pyramidal neurons and PV-interneurons to CCK-dependent thalamocortical plasticity remain to be determined. Our recordings primarily reflected excitatory postsynaptic activity from layer IV pyramidal neurons, given the fEPSP metrics used. As PV-interneurons are essential in shaping cortical inhibition and temporal precision, they may also be modulated by CCK release from thalamocortical inputs. We have explicitly acknowledged this limitation in the Discussion section of the manuscript and propose that future studies should employ cell-type-specific recording or manipulation approaches to dissect the respective roles of inhibitory and excitatory neuronal populations in CCK-dependent thalamocortical plasticity. We appreciate the reviewer’s suggestion and believe this is a valuable direction for ongoing research.

      (2) While these data support an important role for CCK in synaptic LTP in the auditory thalamocortical pathway, perhaps temporal processing of acoustic stimuli is as or more important than frequency discrimination. Given the enhanced responsivity of the system, it is unclear whether this mechanism would improve or reduce the fidelity of temporal processing in this circuit. Understanding this dynamic may also require consideration of cell type as raised in weakness #1.

      We acknowledge that the current study primarily examined frequency discrimination and did not directly assess temporal processing. Enhanced network responsivity could have variable effects on temporal precision, depending on the balance between excitation and inhibition. PV-interneurons, in particular, are known to support temporal fidelity in auditory processing (Nocon et al., 2023; Cai et al., 2018). We discussion that future work should investigate how CCK modulation influences temporal coding at both the circuit and single-cell level, and whether such changes align with or diverge from the mechanisms underlying frequency discrimination improvements.

      (3) In Figure 1, an example of increased spontaneous and evoked firing activity of single neurons after HFS is provided. Yet it is surprising that the group data are analyzed only for the fEPSP. It seems that single neuron data would also be useful at this point to provide insight into how CCK and HFS affect temporal processing and spontaneous activity/excitability, especially given the example in 1F.

      We appreciate the reviewer’s suggestion. While we recorded single-unit activity during HFS protocols, long-term stability over >1.5 hours was less consistent compared to fEPSP measurements, leading to higher variability in spike-based metrics. We therefore used fEPSPs as our primary quantitative measure for robustness. We agree, however, that single-neuron data could yield valuable complementary insights. In future experiments combining stable single-unit recording with synaptic measurements will be conducted to better link cellular excitability and network plasticity.

      (4) The circuitry that determines PPI requires multiple brain areas, including the auditory cortex. Given the complicated dynamics of this process, it may be helpful to consider what, if anything, is known specifically about how layer IV synaptic plasticity in the auditory cortex may shape this behavior.

      We agree that PPI involves multiple cortical and subcortical nodes. In our paradigm, layer IV neurons receive segregated MGv inputs, high-frequency activation of thalamocortical projections induces robust synaptic plasticity in layer IV. The potentiation at these synapses could amplify the cortical representation of weak prepulses, facilitating their detection and enhancing PPI performance. This interpretation is consistent with prior work showing that local CCK infusion combined with auditory stimuli can augment cortical responses (Li et al., 2014). We have expanded the Discussion to highlight that in aged animals, where baseline PPI performance is often reduced due to degraded auditory inputs (Ouagazzal et al., 2006; Young et al., 2010), restoring thalamocortical plasticity via CCK may partially compensate for sensory gating deficits. We further note that the exact contribution of layer IV to PPI circuitry warrants future investigation using pathway-specific perturbations.

      Comments on revisions:

      The manuscript is much improved and many of the issues or questions have been addressed. Ideally, evidence for the degree of transsynaptic spread for AAV9-Syn-ChrimsonR-tdTomato would also be provided in some form since in the authors' response in sounds like some was observed, as expected.

      We thank the reviewer for this important point and for the opportunity to clarify. As requested, we have carefully examined the possibility of transsynaptic spread in our experiments:

      We clarify that our histological verification was conducted by systematic sampling rather than exhaustive quantification. Under the same sampling procedure, we did not detect tdTomato-positive cortical somata after AAV9‑Syn‑ChrimsonR‑tdTomato injections into the MGB, whereas we observed rare EYFP-positive cortical somata after AAV9‑EF1a‑DIO‑ChETA‑EYFP (median < 1 cell per 0.4 × 0.4 mm² section, see Figure 2A and Figure S1F), consistent with occasional low-level transsynaptic spread reported in the literature.

      We have updated the Discussion sections to clearly report these findings, and to emphasize the potential for vector- and construct-dependent variability in transsynaptic spread. We also explicitly acknowledge this technical limitation and discuss its implications for data interpretation.

      We hope these clarifications and additions address the reviewer’s concern regarding viral specificity and transsynaptic spread.

      Discussion section:

      “Another potential limitation of our study is the trans-synaptic transfer property of AAV9 (Figure S1F). To mitigate this risk, we carefully control the injection volume, rate, and viral expression time, while also verifying expression post-hoc. Systematic sampling histological analysis detected no tdTomato-positive cortical somata in the ACx (Figure 2E lower panel), whereas rare EYFP-positive cortical somata were observed after AAV9-EF1a-DIO-ChETA-EYFP injections (median < 1 cell in 0.4 × 0.4 mm2 section, Figure S1F, corresponds to Figure 2A upper-middle panel). These construct‑dependent observations align with occasional low‑level trans‑synaptic transfer reported for AAV9 (Zingg et al., 2017) and indicate that off‑target cortical infection was negligible for ChrimsonR and exceedingly rare for ChETA under our experimental conditions.”

      Zingg, B., Chou, X.L., Zhang, Z.G., Mesik, L., Liang, F., Tao, H.W., and Zhang, L.I. (2017). AAV-Mediated Anterograde Transsynaptic Tagging: Mapping Corticocollicular Input-Defined Neural Pathways for Defense Behaviors. Neuron 93, 33-47. 10.1016/j.neuron.2016.11.045.

      Figure legend:

      " Representative histological images demonstrating low-level transsynaptic spread following AAV9-EF1a-DIO-ChETA-EYFP injection into the MGv. Rare EYFP-positive cortical neurons were observed (median < 1 cell per 0.4 × 0.4 mm² section). Scale bar: 100 µm."

      Reviewer #1 (Recommendations for the authors):

      Thank you for your efforts in revising the manuscript. While progress has been made, I have a few remaining concerns that I hope you can address to further strengthen the study.

      Focus of the Introduction:

      Auditory thalamocortical plasticity is known to be NMDA-dependent, albeit with weaker dependence during early development. Given that this work examines thalamocortical LTP in young adult and aged mice, I recommend refining the Introduction to place greater emphasis on auditory thalamocortical plasticity in the adult brain. The current discussion of somatosensory plasticity during early development, while interesting, seems less directly relevant to the present study. A sharper focus on the auditory system would better frame your research questions.

      We thank the reviewer for this constructive suggestion. We have revised the Introduction to emphasize adult auditory thalamocortical plasticity and to streamline content less directly related to our study. Specifically:

      (1) We now foreground evidence that thalamocortical inputs retain experience-dependent plasticity beyond the critical period in adult ACx, including neuromodulatory pairing, HFS-induced LTP, and experience-dependent reinstatement.

      (2) We explicitly note that adult auditory thalamocortical plasticity is more weakly NMDAR-dependent than in other cortices, thereby motivating our focus on CCK–CCKBR signaling as a permissive mechanism for adult LTP.

      (3) We have condensed the discussion of somatosensory plasticity during early development to a brief background and shifted the focus to adult auditory mechanisms and knowledge gaps that directly frame our research questions.

      These changes appear in the revised Introduction (paragraphs 2–3), which now provide a sharper rationale for investigating CCK‑dependent thalamocortical LTP in young adult and aged mice.

      Introduction section:

      “However, converging evidence shows that thalamocortical inputs retain a capacity for experience-dependent modification in adulthood. Sensory enrichment or deprivation can gate or reinstate thalamocortical plasticity. In the adult ACx, pairing sounds with neuromodulatory drive can reshape cortical representations. In vivo high-frequency stimulation (HFS) of dorsal lateral geniculate nucleus (LGN) or medial geniculate body (MGB) induces LTP in sensory cortices and has been linked to perceptual learning beyond the critical period. Notably, auditory thalamocortical plasticity appears less dependent on NMDA receptors compared to other cortical regions. The mechanisms underlying thalamocortical plasticity in the mature brain remain poorly understood.

      Cholecystokinin (CCK) and its receptor CCK-B receptor (CCKBR) are well positioned to influence thalamocortical transmission: Cck mRNA is abundant in MGB neurons and CCKBR is enriched in layer IV of ACx, the principal thalamorecipient layer.”

      Anatomical Specificity of MGv Targeting:

      The mouse MGv is a small and deep structure, and precise targeting is critical given the functional differences between MGv and MGd pathways. In the current figures:

      Fig. 1A suggests the electrode track may have approached the MGd.

      Fig. 2E indicates some viral spread beyond the MGB.

      Since MGv-AI and MGd-AC pathways exhibit distinct (and sometimes opposing) effects on plasticity, I encourage you to provide additional clarification or verification of the stimulated/infected regions. This would greatly enhance the interpretability of your LTP data.

      Please see above.

      Data Variability and Transparency:

      The reported thalamocortical LTP values exhibit remarkably small standard deviations, which is somewhat unexpected given typical experimental variability in such measurements. To address this concern, it would be helpful to include example raw traces of the recorded LTP (e.g., in a supplementary figure). This would allow readers to better evaluate the data quality and consistency.

      Please see above.

      Reviewer #2 (Recommendations for the authors):

      Overall, the authors did an excellent job of responding to our critiques, both in their direct responses and in the modified text. The modified text is also more readable than before. Two issues that the authors should consider addressing;

      (1) Unless I missed it, there is no commentary stated about the impact of using aged C57 mice, which lose their hearing, such that the effects seen in the older mice could be related to hearing loss rather than aging alone. Some discussion of this point should be made.

      We thank the reviewer for raising this important point. C57BL/6 mice are known to develop age-related hearing loss, which could potentially affect PPI performance in older animals. We note that in our internal screening we observed markedly reduced startle amplitudes and frequent negative PPI values in many mice >20 months, indicating severe auditory impairment. To minimize this confound a priori, we excluded mice older than 20 months and restricted the aged cohort to 17–19 months, which consistently exhibited robust startle responses and reliable PPI. While some degree of presbycusis may still be present in this age range in C57BL/6 mice, the improvement of PPI following CCK administration combined with acoustic exposure indicates that the auditory pathways remained sufficiently functional to support sensorimotor gating. In fact, the presence of partial hearing loss in these aged mice may have allowed us to better detect the beneficial effects of CCK, further highlighting its therapeutic potential for age-related deficits. The greater improvement in PPI observed in older mice —as compared to younger mice, whose PPI in control group is already high—likely reflect the combined effects of age-related hearing loss and CCK deficiency, with CCK-induced restoration of thalamocortical plasticity being the primary focus of our study. We have now added a discussion of this point in the revised manuscript.

      Discussion section:

      “In aged mice, PPI deficits are commonly observed due to impaired auditory processing. Notably, C57BL/6 mice exhibit age-related hearing loss (Johnson et al., 1997). Both age-associated changes in auditory function and CCK deficiency contribute to impaired sensory gating. The presence of partial hearing loss in aged mice may have facilitated the detection of CCK’s beneficial effects, further highlighting its therapeutic potential for age-related deficits. Our results suggest that enhanced thalamocortical plasticity mediated by CCK might partially compensate for these deficits by amplifying residual auditory signals in aged mice.”

      Johnson, K.R., Erway, L.C., Cook, S.A., Willott, J.F., and Zheng, Q.Y. (1997). A major gene affecting age-related hearing loss in C57BL/6J mice. Hearing Research 114, 83-92. https://doi.org/10.1016/S0378-5955(97)00155-X.

      (2) Minor point - I do not agree with the use of the term "ventral to bregma" to describe where the craniotomies were placed (e.g., line 599). The direction being described is more typically referred to as "lateral." If the authors prefer to use the term "ventral," perhaps additional clarification can be added.

      We thank the reviewer for pointing out this issue and apologize for any confusion. We agree that “ventral to bregma” is not the standard terminology and have revised the Methods section to use “below the temporal ridge”. We have also clarified that the craniotomy for accessing the auditory cortex was performed on the lateral aspect of the skull in rodents, just below the temporal ridge. We hope this revision resolves the ambiguity.

      Method section:

      “A craniotomy was performed over the temporal bone, as the auditory cortex is located on the lateral surface of the brain (coordinates: 1.5 to 3.0 mm below the temporal ridge and 2.0 to 4.0 mm posterior to bregma for mice; 2.5 to 6.5 mm below the temporal ridge and 3.0 to 5.0 mm posterior to bregma for rats) to access the auditory cortex.”

      “Six-week after CCK-sensor virus injection, a craniotomy was performed to access the auditory cortex at the temporal bone (1.5 to 3.0 mm below the temporal ridge and 2.0 to 4.0 mm posterior to bregma), and the dura mater was opened.”

    1. Reviewer #1 (Public review):

      Summary

      The manuscript by K.H. Lee et al. presents Spyglass, a new open-source framework for building reproducible pipelines in systems neuroscience. The framework integrates the NWB (Neurodata Without Borders) data standard with the DataJoint relational database system to organize and manage analysis workflows. It enables the construction of complete pipelines, from raw data acquisition to final figures. The authors demonstrate their capabilities through examples, including spike sorting, LFP filtering, and sharp-wave ripple (SWR) detection. Additionally, the framework supports interactive visualizations via integration with Figurl, a platform for sharing neuroscience figures online.

      Strengths:

      Reproducibility in data analysis remains a significant challenge within the neuroscience community, posing a barrier to scientific progress. While many journals now require authors to share their data and code upon publication, this alone does not ensure that the code will execute properly or reproduce the original results. Recognizing this gap, the authors aim to address the community's need for a robust tool to build reproducible pipelines in systems neuroscience.

      Weaknesses:

      The issues identified here may serve as a foundation for future development efforts.

      (1) User-friendliness:

      The primary concern is usability. The manuscript does not clearly define the intended user base within a modern systems neuroscience lab. Improving user experience and lowering the barrier to entry would significantly enhance the framework's potential for broad adoption. The authors provide an online example notebook and a local setup notebook. However, the local setup process is overly complex, with many restrictive steps that could discourage new users. A more streamlined and clearly documented onboarding process is essential. Additionally, the lack of Windows support represents a practical limitation, particularly if the goal is widespread adoption across diverse research environments.

      (2) Dependency management and long-term sustainability:

      The framework depends on numerous external libraries and tools for data processing. This raises concerns about long-term maintainability, especially given the short lifespan of many academic software projects and the instability often associated with Python's backward compatibility. It would be helpful for the authors to clarify how flexible and modular the pipeline is, and whether it can remain functional if upstream dependencies become deprecated or change substantially.

      (3) Extensibility for custom pipelines:

      A further limitation is the insufficient documentation regarding the creation of custom pipelines. It is unclear how a user could adapt Spyglass to implement their own analysis workflows, especially if these differ from the provided examples (e.g., spike sorting, LFP analysis that are very specific to the hippocampal field). A clearer explanation or example of how to extend the framework for unrelated or novel analyses would greatly improve its utility and encourage community contributions.

      (4) Flexibility vs. Standardization:

      The authors may benefit from more explicitly defining the intended role of the framework: is Spyglass designed as a flexible, general-purpose tool for developing custom data analysis pipelines, or is its primary goal to provide a standardized framework for freezing and preserving pipelines post-publication to ensure reproducibility? While both goals are valuable, attempting to fully support both may introduce unnecessary complexity and result in a tool that is not well-suited for either purpose. The manuscript briefly touches on this tradeoff in the introduction, and the latter-pipeline preservation-may be the more natural fit for the package. If so, this intended use should be clearly communicated in the documentation to help users understand its scope and strengths.

      Impact:

      This work represents a significant milestone in advancing reproducible data analysis pipelines in neuroscience. Beyond reproducibility, the integration of cloud-based execution and shareable, interactive figures has the potential to transform how scientific collaboration and data dissemination are conducted. The authors are at the forefront of this shift, contributing valuable tools that push the field toward more transparent and accessible research practices.

    2. Reviewer #2 (Public review):

      Summary:

      This valuable paper presents Spyglass, a comprehensive software framework designed to address the critical challenges of reproducibility and data sharing in neuroscience. The authors have developed a robust ecosystem built on community standards such as NWB and DataJoint, and demonstrate its utility by applying it to datasets from two independent labs, successfully validating the framework's ability to reproduce and extend published findings. While the framework offers a powerful blueprint for modern, reproducible research, its immediate broad impact may be tempered by the significant upfront investment required for adoption and its current focus on electrophysiological data. Nevertheless, Spyglass stands as an important and practical contribution, providing a well-documented and thoughtfully designed path toward more transparent and collaborative science.

      Strengths:

      (1) Principled solution to a foundational challenge:

      The work offers a concrete and comprehensive framework for reproducibility in neuroscience, moving beyond abstract principles to provide an implemented, end-to-end ecosystem.

      (2) Pragmatic and robust architectural design:

      Features such as the "cyclic iteration" motif for spike-sorting curation and the "merge" motif for pipeline consolidation demonstrate deep, practical experience with neurophysiological analysis and address real-world challenges.

      (3) Cross-laboratory validation:

      The successful replication and extension of published hippocampal decoding findings across independent datasets strongly support the framework's utility and underscore its potential for enabling reproducible science.

      (4) Accessibility through documentation and demos:

      Extensive tutorials and the availability of a public demo environment lower some of the barriers to adoption.

      Weaknesses:

      (1) High barrier to adoption:

      The requirement to convert all data into NWB, maintain a relational database, and train users in structured workflows is a significant hurdle, particularly for smaller labs.

      (2) Limited tool integration:

      The current pipelines, while useful, still resemble proof-of-principle demonstrations. Closer integration with established analysis libraries such as Pynapple and others could broaden the toolkit and reduce duplication of effort.

      (3) Experimental metadata support:

      While NWB provides a solid foundation for storing neurophysiology data streams, it still lacks broad and standardized support for experimental metadata, including descriptions of conditions, subject details, and procedures, as well as links across datasets. This limitation constrains one of Spyglass's key promises: enabling reproducible, cross-laboratory science. The authors should clarify how Spyglass plans to address or mitigate this gap - for example, by adopting or contributing to metadata extensions, providing templates for experimental conditions, or integrating with complementary systems that manage metadata across datasets.

      (4) Cross-laboratory interoperability:

      While demonstrated across two datasets, the manuscript does not fully address how Spyglass will handle the diversity of metadata standards, acquisition systems, and lab-specific practices that remain major obstacles to reproducibility.

      (5) Visualization limitations:

      Beyond the export system and Figurl, NWB offers relatively few options for interactive data exploration. The ability to explore data flexibly and discover new phenomena remains limited, which constrains one of the potential strengths of standardized pipelines.

      Spyglass is well-positioned to become a community framework for reproducible neuroscience workflows, with the potential to set new standards for transparency and data sharing. With expanded modality coverage, tighter integration of existing community tools, stronger solutions for cross-lab interoperability, and richer visualization capabilities, it could have a transformative impact on the field.

    1. Reviewer #1 (Public review):

      Summary:

      This paper aims to characterise the physiological and computational underpinnings of the accumulation of intermittent glimpses of sensory evidence.

      Strengths:

      (1) Elegant combination of electroencephalography and computational modelling.

      (2) The authors describe results of two separate experiments, with very similar results, in effect providing an internal replication.

      (3) Innovative task design, including different gap durations.

      Weaknesses:

      (1) The authors introduce the CPP as tracking an intermediary (motor-independent) evidence integration process, and the MBL as motor preparation that maintains a sustained representation of the decision variable. It would help if the authors could more directly and quantitatively assess whether their current data are in line with this. That is, do these signals exhibit key features of evidence accumulation (slope proportional to evidence strength, terminating at a common amplitude that reflects the bound)? Additionally, plotting these signals report locked (to the button press) would help here. What do the results mean for the narrative of this paper?

      (2) The novelty of this work lies partly in the aim to characterize how the CPP and MBL interact (page 5, line 3-5). However, this analysis seems to be missing. E.g., at the single-trial level, do relatively strong CPP pulses predict faster/larger MBL? The simulations in Figure 5 are interesting, but more could be done with the measured physiology.

      (3) The focus on CPP and MBL is hypothesis-driven but also narrow. Since we know only a little about the physiology during this "gaps" task, have the authors considered computing TFRs from different sensor groupings (perhaps in a supplementary figure?).

      (4) The idea of a potential bound crossing during P1 is elegant, albeit a little simplistic. I wonder if the authors could more directly show a physiological signature of this. For example, by focusing on the MBL or occipital alpha split by the LL, LH, HL and HH conditions, and showing this pulse- as well as report-locked. Related, a primacy effect can also be achieved by modelling (i) self-excitation of the current one-dimensional accumulator, or (ii) two competing accumulators that produce winner-take-all dynamics. Is it possible to distinguish between these models, either with formal model comparison or with diagnostic physiological signatures?

      (5) The way the authors specify the random effects of the structure of their mixed linear models should be specified in more detail. Now, they write: "Where possible, we included all main effects of interest as random effects to control for interindividual variability." This sounds as if they started with a model with a full random effect structure and dropped random components when the model would not converge. This might not be sufficiently principled, as random components could be dropped in many different orders and would affect the results. Do all main results hold when using classical random effects statistics on subject-wise regression coefficients?

    1. Reviewer #2 (Public review):

      Summary:

      This paper investigates putative networks associated with prediction errors in task-based and resting-state fMRI. It attempts to test the idea that prediction errors minimisation includes abstract cognitive functions, referred to as the global prediction error hypothesis, by establishing a parallel between networks found in task-based fMRI where prediction errors are elicited in a controlled manner and those networks that emerge during "resting state".

      Strengths:

      Clearly, a lot of work and data went into this paper, including 2 task-based fMRI experiments and the resting state data for the same participants, as well as a third EEG-fMRI dataset. Overall, well written with a couple of exceptions on clarity, as per below, and the methodology appears overall sound, with a couple of exceptions listed below that require further justification. It does a good job of acknowledging its own weakness.

      Weaknesses:

      (1) The paper does a good job of acknowledging its greatest weakness, the fact that it relies heavily on reverse inference, but cannot quite resolve it. As the authors put it, "finding the same networks during a prediction error task and during rest does not mean that the networks' engagement during rest reflects prediction error processing". Again, the authors acknowledge the speculative nature of their claims in the discussion, but given that this is the key claim and essence of the paper, it is hard to see how the evidence is compelling to support that claim.

      (2) Given how uncontrolled cognition is during "resting-state" experiments, the parallel made with prediction errors elicited during a task designed for that effect is a little difficult to make. How often are people really surprised when their brains are "at rest", likely replaying a previously experienced event or planning future actions under their control? It seems to be more likely a very low prediction error scenario, if at all surprising.

      (3) The quantitative comparison between networks under task and rest was done on a small subset of the ROIs rather than on the full network - why? Noting how small the correlation between task and rest is (r=0.021) and that's only for part of the networks, the evidence is a little tenuous. Running the analysis for the full networks could strengthen the argument.

      (4) Looking at the results in Figure 2C, the four-quadrant description of the networks labelled for low and high PE appears a little simplistic. The authors state that this four-quadrant description omits some ROIs as motivated by prior knowledge. This would benefit from a more comprehensive justification. Which ROIs are excluded, and what is the evidence for exclusion?

      (5) The EEG-fMRI analysis claiming 3-6Hz fluctuations for PE is hard to reconcile with the fact that fMRI captures activity that is a lot slower, while some PEs are as fast as 150 ms. The discussion acknowledges this but doesn't seem to resolve it - would benefit from a more comprehensive argument.

    2. Reviewer #3 (Public review):

      Bogdan et al. present an intriguing and timely investigation into the intrinsic dynamics of prediction error (PE)-related brain states. The manuscript is grounded in an intuitive and compelling theoretical idea: that the brain alternates between high and low PE states even at rest, potentially reflecting an intrinsic drive toward predictive minimization. The authors employ a creative analytic framework combining different prediction tasks and imaging modalities. They shared open code, which will be valuable for future work.

      However, the current manuscript would benefit from further clarification and empirical grounding, especially with regard to its theoretical framing (that PE-like state fluctuations are intrinsic and help us minimize PE), interpretation of results, and broader functional significance. Below, I outline a few major comments and suggestions that I think would strengthen the contribution.

      (1) Consistency in Theoretical Framing

      The title, abstract, and introduction suggest inconsistent theoretical goals of the study.

      The title suggests that the goal is to test whether there are intrinsic fluctuations in high and low PE states at rest. The abstract and introduction suggest that the goal is to test whether the brain intrinsically minimizes PE and whether this minimization recruits global brain networks. My comments here are that a) these are fundamentally different claims, and b) both are challenging to falsify. For one, task-like recurrence of PE states during resting might reflect the wiring and geometry of the functional organization of the brain emerging from neurobiological constraints or developmental processes (e.g., experience), but showing that mirroring exists because of the need to minimize PE requires establishing a robust relationship with behavior or showing a causal effect (e.g., that interrupting intrinsic PE state fluctuations affects prediction).

      The global PE hypothesis-"PE minimization is a principle that broadly coordinates brain functions of all sorts, including abstract cognitive functions"-is more suitable for discussion rather than the main claim in the abstract, introduction, and all throughout the paper.

      Given the above, I recommend that the authors clarify and align their core theoretical goals across the title, abstract, introduction, and results. If the focus is on identifying fluctuations that resemble task-defined PE states at rest, the language should reflect that more narrowly, and save broader claims about global PE minimization for the discussion. This hypothesis also needs to be contextualized within prior work. I'd like to see if there is similar evidence in the literature using animal models.

      (2) Interpretation of PE-Related Fluctuations at Rest and Its Functional Relevance

      It would strengthen the paper to clarify what is meant by "intrinsic" state fluctuations. Intrinsic might mean task-independent, trait-like, or spontaneously generated. Which do the authors mean here? Is the key prediction that these fluctuations will persist in the absence of a prediction task?

      Regardless of the intrinsic argument, I find it challenging to interpret the results as evidence of PE fluctuations at rest. What the authors show directly is that the degree to which a subset of regions within a PE network discriminates high vs. low PE during task correlates with the magnitude of separation between high and low PE states during rest. While this is an interesting relationship, it does not establish that the resting-state brain spontaneously alternates between high and low PE states, nor that it does so in a functionally meaningful way that is related to behavior. How can we rule out brain dynamics of other processes, such as arousal, that also rise and fall with PE? I understand the authors' intention to address the reverse inference concern by testing whether "a participant's unique connectivity response to PE in the reward-processing task should match their specific patterns of resting-state fluctuation". However, I'm not fully convinced that this analysis establishes the functional role of the identified modules to PE because of the following:

      Theoretically, relating the activities of the identified modules directly to behavior would demonstrate a stronger functional role.

      a) Across participants: Do individuals who exhibit stronger or more distinct PE-related fluctuations at rest also perform better on tasks that require prediction or inference? This could be assessed using the HCP prediction task, though if individual variability is limited (e.g., due to ceiling effects), I would suggest exploring a dataset with a prediction task that has greater behavioral variance.

      Or even more broadly, does this variability in resting state PE state fluctuations predict general cognitive abilities like WM and attention (which the HCP dataset also provides)? I appreciate the inclusion of the win-loss control, and I can see the intention to address specificity. This would test whether PE state fluctuations reflect something about general cognition, but also above and beyond these attentional or WM processes that we know are fluctuating.

      b) Within participants: Do momentary increases in PE-network expression during tasks relate to better or faster prediction? In other words, is there evidence that stronger expression of PE-related states is associated with better behavioral outcomes?

      (3) Apriori Hypothesis for EEG Frequency Analysis

      It's unclear how to interpret the finding that fMRI fluctuations in the defined modules correlate with frontal Delta/Theta power, specifically in the 3-6 Hz range. However, in the EEG literature, this frequency band is most commonly associated with low arousal, drowsiness, and mind wandering in resting, awake adults, not uniquely with prediction error processing. An a priori hypothesis is lacking here: what specific frequency band would we expect to track spontaneous PE signals at rest, and why? Without this, it is difficult to separate a PE-based interpretation from more general arousal or vigilance fluctuations.

      (4) Significance Assessment

      The significance of the correlation above and all other correlation analyses should be assessed through a permutation test rather than a single parametric t-test against zero. There are a few reasons: a) EEG and fMRI time series are autocorrelated, violating the independence assumption of parametric tests;<br /> b) Standard t-tests can underestimate the true null distribution's variance, because EEG-fMRI correlations often involve shared slow drifts or noise sources, which can yield spurious correlations and inflating false positives unless tested against an appropriate null.

      Building a null distribution that preserves the slow drifts, for example, would help us understand how likely it is for the two time series to be correlated when the slow drifts are still present, and how much better the current correlation is, compared to this more conservative null. You can perform this by phase randomizing one of the two time courses N times (e.g., N=1000), which maintains the autocorrelation structure while breaking any true co-occurrence in patterns between the two time series, and compute a non-parametric p-value. I suggest using this approach in all correlation analyses between two time series.

      (5) Analysis choices

      If I'm understanding correctly, the algorithm used to identify modules does so by assigning nodes to communities, but it does not itself restrict what edges can be formed from these modules. This makes me wonder whether the decision to focus only on connections between adjacent modules, rather than considering the full connectivity, was an analytic choice by the authors. If so, could you clarify the rationale? In particular, what justifies assuming that the gradient of PE states should be captured by edges formed only between nearby modules (as shown in Figure 2E and Figure 4), rather than by the full connectivity matrix? If this restriction is instead a by-product of the algorithm, please explain why this outcome is appropriate for detecting a global signature of PE states in both task and rest.

      When assessing the correspondence across task-fMRI and rs-fMRI in section 2.2.2, why was the pattern during task calculated from selecting a pair of bilateral ROIs (resulting in a group of eight ROIs), and the resting state pattern calculated from posterior-anterior/ventral-dorsal fluctuation modules? Doesn't it make more sense to align the two measures? For example, calculating task effects on these same modules during task and rest?

    1. BACKUP

      1. mysqldump -u root -p msys140_activity > dump_1.sql

      • Dumps a single database (msys140_activity).
      • Output is written to dump_1.sql.
      • ⚠️ No CREATE DATABASE or USE statements (because no --databases flag).

      2. mysqldump -u root -p --all-databases > alldb.sql

      • Dumps all databases on the server.
      • Output goes to alldb.sql.
      • Includes CREATE DATABASE and USE statements for each database.

      3. mysqldump -u root -p --databases msys140_activity csci40_db > multipledb.sql

      • Dumps multiple databases (msys140_activity and csci40_db).
      • Output goes to multipledb.sql.
      • Includes CREATE DATABASE and USE statements for each database (because of --databases).

      4. [mysqldump -u root -p msys140_activity > dump_1.sql VS mysqldump -u root -p --databases msys140_activity > dump_2.sql]

      • Without --databases: dump_1.sql has only table structures + data, but no CREATE DATABASE or USE.
      • With --databases: dump_2.sql includes the CREATE DATABASE and USE statements, so you can recreate the DB automatically when restoring.

      5. mysqldump -u root -p msys140_activity tblCustomers > dump_tables.sql

      • Dumps a single table (tblCustomers) from the msys140_activity database.
      • Output is written to dump_tables.sql.
      • Only contains the structure + data for that table.

      👉 Shortcut to remember:

      • No --databases → Only schema + data.
      • With --databases / --all-databases → Schema + data + CREATE DATABASE + USE statements.

    Annotators

    1. Several decades ago, a new way of thinking about learning became very prominent in education. It was based on the concept that each person has a preferred way to learn. It was thought that these preferences had to do with each person’s natural tendencies toward one of their senses.

      I think about this a lot. I could have been born in a time where people knew nothing about autism, and I think I would have hated school more then anything. Its insane that this development was only treated as a huge deal a couple of decades ago. Glad I was born today!

    2. Several decades ago, a new way of thinking about learning became very prominent in education. It was based on the concept that each person has a preferred way to learn. It was thought that these preferences had to do with each person’s natural tendencies toward one of their senses.

      I think this passage will be important to me because I don't think I ever really figured out what my learning type is. Hopefully this course will help me find out!

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Response to the reviewers

      We thank the reviewers for recognizing the importance of study, and how it “addresses a long-standing question in the heterogeneity of cellular responses to stressors”, “makes a conceptual advance by identifying transcription factors as the limiting determinant of IFN-β induction in KSHV-infected cells”, and “serves as a crucial starting point for understanding cellular heterogeneity”. We agree that our findings appeal to a broad audience interested in virology, immunology, cell biology, and gene transcription.

      We also thank the reviewers for their insightful suggestions that will greatly strengthen our study. Below we detail how we plan to address their comments experimentally and how we have already edited the text to respond to them.

      Referee #1

      One experiment that may provide some insight into the selective RelA activation is to quantify viral genomes within the high and low IFN producing cells. Perhaps, the genome as a PAMP, is more abundant in the inducing cells.

      We have added a note in the Discussion section (line 417) that we have evidence that the cGAS PAMP in our system is mitochondrial DNA, not viral DNA. Moreover, our results suggest that the variation in PAMP levels are not the source of heterogeneity, as this would cause heterogeneous activation of the cGAS-STING-TBK1-IRF3 axis. Instead, we have discovered that TBK1 and IRF3 are activated even in cells without interferon-β induction.

      Referee #2

      1) While the study presents intriguing evidence for AP-1 involvement in regulating IFN-β responses, the reliance on total c-Jun levels as a readout is limiting. Because c-Jun activity depends on phosphorylation and promoter binding, additional experiments (i.e., phospho-c-Jun analysis or ChIP at the IFNB1 promoter) would strengthen the link between AP-1 activity and the observed reporter outcomes.

      We agree that that a stronger link between AP-1 activity and IFN induction would improve our study, so we have cloned interferon-β reporter constructs that contain mutations in the AP-1 binding sites. We plan to use this reporter, as well as IFN-β reporter constructs that contain mutations in either the AP-1, IRF3, or NF-κB binding sites, to mechanistically test the connection between AP-1 and activation of the IFN promoter. As a control, we will test that the mutations block reporter induction after stimulation with a well characterized agonist of the IFN induction pathway such as poly(I:C). We have previously investigated c-Jun and ATF2 phosphorylation during KSHV reactivation and caspase inhibition. Surprisingly, in preliminary experiments we did not detect phosphorylation of either AP-1 subunit. We will confirm this result and add these data to the manuscript.

      2) The data presented demonstrating that Serine 386 phosphorylation does not distinguish first responder cells is strong. Including complementary data on Ser396 phosphorylation would strengthen the conclusion, as this well-established activation marker is readily detectable with available reagents and would help confirm that the potentiation of IRF3 activity is not the driver of the observed heterogeneity.

      We will complement the Ser386 results with Ser396 staining.

      3) Consider updating the title to more directly reflect the findings (e.g. "Interferon-β induction heterogeneity during KSHV infection is correlated to expression of ATF2 and RelA")

      We have updated the title to “Interferon-β induction heterogeneity during KSHV infection is correlated to levels and activation of the transcription factors ATF2 and RelA, and not IRF3”

      *4) To ease the interpretation of data, indicate what the black and white circles indicate in the figure legends. *

      We have updated the figures to be more intuitive, using + and -.

      5) IE ORF50 is used to show no differences between first responders and non-responders, but showing early and late genes across tdTomato positive and negative cells would rule out potential differences in progression through reactivation.

      We added a clarification in the Results section (line 195), explaining that we have examined the progression of viral reactivation through single-cell transcriptomics in our previous publication, and that the results indicate that viral gene expression plays a small role in interferon-β heterogeneity. We favor the scRNA-seq dataset for this conclusion, because the tdTomato negative cell population represents a mix of non-reactivating cells, which would not be expected to make IFN, and reactivating cells that fail to induce IFN expression.

      6) The data in Figure 5D (quantified in E and F) show a compelling trend. This could be further clarified by plotting a trend line that connects the results of independent experiments, rather than only showing individual data points. Such visualization would make the consistency of the observed trend across experiments more apparent.

      We have added lines in the graphs in Figure 5 to ease visualization.

      Referee #3

      A major worry comes from using lentiviral transduction to insert the reporter promoter into cells without selecting for clones. Lentiviral transduction introduces heterogeneity due to random insertion of their vector. This results in different copy numbers for the reporter construct, leading to heterogeneity in the reporter expression. Additionally, the expression of foreign proteins, particularly in immune cells, can be perceived as danger signals (10.1007/s12015-016-9670-8) and occasionally trigger p65 activation. To control for this, the authors could validate their reporter results by including a non-IFNb promoter (e.g., constitutive) expressing tdTomato and verifying that these cell populations do not also express endogenous IFNb mRNAs.

      We did not select clonal cell lines because different cells may have different reactivation propensity. Moreover, we did not want the tdTomato signal to reflect specific regulation of a single genomic region. We have now added an explanation as to why we did not clonally select that cell lines in the Results section (line 157). Our control conditions that do not result in IFN-β induction show that lentiviral insertion is not sufficient to cause IFN induction, as we did not detect IFN-β mRNA in the untreated reporter cells (first bar in Figure 1C). We also clarified in the Results section (line 184) that the selective enrichment of both IFN-β and tdTomato mRNA in the sorted tdTomato+ cells demonstrates that tdTomato is a faithful reporter for rare IFN-β expression, regardless of heterogeneous lentiviral transduction in the population. To further verify that lentiviral transduction does not play a role in introducing heterogeneity in induction of our tdTomato reporter and of IFN-β, we will measure IFN-β levels in BC-3 cells constitutively expressing tdTomato, which we have already created. We may also sort BC-3 cells constitutively expressing tdTomato and check that the tdTomato signal is not predictive of IFN expression in these cells. However, the expectation is that all or most cells will be tdTomato positive, which may make sorting for tdTomato negative cells impossible.

      Regarding AP1 and NF-kB activation, the authors could investigate downstream genes such as GADD45B, HSPA1A, and ATF-3 (for AP1), and IL-6, TNFAIP3 (A20) (for both AP1 and NFkB). It would be interesting to determine if these genes are exclusively expressed in tdTomato-expressing cells.

      We will quantify the mRNA levels of these genes by performing qPCR on our cDNA from sort experiments. So far, we have detected IL-6 induction but no enrichment of this transcript in the sorted tdTomato+ samples.

      While the authors observed no direct correlation between c-Jun alone and IFN-b production, it is conceivable that TPA-induced c-Jun primes the cells that become fully transcriptionally active upon a stimulus like viral reactivation. I propose that the authors attempt to inhibit c-Jun activation during KSHV reactivation (TPA + caspase inhibitor) using inhibitors like SP600125 and subsequently assess whether this blockade reduces the proportion of IFNb+ cells.

      We have tried using the suggested inhibitor (SP600125), but found that it inhibits KSHV reactivation, making any result on IFN levels difficult to interpret. Currently, we are testing a dual AP-1 and NF-κB inhibitor (SP100030) and may add these data to the results if we do not encounter similar issues.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      In the manuscript "Interferon-β induction heterogeneity during KSHV infection is correlated to expression and activation of enhanceosome transcription factors other than IRF3", Kaku and Gaglia address a long-standing question in the initiation of host antiviral responses: what drives the heterogeneity in the initiation of IFN responses within a presumably homogenous population of cells. In this study, the authors focus on host factors that contribute to the heterogeneity in IFN induction. They use a KSHV lytic reactivation model (TPA + Caspase inhibitor treated BC-3 cells) and FACS-based reporter assays to enhance the resolution in the detection of molecular drivers of "first responder" cells that make IFN. They find that IRF3 activation alone does not predict IFN expression; rather, the expression of ATF2 and RelA is predictive of IFN-β induction. The authors carefully control for off-target effects of TPA treatment in BJAB cells and paracrine signaling through the inclusion of IFN-neutralizing antibodies. Overall, the manuscript is well-written and easy to follow, and the data compellingly support their conclusion that cell-specific transcription factor activity limits IFN production to cell subsets. Demonstrating coordinated occupancy or functional interplay of these factors would increase confidence in the proposed model and broaden the impact for readers interested in virology, immunology, and transcriptional regulation.

      Comments:

      1. While the study presents intriguing evidence for AP-1 involvement in regulating IFN-β responses, the reliance on total c-Jun levels as a readout is limiting. Because c-Jun activity depends on phosphorylation and promoter binding, additional experiments (i.e., phospho-c-Jun analysis or ChIP at the IFNB1 promoter) would strengthen the link between AP-1 activity and the observed reporter outcomes.
      2. The data presented demonstrating that Serine 386 phosphorylation does not distinguish first responder cells is strong. Including complementary data on Ser396 phosphorylation would strengthen the conclusion, as this well-established activation marker is readily detectable with available reagents and would help confirm that the potentiation of IRF3 activity is not the driver of the observed heterogeneity.

      Minor Comments:

      1. Consider updating the title to more directly reflect the findings (e.g. "Interferon-β induction heterogeneity during KSHV infection is correlated to expression of ATF2 and RelA".
      2. To ease the interpretation of data, indicate what the black and white circles indicate in the figure legends.
      3. The authors predominantly rely on the IE gene ORF50 as a marker of KSHV reactivation and show no differences in expression between first responder cells and those that don't. Measurement of early and late genes across TdTomato-positive and negative cells would rule out potential differences in progression through reactivation that might influence IFN production.
      4. The data in Figure 5D (quantified in E and F) show a compelling trend. This could be further clarified by plotting a trend line that connects the results of independent experiments, rather than only showing individual data points. Such visualization would make the consistency of the observed trend across experiments more apparent.

      Significance

      This study addresses an important and long-stading question in the heterogeneity of cellular responses to stressors, such as viruses. The study is well designed and presented, making it appealling to a broad audience interested in virology, immunology, cell biology, and gene transcription.

    1. yet there the nightingale 100 Filled all the desert with inviolable voice And still she cried, and still the world pursues, 'Jug Jug' to dirty ears.

      Voice continues to be a significant motif in "The Waste Land" and the myths that it draws from. In Ovid's story of Tereus, Procne, and Philomela, Tereus cuts off Philomela's tongue so she is unable to speak the crime of her rape to others. Even though Philomela is eventually able to reveal the truth by weaving purple letters into cloth, she cannot explain herself to her sister, for "she has no voice--just gestures" (13). However, once she is turned into a nightingale, her voice returns to her, as described in TWL: "yet there the nightingale / Filled all the desert with inviolable voice / And still she cried, and still the world pursues, / 'Jug Jug' to dirty ears" (lines 100-4). Here, Philomena is able to voice her suffering and lament; however, her speech is distorted to a bird's cry, stripping her message of its clarity and language. This circles back to a broader theme in The Waste Land: the struggle of communication. While communication is attempted by some, it is rarely successful. Voices merge, overlap, vanish, or are taken over, but they never seem to establish mutual recognition or continuity; they speak, but never converse with one another. In this way, Philomela's loss and succeeding transformation of voice also represents the disappearance of communication in the modern world, leaving people to suffer alone and in silence.

    1. Reviewer #2 (Public review):

      Summary:

      This study establishes a platform for studying mosquito flight activity over the course of several weeks and demonstrates key applications of such a paradigm: the comparison of daily activity profiles across different Aedes aegypti populations and the quantification of responses to physiological and environmental perturbations.

      Strengths:

      (1) Overall, the authors succeed in setting up a low-cost, scalable tracking system that stably records mosquito flight activity for several weeks and uses it to demonstrate compelling use cases.

      (2) The text is organized well, is easy to read, and is understandable for a broad audience.

      (3) Instructions for constructing housing and for performing tracking with a dedicated GUI are available on an accompanying website, with open-source (and well-organized) code.

      (4) A complementary pair of methods (one testing for activity signals at specific times of the day, and the other capturing broader daily patterns) is used effectively.

      Weaknesses:

      (1) In the interval-based GLMM results, since each time interval is tested independently, p-values should be corrected for multiple hypotheses (for instance, through controlling the false discovery rate).

      (2) The accompanying GUI application needs some modifications to fully work out of the box on a sample video.

    2. Reviewer #3 (Public review):

      Summary:

      The authors in this paper introduce BuzzWatch, an open-source, low-cost (200-300 Euros) platform for long-term monitoring of mosquito flight and behavior. They use a Raspberry Pi with a Noirv2 Camera set up under laboratory conditions to observe 3 different species of mosquitoes. The system captures a variety of multimodal data, like flight activity, sugar feeding, and host-seeking responses, with the help of external modules like CO2 and fructose-soaked cottons. They also release a GUI in addition to automated tracking and behaviour analysis, which doesn't run on Pi but rather on a personal laptop.

      Four main use cases are demonstrated:

      (1) Characterizing diel rhythms in various Aedes aegypti populations.

      (2) Differentiating behaviors of native African vs. invasive human-adapted subspecies.

      (3) Assessing physiological (blood-feeding) and environmental (light regime) perturbations.

      (4) Testing time-of-day variation in responses to host-associated cues like CO₂ and heat.

      Description (Strengths):

      (1) The authors introduce a low-cost, scalable system that uses flight tracking in 2D as an alternative to 3D multi-camera systems.

      (2) Due to the low pixel quality required by the system, they can record for weeks at a time, capturing long temporal and behavioral activities.

      (3) They also integrate external modules such as lights, CO2, and heat as a way to measure responses to a variety of stimuli.

      (4) They also introduce a wiki as a guide for building replication and a help in using the GUI module.

      (5) They implement both GLMM hourly and PCA of behavior data.

      Limitations - Major Comments:

      (1) Most experiments are only done with single replicates per colony. If the setup is claimed to be cheap and replicable, there should be clearer replicates across experiments.

      (2) No external validation for the flight tracking algorithm using manual annotation or comparison with field data. The authors focus early on biological proof of principle, but the validity of the tracking algorithm is not presented. How accurate is the algorithm at classifying behaviours (e.g., vs human ground truth)? How reliable is tracking?

      (3) Why develop a custom GUI instead of using established packages such as rethomics (https://rethomics.github.io/) that are already available for behavioral analysis?

      (4) Why use RGB light strips when perceptual white light for humans is not relevant for mosquitoes? The choice of lighting should be based on the mosquito's visual perception. - https://pmc.ncbi.nlm.nih.gov/articles/PMC12077400/ .

      (5) Why use GLMMs instead of GAMs (with explicit periodic components)? With GLMMs, you do not account for temporal structure, which is highly relevant and autocorrelated in behavioral time series data.

      (6) What is the proportion of mosquitoes that stay alive throughout the experiments? How do you address dead animals in tracking? No data are available on whether all mosquitoes made it through the monitoring period. No survival data is mentioned in the paper, and in the wiki, it is not clear how it is used or how it affects the analyses - https://theomaire.github.io/buzzwatch/analyze.html#diff-cond .

      (7 )The sugar feeding behavior is not manually validated.

      (8) Figure 4d is difficult to understand - how did you align time? Why is ZT4 aligning with ZT0? Should you "warp" the time series to compare them (e.g., from dawn to dusk)?

      (9) No video recordings are made available for demonstration or validation purposes.

      Appraisal

      (1) The core conclusions---that BuzzWatch can capture multiscale mosquito behavioral rhythms and quantify the effect of genetic, environmental, and physiological variation - show promise but require stronger validation.

      (2) Statistical approaches (GLMM, PCA) are chosen but may not be optimal for temporal data with autocorrelation.

      (3) The host-seeking module shows a differential response, which is a potentially valuable feature.

    1. Reviewer #2 (Public review):

      Summary:

      The manuscript by Kawadkar et al investigates the role of Nup107 in developmental progression via regulation of ecdysone signaling. The authors identify an interesting phenotype of Nup107 whole body RNAi depletion in Drosophila development - developmental arrest at the late larval stage. Nup107-depleted larvae exhibit mis-localization of the Ecdysone receptor (EcR) from the nucleus to the cytoplasm and reduced expression of EcR taret genes in salivary glands, indicative of compromised ecdysone signaling. This mis-localization of EcR in salivary glands was phenocopied when Nup107 was depleted only in the prothoracic gland (PG), suggesting that it is not nuclear transport of EcR but presence of ecdysone (normally secreted from PG) that is affected. Consistently, whole body levels of ecdysone were shown to be reduced in Nup107 KD, particularly at the late third instar stage when a spike in ecdysone normally occurs. Importantly, the authors could rescue the developmental arrest and EcR mis-localization phenotypes of Nup107 KD by adding exogenous ecdysone, supporting the notion that Nup107 depletion disrupts biosynthesis of ecdysone, which arrests normal development. Additionally, they found that rescue of Nup107 KD phenotype can also be achieved by over-expression of the receptor tyrosine kinase torso, which is thought to be the upstream regulator of ecdysone synthesis in the PG. Transcript levels of torso are also shown to be downregulated in the Nup107KD, as are transcript levels of multiple ecdysone biosynthesis genes. Together, these experiments reveal a new role of Nup107 or nuclear pore levels in hormone-driven developmental progression, likely via regulation of levels of torso and torso-stimulated ecdysone biosynthesis.

      Strengths:

      The developmental phenotypes of an NPC component presented in the manuscript are striking and novel, and the data appears to be of high quality. The rescue experiments are particularly significant, providing strong evidence that Nup107 functions upstream of torso and ecdysone levels in regulation of developmental timing and progression.

      Weaknesses:

      The underlying mechanism is however not clear, and any insight into how Nup107 may regulate these pathways would greatly strengthen the manuscript. Some suggestions to address this are detailed below.

      Major questions:

      (1) Determining how specific this phenotype is to Nup107 vs. to reduced NPC levels overall would give some mechanistic insight. Does knocking down other components of the Nup107 subcomplex (the Y-complex) lead to similar phenotypes? Given the published gene regulatory function of Nup107, do other gene regulatory Nups such as Nup98 or Nup153 produce these phenotypes?

      (2) In a related issue, does this level of Nup107 KD produce lower NPC levels? It is expected to, but actual quantification of nuclear pores in Nup107-depleted tissues should be added. These and above experiments would help address a key mechanistic question - is this phenotype the result of lower numbers of nuclear pores or specifically of Nup107?

      (3) Additional experiments on how Nup107 regulates torso would provide further insight. Does Nup107 regulate transcription of torso or perhaps its mRNA export? Looking at nascent levels of the torso transcript and the localization of its mRNA can help answer this question. Or alternatively, does Nup107 physically bind torso?

      (4) The depletion level of Nup107 RNAi specifically in the salivary gland vs. the prothoracic gland should be compared by RT-qPCR or western blotting.

      (5) The UAS-torso rescue experiment should also include the control of an additional UAS construct - so Nup107; UAS-control vs Nup107; UAS-torso should be compared in the context of rescue to make sure the Gal4 driver is functioning at similar levels in the rescue experiment.

      Minor:

      (6) Figures and figure legends can stand to be more explicit and detailed, respectively.

      Comments on revisions:

      The revised manuscript addresses several outstanding issues, most importantly the question of whether the developmental delay phenotype of Nup107 is exhibited by other Nups.

      I recommend that the authors include the data they provide in the rebuttal letter on Nup153 KD not showing the delay phenotype (Figure R1) into the actual manuscript. It's an important mechanistic question raised by multiple reviewers, and would strengthen the authors' conclusions. Ideally, knock downs of other Nups of the Nup107 complex should be investigated, especially given that all those RNAi lines are publicly available.

      Figure 6B should also specify whether the torso transcript being measured is mRNA or nascent, as it would help understand whether it's transcription or mRNA stability that is affected by Nup107 KD.

    1. Reviewer #2 (Public review):

      In their study, Avraham-Davidi et al. combined scRNA-seq and spatial mapping studies to profile two preclinical mouse models of colorectal cancer: Apcfl/fl VilincreERT2 (AV) and Apcfl/fl LSL-KrasG12D Trp53fl/fl Rosa26LSL-tdTomato/+ VillinCreERT2 (AKPV). In the first part of the manuscript, the authors describe the analysis of the normal colon and dysplastic lesions induced in these models following tamoxifen injection. They highlight broad variations in immune and stromal cell composition within dysplastic lesions, emphasizing the infiltration of monocytes and granulocytes, the accumulation of IL-17+gdT cells and the presence of a distinct group of endothelial cells. A major focus the study is the remodeling of the epithelial compartment, where most significant changes are observed. Using no-negative matrix factorization, the authors identify molecular programs of epithelial cell functions, emphasizing stemness, Wnt signaling, angiogenesis and inflammation as majors features associated with dysplastic cells. They conclude that findings from scRNA-seq analyses in mouse models are transposable to human CRC. In the second part of the manuscript, the authors aim to provide the spatial contexture for their scRNA-seq findings using Slide-seq and TACCO. They demonstrate that dysplastic lesions are disorganized and contain tumor-specific regions, which contextualize the spatial proximity between specific cell states and gene programs. Finally, they claim that these spatial organizations are conserved in human tumors and associate region-based gene signatures with patient outcome in public datasets. Overall, the data were collected and analyzed using solid and validated methodology to offer a useful resource to the community.

      Main comments:

      (1) Clarity. The manuscript would benefit from a substantial reorganization to improve clarity and accessibility for a broad readership. The text could be shortened and the number of figure panels reduced to emphasize the novel contributions of this work while minimizing extensive discussions on general and expected findings, such as tissue disorganization in dysplastic lesions. Additionally, figure panels are not consistently introduced in the correct order, and some are not discussed at all (e.g., Fig. S1D; Fig. 3C is introduced before Fig. 3A; several panels in Fig. 4 are not discussed). The annotation of scRNA-seq cell states is insufficiently explained, with no corresponding information about associated genes provided in the figures or tables. Multiple annotations are used to describe cell groups (e.g., TKN01 = γδ T and CD8 T, TKN05 = γδT_IL17+), but these are not jointly accessible in the figures, making the manuscript challenging to follow. It is also not clear what is the respective value of the two mouse models and timepoints of tissue collection in the analysis.

      (2) Novelty. While the study is of interest, it does not present major findings that significantly advance the field or motivate new directions and hypotheses. Many conclusions related to tissue composition and patient outcomes, such as the epithelial programs of Wnt signaling, angiogenesis, and stem cells, are well-established and not particularly novel. Greater exploration of the scRNA-seq data beyond cell type composition could enhance the novelty of the findings. For instance, several tumor microenvironment clusters uniquely detected in dysplastic lesions (e.g., Mono2, Mono3, Gran01, Gran02) are identified, but no further investigation is conducted to understand their biological programs, such as applying nNMF as was done for epithelial cells. Additional efforts to explore precise tissue localization and cellular interactions within tissue niches would provide deeper insights and go beyond the limited analyses currently displayed in the manuscript.

      (3) Validation. Several statements made by the authors are insufficiently supported by the data presented in the manuscript and should be nuanced in the absence of proper validation. For example: 1.) RNA velocity analyses: The conclusions drawn from these analyses are speculative and need further support. 2.) Annotations of epithelial clusters as dysplastic: These annotations could have been validated through morphological analyses and staining on FFPE slides. 3.) Conservation of mouse epithelial programs in human tumors: The data in Figure S5B does not convincingly demonstrate enrichment of stem cell program 16 in human samples. This should be more explicitly stated in the text, given the emphasis placed on this program by the authors. 4.) Figure S6E: Cluster Epi06 is significantly overrepresented in spatial data compared to scRNA-seq, yet the authors claim that cell type composition is largely recapitulated without further discussion, which reduces confidence in other conclusions drawn.<br /> Furthermore, stronger validation of key dysplastic regions (regions 6, 8, and 11) in mouse and human tissues using antibody-based imaging with markers identified in the analyses would have considerably strengthened the study. Such validation would better contextualize the distribution, composition, and relative abundance of these regions within human tumors, increasing the significance of the findings and aiding the generation of new pathophysiological hypotheses.

      Comments on revisions:

      The authors have improved the clarity of the manuscript and responded adequately to all my initial comments.<br /> I don't have any other comments. Congratulations to the authors on this work.

    1. Reviewer #1 (Public review):

      Summary:

      Taber et al report the biochemical characterization of 7 mutations in PHD2 that induce erythrocytosis. Their goal is to provide a mechanism for how these mutations cause the disease. PHD2 hydroxylates HIF1a in the presence of oxygen at two distinct proline residues (P564 and P402) in the "oxygen degradation domain" (ODD). This leads to the ubiquitylation of HIF1a by the VHL E3 ligase and its subsequent degradation. Multiple mutations have been reported in the EGLN1 gene (coding for PHD2), which are associated with pseudohypoxic diseases that include erythrocytosis. Furthermore, 3 mutations in PHD2 also cause pheochromocytoma and paraganglioma (PPGL), a neuroendocrine tumour. These mutations likely cause elevated levels of HIF1a, but their mechanisms are unclear. Here, the authors analyze mutations from 152 case reports and map them on the crystal structure. They then focus on 7 mutations, which they clone in a plasmid and transfect into PHD2-KO to monitor HIF1a transcriptional activity via a luciferase assay. All mutants show impaired activation. Some mutants also impaired stability in pulse chase turnover assays (except A228S, P317R, and F366L). In vitro purified PHD2 mutants display a minor loss in thermal stability and some propensity to aggregate. Using MST technology, they show that P317R is strongly impaired in binding to HIF1a and HIF2a, whereas other mutants are only slightly affected. Using NMR, they show that the PHD2 P317R mutation greatly reduces hydroxylation of P402 (HIF1a NODD), as well as P562 (HIF1a CODD), but to a lesser extent. Finally, BLI shows that the P317R mutation reduces affinity for CODD by 3-fold, but not NODD.

      Strengths:

      (1) Simple, easy-to-follow manuscript. Generally well-written.

      (2) Disease-relevant mutations are studied in PHD2 that provide insights into its mechanism of action.

      (3) Good, well-researched background section.

      Weaknesses:

      (1) Poor use of existing structural data on the complexes of PHD2 with HIF1a peptides and various metals and substrates. A quick survey of the impact of these mutations (as well as analysis by Chowdhury et al, 2016) on the structure and interactions between PHD2 peptides of HIF1a shows that the P317R mutation interferes with peptide binding. By contrast, F366L will affect the hydrophobic core, and A228S is on the surface, and it's not obvious how it would interfere with the stability of the protein.

      (2) To determine aggregation and monodispersity of the PHD2 mutants using size-exclusion chromatography (SEC), equal quantities of the protein must be loaded on the column. This is not what was done. As an aside, the colors used for the SEC are very similar and nearly indistinguishable.

      (3) The interpretation of some mutants remains incomplete. For A228S, what is the explanation for its reduced activity? It is not substantially less stable than WT and does not seem to affect peptide hydroxylation.

      (4) The interpretation of the NMR prolyl hydroxylation is tainted by the high concentrations used here. First of all, there is a likely a typo in the method section; the final concentration of ODD is likely 0.18 mM, and not 0.18 uM (PNAS paper by the same group in 2024 reports using a final concentration of 230 uM). Here, I will assume the concentration is 180 uM. Flashman et al (JBC 2008) showed that the affinity of the NODD site (P402; around 10 uM) for PHD2 is 10-fold weaker than CODD (P564, around 1 uM). This likely explains the much faster kinetics of hydroxylation towards the latter. Now, using the MST data, let's say the P317R mutation reduces the affinity by 40-fold; the affinity becomes 400 uM for NODD (above the protein concentration) and 40 uM for CODD (below the protein concentration). Thus, CODD would still be hydroxylated by the P317R mutant, but not NODD.

      (5) The discrepancy between the MST and BLI results does not make sense, especially regarding the P317R mutant. Based on the crystal structures of PHD2 in complex with the ODD peptides, the P317R mutation should have a major impact on the affinity, which is what is reported by MST. This suggests that the MST is more likely to be valid than BLI, and the latter is subject to some kind of artefact. Furthermore, the BLI results are inconsistent with previous results showing that PHD2 has a 10-fold lower affinity for NODD compared to CODD.

      (6) Overall, the study provides some insights into mutants inducing erythrocytosis, but the impact is limited. Most insights are provided on the P317R mutant, but this mutant had already been characterized by Chowdhury et al (2016). Some mutants affect the stability of the protein in cells, but then no mechanism is provided for A228S or F366L, which have stabilities similar to WT, yet have impaired HIF1a activation.

      Comments on revision:

      While the authors have addressed my concerns regarding the SEC experiments and the structural interpretation of most mutants, I remain unconvinced by their interpretation of the P317R mutant and affinity measurements. The BLI and MST data remain inconsistent for P317R binding to CODD, and the authors' response is essentially that the fluorescent labeling of P317R (but not other mutants) uniquely interferes with binding to the NODD/CODD peptides, which does not make a lot of sense. The fluorescent labeling target lysine residues; while there are lysine in PHD2 in proximity to the peptide binding site, labeling these sites would affect binding to all mutants, not only P317R (which does not introduce any new labeling site). Furthermore, the authors did not really address the discrepancy with the observations by Flashman et al (2008) that NODD binds more weakly than CODD, which is inconsistent with their BLI results. Another point that makes me doubt the validity of the BLI results is the poor fit of the sensorgrams and the slow dissociation kinetics, which is inconsistent with the relatively low affinity in the 2-6 uM range.

    2. Reviewer #2 (Public review):

      Summary:

      Mutations in the prolyl hydroxylase, PHD2, cause erythrocytosis and, in some cases, can result in tumorigenesis. Taber and colleagues test the structural and functional consequences of seven patient-derived missense mutations in PHD2 using cell-based reporter and stability assays, and multiple biophysical assays, and find that most mutations are destabilizing. Interestingly, they discover a PHD2 mutant that can hydroxylate the C-terminal ODD, but not the N-terminal ODD, which suggests the importance of N-terminal ODD for biology. A major strength of the manuscript is the multidisciplinary approach used by the authors to characterize the functional and structural consequences of the mutations. However, the manuscript had several major weaknesses, such as an incomplete description of how the NMR was performed, a justification for using neighboring residues as a surrogate for looking at prolyl hydroxylation directly, or a reference to the clinical case studies describing the phenotypes of patient mutations. Additionally, the experimental descriptions for several experiments are missing descriptions of controls or validation, which limits their strength in supporting the claims of the authors.

      Strengths:

      (1) This manuscript is well-written and clear.

      (2) The authors use multiple assays to look at the effects of several disease-associated mutations, which support the claims.

      (3) The identification of P317R as a mutant that loses activity specifically against NODD, which could be a useful tool for further studies in cells.

      Weaknesses:

      Major:

      (1) The source data for the patient mutations (Figure 1) in PHD2 is not referenced, and it's not clear where this data came from or if it's publicly available. There is no section describing this in the methods.

      (2) The NMR hydroxylation assay.

      A. The description of these experiments is really confusing. The authors have published a recent paper describing a method using 13C-NMR to directly detect proly-hydroxylation over time, and they refer to this manuscript multiple times as the method used for the studies under review. However, it appears the current study is using 15N-HSQC-based experiments to track the CSP of neighboring residues to the target prolines, so not the target prolines themselves. The authors should make this clear in the text, especially on page 9, 5th line, where they describe proline cross-peaks and refer to the 15N-HSQC data in Figure 5B.<br /> B. The authors are using neighboring residues as reporters for proline hydroxylation, without validating this approach. How well do CSPs of A403 and I566 track with proline hydroxylation? Have the authors confirmed this using their 13C-NMR data or mass spec?<br /> C. Peak intensities. In some cases, the peak intensities of the end point residue look weaker than the peak intensities of the starting residue (5B, PHD2 WT I566, 6 ct lines vs. 4 ct lines). Is this because of sample dilution (i.e., should happen globally)? Can the authors comment on this?

      (3) Data validating the CRISPR KO HEK293A cells is missing.

      (4) The interpretation of the SEC data for the PHD2 mutants is a little problematic. Subtle alterations in the elution profiles may hint at different hydrodynamic radii, but as the samples were not loaded at equal concentrations or volumes, these data seem more anecdotal, rather than definitive. Repeating this multiple times, using matched samples, followed by comparison with standards loaded under identical buffer conditions, would significantly strengthen the conclusions one could make from the data.

      Minor:

      (1) Justification for picking the seven residues is not clearly articulated. The authors say they picked 7 mutants with "distinct residue changes", but no further rationale is provided.

      (2) A major finding of the paper is that a disease-associated mutation, P317R, can differentially affect HIF1 prolyhydroxylation, however, additional follow-up studies have not been performed to test this in cells or to validate the mutant in another method. Is it the position of the proline within the catalytic core, or the identity of the mutation that accounts for the selectivity?

      Comments on revision:

      The revised manuscript addresses most of my concerns, i.e performing SEC experiments under matched sample concentrations, and incorporating additional data to justify the use of surrogate residues to monitor proline hydroxylation. I appreciate the improvements in the text to clarify the NMR experiments, but I still find their description confusing. Although the authors are using neighboring residues to monitor proline hydroxylation (which they justify convincingly using supplementary data), the language in the text suggests they are (and can?) monitor them directly (i.e. referring to proline cross-peaks in an 15N-HSQC spectrum). The axis labels in Figure 5B also seem to have become mislabeled in this revised version.

    3. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review): 

      Summary: 

      Taber et al report the biochemical characterization of 7 mutations in PHD2 that induce erythrocytosis. Their goal is to provide a mechanism for how these mutations cause the disease. PHD2 hydroxylates HIF1a in the presence of oxygen at two distinct proline residues (P564 and P402) in the "oxygen degradation domain" (ODD). This leads to the ubiquitylation of HIF1a by the VHL E3 ligase and its subsequent degradation. Multiple mutations have been reported in the EGLN1 gene (coding for PHD2), which are associated with pseudohypoxic diseases that include erythrocytosis. Furthermore, 3 mutations in PHD2 also cause pheochromocytoma and paraganglioma (PPGL), a neuroendocrine tumour. These mutations likely cause elevated levels of HIF1a, but their mechanisms are unclear. Here, the authors analyze mutations from 152 case reports and map them on the crystal structure. They then focus on 7 mutations, which they clone in a plasmid and transfect into PHD2-KO to monitor HIF1a transcriptional activity via a luciferase assay. All mutants show impaired activation. Some mutants also impaired stability in pulse chase turnover assays (except A228S, P317R, and F366L). In vitro purified PHD2 mutants display a minor loss in thermal stability and some propensity to aggregate. Using MST technology, they show that P317R is strongly impaired in binding to HIF1a and HIF2a, whereas other mutants are only slightly affected. Using NMR, they show that the PHD2 P317R mutation greatly reduces hydroxylation of P402 (HIF1a NODD), as well as P562 (HIF1a CODD), but to a lesser extent. Finally, BLI shows that the P317R mutation reduces affinity for CODD by 3-fold, but not NODD.  

      Strengths: 

      (1) Simple, easy-to-follow manuscript. Generally well-written. 

      (2) Disease-relevant mutations are studied in PHD2 that provide insights into its mechanism of action. 

      (3) Good, well-researched background section. 

      Weaknesses: 

      (1) Poor use of existing structural data on the complexes of PHD2 with HIF1a peptides and various metals and substrates. A quick survey of the impact of these mutations (as well as analysis by Chowdhury et al, 2016) on the structure and interactions between PHD2 peptides of HIF1a shows that the P317R mutation interferes with peptide binding. By contrast, F366L will affect the hydrophobic core, and A228S is on the surface, and it's not obvious how it would interfere with the stability of the protein. 

      Thank you for the comment.  We have further analyzed the mutations on the available PHD2 crystal structures in complex with HIFα to discern how these substitution mutations may impact PHD2 structure and function.  This analysis has been added into the discussion.

      (2) To determine aggregation and monodispersity of the PHD2 mutants using size-exclusion chromatography (SEC), equal quantities of the protein must be loaded on the column. This is not what was done. As an aside, the colors used for the SEC are very similar and nearly indistinguishable. 

      Agreed. We have performed an additional experiment as suggested by the reviewer to further assess aggregation and hydrodynamic size.  The colors used in the graph were changed for clearer differentiation between samples.

      (3) The interpretation of some mutants remains incomplete. For A228S, what is the explanation for its reduced activity? It is not substantially less stable than WT and does not seem to affect peptide hydroxylation. 

      We agree with the reviewer that the causal mechanism for some of the tested disease-causing mutants remain unclear.  The negative findings also raise the notion, perhaps considered controversial, that there may be other substrates of PHD2 that are impacted by certain mutations, which contribute to disease pathogenesis.  A brief paragraph discussing this has been included in the discussion.

      (4) The interpretation of the NMR prolyl hydroxylation is tainted by the high concentrations used here. First of all, there is a likely a typo in the method section; the final concentration of ODD is likely 0.18 mM, and not 0.18 uM (PNAS paper by the same group in 2024 reports using a final concentration of 230 uM). Here, I will assume the concentration is 180 uM. Flashman et al (JBC 2008) showed that the affinity of the NODD site (P402; around 10 uM) for PHD2 is 10-fold weaker than CODD (P564, around 1 uM). This likely explains the much faster kinetics of hydroxylation towards the latter. Now, using the MST data, let's say the P317R mutation reduces the affinity by 40-fold; the affinity becomes 400 uM for NODD (above the protein concentration) and 40 uM for CODD (below the protein concentration). Thus, CODD would still be hydroxylated by the P317R mutant, but not NODD. 

      The HIF1α concentration was indeed an oversight, which will be corrected to 0.18 mM.  The study by Flashman et al.[1] showing PHD2 having a lower affinity to the NODD than CODD likely contributes to the differential hydroxylation rates via PHD2 WT.  We showed here via MST that PHD2 P317R had K[d] of 320 ± 20 uM for HIF1αCODD, which should have led to a severe enzymatic defect, even at the high concentrations used for NMR (180 uM).  However, we observed only a subtle reduction in hydroxylation efficiency in comparison to PHD2 WT.  Thus, we performed another binding method using BLI that showed a mild binding defect on CODD by PHD2 P317R, consistent with NMR data.  The perplexing result is the WT-like binding to the NODD by PHD2 P317R, which appears inconsistent with the severe defect in NODD hydroxylation via PHD2 P317R as measured via NMR.  These results suggest that there are supporting residues within the PHD2/NODD interface that help maintain binding to NODD but compromise the efficiency of NODD hydroxylation upon PHD2 P317R mutation. 

      (5) The discrepancy between the MST and BLI results does not make sense, especially regarding the P317R mutant. Based on the crystal structures of PHD2 in complex with the ODD peptides, the P317R mutation should have a major impact on the affinity, which is what is reported by MST. This suggests that the MST is more likely to be valid than BLI, and the latter is subject to some kind of artefact. Furthermore, the BLI results are inconsistent with previous results showing that PHD2 has a 10-fold lower affinity for NODD compared to CODD. 

      The reviewer’s structural prediction that P317R mutation should cause a major binding defect, while agreeable with our MST data, is incongruent with our NMR and the data from Chowdhury et al.[2] that showed efficient hydroxylation of CODD via PHD2 P317R.  Moreover, we have attempted to model NODD and CODD on apo PHD2 P317R structure and found that the mutation had no major impact on CODD while the mutated residue could clash with NODD, causing a shifting of peptide positioning on the protein.  However, these modeling predictions, like any in silico projections, would need experimental validation.  As mentioned in our preceding response, we also performed BLI, which showed that PHD2 P317R had a minor binding defect for CODD, consistent with the NMR results and findings by Chowdhury et al[2].  NODD binding was also measured with BLI as purified NODD peptides were not amenable for soluble-based MST assay, which showed similar K[d]’s for PHD2 WT and P317R.  Considering the absence of NODD hydroxylation via PHD2 P317R as measured by NMR and modeling on apo PHD2 P317R, we posit that P317R causes deviation of NODD from its original orientation that may not affect binding due to the other interactions from the surrounding elements but unfortunately disallows NODD from turnover.  Further study would be required to validate such notion, which we feel is beyond the scope of this manuscript.  

      (6) Overall, the study provides some insights into mutants inducing erythrocytosis, but the impact is limited. Most insights are provided on the P317R mutant, but this mutant had already been characterized by Chowdhury et al (2016). Some mutants affect the stability of the protein in cells, but then no mechanism is provided for A228S or F366L, which have stabilities similar to WT, yet have impaired HIF1a activation. 

      We thank the reviewer for raising these and other limitations.  We have expanded on the shortcomings of the present study but would like to underscore that the current work using the recently described NMR assay along with other biophysical analyses suggests a previously under-appreciated role of NODD hydroxylation in the normal oxygen-sensing pathway.  

      Reviewer #2 (Public review): 

      Summary: 

      Mutations in the prolyl hydroxylase, PHD2, cause erythrocytosis and, in some cases, can result in tumorigenesis. Taber and colleagues test the structural and functional consequences of seven patientderived missense mutations in PHD2 using cell-based reporter and stability assays, and multiple biophysical assays, and find that most mutations are destabilizing. Interestingly, they discover a PHD2 mutant that can hydroxylate the C-terminal ODD, but not the N-terminal ODD, which suggests the importance of N-terminal ODD for biology. A major strength of the manuscript is the multidisciplinary approach used by the authors to characterize the functional and structural consequences of the mutations. However, the manuscript had several major weaknesses, such as an incomplete description of how the NMR was performed, a justification for using neighboring residues as a surrogate for looking at prolyl hydroxylation directly, or a reference to the clinical case studies describing the phenotypes of patient mutations. Additionally, the experimental descriptions for several experiments are missing descriptions of controls or validation, which limits their strength in supporting the claims of the authors. 

      Strengths: 

      (1) This manuscript is well-written and clear. 

      (2) The authors use multiple assays to look at the effects of several disease-associated mutations, which support the claims. 

      (3) The identification of P317R as a mutant that loses activity specifically against NODD, which could be a useful tool for further studies in cells. 

      Weaknesses: 

      Major: 

      (1) The source data for the patient mutations (Figure 1) in PHD2 is not referenced, and it's not clear where this data came from or if it's publicly available. There is no section describing this in the methods. 

      Clinical and patient information on disease-causing PHD2 mutants was compiled from various case reports and summarized in an excel sheet found in the Supplementary Information.  The case reports are cited in this excel file.  A reference to the supplementary data has been added to the Figure 1 legend and in the introduction.

      (2) The NMR hydroxylation assay. 

      A. The description of these experiments is really confusing. The authors have published a recent paper describing a method using 13C-NMR to directly detect proly-hydroxylation over time, and they refer to this manuscript multiple times as the method used for the studies under review. However, it appears the current study is using 15N-HSQC-based experiments to track the CSP of neighboring residues to the target prolines, so not the target prolines themselves. The authors should make this clear in the text, especially on page 9, 5th line, where they describe proline cross-peaks and refer to the 15N-HSQC data in Figure 5B. 

      As the reviewer mentioned, the assay that we developed directly measures the target proline residues.  This assay is ideal when mutations near the prolines are studied, such as A403, Y565 (He et al[3]).  In this previous work, we observed that the shifting of the target proline cross-peaks due to change in electronegativity on the pyrrolidine ring of proline in turn impacted the neighboring residues[3], which meant that the neighboring residues can be used as reporter residues for certain purposes.  In this study, we focused on investigating the mutations on PHD2 while leaving the sequence of the HIF-1α unchanged by using solely 15N-HSQC-based experiments without the need for double-labeled samples.  Nonetheless, we thank the reviewer for pointing out the confusion in the text and we have corrected and clarified our description of this assay.

      B. The authors are using neighboring residues as reporters for proline hydroxylation, without validating this approach. How well do CSPs of A403 and I566 track with proline hydroxylation? Have the authors confirmed this using their 13C-NMR data or mass spec? 

      For previous studies, we performed intercalated 15N-HSQC and 13C-CON experiments for the kinetic measurements of wild-type HIF-1α and mutants.  We observed that the shifting pattern of A403 and I566 in the 15N-HSQC spectra aligned well with the ones of P402 and P564, respectively, in the 13C-CON spectra.  Representative data has been added to Supplemental Data.

      C. Peak intensities. In some cases, the peak intensities of the end point residue look weaker than the peak intensities of the starting residue (5B, PHD2 WT I566, 6 ct lines vs. 4 ct lines). Is this because of sample dilution (i.e., should happen globally)? Can the authors comment on this? 

      This is an astute observation by the reviewer.  We checked and confirmed that for all kinetic datasets, the peak intensities of the end point residue are always slightly lower than the ones of the starting.  This includes the cases for PHD2 A228S and P317R in 5B, although not as obvious as the one of PHD2 WT.  We agree with the reviewer that the sample dilution is a factor as a total volume of 16 microliters of reaction components was added to the solution to trigger the reaction after the first spectrum was acquired.  It is also likely that rate of prolyl hydroxylation becomes extremely slow with only a low amount of substrate available in the system.  Therefore, the reaction would not be 100% complete which was detected by the sensitive NMR experimentation.

      (3) Data validating the CRISPR KO HEK293A cells is missing. 

      We thank the reviewer for noting this oversight.  Western blots validating PHD2 KO in HEK293A cells have been added to the Supplementary Data file.

      (4) The interpretation of the SEC data for the PHD2 mutants is a little problematic. Subtle alterations in the elution profiles may hint at different hydrodynamic radii, but as the samples were not loaded at equal concentrations or volumes, these data seem more anecdotal, rather than definitive. Repeating this multiple times, using matched samples, followed by comparison with standards loaded under identical buffer conditions, would significantly strengthen the conclusions one could make from the data. 

      Agreed.  We have performed an additional experiment as suggested with equal volume and concentration of each PHD2 construct loaded onto the SEC column for better assessment of aggregation.  Notably, our conclusion remained unchanged.

      Minor: 

      (1) Justification for picking the seven residues is not clearly articulated. The authors say they picked 7 mutants with "distinct residue changes", but no further rationale is provided. 

      Additional justification for the selection of the mutants has been added to the ‘Mutations across the PHD2 enzyme induce erythrocytosis’ section.  Briefly, some mutants were chosen based on their frequency in the clinical data and their presence in potential mutational hot spots.  Various mutations were noted at W334 and R371, while F366L was identified in multiple individuals.  Additionally, 9 cases of PHD2-driven disease were reported to be caused from mutations located between residues 200 to 210 while 13 cases were reported between residues 369-379, so G206C and R371H were chosen to represent potential hot spots.  To examine a potential genotype-phenotype relationship, two of the mutants responsible for neuroendocrine tumor development, A228S and H374R, were also selected.  Finally, mutations located close or on catalytic core residues (P317R, R371H, and H374R) were chosen to test for suspected defects.   

      (2) A major finding of the paper is that a disease-associated mutation, P317R, can differentially affect HIF1 prolyhydroxylation, however, additional follow-up studies have not been performed to test this in cells or to validate the mutant in another method. Is it the position of the proline within the catalytic core, or the identity of the mutation that accounts for the selectivity? 

      This is the very question that we are currently addressing but as a part of a follow-up study.  Indeed, one thought is that the preferential defect observed could be the result of the loss of proline, an exceptionally rigid amino acid that makes contact with the backbone twice, or the addition of a specific amino acid, namely arginine, a flexible amino acid with an added charge at this site.  Although beyond the scope of this manuscript, we will investigate whether such and other characteristics in this region of PHD2/HIF1α interface contribute to the differential hydroxylation. 

      Reviewer #3 (Public review): 

      Summary: 

      This is an interesting and clinically relevant in vitro study by Taber et al., exploring how mutations in PHD2 contribute to erythrocytosis and/or neuroendocrine tumors. PHD2 regulates HIFα degradation through prolyl-hydroxylation, a key step in the cellular oxygen-sensing pathway. 

      Using a time-resolved NMR-based assay, the authors systematically analyze seven patient-derived PHD2 mutants and demonstrate that all exhibit structural and/or catalytic defects. Strikingly, the P317R variant retains normal activity toward the C-terminal proline but fails to hydroxylate the N-terminal site. This provides the first direct evidence that N-terminal prolyl-hydroxylation is not dispensable, as previously thought. 

      The findings offer valuable mechanistic insight into PHD2-driven effects and refine our understanding of HIF regulation in hypoxia-related diseases. 

      Strengths: 

      The manuscript has several notable strengths. By applying a novel time-resolved NMR approach, the authors directly assess hydroxylation at both HIF1α ODD sites, offering a clear functional readout. This method allows them to identify the P317R variant as uniquely defective in NODD hydroxylation, despite retaining normal activity toward CODD, thereby challenging the long-held view that the N-terminal proline is biologically dispensable. The work significantly advances our understanding of PHD2 function and its role in oxygen sensing, and might help in the future interpretation and clinical management of associated erythrocytosis. 

      Weaknesses: 

      (1) There is a lack of in vivo/ex vivo validation. This is actually required to confirm whether the observed defects in hydroxylation-especially the selective NODD impairment in P317R-are sufficient to drive disease phenotypes such as erythrocytosis.

      We thank the reviewer for this comment, and while we agree with this statement, the objective of this study per se was to elucidate the structural and/or functional defect caused by the various diseaseassociated mutations on PHD2.  The subsequent study would be to validate whether the identified defects, in particular the selective NODD impairment, would lead to erythrocytosis in vivo.  However, we feel that such study would be beyond the scope of this manuscript.

      (2) The reliance on HRE-luciferase reporter assays may not reliably reflect the PHD2 function and highlights a limitation in the assessment of downstream hypoxic signaling. 

      Agreed.  All experimental assays and systems have limitations.  The HRE-luciferase assay used in the present manuscript also has limitations such as the continuous expression of exogenous PHD2 mutants driven via CMV promoter.  Thus, we performed several additional biophysical methodologies to interrogate the disease-causing PHD2 mutants.  The limitations of the luciferase assay have been expanded in the revised manuscript. 

      (3) The study clearly documents the selective defect of the P317R mutant, but the structural basis for this selectivity is not addressed through high-resolution structural analysis (e.g., cryo-EM). 

      We thank the reviewer for the comment.  While solving the structure of PHD2 P317R in complex with HIFα substrate is beyond the scope for this study, a structure of PHD2 P317R in complex with a clinically used inhibitor has been solved (PDB:5LAT).  In analyzing this structure and that of PHD2 WT in complex with NODD, Chowdhury et al[2] stated that P317 makes hydrophobic contacts with LXXLAP motif on HIFα and R317 is predicted to interact differently with this motif.  While this analysis does not directly elucidate the reason for the preferential NODD defect, it supports the possibility that P317R substitution may be more detrimental for enzymatic activity on NODD than CODD.  We have discussed this notion in the revised manuscript. 

      (4) Given the proposed central role of HIF2α in erythrocytosis, direct assessment of HIF2α hydroxylation by the mutants would have strengthened the conclusions. 

      We thank the reviewer for this comment, but we feel that such study would be beyond the scope of the present study.  We observed that the PHD2 binding patterns to HIF1α and HIF2α were similar, and we have previously assigned >95% of the amino acids in HIF1α ODD for NMR study[3]. Thus, we first focused on the elucidation of possible defects on disease-associated PHD2 mutants using HIF1α as the substrate with the supposition that an identified deregulation on HIF1α could be extended to HIF2α paralog.  However, we agree with the reviewer that future studies should examine the impact of PHD2 mutants directly on HIF2α.  

      References:

      (1) Flashman, E. et al. Kinetic rationale for selectivity toward N- and C-terminal oxygen-dependent degradation domain substrates mediated by a loop region of hypoxia-inducible factor prolyl hydroxylases. J Biol Chem 283, 3808-3815 (2008).

      (2) Chowdhury, R. et al. Structural basis for oxygen degradation domain selectivity of the HIF prolyl hydroxylases. Nat Commun 7, 12673 (2016).

      (3) He, W., Gasmi-Seabrook, G.M.C., Ikura, M., Lee, J.E. & Ohh, M. Time-resolved NMR detection of prolyl-hydroxylation in intrinsically disordered region of HIF-1alpha. Proc Natl Acad Sci U S A 121, e2408104121 (2024).

      Reviewer #1 (Recommendations for the authors): 

      (1) To increase the impact and significance of this work, I would recommend determining the mechanism by which A228S and F366L impair PHD2. Are these mutations affecting interactions with proteins other than HIF1a? Furthermore, does the F366L mutation affect the hydroxylation rate? This should be measured. The authors should also perform a more in-depth structural analysis of these mutations and perhaps use AlphaFold to identify how these sites may be involved in other interactions. 

      We thank the reviewer for the recommendations.  A paragraph discussing the quandary of A228S and F366L has been added to the discussion as well as an in-depth structural analysis of each selected mutant.  While AlphaFold is excellent at predicting protein structures overall, its capability to predict the effect of single point mutation, such as those in this study, is limited.  Therefore, it was not utilized for this paper.

      (2) For the aggregation assay, I recommended injecting the same quantity of protein on the SEC. If the aggregation-prone mutants' yields were too low, then reduced amounts of the other mutants should be injected. 

      Agreed.  An additional experiment was performed in which similar concentrations of each mutant protein was loaded onto the SEC column and chromatograms was normalized according to the molecular concentration.  Results from this experiment have been added to replace the previously performed aggregation assay.  Notably, the data from the revised experiment did not change the outcome or conclusion of the study.

      (3) For the NMR kinetics data, the authors should discuss the impact of affinities and concentrations on the reaction rate and incorporate this analysis framework to interpret their data. 

      Done.  As discussed in depth in response to Public Reviewer 1’s fourth comment, we observed only a subtle reduction in hydroxylation efficiency of HIF1aCODD by PHD2 P317R in comparison to PHD2 WT.  Upon performing BLI, we found PHD2 P317R displays only a mild binding defect on the CODD and NODD.  The WT-like binding to the NODD by PHD2 P317R appears to be inconsistent with the severe defect in NODD hydroxylation via PHD2 P317R as measured via NMR.   These results suggest that there are supporting residues within the PHD2/NODD interface that help maintain binding to NODD but compromise the efficiency of NODD hydroxylation upon PHD2 P317R mutation.

      Reviewer #2 (Recommendations for the authors): 

      It is unclear where the source data came from describing the patient mutations, or if it is publicly available. Several minor issues were noted with several of the figures or methods: 

      (1) Figure 2C. It is not clear what data are being compared for significance. The lines don't seem to clearly distinguish this. 

      Done.  The significance lines have been adjusted in the figure to better convey which data are being compared.

      (2) Please incorporate the calculated biophysical constants (KD, TM, etc, average +/- std dev) from the tables into the figures or figure legends that show the data from which they are calculated.  

      Done.  References to the corresponding tables have been added to the appropriate figure legends.

      (3) Figure 3C, the data for F366L do not appear normalized in the same way as the other constructs. 

      CD melt values for F366L were normalized in the same way as other constructs but due to noisier data acquired between 25-37°C, the top value of the sigmoidal curve is slightly higher than the other constructs (F366L: 1.066, WT: 1.007, A228S: 1.000, P317R: 1.015, R371H: 1.005). 

      (4) For Figure 1B, it would be helpful to highlight the mutants characterized in the current study with a different color/symbol to help show the number of cases. 

      Done.  Dots representing the selected mutants have been highlighted in red in Figure 1B.

      (5) A description of the isotopic labeling of PHD2 is missing from the methods.

      Due to the nature of the NMR assay, no isotopic labeling was required for PHD2.

      Reviewer #3 (Recommendations for the authors): 

      (1) To further strengthen the manuscript, the authors could consider exploring the relevance of their in vitro findings in a more physiological context. 

      We thank the reviewer for the suggestion, and we will certainly consider furthering our investigation in a more physiological context for future studies.

      (2) If technically feasible, integrating direct analyses of HIF2α regulation by the PHD2 mutants would better reflect the clinical phenotype, given the known importance of HIF2α in erythrocytosis. 

      We agree that HIF2α is important in the context of erythrocytosis, but through MST we observed no difference in binding pattern between HIF1 and HIF2 and the selected PHD2 mutants.  As we had previously assigned >95% of residues for HIF1α ODD for NMR assay, we analyzed HIF1 with the supposition that any defects observed would likely apply to HIF2.  However, we agree that future studies on the impact of PHD2 mutants directly on HIF2 would be beneficial to supplement our understanding of pseudohypoxic disease.

      (3) Additionally, although perhaps more suitable for future work or discussion, structural modeling or highresolution structural studies of the P317R variant could offer valuable insight into the observed NODD selectivity defect. 

      We thank the reviewer for the suggestion. While solving the structure of PHD2 P317R in complex with NODD is beyond the scope of this manuscript, a crystal structure of PHD2 P317R in complex with an inhibitor has been solved and insights from this structure have been added to the discussion. 

      (4) Finally, a brief clarification or discussion of the limitations of the luciferase reporter assay-especially in the context of aggregation-prone mutants-would help readers better interpret the functional data. 

      We thank the reviewer for the suggestion.  The limitations of the luciferase reporter assay in regard to its inability to detect defects with aggregation-prone mutants have been elaborated on in the discussion.

    1. Author response:

      The following is the authors’ response to the previous reviews

      Reviewer #2 (Public Review):

      Summary:

      The authors show that a combination of arginine methyltransferase inhibitors synergize with PARP inhibitors to kill ovarian and triple negative cancer cell lines in vitro and in vivo using preclinical mouse models.

      Strengths and weaknesses

      The experiments are well-performed, convincing and have the appropriate controls (using inhibitors and genetic deletions) and use statistics.

      They identify the DNA damage protein ERCC1 to be reduced in expression with PRMT inhibitors. As ERCC1 is known to be synthetic lethal with PARPi, this provides a mechanism for the synergy. They use cell lines only for their study in 2D as well as xenograph models.

      We sincerely thank Reviewer #2 for the insightful and constructive feedback, as well as for the kind recognition of the scientific quality of our work: “The experiments are well-performed, convincing and have the appropriate controls (using inhibitors and genetic deletions) and use statistics.” We sincerely thank Reviewer #2 for their thoughtful and constructive comments during both rounds of review, which have significantly improved the quality of our manuscript. In response, we have incorporated new results from additional experiments into the figures (Figures 6M and 6N) and made comprehensive revisions throughout the text, figures, and supplementary materials. Following the reviewer’s valuable suggestions, we also revised the Discussion section. In the “Recommendations for the authors” sections, we have provided detailed point-by-point responses to each comment, which were instrumental in guiding our revisions. We believe these updates have substantially strengthened the manuscript and fully addressed all reviewer concerns.

      Reviewer #2 (Recommendations for the authors): 

      Although the authors have addressed each recommendation from the reviewer, further revision of the manuscript are still necessary, as outlined below.

      Add these additional comments in the text to further enhance the comprehension and clarity of the data.

      (1) If the authors kept the tumors of various sizes in Figure 7I, it would be important to assess the protein and/or mRNA level of ERCC1 to further support their mechanism.

      Question (1): Please add the figures of new experiments (treatment diagram, curves for tumor volume and qRT-PCR data) to Figure 6.

      We thank the reviewers for their constructive suggestions. In response to the reviewers’ comments, we have added the treatment diagram and qPCR results to Figure 6. In this experiment, we shortened the treatment duration to seven days to assess early molecular responses to therapy rather than downstream effects. As expected, such short-term treatment did not result in significant differences in tumor growth among groups. The new results are now presented in Figure 6, panels M and N. The corresponding results and figure legends will also be included in the revised version of the manuscript

      (2) Figure 2G: please explain why two bands remain for sgPRMT1.

      Question (2): In the answer, the authors stated, "Upon knockdown of the major isoforms by CRISPR/Cas9, expression of this minor isoform may have increased as part of a compensatory feedback mechanism, rendering it detectable by immunoblotting." Please put the statement into the discussion section.

      We sincerely thank the reviewers for their thoughtful and constructive suggestions. In response to these comments, we have carefully revised the manuscript and incorporated the corresponding information into the Discussion section to provide greater clarity and context for our findings.

      (3) (Previously point 5) What is the link with ERCC1 splicing because reduced overall ERCC1 expression is clear?

      Question (5): Please add the explanation you provide of links between ERCC1 splicing and PRMTi into the discussion section.

      "Furthermore, as shown in Figure 4G, we observed a reduction in the total ERCC1 mRNA reads following PRMTi treatment. This decrease may be attributed, at least in part, to the instability of the alternatively spliced ERCC1 transcripts, which could be more prone to degradation. In combination with the transcriptional downregulation of ERCC1 induced by PRMT inhibition, these alternative splicing events may lead to a further reduction in functional ERCC1 protein levels. This dual impact on ERCC1 expression, through both decreased transcription and the generation of unstable or nonfunctional isoforms, likely contributes to the enhanced cellular sensitivity to PARP inhibitors observed in our study."

      We sincerely thank the reviewers for their thoughtful and constructive suggestions. In response to these comments, we have carefully revised the manuscript and incorporated the corresponding information into the Discussion section to provide greater clarity and context for our findings.

      (4) (Previously 6) Figure 7J: From the graph, it seems like Olaparib+G715 and G715+G025 have a similar effect on tumor volume (two curves overlap). Please discuss.

      Question (6): In the answer, the authors stated, "Our in vitro and in vivo findings, together with previously published data, consistently demonstrate that GSK715 is more potent than both GSK025 and Olaparib. Notably, treatment with GSK715 alone led to significantly greater inhibition of tumor growth compared to either GSK025 or Olaparib administered individually. This higher potency of GSK715 also explains the comparable levels of tumor suppression observed in the combination groups, including GSK715 plus Olaparib and GSK715 plus GSK025. These results suggest that GSK715 is likely the primary driver of efficacy in the two drug combination settings." Please put the statement in the corresponding result section for Figure 6J.

      We sincerely thank the reviewers for their thoughtful and constructive suggestions. In response to these comments, we have carefully revised the manuscript and incorporated the corresponding information into the result section for Figure 6J to provide greater clarity and context for our findings.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review): 

      Summary: 

      In this manuscript entitled "Molecular dynamics of the matrisome across sea anemone life history", Bergheim and colleagues report the prediction, using an established sequence analysis pipeline, of the "matrisome" - that is, the compendium of genes encoding constituents of the extracellular matrix - of the starlet sea anemone Nematostella vectensis. Re-analysis of an existing scRNA-Seq dataset allowed the authors to identify the cell types expressing matrisome components and different developmental stages. Last, the authors apply time-resolved proteomics to provide experimental evidence of the presence of the extracellular matrix proteins at three different stages of the life cycle of the sea anemone (larva, primary polyp, adult) and show that different subsets of matrisome components are present in the ECM at different life stages with, for example, basement membrane components accompanying the transition from larva to primary polyp and elastic fiber components and matricellular proteins accompanying the transition from primary polyp to the adult stage. 

      Strengths: 

      The ECM is a structure that has evolved to support the emergence of multicellularity and different transitions that have accompanied the complexification of multicellular organisms. Understanding the molecular makeup of structures that are conserved throughout evolution is thus of paramount importance. 

      The in-silico predicted matrisome of the sea anemone has the potential to become an essential resource for the scientific community to support big data annotation efforts and understand better the evolution of the matrisome and of ECM proteins, an important endeavor to better understand structure/function relationships. This study is also an excellent example of how integrating datasets generated using different -omic modalities can shed light on various aspects of ECM metabolism, from identifying the cell types of origins of matrisome components using scRNA-Seq to studying ECM dynamics using proteomics. 

      We greatly appreciate the positive feedback regarding the design of our study and the evolutionary significance of our findings.

      Weaknesses: 

      My concerns pertain to the three following areas of the manuscript: 

      (1) In-silico definition of the anemone matrisome using sequence analysis: 

      a) While a similar computational pipeline has been applied to predict the matrisome of several model organisms, the authors fail to provide a comprehensive definition of the anemone matrisome: In the text, the authors state the anemone matrisome is composed of "551 proteins, constituting approximately 3% of its proteome (see page 6, line 14), but Figure 1 lists 829 entries as part of the "curated" matrisome, Supplementary Table S1 lists the same 829 entries and the authors state that "Here, we identified 829 ECM proteins that comprise the matrisome of the sea anemone Nematostella vectensis" (see page 17, line 10). Is the sea anemone matrisome composed of 551 or 829 genes? If we refer to the text, the additional 278 entries should not be considered as part of the matrisome, but what is confusing is that some are listed as glycoproteins and the "new_manual_annotation" proposed by the authors and that refer to the protein domains found in these additional proteins suggest that in fact, some could or should be classified as matrisome proteins. For example, shouldn't the two lectins encoded by NV2.3951 and NV2.3157 be classified as matrisome-affiliated proteins? Based on what has been done for other model organisms, receptors have typically been excluded from the "matrisome" but included as part of the "adhesome" for consistency with previously published matrisome; the reviewer is left wondering whether the components classified as "Other" / "Receptor" should not be excluded from the matrisome and moved to a separate "adhesome" list. 

      In addition to receptors, the authors identify nearly 70 glycoproteins classified as "Other". Here, does other mean "non-matrisome" or "another matrisome division" that is not core or associated? If the latter, could the authors try to propose a unifying term for these proteins? Unfortunately, since the authors do not provide the reasons for excluding these entries from the bona fide matrisome (list of excluding domains present, localization data), the reader is left wondering how to treat these entries. 

      Overall, the study would gain in strength if the authors could be more definitive and, if needed, even propose novel additional matrisome annotations to include the components for now listed as "Other" (as was done, for example, for the Drosophila or C. elegans matrisomes). 

      The reviewer is correct to point out the confusing terminology used throughout our manuscript, where both the total of 829 proteins constituting the curated list of ECM domain proteins and the actual matrisome (excluding "others") were referred to as "matrisomes". In general, we followed the example set by Naba & Hynes in their 2012 paper (Mol Cell Proteomics. 2012 Apr;11(4):M111.014647. doi: 10.1074/mcp.M111.014647), where they define the "matrisome" as encompassing all components of the extracellular matrix ("core matrisome") and those associated with it ("matrisome-associated" proteins). This corresponds to our group of 551 proteins, comprising both core matrisome and matrisomeassociated proteins. The Naba & Hynes paper also contains the inclusive and exclusive domain lists for the matrisome that we applied for our dataset. In the revised manuscript, we have now labelled the group of 829 proteins as "curated ECM domain proteins/genes", which includes all proteins positively selected for containing a bona fide ECM domain. After excluding non-matrisomal proteins such as receptors, we arrive at the 551 proteins that constitute the "Nematostella matrisome". We have maintained this terminology throughout the revised manuscript and have revised Figures 1B and 4B accordingly.

      Regarding the category of "other" proteins, which by definition are not part of the matrisome although containing ECM domains, we have taken the reviewer's advice and classified these in more detail. We categorized all receptors as "adhesome" (202 proteins).  The remaining group of “other” secreted ECM domain proteins were then further subcategorized. Those exhibiting significant matches in the ToxProt database were subclassified as "putative venoms" (15 proteins). This group also includes the two lectins (NV2.3951 and NV2.3157), which had been originally shifted to the “other” category due to their classification as venoms. We categorized as “adhesive proteins” (28 proteins) factors such as coadhesins that due to their domain architecture resemble bioadhesive proteins described in proteomic studies of other invertebrate species, such as corals or sponges (see also https://doi.org/10.1016/j.jprot.2022.104506). Further sub-categories are stress/injury response proteins (9 proteins) and ion channels (6 proteins). The remaining 17 proteins were categorized as “uncharacterized ECM domain proteins”. These include highly diverse proteins possessing either single ECM domains or novel domain combinations. We decided to retain those in our dataset as candidates for future functional characterization.

      b) It is surprising that the authors are not providing the full currently accepted protein names to the entries listed in Supplementary Table S1 and have used instead "new_manual_annotation" that resembles formal protein names. This liberty is misleading. In fact, the "new_manual_annotation" seems biased toward describing the reason the proteins were positively screened for through sequence analysis, but many are misleading because there is, in fact, more known about them, including evidence that they are not ECM proteins. The authors should at least provide the current protein names in addition to their "new_manual_annotations". 

      c) To truly serve as a resource, the Table should provide links to each gene entry in the Stowers Institute for Medical Research genome database used and some sort of versioning (this could be added to columns A, B, or D). Such enhancements would facilitate the assessment of the rigor of the list beyond the manual QC of just a few entries. 

      d) Since UniProt is the reference protein knowledge database, providing the UniProt IDs associated with the predicted matrisome entries would also be helpful, giving easy access to information on protein domains, protein structures, orthology information, etc. 

      e) In conclusion, at present, the study only provides a preliminary draft that should be more rigorously curated and enriched with more comprehensive and authoritative annotations if the authors aspire the list to become the reference anemone matrisome and serve the community. 

      Table S1 has been updated to include links to the respective Stowers Institute IDs (first two columns), as well as SwissProt IDs and current descriptions from both the Stowers Institute (SI) and Swissprot.

      In our manual annotations, we prioritized these over automated ones due to the considerable effort invested in examining each sequence individually. The cnidaria-specific minicollagens and NOWA proteins might serve as an example. According to the SI descriptions, the minicollagens are annotated as “keratin-associated protein, predicted or hypothetical protein, collagen-like protein and pericardin”. We classified these as minicollagens on the basis of overall domain architecture and of signature domains and sequence motifs, such as minicollagen cysteine-rich domains (CRDs) and polyproline stretches (doi: 10.1016/j.tig.2008.07.001). NOWA is a CTLD/CRD-containing protein that is part of nematocyst tubules (doi:10.1016/j.isci.2023.106291). The first two NOWA isoforms, according to Si descriptions, were annotated as aggrecan and brevican core proteins, which is very misleading. We therefore feel that our manual annotations better serve the cnidarian research community in classifying these proteins.

      Automated annotations of ECM proteins often rely on similarities between individual domains, neglecting overall domain composition. For example, Swissprot descriptions annotate 31 TSP1 domain-containing proteins in our list as "Hemicentin-1", but closer inspection reveals that only one sequence (NV2.24790) qualifies as Hemicentin-1 due to its characteristic vWFA, Ig-like, TSP1, G2 nidogen, and EGF-like domain architecture. Regarding novel protein annotations, NV2.650 might serve as an example. While SI descriptions annotate this protein as "epidermal growth factor" based on the presence of several EGF-like domains, our analysis reveals two integrin alpha N-terminal domains that classify this sequence as integrin-related. We have therefore assigned a description (Secreted integrin-N-related protein) that references this defining domain and avoids misclassification within the EGF family.

      In cases where the automated annotation (including those in Genbank) matched our own findings, we adopted the existing description, as seen with netrin-1 (NV2.7734). We acknowledge that our manual annotations are not flawless and will be refined by future research. Nonetheless, we offer them as an approximation to a more accurate definition of the identified protein list.

      (2) Proteomic analysis of the composition of the mesoglea during the sea anemone life cycle: 

      a) The product of 287 of the 829 genes proposed to encode matrisome components was detected by proteomics. What about the other ~550 matrisome genes? When and where are they expressed? The wording employed by the authors (see line 11, page 13) implies that only these 287 components are "validated" matrisome components. Is that to say that the other ~550 predicted genes do not encode components of the ECM? This should be discussed. 

      Obviously, our wording was not sufficiently accurate here. In the revised Fig. 1B we indicated that 210 of the 551 matrisome (core and associated) proteins were confirmed by mass spectrometry. In total, 287 proteins were identified by mass spectrometry, meaning that 77 of those are non-matrisomal proteins belonging to the “adhesome” (47) and “other” (30) groups. The fact that the remaining 542 proteins of the matrisome predicted by our in silico analysis could not be identified has two major reasons: (1) Our study was focussed on the molecular dynamics of the mesoglea. Therefore, only mesogleas were isolated for the mass spectrometry analysis and nematocysts were mostly excluded by extensive washing steps. As nematocysts contribute significantly to the predicted matrisome, this group of proteins is underrepresented in the mass spectrometry analysis. (2) A significant fraction of the predicted ECM proteins constitutes soluble factors and transmembrane receptors. These might not be necessarily part of the mesoglea isolates. In addition, the isolation and solubilization method we applied might have technical limitations. Although we used harsh conditions for solubilizing the mesoglea samples (90°C and high DTT concentrations), we cannot exclude that we missed proteins which resisted solubilization and thus trypsinization. We confirmed that all genes predicted by the in silico analysis have transcriptomic profiles as demonstrated in supplementary table S4. We have clarified these points in the revised results part (p.6) and also revised the statement in line 16, page 13.

      b) Can the authors comment on how they have treated zero TMT values or proteins for which a TMT ratio could not be calculated because unique to one life stage, for example? 

      We did not include these proteins in the analysis of the respective statistical comparison. This involved only very few proteins (about 10).  

      c) Could the authors provide a plot showing the distribution of protein abundances for each matrisome category in the main figure 4? In mammals, the bulk of the ECM is composed of collagens, followed by fibrillar ECM glycoproteins, the other matrisome components being more minor. Is a similar distribution observed in the sea anemone mesoglea? 

      We have included such a plot showing protein abundances across life stages and protein categories (Fig. 4A). Collagens and basement membrane proteoglycans (perlecan) are the most abundant protein categories in the core matrisome while secreted factors dominate in the matrisome-associated group.

      d) Prior proteomic studies on the ECM of vertebrate organisms have shown the importance of allowing certain post-translational modifications during database search to ensure maximizing peptide-to-spectrum matching. Such PTMs include the hydroxylation of lysines and prolines that are collagen-specific PTMs. Multiple reports have shown that omitting these PTMs while analyzing LC-MS/MS data would lead to underestimating the abundance of collagens and the misidentification of certain collagens. The authors may want to reanalyze their dataset and include these PTMs as part of their search criteria to ensure capturing all collagen-derived peptides. 

      Thank you for this suggestion. We have re-analyzed our dataset including lysine and proline hydroxylation as PTM. While we obtained in total 70 more proteins using this approach, this additional group did not contain any large collagen or minicollagen we had not detected before. We only obtained two additional collagen-like proteins with very short triple helical domains (V2t013973001.1, NV2t024002001.1), one being a fragment. We don’t feel this justifies implementing a re-analysis of the proteome in our study.

      e) The authors should ensure that reviewers are provided with access to the private PRIDE repository so the data deposited can also be evaluated. They should also ensure that sufficient meta-data is provided using the SRDF format to allow the re-use of their LCMS/MS datasets. 

      We apologize for not providing the reviewer access in our initial submission and have asked the editorial office to forward the PRIDE repository link to all reviewers immediately after receiving the reviews. We did upload a metadata.csv file with the proteomics dataset. This file contains an annotation of all TMT labels to the samples and conditions and replicates used in the manuscript. It contains similar information as an SRDF format file. In addition, the search output files on protein and psm level have been provided. So, from our point of view, we provided all necessary information to reproduce the analysis.

      (3) Supplementary tables: 

      The supplementary tables are very difficult to navigate. They would become more accessible to readers and non-specialists if they were accompanied by brief legends or "README" tabs and if the headers were more detailed (see, for example, Table S2, what does "ctrl.ratio_Larvae_rep2" exactly refer to? Or Table S6 whose column headers using extensive abbreviations are quite obscure). Similarly, what do columns K to BX in Supplementary Table S1 correspond to? Without more substantial explanations, readers have no way of assessing these data points. 

      We have revised the tables and removed any redundant data columns. We also included detailed explanations of the used abbreviations, both in the headers and in a separate README file. Some of the information was apparently lost during the conversion to pdf files. We will therefore upload the original .xls files when submitting the revised manuscript.

      Reviewer #2 (Public review): 

      This work set out to identify all extracellular matrix proteins and associated factors present within the starlet sea anemone Nematostella vectensis at different life stages. Combining existing genomic and transcriptomic datasets, alongside new mass spectometry data, the authors provide a comprehensive description of the Nematostella matrisome. In addition, immunohistochemistry and electron microscopy were used to image whole mount and decellularized mesoglea from all life stages. This served to validate the de-cellularization methods used for proteomic analyses, but also resulted in a very nice description of mesoglea structure at different life stages. A previously published developmental cell type atlas was used to identify the cell type specificity of the matrisome, indicating that the core matrisome is predominantly expressed in the gastrodermis, as well as cnidocytes. The analyses performed were rigorous and the results were clear, supporting the conclusions made by the authors. 

      Thank you. We greatly appreciate the positive assessment of our study.

      Reviewer #3 (Public review): 

      Summary: 

      This manuscript by Bergheim et al investigates the molecular and developmental dynamics of the matrisome, a set of gene products that comprise the extracellular matrix, in the sea anemone Nematostella vectensis using transcriptomic and proteomic approaches. Previous work has examined the matrisome of the hydra, a medusozoan, but this is the first study to characterize the matrisome in an anthozoan. The major finding of this work is a description of the components of the matrisome in Nematostella, which turns out to be more complex than that previously observed in hydra. The authors also describe the remodeling of the extracellular matrix that occurs in the transition from larva to primary polyp, and from primary polyp to adult. The authors interpret these data to support previously proposed (Steinmetz et al. 2017) homology between the cnidarian endoderm with the bilaterian mesoderm. 

      Strengths: 

      The data described in this work are robust, combining both transcriptome and proteomic interrogation of key stages in the life history of Nematostella, and are of value to the community. 

      Thank you for your positive assessment of our dataset. 

      Weaknesses: 

      The authors offer numerous evolutionary interpretations of their results that I believe are unfounded. The main problem with extending these results, together with previous results from hydra, into an evolutionary synthesis that aims to reconstruct the matrisome of the ancestral cnidarian is that we are considering data from only two species. I agree with the authors' depiction of hydra as "derived" relative to other medusozoans and see it as potentially misleading to consider the hydra matrisome as an exemplar for the medusozoan matrisome. Given the organismal and morphological diversity of the phylum, a more thorough comparative study that compares matrisome components across a selection of anthozoan and medusozoan species using formal comparative methods to examine hypotheses is required. 

      Specifically, I question the author's interpretation of the evolutionary events depicted in this statement: 

      "The observation that in Hydra both germ layers contribute to the synthesis of core matrisome proteins (Epp et al. 1986; Zhang et al. 2007) might be related to a secondary loss of the anthozoan-specific mesenteries, which represent extensions of the mesoglea into the body cavity sandwiched by two endodermal layers." 

      Anthozoans and medusozoans are evolutionary sisters. Therefore, the secondary loss of "anthozoan-like mesenteries" in hydrozoans is at least as likely as the gain of this character state in anthozoans. By extension, there is no reason to prefer the hypothesis that the state observed in Nematostella, where gastroderm is responsible for the synthesis of the core matrisome components, is the ancestral state of the phylum. Moreover, the fossil evidence provided in support of this hypothesis (Ou et al. 2022) is not relevant here because the material described in that work is of a crown group anthozoan, which diversified well after the origin of Anthozoa. The phylogenetic structure of Cnidaria has been extensively studied using phylogenomic approaches and is generally well supported (Kayal et al. 2018; DeBiasse et al. 2024). Based on these analyses, anthozoans are not on a "basal" branch, as the authors suggest. The structure of cnidarian phylogeny bifurcates with Anthozoa forming one clade and Medusozoa forming the other. From the data reported by Bergheim and coworkers, it is not possible to infer the evolutionary events that gave rise to the different matrisome states observed in Nematostella (an anthozoan) and hydra (a medusozoan). Furthermore, I take the observation in Fig 5 that anthozoan matrisomes generally exhibit a higher complexity than other cnidarian species to be more supportive of a lineage-specific expansion of matrisome components in the Anthozoa, rather than those components being representative of an ancestral state for Cnidaria. Whatever the implication, I take strong issue with the statement that "the acquisition of complex life cycles in medusozoa, that are distinguished by the pelagic medusa stage, led to a secondary reduction in the matrisome repertoire." There is no causal link in any of the data or analyses reported by Bergheim and co-workers to support this statement and, as stated above, while we are dealing with limited data, insufficient to address this question, it seems more likely to me that the matrisome expanded in anthozoans, contrasting with the authors' conclusions. While the discussion raises many interesting evolutionary hypotheses related to the origin of the cnidarian matrisome, which is of vital interest if we are to understand the origin of the bilaterian matrisome, a more thorough comparative analysis, inclusive of a much greater cnidarian species diversity, is required if we are to evaluate these hypotheses. 

      DeBiasse MB, Buckenmeyer A, Macrander J, Babonis LS, Bentlage B, Cartwright P, Prada C, Reitzel AM, Stampar SN, Collins A, et al. 2024. A Cnidarian Phylogenomic Tree Fitted With Hundreds of 18S Leaves. Bulletin of the Society of Systematic Biologists [Internet] 3. Available from: https://ssbbulletin.org/index.php/bssb/article/view/9267

      Epp L, Smid I, Tardent P. 1986. Synthesis of the mesoglea by ectoderm and endoderm in reassembled hydra. J Morphol [Internet] 189:271-279. Available from: https://pubmed.ncbi.nlm.nih.gov/29954165/ 

      Kayal E, Bentlage B, Sabrina Pankey M, Ohdera AH, Medina M, Plachetzki DC, Collins AG, Ryan JF. 2018. Phylogenomics provides a robust topology of the major cnidarian lineages and insights on the origins of key organismal traits. BMC Evol Biol [Internet] 18:1-18. Available from: https://bmcecolevol.biomedcentral.com/articles/10.1186/s12862-018-1142-0

      Ou Q, Shu D, Zhang Z, Han J, Van Iten H, Cheng M, Sun J, Yao X, Wang R, Mayer G. 2022. Dawn of complex animal food webs: A new predatory anthozoan (Cnidaria) from Cambrian. The Innovation 3:100195 

      Steinmetz PRH, Aman A, Kraus JEM, Technau U. 2017. Gut-like ectodermal tissue in a sea anemone challenges germ layer homology. Nature Ecology & Evolution 2017 1:10 [Internet] 1:1535-1542. Available from: https://www.nature.com/articles/s41559-017-0285-5

      Zhang X, Boot-Handford RP, Huxley-Jones J, Forse LN, Mould AP, Robertson DL, Li L, Athiyal M, Sarras MP. 2007. The collagens of hydra provide insight into the evolution of metazoan extracellular matrices. J Biol Chem [Internet] 282:6792-6802. Available from: https://pubmed.ncbi.nlm.nih.gov/17204477/ 

      We agree with the reviewer that only the analysis of several additional anthozoan and medusozoan representatives will yield a valid basis for a reconstruction of the ancestral cnidarian matrisome and allow statements about ancestral or novel features within the phylum. We have therefore revised our statements in the discussion part of the manuscript by implementing the cited literature and also findings from medusozoan genome analysis (e.g. Gold et al., 2018) demonstrating that changes in gene content are as common in the anthozoans as in medusozoans, which questioned the previously stated “basal” state of Nematostella or of anthozoans in general.

      Reviewer #1 (Recommendations for the authors): 

      (1) In Figure 2A, an "o" is missing in the labeling of the "developing cnidcytes" population. 

      Thank you, we have corrected the typo.

      (2) It would be helpful to have the different life stages indicated as headers of the heat maps presented in Figure 4. 

      We have included symbolic representations for the different life stages on top of the heat maps in addition to the respective labels at the bottom.

      Reviewer #2 (Recommendations for the authors): 

      Important changes: 

      (1) Figure 2B The x-axis tissue names should be changed to something more easily readable/understandable - some are clear, but others are not. Perhaps abbreviations could be expanded in the legend. 

      We have expanded the legend in Fig. 2B to render it more easily readable. We have also rotated the maps in A to have them aligned with the ones in Fig.3B.

      (2) Figure 3B This figure would be improved by the inclusion of cluster names, to understand better the mapping. 

      We have added relevant cluster names to Fig. 3B and as stated above aligned the orientation of the maps in Fig. 2B and Fig. 3B.

      (3) Figure 3C As with 2B, I find the y-axis cnidocyte cell state names to be unclear at times. Perhaps abbreviations could be expanded in the legend. 

      All abbreviations were expanded in Fig.3C axis labels.

      (4) Many of the supplementary tables are not well exported or easily readable as is (gene names are truncated, headers truncated, etc), which means that they may not be easily usable by researchers in the field interested in following up on this work in other contexts. Indeed, to be more usable, please consider sharing these supplementary data as .csv files, for example, instead of as .pdfs. 

      We are sorry for this inconvenience, which was obviously caused by the conversion to pdf files. We will upload the original csv files when submitting the revised manuscript.

      Smaller nitpicky comments: 

      (5) Page 2 line 4 & page 3 line 7: Please consider a term other than "pre-bilaterian". The drawing/ordering of a phylogeny of extant species is not meaningful in terms of more or less ancestral. e.g. if the tips are flipped in the drawing of the tree, can we say that bilaterians are pre-cnidarians? What does that mean? 

      We have used that term on the basis that cnidarians existed before the appearance of bilaterians according to the fossil record and molecular phylogenies (McFadden et al., 2021; Adoutte et al., 2000;Cavalier-Smith et al., 1996; Collins, 1998; Kim et al., 1999; Medina et al., 2001; Wainright et al., 1993). To acknowledge remaining uncertainties in the timing of origin of animals, we will use the term “early-diverging metazoans” instead, which is widely accepted in the cnidarian community. 

      (6) Page 3 line 9 I was confused by the use of "gastrula-shaped body" to describe cnidarians, which are on the whole very morphologically diverse and don't all resemble gastrulae (that can also be quite diverse). 

      This term is sometimes used to refer to the diploblastic cnidarian body plan (outer ectoderm, inner endoderm) with a mouth that corresponds to the blastopore. To avoid misunderstandings, we changed it in the revised manuscript to “Cnidarians, the sister group to bilaterians, are characterized by a simple body plan with a central body cavity and a mouth opening surrounded by tentacles.”

      Reviewer #3 (Recommendations for the authors): 

      (1) In general, I felt there was a lot of discussion about protein structure and diversity that is difficult to follow without a figure. I think some of the information in Supplementary Figures S5, S9, and S11 should be in the main figures. 

      Following the reviewer’s suggestion, we have integrated Fig. S5 (collagens) into the main Fig. 2 and Fig. S9 (polydoms) into Fig. 4. As metalloproteases are not extensively discussed in the manuscript (and also due to the large size of the figure) we have kept Fig. S11 as a supplementary figure.

      (2) Page 3, Line 7: The use of the term "pre-bilaterian" is inappropriate. Cnidarians and bilaterians are evolutionary sisters. Therefore, each lineage derives from the same split and is the same age. The cnidarian lineage is not older than the bilaterian lineage. 

      Following a similar request by reviewer 2 we have replaced this term by “early diverging metazoans”.

      (3) Page 5, Line 10. How were in silico matrisomes from early-branching metazoan species predicted? 

      We applied the same bioinformatic pipeline as for the Nematostella matrisome. We clarified this in the respective methods part.

      (4) Page 16, Line 8: This should be Thus. 

      Obviously, the wording of this sentence was ambiguous. We changed it to ”In contrast, the adult mesoglea is significantly enriched in elastic fiber components, such as fibrillins and fibulin. This compositional shift likely adds to the visco-elastic properties (Gosline 1971a, b) of the growing body column (Fig. 4B,D, supplementary table S7).”

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the editor and the reviewers for their positive and constructive comments. Below is our point-by-point responses.

      Reviewer #1 (Evidence, reproducibility and clarity (Required)):

      Metabolic dysfunction-associated steatotic liver disease (MASLD) ranges from simple steatosis, steatohepatitis, fibrosis/cirrhosis, and hepatocellular carcinoma. In the current study, the authors aimed to determine the early molecular signatures differentiating patients with MASLD associated fibrosis from those patients with early MASLD but no symptoms. The authors recruited 109 obese individuals before bariatric surgery. They separated the cohorts as no MASLD (without histological abnormalities) and MASLD. The liver samples were then subjected to transcriptomic and metabolomic analysis. The serum samples were subjected to metabolomic analysis. The authors identified dysregulated lipid metabolism, including glyceride lipids, in the liver samples of MASLD patients compared to the no MASLD ones. Circulating metabolomic changes in lipid profiles slightly correlated with MASLD, possibly due to the no MASLD samples derived from obese patients. Several genes involved in lipid droplet formation were also found elevated in MASLD patients. Besides, elevated levels of amino acids, which are possibly related to collagen synthesis, were observed in MASLD patients. Several antioxidant metabolites were increased in MASLD patients. Furthermore, dysregulated genes involved in mitochondrial function and autophagy were identified in MASLD patients, likely linking oxidative stress to MASLD progression. The authors then determined the representative gene signatures in the development of fibrosis by comparing this cohort with the other two published cohorts. Top enriched pathways in fibrotic patients included GTPase signaling and innate immune responses, suggesting the involvement of GTPase in MASLD progression to fibrosis. The authors then challenged human patient derived 3D spheroid system with a dual PPARa/d agonist and found that this treatment restored the expression levels of GTPase-related genes in MASLD 3D spheroids. In conclusion, the authors suggested the involvement of upregulated GTPase-related genes during fibrosis initiation. Overall, the current study might provide some resources regarding transcriptomic and metabolomic data derived from obese patients with and without MASLD. However, several concerns should be carefully addressed.

      1. A recent study, via proteomic and transcriptomic analysis, revealed that four proteins (ADAMTSL2, AKR1B10, CFHR4 and TREM2) could be used to identify MASLD patients at risk of steatohepatitis (PMID: 37037945). It is not clear why the authors did not include this study in their comparison. Thank you for the suggestion. The RNA sequencing dataset (GSE135251) from study PMID 37037945 is the same dataset we used as an external benchmark in our study, referred to as the EU cohort on page 4 in the manuscript. In addition to PMID 37037945, we have cited the original transcriptomic study (PMID 33268509) for the EU cohort. In the revised manuscript, we discussed this proteome-transcriptome paper in the Discussion section and highlighted the potential of AKR1B10 as a biomarker in early MASLD.

      The authors recruited 109 patients but only performed transcriptomic and metabolomic analysis in 94 liver samples. Why did the authors exclude other samples?

      We thank the reviewer for their question and we understand the confusion. The discrepancy in sample size between liver and plasma cohorts is due to the fact that, for certain cases, we were unable to get sufficient liver tissue slices (“Exclusion criteria included: age The authors mentioned clinical data in Table 1 but did not present the table in this manuscript.

      Table 1 (key patient characteristics) was included in the main document after the Methods section, and Table S1 (additional patient characteristics) was provided as a supplemental file in our original submission.

      The generated metabolomic data could be a very useful resource to the MASLD community. However, it is very confusing how the data was generated in those supplemental tables. There is no clear labeling of human clinical information in those tables. Also, what do those values mean in columns 47-154? This reviewer assumed that they are the raw data of metabolomic analysis in plasma samples. However, without clear clinical information in these patients, it is impossible that any scientist can use the data to reproduce the authors' findings.

      We appreciate this suggestion. To ensure accessibility of the data resources, we created a GitHub repository for both data and code, available at https://github.com/SLINGhub/MASLD_dual_omics____.

      The GitHub repository includes clinical data for all 109 participants with patient characteristics and histological gradings, as well as processed omics data (log₂-transformed). We have generated artificial IDs for each patient so that we can include all the requested data in an organized manner. A code template is also provided to replicate the main statistical results from this study. In addition, for readers interested in conducting analyses from the raw data, we have deposited the raw sequencing files and mass spectrometry data in GEO and Zenodo, as detailed in the ‘Data Availability’ section.

      In Fig. 5B, the authors excluded the steatosis and fibrosis overlapped genes. Steatosis and fibrosis specific genes could simply reflect the outcomes rather than causes. In this case, the obtained results might not identify the gene signatures related to fibrosis initiation.

      We appreciate this comment, but we do not fully understand the reviewer’s point since we did not exclude overlapped genes in our analysis, and it was unclear to us whether excluding overlapping genes has anything to do with causality of both processes.

      In Figure 5B, we identified the gene signatures associated with steatosis and fibrosis after adjusting for potential confounders such as age, sex, BMI and diabetes status. Our results showed that these signatures were relatively independent, sharing a limited number of genes. We then examined genes uniquely associated with each process by additional adjustment (e.g., adjusting steatosis models for fibrosis grades). To us this was not an unreasonable approach, given that steatosis precedes fibrosis in most cases, especially in morbid obesity.

      We nevertheless agree with the reviewer’s point that the gene expression changes we identified represent statistical associations without warranting causality. To specifically address fibrosis initiation mechanisms within the limitation of the current study design, we performed a separate comparative analysis between patients with fibrosis+steatosis versus those with steatosis alone (Table S11), which still identified GTPase regulation as a potential key mechanism in fibrosis initiation (Figure 6B).

      In Fig. 6D, the authors used 3D liver spheroid to validate their findings. However, there is no images showing the 3D liver spheroid formation before and after PPARa/d agonist treatment. It is not clear whether the 3D liver spheroid was successfully established.

      There is extensive literature (>40 papers) from the Lauschke lab on 3D liver spheroid culture, including but not limited to PMIDs 27143246, 28264975, 32775153, 37870288 and 39605182. Images of the spheroids can be seen in Figure 1c of Adv. Sci. 2024, 2407572 and elafibrinor treatment did not affect the morphology of the spheroids.

      The authors suggested that targeting LX-2 cells with Rac1 and Cdc42 inhibitors could reduce collagen production. Did the authors observe these two genes upregulated in mRNA and protein expression levels in their cohort when compared MASLD patients with and without fibrosis? Did the authors observe that the expression levels of Rac1 and Cdc42 are correlated with fibrosis progression in MASLD patients?

      Regarding comments 7 and 8, we targeted Rac1 and Cdc42 in the LX-2 cell experiment as they are common and major GTPases. Protein-level data are not available in our dataset, but we examined their transcript-level expression. RAC1 and CDC42 expression levels were positively associated with fibrosis progression, with coefficients of 0.362 (q = 0.027) and 0.342 (q = 0.031), respectively. These results are presented in Table S5, and the corresponding boxplots are shown here.

      Figure R1. RAC1 and CDC42 expression levels in individuals with different fibrosis *levels. *

      Other studies have revealed several metabolite changes related to MASLD progression (PMID: 35434590, PMID: 22364559). However, the authors did not discuss the discrepancies between their findings with the previous studies.

      Thank you for the suggestion. We have incorporated a discussion of the two studies into the Discussion section, highlighting the consistencies and discrepancies between our plasma metabolomic results and previous findings. The main differences may stem from variations in MASLD spectrum and the degree of obesity in the cohorts.

      Reviewer #1 (Significance (Required)):

      Overall, the current study might provide some new resources regarding transcriptomic and metabolomic data derived from obese patients with and without MASLD. The MASLD research community will be interested in the resource data.

      We thank this reviewer for the positive and constructive evaluation of our manuscript.

      Reviewer #2 (Evidence, reproducibility and clarity (Required)):

      Summary:

      In this paper, Kaldis and collaborators investigate the molecular heterogeneity of a 109 morbidly obese patient cohort, focusing on liver transcriptomics and metabolomics analysis from liver and serum. The main finding (i.e. upregulation of GTPase-coding genes) was validated in spheroids and a human HSC cell line. As these proteins are involved in critical cellular functions related to metabolism and cytoskeleton dynamics, these findings shed light on their involvement in human liver pathology which so far has been poorly (or even not) documented to date. This is an interesting addition to the current knowledge about chronic liver pathology. However the manuscript suffers from the lack of a clear-cut definition of patient subgroups and the seemingly indistinct use of generic (MASLD, NAS score) and more granular terms (MASH, fibrosis) across the various analysis they performed.

      We thank this reviewer of highlighting the novelty of our manuscript. We agree that mixing generic and granular terms can be confusing and we tried to use of terms consistently throughout, which has been further improved in the revised version.

      Figure 1 and Table 1 provide comprehensive information regarding histological phenotypes, NAS scores, and patient characteristics. From Figure 2 onward, we specifically focused on steatosis and fibrosis as distinct histological features, identifying molecular signatures associated with each process.

      The term ‘MASH’ was used only when referring to the ex vivo 3D spheroids derived from histologically confirmed MASH patients for validation purposes. As our primary cohort represents early disease stages, we did not characterize molecular features of MASH in that data set.

      In this cohort, the term 'NAS' was mentioned only in Section 1 to characterize the disease spectrum. Additionally, in Figures 3A and 6A, we illustrated the association between gene expression levels and NAS in two external cohorts. This was due to the absence of steatosis grades in the two datasets. NAS is an additive measure of multiple scores (steatosis, inflammation and ballooning), but does not account for fibrosis grades.

      Our study focuses on the molecular features of steatosis grades and fibrosis grades as the main histological processes, with all terminology aligned with this stated objective. This allows us to map the transcriptome and metabolome to pathologist-defined steatosis/fibrosis severity (i.e., 0,1,2,3) and identify genes/metabolites that are correlated with increasing steatosis/fibrosis score.

      Major comments:

      • Are the key conclusions convincing?

      The conclusions are generally consistent with findings from numerous previous studies, as many of the genes identified and their associations with disease states have been previously reported. However, I found it difficult to discern which specific disease stages the authors are referring to throughout the manuscript. Terms such as MASLD (Fig. 1F), steatosis (Fig. 4A), MASH, fibrosis (Fig. 6), and the composite NAS score (Fig. 1G) are used interchangeably, without clearly explaining whether or how the patient cohort was stratified to distinguish between isolated steatosis, MASH, and MASH with or without fibrosis. It is also unclear whether subgroups were propensity score-matched.

      As explained in our previous point, we believe that we did not carelessly use the terms interchangeably, but rather used them as they were available or pertinent to the comparisons in discussion. We have provided a comprehensive cohort description in the first section (Table 1, including all histological features and NAS scores), then focused specifically on steatosis and fibrosis in subsequent analyses. We identified distinct molecular processes underlying these two histological features and validated key fibrosis-related pathways.

      Regarding the comment of ‘propensity score-matched subgroups’, we would like to clarify that the only “sub”-group analysis performed in this paper is the transition from steatosis to steatosis with fibrosis. We have consistently used linear regression as the association analysis framework, without binarization of outcomes. We recall that this is a cross-sectional study with challenging recruitment situation from a bariatric surgery clinic that naturally represents the spectrum of MASLD in obesity. We acknowledge that the sampling can always be biased in such a study. However, given the invasiveness of liver resection, the study is also limited by the reality that not all patients would agree to the study, nor it is feasible to form a perfect subgroup meeting 1:1 ratio as in large-scale epidemiology studies based on plasma samples.

      In a related point, the authors mention that 76% of patients are non-fibrotic, introducing a marked imbalance between fibrotic (n=26) and non-fibrotic (n=83) samples. Given this disparity and potential inter-individual variability, it would be helpful to include observed fold changes or effect sizes to give readers a sense of the magnitude of the biological dysregulations being reported.

      As explained in our previous response, our study design examines associations between histological and molecular features rather than using a case-control approach. For effect size quantification, we report standardized linear regression coefficients, i.e. the change in gene expression Z-score per one-point increase in steatosis or fibrosis grade. We also provided fold changes in our comparative analysis of steatosis+fibrosis versus fibrosis-free steatosis. These effect sizes were fully documented in the Supplemental Tables.

      • Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?

      • The authors seem pretty enthusiastic about elafibranor, despite a failed phase 3 clinical trial. I would qualify elafibranor as a useful tool in preclinical model. We agree with the reviewer and indeed used elafibranor as a research tool for PPARa/d modulation rather than a clinically promising prospect. Discussion regarding elafibranor has been updated.

      • The authors should make clearly the pronounced sex bias in their study, which includes mostly women (and btw refer to sex and not gender in the manuscript). Thank you for this important point. We added "Notably, the cohort was predominantly female (76.1%)" to the 'Overview of the study' section in the manuscript. We also replaced all 'gender' with 'sex' throughout the manuscript. In this cohorts, individuals with previous gender reassignment were excluded (see Materials and Methods).

      • The "MASH" status of the spheroid model is overstated. As described in the text it is much closer to a lipotoxicity model (and even glucotoxicity as Glc concentration is 2g/L). The 3D cultures were established from cells isolated from patients with histologically confirmed MASH. Besides steatosis, we observe increased secretion of pro-inflammatory cytokines, activation of hepatic stellate cells and increased deposition of collagen, thus phenocopying the critical disease hallmarks. Additionally, unbiased omics profiling (transcriptomics, proteomics and lipidomics) reveals significant increases in collagen biosynthesis, inflammatory signaling and cholesterol biosynthesis in MASH patient-derived cultures compared to controls. These differences largely overlapped with the results from analyses of six MASH case-control cohort studies. All of these results have been published previously (PMID 39605182).

      This is confusing with panel D in which the authors establish a relationship between fibrotic patients (F2/F3 vs F0/S0, so I guess "no MASLD liver?) and this model. Is the relationship maintained for steatotic-only patients?

      In Figure 6D, we compared GTPase-related gene expression between patients with fibrosis grade 2/3 (n = 26) and those without fibrosis and steatosis (n = 24). Principal component regression resulted in a positive correlation (β = 9.97) between log2 fold changes in 3D spheroids and human fibrosis samples, indicating consistent directional changes in both systems.

      To answer the question from the reviewer, we compared the expression levels of GTPase-related genes in patients with steatosis but no fibrosis (n = 18) to those without fibrosis and steatosis (n = 24), we observed a negative correlation (β = -10.91). This indicates that GTPase-related gene changes in our 3D spheroids do not align with steatosis-related changes in humans.

      Therefore, under the assumption that fibrosis follows steatosis in the majority of the cases of MASLD progression, the result indicates that the alterations in GTPase-related gene expression in the 3D spheroid model specifically is reflective of fibrosis rather than steatosis.

      Figure R2. Comparison of expression level changes in GTPase-related genes between this human cohort and an independent 3D spheroid system: (A) positive correlation with fibrosis grade 2/3 patients versus controls (left), and (B) negative correlation with steatosis-only patients versus controls (right).

      • Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.

      I am not convinced that HSC and LX2 cells express significant levels of PPARα. However, did the authors check for this parameter in their LX2 cell line and assessed whether PPARα/b activation by elafibranor (and/or pemafibrate as it is PPARα selective) alter GTPase expression? Whether negative or positive, this could give a clue about possible intercellular crosstalk in the spheroid model.

      We thank this reviewer to point this out. In response, we analysed the mRNA expression of all PPARs in LX-2 cells with and without Elafibranor treatment, respectively (see Figure R3, same as Figure S8G in the Supplemental Material). We confirmed PPARs are expressed in LX-2 cells at the mRNA level (Figure R3A). Elafibranor does not affect their mRNA levels, which is consistent with previous reports that its primary mechanism is through binding and altering the activity of PPAR proteins, not gene expression (PMID 33326461 and PMID 37627519).

      *Figure R3. Gene signatures in LX-2 cells with and without Elafibranor treatment (n = 3). *

      In addition, we assessed mRNA levels of selected GTPase-related genes in LX-2 cells with and without Elafibranor treatment (Figure R3B). Although statistical power was limited, we observed a consistent trend toward reduced RHOU, DOCK2, and RAC1 expression with Elafibranor. this preliminary signal suggests that Elafibranor may counter the elevated GTPase levels seen in MASH patient spheroids, potentially via crosstalk among hepatic cell types, including HSCs.

      To further investigate intercellular crosstalk in GTPase regulation among hepatic cell types, we evaluated signature GTPase-related genes in LX-2 cells, spheroid co-cultures (hepatocytes, HSCs, Kupffer cells), and hepatocyte monocultures. As shown in Figure R4 (same as Figure S10 in the supplemental material), TGFB1 served as a positive control, exhibiting the most pronounced induction upon TGF-β1 treatment in hepatocytes. Despite varied alterations across the selected GTPase-related genes, TGF-β1 treatment produced a trend toward increased VAV1 and DOCK2 expression in co-culture, hepatocytes, and LX-2 cells, and this was reversed by the TGF-β inhibitor in co-culture and hepatocytes. Other GTPase genes, including RAC1, RAB32, and RHOU, displayed cell type–specific responses to TGF-β1. These observations suggest that the regulation of GTPases is mediated by multiple hepatic cell types, supporting the importance of intercellular crosstalk.

      Figure R4. Expression of GTPase-related genes in spheroid co-culture, hepatocyte monoculture, and LX-2 cells (n = 3). Controls for each gene and experiment were normalized to 1 to enable comparison across treatment groups.

      • Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.

      The experiment mentioned above is cheap (cell culture, RT-QPCR) and can be performed within a couple of weeks.

      • Are the data and the methods presented in such a way that they can be reproduced? Yes

      • Are the experiments adequately replicated and statistical analysis adequate? There is no indication of group size, number of replicates for in vitro experiments

      Thank you for this suggestion. We have added the sample sizes to all relevant sections: ‘n = 4’ in the figure legends for 3D spheroid experiments and ‘n = 8–10’ for the LX-2 experiments. This information has also been incorporated into the corresponding experimental descriptions in the Methods section.

      **Referees cross-commenting**

      I believe there is a general consensus on this potentially interesting contribution to the field, with three main points: (1) the need for a careful group-by-group comparison that accounts for potential confounders, (2) a more rigorous exploitation/characterization of the spheroid system, and (3) the need to benchmark the authors' findings against the available literature.

      Thank you for summarizing the main points. Our responses are as follows:

      • We adjusted for key confounders (sex, gender, age, BMI, diabetes) in all statistical analysis to minimize potential bias, mostly using linear regression (rather than group-to-group comparison). In response to Reviewer 3, comment 1, we also conducted additional statistical analyses exploring molecular changes in diabetic vs. non-diabetic individuals.
      • We provided detailed characterization of the spheroid model (response to Reviewer 3, comment 3) and we have done additional experiments in LX-2 cells.
      • We benchmarked our findings using external human cohorts, mouse models, and single cell spheroid systems:
      • Compared our liver transcriptomics data with two published liver RNA-seq datasets (EU cohort, PMID 31467298; VA cohort, PMID 33268509) as shown in Figure 1G. In Figures 3A and 6A, we also included sidebars indicating gene alterations in these cohorts, showing consistent trends. Moreover, we examined the expression alterations of GTPase-related genes in these datasets in response to Reviewer 3’s comment 2.
      • Assessed genes linked to fibrosis progression in hepatic stellate cells from a murine liver fibrosis model (PMID 34839349), confirming differential expression of GTPases and their regulators during fibrosis initiation (Figure S9A).
      • Examined GTPase-related genes in an independent single-cell human spheroid system (PMID 37962490). This enabled cell-type-specific information of GTPase regulation in response to TGF-β (Figure S9C). We also expanded the discussion section on both the consistencies and discrepancies between our findings and previously published studies.

      Reviewer #2 (Significance (Required)):

      The authors identified GTPases as players in the progression of MASLD. This is an interesting preliminary report warranting further molecular investigations (in which liver cell types, which GTPase pathway(s) are involved, which functions are controlled through this pathway...)

      • State what audience might be interested in and influenced by the reported findings.

      This paper will have an impact in the hepatology field

      • Define your field of expertise with a few keywords to help the authors contextualize your point of view. Indicate if there are any parts of the paper that you do not have sufficient expertise to evaluate.

      I have expertise in the analysis of "MASLD" human cohorts and in the molecular biology of chronic liver diseases.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      Metabolic dysfunction associated liver disease (MASLD) describes a spectrum of progressive liver pathologies linked to life style-associated metabolic alterations (such as increased body weight and elevated blood sugar levels), reaching from steatosis over steatohepatitis to fibrosis and finally end stage complications, such as liver failure and hepatocellular carcinoma. Treatment options for MASLD include diet adjustments, weight loss, and the receptor-β (THR-β) agonist resmetirom, but remain limited at this stage, motivating further studies to elucidate molecular disease mechanisms to identify novel therapeutic targets. In their present study, the authors aim to identify early molecular changes in MASLD linked to obesity. To this end, they study a cohort of 109 obese individuals with no or early-stage MASLD combining measurements from two anatomic sides: 1. bulk RNA-sequencing and metabolomics of liver biopsies, and 2. metabolomics from patient blood. Their major finding is that GTPase-related genes are transcriptionally altered in livers of individuals with steatosis with fibrosis compared to steatosis without fibrosis.

      Major comments:

      1. Confounders (such as (pre-)diabetes) The patient table shows significant differences in non-MASLD vs. MASLD individuals, with the latter suffering more often from diabetes or hypertriglyceridemia.

      Rather than just stating corrections, subgroup analyses should be performed (accompanied with designated statistical power analyses) to infer the degree to which these conditions contribute to the observations. I.e., major findings stating MASLD-associated changes should hold true in the subgroup of MASLD patients without diabetes/of female sex and so forth (testing for each of the significant differences between groups).

      Our original statistical analysis employed linear regression to examine associations between molecular variables (genes/metabolites) and histological progression (steatosis and fibrosis), with adjustment for potential confounders including diabetic status, age, sex, and BMI. We specifically focused on these two histological features to elucidate the disturbed molecular processes during their progression. Regression coefficients represent the expected change in abundance levels (in units of standard deviation of the corresponding molecule) per one-unit increase in histological grades.

      To address the reviewer's question, we conducted additional subgroup analyses to determine whether our major findings remain consistent in individuals with and without diabetes. We assessed linear associations between gene signatures and histological features separately in non-diabetic (n = 71) and diabetic individuals (n = 23). Statistical power was estimated by comparing the variance explained by the full regression model (y ~ x + a + b + c) against the reduced model (y ~ a + b + c), converting the incremental for x into Cohen's , and applying pwr.f2.test with the corresponding degrees of freedom and sample size at α = 0.05.

      For both steatosis and fibrosis, the results in the non-diabetic subgroup (n = 71) showed high consistency with findings in our original analysis (n = 94, adjusted for diabetes), indicating that our originally reported gene signatures, after correction for diabetic status, remain valid in non-diabetic individuals.

      In contrast, for diabetic individuals (n = 23), associations between genes and histological features did not closely replicate our original findings. Notably, we observed larger estimate effects for fibrosis-associated genes in diabetic individuals, suggesting a potential interaction between diabetes and fibrosis progression.

      Figure R5. Subgroup analysis of the association between gene expressions and steatosis grades

      Figure R6. Subgroup analysis of the association between gene expressions and fibrosis grades

      On the comment "degree to which these conditions contribute to the observations," our original analysis adjusted for diabetes status to identify molecular signatures independently associated with fibrosis without the confounding of diabetes status. Consequently, the reported gene signatures in the original analysis more closely reflect patterns in the non-diabetes group, as demonstrated in our subgroup analysis plots. We also comment that, unfortunately, we did not adjust for the interaction of fibrosis and diabetes in the original analysis.

      Furthermore, our additional analyses revealed a close relationship between diabetes and liver fibrosis. Consistent with Figure 1C, hepatic fibrosis is significantly correlated with insulin resistance parameters in clinical assays, including blood insulin levels and HOMA2-IR. To explore this association further, we compared gene expression profiles between diabetic MASLD patients (n = 21) and non-diabetic MASLD patients (n = 43). Although few genes reached significance after multiple testing correction, 166 genes showed differential expression (p 0.32) between these groups.

      We identified 55 genes as potential "diabetic markers" that both showed differential expression between diabetic and non-diabetic MASLD patients and were significantly associated with steatosis or fibrosis progression. These genes are predominantly downregulated metabolic genes (e.g., BAAT, G6PC1, SULT2A1, MAT1A), suggesting that diabetes may exacerbate metabolic suppression as fibrosis advances. Given the high prevalence of diabetes in the MASLD population, our analysis supports the hypothesis that diabetes worsens MASLD outcomes, likely through impaired metabolic capability during fibrosis progression.

      Regarding the comment on the "subgroup of female sex," our original analysis also adjusted for sex as a potential confounder. Since our cohort is predominantly female (>76%), the majority of our findings likely holds true in the female sub-population, similar to what we observed in our diabetes subgroup analysis.

      External validation

      Additionally, to back up the major GTPase signature findings, it would be desirable to analyze an external dataset of (pre)diabetes patients (other biased groups) for alternations in these genes. It would be important to know if this signature also shows in non-MASLD diabetic patients vs. healthy patients or is a feature specific to MASLD. Also, could the matched metabolic data be used to validate metabolite alterations that would be expected under GTPase-associated protein dysregulation?

      We appreciate the comments regarding the validation GTPase as a unique MASLD signature by external datasets. As shown in our previous analysis, after adjusting for diabetes status, the gene signatures remained largely preserved in the non-diabetes subgroup. Before we respond further, we also preface that publicly available liver tissue data, with appropriate and full-scale clinical metadata and sufficient sample sizes, are extremely rare. To the best of our knowledge, the public data sets we brought into our paper were the most prominent data of reliable quality.

      In the paper, we benchmarked our RNAseq dataset against two datasets: the VA cohort and EU cohort (Figure 1). Our cohort focused primarily on early MASLD patients with obesity, which aligns more closely with the disease spectrum represented in the VA cohort (Figure 1G). Notably, in the published paper for the VA cohort, Hoang et al. highlighted Rho GTPase signaling as one of the top pathways in the fibrosis PPI network (Figure 1B from publication PMID 31467298).

      We interrogated GTPase-related genes in both the VA and EU cohorts. As shown in Figure R7 (below), GTPase-related genes demonstrated a strong association with fibrosis grades in the VA cohort, as expected. The EU cohort comprises more advanced MASLD cases with higher fibrosis grades, and our re-analysis in this cohort specifically focused on MASH patients (as designated by the authors). In those MASH patients, GTPase-related genes did not show significant positive associations with fibrosis progression. This finding is consistent with our hypothesis that GTPase regulation is triggered more prominent during the early progression of fibrosis rather than at later stages.

      Unfortunately, diabetes status was not available in the GEO repository for the VA cohort. Available liver tissue sequencing datasets with balanced representation of diabetic and nondiabetic patients are rare, especially those derived from obese individuals and reflecting the early-to-middle stages of MASLD. In our own cohort, for instance, only two diabetic patients without MASLD were recruited (Table 1). While we cannot rule out a role for insulin resistance in GTPase regulation, we will plan future experiments using mouse models to examine GTPase-mediated fibrosis under diabetic and nondiabetic conditions.

      Regarding the comment ‘validate metabolite alterations that would be expected under GTPase-associated protein dysregulation,’ we note that GTPases are primarily involved in cytoskeletal organization, vesicle trafficking, and other cellular processes, with few well-established links to specific metabolite signatures. Nevertheless, in our partial correlation network integrating hepatic genes and metabolites, we observed co-regulated metabolites associated with GTPase-related genes (Figure R8). These included palmitoleoyl ethanolamide (N-acylethanolamine, an anti-inflammatory metabolite and PPARα ligand), phenylacetic acid (a phenylalanine metabolite), biotin (a coenzyme), arginine, lysine, melatonin (a tryptophan metabolite), and several lipid species such as PC 32:0 and CAR 20:1. While causal relationships cannot be inferred from this dataset, our integrative network highlights potential connections related to the trafficking of these metabolites that warrant further investigation.

      Figure R7. Associations between GTPase-related genes to fibrosis in this study and two external cohorts. Asterisks denote significant associations with q value Figure R8. Integrative subnetwork of GTPase-related genes. Blue squares represent GTPase-related genes, red circles indicate metabolites connected to these genes, and the purple diamond denotes fibrosis, which is connected to RHOU.*

      3D liver spheroid MASH model, Fig. 6D/E

      This 3D experiment is technically not an external validation of GTPase-related genes being involved in MASLD, since patient-derived cells may only retain changes that have happened in vivo. To demonstrate that the GTPase expression signature is specifically invoked by fibrosis the LX-2 set up is more convincing, however, the up-regulation of the GTPase-related genes upon fibrosis induction with TGF-beta, in concordance with the patient data, needs to be shown first (qPCR or RNA-seq).

      We agree with the reviewer that experiments in LX-2 (HSC) cells are important and as we have described under ‘Reviewer #2’ we have done this (Figure R3 and Figure R4). Because HSCs only comprise a minor cell population of liver cells, the signals observed in patient bulk RNA data are likely driven primarily by hepatocytes. Nevertheless, we have highlighted the importance of hepatic cell crosstalk in Figure R4 and in our response to Reviewer #2. Additionally, in Supplementary Figure S9B, we identify the potential cell types of origin for the GTPase signals (predominantly hepatocytes and HSCs) using a single-cell dataset from an independent study (PMID: 37962490).

      Additionally, the description of the 3D model is too uncritical. The maintenance of functional human PHHs in 3D has only become available this year (PMID: 40240606) marking a break-through in the field. Since the authors did not use this system, I would strongly assume their findings are largely attributable to the mesenchymal cells in the 3D culture, and these limitations need to be stated.

      We humbly disagree with the reviewer on the 3D liver spheroids. The paper that the reviewer is referencing is related to the proliferation of hepatocytes in organoids, not – at least not directly – their functional maintenance. Here, we use a spheroid model of mature fully differentiated cells, which is conceptually different from the organoid approach. Maintenance of such functional human hepatocytes for multiple weeks in culture has been possible for close to a decade (PMID 27143246). Moreover, particularly for the modeling of chronic liver disease, such as MASH, it is important to use directly patient-derived cells as short induction cycles (typically 1-2 weeks) of disease phenotypes in organoid models do not faithfully reproduce the molecular signatures that stem from chronic exposures in vivo.

      The 3D liver spheroid model we used here is derived from livers from patients with a histologically confirmed diagnosis of MASH. The isolated cells are fully mature and thus do not require in vitro differentiation. There are no MSCs in the 3D cultures; rather the spheroids contain hepatocytes, stellate cells, Kupffer cells as well as various other immune cell types present in the liver at the time of isolation (T cells, B cells, NK cells). Furthermore, the model is extensively characterized at the transcriptomic, proteomic and lipidomic level (PMID 39605182).

      Novelty / references

      Similar studies that also combined liver and blood lipidomics/metabolomics in obese individuals with and without MASLD (e.g. PMID 39731853, 39653777) should be cited. Additionally, it would benefit the quality of the discussion to state how findings in this study add new insights over previous studies, if their findings/insights differ, and if so, why.

      Thank you for the suggestion. We added the two papers into the discussion section. Specifically, we discussed the consistent findings (such as AKR1B10 in PMID 37037945 and mitochondrial dysfunction in PMID 39731853) and discrepancies (such as limited plasma metabolomic changes and circulating sphingolipid alterations in multiple human and mouse models) in comparison with previously published omics studies in MASLD patients. Also, we thoroughly discussed our findings (e.g., lipid dysregulation, dysregulated tryptophan metabolism, GTPase regulation) and potential mechanisms with extensive literature supports from of human, animal, and cell studies.

      Minor comments:

      1. The quality of Supplementary Figures (e.g. S7) makes is impossible to read the labels Thank you for this feedback. The resolution of the figures was impaired in the initial upload. We will provide all supplementary figures with high resolution in our revised submission and ensure all labels are clearly readable.

      For Figure S7C, we presented the correlation matrix of more than 200 GTPase-related genes along with the TGF-β genes TGFB1 and TGFB3. This illustrates the overall co-expression patterns of GTPase-related genes rather than displaying individual gene labels, with arrows now included to highlight TGFB1 and TGFB3.

      Reviewer #3 (Significance (Required)):

      The authors provide an overall sound study on the hepatic transcriptomic and metabolomic signatures in an Australian cohort of 109 obese non-to-early stage MASLD patients. They perform thorough analyses of metabolome and transcriptome in liver biopsies and metabolome in blood, using standard technologies such as RNA sequencing and mass spectrometry. Their key finding is a GTPase-associated gene signature related to fibrosis onset. Limitations of the study include potential cohort confounders (raising the need for expanded control experiments), limited discussion of similar studies, and limits in cell-type resolution, the latter of which is related to the molecular read out, and has in parts been started to be addressed by in vitro experiments in an immortalized HSC lines. Taken together, given additional control analyses will be performed, the results could be of interest to an expert community in the field of molecular hepatology and, while still descriptive, hold the potential to prompt mechanistic follow-up studies.

      We thank this reviewer for a balanced, positive, and constructive evaluation of our manuscript.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #3

      Evidence, reproducibility and clarity

      Summary:

      Metabolic dysfunction associated liver disease (MASLD) describes a spectrum of progressive liver pathologies linked to life style-associated metabolic alterations (such as increased body weight and elevated blood sugar levels), reaching from steatosis over steatohepatitis to fibrosis and finally end stage complications, such as liver failure and hepatocellular carcinoma. Treatment options for MASLD include diet adjustments, weight loss, and the receptor-β (THR-β) agonist resmetirom, but remain limited at this stage, motivating further studies to elucidate molecular disease mechanisms to identify novel therapeutic targets.

      In their present study, the authors aim to identify early molecular changes in MASLD linked to obesity. To this end, they study a cohort of 109 obese individuals with no or early-stage MASLD combining measurements from two anatomic sides: 1. bulk RNA-sequencing and metabolomics of liver biopsies, and 2. metabolomics from patient blood. Their major finding is that GTPase-related genes are transcriptionally altered in livers of individuals with steatosis with fibrosis compared to steatosis without fibrosis.

      Major comments:

      1. Confounders (such as (pre-)diabetes) The patient table shows significant differences in non-MASLD vs. MASLD individuals, with the latter suffering more often from diabetes or hypertriglyceridemia. Rather than just stating corrections, subgroup analyses should be performed (accompanied with designated statistical power analyses) to infer the degree to which these conditions contribute to the observations. I.e., major findings stating MASLD-associated changes should hold true in the subgroup of MASLD patients without diabetes/of female sex and so forth (testing for each of the significant differences between groups).
      2. External validation Additionally, to back up the major GTPase signature findings, it would be desirable to analyze an external dataset of (pre)diabetes patients (other biased groups) for alternations in these genes. It would be important to know if this signature also shows in non-MASLD diabetic patients vs. healthy patients or is a feature specific to MASLD. Also, could the matched metabolic data be used to validate metabolite alterations that would be expected under GTPase-associated protein dysregulation?
      3. 3D liver spheroid MASH model, Fig. 6D/E This 3D experiment is technically not an external validation of GTPase-related genes being involved in MASLD, since patient-derived cells may only retain changes that have happened in vivo. To demonstrate that the GTPase expression signature is specifically invoked by fibrosis the LX-2 set up is more convincing, however, the up-regulation of the GTPase-related genes upon fibrosis induction with TGF-beta, in concordance with the patient data, needs to be shown first (qPCR or RNA-seq). Additionally, the description of the 3D model is too uncritical. The maintenance of functional human PHHs in 3D has only become available this year (PMID: 40240606) marking a break-through in the field. Since the authors did not use this system, I would strongly assume their findings are largely attributable to the mesenchymal cells in the 3D culture, and these limitations need to be stated.
      4. Novelty / references Similar studies that also combined liver and blood lipidomics/metabolomics in obese individuals with and without MASLD (e.g. PMID 39731853, 39653777) should becited. Additionally, it would benefit the quality of the discussion to state how findings in this study add new insights over previous studies, if their findings/insights differ, and if so, why.

      Minor comments:

      1. The quality of Supplementary Figures (e.g. S7) makes is impossible to read the labels

      Significance

      The authors provide an overall sound study on the hepatic transcriptomic and metabolomic signatures in an Australian cohort of 109 obese non-to-early stage MASLD patients. They perform thorough analyses of metabolome and transcriptome in liver biopsies and metabolome in blood, using standard technologies such as RNA-sequencing and mass spectrometry. Their key finding is a GTPase-associated gene signature related to fibrosis onset. Limitations of the study include potential cohort confounders (raising the need for expanded control experiments), limited discussion of similar studies, and limits in cell-type resolution, the latter of which is related to the molecular read out, and has in parts been started to be addressed by in vitro experiments in an immortalized HSC lines. Taken together, given additional control analyses will be performed, the results could be of interest to an expert community in the field of molecular hepatology and, while still descriptive, hold the potential to prompt mechanistic follow-up studies.

    3. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      Metabolic dysfunction-associated steatotic liver disease (MASLD) ranges from simple steatosis, steatohepatitis, fibrosis/cirrhosis, and hepatocellular carcinoma. In the current study, the authors aimed to determine the early molecular signatures differentiating patients with MASLD associated fibrosis from those patients with early MASLD but no symptoms. The authors recruited 109 obese individuals before bariatric surgery. They separated the cohorts as no MASLD (without histological abnormalities) and MASLD. The liver samples were then subjected to transcriptomic and metabolomic analysis. The serum samples were subjected to metabolomic analysis. The authors identified dysregulated lipid metabolism, including glyceride lipids, in the liver samples of MASLD patients compared to the no MASLD ones. Circulating metabolomic changes in lipid profiles slightly correlated with MASLD, possibly due to the no MASLD samples derived from obese patients. Several genes involved in lipid droplet formation were also found elevated in MASLD patients. Besides, elevated levels of amino acids, which are possibly related to collagen synthesis, were observed in MASLD patients. Several antioxidant metabolites were increased in MASLD patients. Furthermore, dysregulated genes involved in mitochondrial function and autophagy were identified in MASLD patients, likely linking oxidative stress to MASLD progression. The authors then determined the representative gene signatures in the development of fibrosis by comparing this cohort with the other two published cohorts. Top enriched pathways in fibrotic patients included GTPas signaling and innate immune responses, suggesting the involvement of GTPas in MASLD progression to fibrosis. The authors then challenged human patient derived 3D spheroid system with a dual PPARa/d agonist and found that this treatment restored the expression levels of GTPase-related genes in MASLD 3D spheroids. In conclusion, the authors suggested the involvement of upregulated GTPase-related genes during fibrosis initiation. Overall, the current study might provide some resources regarding transcriptomic and metabolomic data derived from obese patients with and without MASLD. However, several concerns should be carefully addressed.

      1. A recent study, via proteomic and transcriptomic analysis, revealed that four proteins (ADAMTSL2, AKR1B10, CFHR4 and TREM2) could be used to identify MASLD patients at risk of steatohepatitis (PMID: 37037945). It is not clear why the authors did not include this study in their comparison.
      2. The authors recruited 109 patients but only performed transcriptomic and metabolomic analysis in 94 liver samples. Why did the authors exclude other samples?
      3. The authors mentioned clinical data in Table 1 but did not present the table in this manuscript.
      4. The generated metabolomic data could be a very useful resource to the MASLD community. However, it is very confusing how the data was generated in those supplemental tables. There is no clear labeling of human clinical information in those tables. Also, what do those values mean in columns 47-154? This reviewer assumed that they are the raw data of metabolomic analysis in plasma samples. However, without clear clinical information in these patients, it is impossible that any scientist can use the data to reproduce the authors' findings.
      5. In Fig. 5B, the authors excluded the steatosis and fibrosis overlapped genes. Steatosis and fibrosis specific genes could simply reflect the outcomes rather than causes. In this case, the obtained results might not identify the gene signatures related to fibrosis initiation.
      6. In Fig. 6D, the authors used 3D liver spheroid to validate their findings. However, there is no images showing the 3D liver spheroid formation before and after PPARa/d agonist treatment. It is not clear whether the 3D liver spheroid was successfully established.
      7. The authors suggested that targeting LX-2 cells with Rac1 and Cdc42 inhibitors could reduce collagen production. Did the authors observe these two genes upregulated in mRNA and protein expression levels in their cohort when compared MASLD patients with and without fibrosis?
      8. Did the authors observe that the expression levels of Rac1 and Cdc42 are correlated with fibrosis progression in MASLD patients?
      9. Other studies have revealed several metabolite changes related to MASLD progression (PMID: 35434590, PMID: 22364559). However, the authors did not discuss the discrepancies between their findings with the previous studies.

      Significance

      Overall, the current study might provide some new resources regarding transcriptomic and metabolomic data derived from obese patients with and without MASLD. The MASLD research community will be interested in the resource data.

    1. Reviewer #3 (Public review):

      Summary:

      In this manuscript, the authors describe the results of a high-throughput screen for small-molecule activators of GCN2. Ultimately, they find 3 promising compounds. One of these three, compound 20 (C20), is of the most interest both for its potency and specificity. The major new finding is that this molecule appears to activate GCN2 independent of GCN1, which suggests that it works by a potentially novel mechanism. Biochemical analysis suggests that each binds in the ATP-binding pocket of GCN2, and that at least in vitro, C20 is a potent agonist. Structural modeling provides insight into how the three compounds might dock in the pocket and generates testable hypotheses as to why C20 perhaps acts through a different mechanism than other molecules.

      Strengths:

      Of the 3 compounds identified by the authors, C20 is the most interesting, not just for its intriguing mechanistic distinction as being GCN1-independent (shown genetically in two distinct cell lines, CHO and 293T in Figure 4, and in contrast to other GCN2 activators) but also for its potency. In in-cellulo assays, compound 21 appears as more of an ISR enhancer than an activator per se, and although compound 18 and compound 21 lead to upregulation of the ISR targets (Figure 2), that degree of upregulation is probably not significantly different from that induced by those compounds in Gcn2-/- cells. For C20, the effect appears stronger (although it is unclear whether the authors performed statistical analysis comparing the two genotypes in Figure 2D). In Figure 3, only C20 activates the ISR robustly in both CHO and 293T. Ultimately, C20 might be a tool for providing mechanistic insight into the details of GCN2 activation and regulation, and could be exploited therapeutically.

      Weaknesses:

      There are some limitations to the existing work. As the authors acknowledge, they do not use any of the compounds in animals; their in vivo efficacy, toxicity, and pharmacokinetics are unknown. But even in the context of the in cellulo experiments, it is puzzling that none of the three compounds, including C20, has any effects in HeLa cells when Neratinib does. It's beyond the scope of this paper to address definitively why that is, but it would at least be reassuring to know that C20 activates the ISR in a wider range of cells, including ideally some primary, non-immortalized cells. In addition, the ISR is a complex, feedback-regulated response whose output varies depending on the time point examined. The in cellulo analysis in this paper is limited to reporter assays at 18 hours and qRT-PCR assays at 4 and 8 hours. A more extensive examination of the behavior of the relevant ISR mRNAs and proteins (eIF2, ATF4, CHOP, cell viability, etc.) for C20 across a more extensive time course would give the reader a clearer sense of how this molecule affects ISR output. I also find it a bit strange that the authors describe C20 as "demonstrat(ing) weak inhibition of ... PKR"-the measured IC50 is ~4 μM, which is right around its EC50 for GCN2 activation. This raises the confounding possibility that C20 would simultaneously activate GCN2 while inhibiting PKR. While perhaps inhibition of PKR is not relevant under the conditions when GCN2 would be activated either experimentally or therapeutically, examining in cells the effects of C20 on GCN2 and PKR across a dose range would shed light on whether this cross-reactivity is likely to be of concern.

    1. Reviewer #1 (Public review):

      Summary:

      The authors demonstrate the stereoselective role of D-serine in 1C metabolism, showing that D-serine competes with L-serine and inhibits mitochondrial L-serine transport. They observe expression of 1C metabolites in their metabolomics approach in primary cortical neurons treated with L-serine, D-serine, and a mixture of both. Their conclusions are based on the reduction in levels of glycine, polyamines, and their intermediates and formate. Single-cell RNA sequencing of N2a cells showed that cells treated with D-serine enhanced expression of genes associated with mitochondrial functions, such as respiratory chain complex assembly, and mitochondrial functions, with downregulation of genes related to amino acid transport, cellular growth, and neuron projection extension. Their work demonstrates that D-serine inhibits tumor cell proliferation and induces apoptosis in neural progenitor cells, highlighting the importance of D-serine in neurodevelopment.

      Strengths:

      D-amino acids are a marvel of nature. It is fascinating that nature decided to make two versions of the same molecule, in this case, an amino acid. While the L-stereoisomer plays well-known roles in biology, the D-stereoisomer seems to function in obscurity. Research into these novel signaling molecules is gathering momentum, with newer stereoisomers being discovered. D-serine has been the most well-studied among the different stereoisomers, and we still continue to learn about this novel neurotransmitter. The roles of these molecules in the context of metabolism is not well studied. The authors aim to elucidate the metabolic role of D-serine in the context of neuronal maturation with implications for 1C metabolism and in cell proliferation. The metabolic role of these molecules is just beginning to be uncovered, especially in the context of mammalian biology. This is the strength of the manuscript. The authors have done important work in prior publications elucidating the role of D-amino acids. The advancement of the field of D-amino acids in mammalian biology is significant, as not much is known. The presentation of RNA seq data is a valuable resource to the community, however, with caveats as mentioned below.

      Weaknesses:

      The following are some of the issues that come out in a critical reading of the manuscript. Addressing these would only strengthen and clarify the work.

      (1) Kinetic assessment of D-serine versus L-serine: While the authors mention that D-serine is not a good substrate for SHMT2 compared to L-serine, the kinetic data are presented for only D-serine. In a substrate comparison with an enzyme, data must be presented for L-serine as well to make the conclusion about substrate specificity and affinity. Since the authors talk about one versus another substrate, there needs to be a kinetic comparison of both with Km (affinity). (Ref Figure 2 panel).

      (2) Molecular Dynamics simulations, while a good first step in modeling interactions at the active site, rely on force fields. These force fields are approximations and do not represent all interactions occurring in the natural world. Setting up the initial conditions in the simulations can impact the final results in non-equilibrium scenarios. The basic question here is this: Is the simulated trajectory long enough so that the system reaches thermodynamic equilibrium and the measured properties converge? Prior studies have shown mixed results with the conclusion that properties of biological systems tend to converge in multi-second trajectories (not nanosecond scales as reported by the authors) and transition rates to low probability conformations require more time. (Ref Figure 2C).

      (3) The authors use N2a cell line to demonstrate D-serine burden on primary cortical neurons. N2a is an immortalized cell line, and its properties are very different from primary neurons. The authors need to mention a rationale for the use of an immortalized cell line versus primary neurons. The transcriptomic profile of an immortalized cell line is different compared to a primary cell. Hence, the response to D-serine may vary between the two different cell types.

      (4) In Figure 4D, the authors mention that D-serine activates the cleavage of caspase 3. Figure 4D shows only cleaved caspase 3 as a single band. They need to show the full blot that contains the cleaved fragments along with the major caspase 3 band.

      (5) In Figure panel 4, the authors use neural progenitor cells (NPCs). They need to demonstrate that the population they are working with is NPCs and not primary neurons. There must be a figure panel staining for NPC markers like SOX2 and PAX6. Also, Figure S5 needs to be properly labeled. It is confusing from the legend what panels B-E refer to? Also, scale bars are not indicated.

      (6) In Supplementary Figure panel 7F, the authors mention phosphatidyl L-serine and phosphatidyl D-serine. A chromatogram of the two species would clarify their presence as they used 2D-HPLC. On an MS platform, these 2 species are not distinguishable. Including a chromatogram of the 2 species would be helpful to the readers.

      (7) The authors mention about enantiomeric shift of serine metabolism during neural development, which appears to be a discussion of prior published data from Hubbard et al, 2013, Burk et al, 2020, and Bella et a,l 2021 in Supplementary Figure panels 8 A-E. This should not be presented as a figure panel, as it gives the false impression that the authors have performed the experiment, which is clearly not the case. However, its discussion can well serve as part of the manuscript in the discussion section.

      (8) The entire presentation of the section on enantiomeric shift of serine metabolism during neural development (lines 274-312) is a discussion and should be part of the discussion section and not in the results section. This is misleading.

      (9) The discussion section is not well written. There is no mention of recent work related to D-serine that has a direct bearing on its metabolic properties. In the discussion section, paragraph 1, the authors mention that their work demonstrates the selective synthesis of D-serine in mature neurons as opposed to neural progenitor cells. This concept has been referred to in prior publications:

      (a) Spatiotemporal relationships among D-serine, serine racemase, and D-amino acid oxidase during mouse postnatal development. PMID:14531937.

      (b) D-cysteine is an endogenous regulator of neural progenitor cell dynamics in the mammalian brain. PMID:34556581.

      (10) In the abstract, in lines 101 and 102, the authors mention "how D-serine contributes to cellular metabolism beyond neurotransmission remains largely unknown". In 2023, a paper in Stem Cell Reports by Roychaudhuri et al (PMID:37352848) showed that D and L-serine availability impacts lipid metabolism in the subventricular zone in mice, affecting proliferative properties of stem-cell derived neurons using a comprehensive lipidomics approach. There is no mention of this work even in the discussion section, as it bears directly on L and D-serine availability in neurons, which the authors are investigating. In the discussion section in lines 410-411, the authors mention the role of D-serine in neurogenesis, but surprisingly don't refer to the above reference. The role of D-serine in neurogenesis has been demonstrated in the Sultan et al (lines 855-857) and Roychaudhuri et al references.

      (11) Both D-serine and the structurally similar stereoisomer D-cysteine (sulfur versus oxygen atom) have a bearing on 1C metabolism and the folate cycle. With reference to the folate cycle, Roychaudhuri et al in 2024 (PMID:39368613) have shown in rescue experiments in mice that supplementing a higher methionine diet provides folate cycle precursors to rescue the high insulin phenotype in SR-deficient mice. Since 1C metabolism is being discussed in this manuscript, the authors seem to overlook prior work in the field and not include it in their discussion, even when it is the same enzyme (SR) that synthesizes both serine and cysteine. Since the field of D-amino acid research is in its infancy, the authors must make it a point to include prior work related to D-serine at least, and not claim that it is not known. The known D-stereoisomers are not many, hence any progress in the area must include at least a discussion of the other structurally related stereoisomers.

      (12) Racemases (serine and aspartate) in general are promiscuous enzymes and known to synthesize other stereoisomers in addition to D-serine, D-cysteine, and D-aspartate. A few controls, like D-aspartate, D-cysteine, or even D-alanine must be included in their study to demonstrate the specific actions of D-serine, especially in the N2a cell treatment experiments. Cysteine and Serine are almost identical in structure (sulfur versus oxygen atom), and both are synthesized by serine racemase (published). Cysteine has also been very recently shown to inhibit tumor growth and neural progenitor cell proliferation. (PMIDs: 40797101 and 34556581). How the authors' work relates to the existing findings must be discussed, and this would put things in perspective for the reader.

    2. Reviewer #2 (Public review):

      Summary:

      This study by Suzuki et al. reports an interesting stereo-selective role of D-serine in regulating one-carbon metabolism during neurodevelopment to adapt the functional transition, probably through the competition with mitochondrial transport of L-serine. The authors provide a multi-layered set of evidence, including metabolomics, enzyme assays, mitochondrial transport competition, and functional assays in immature/neural progenitor cells, to build up a conceptual integration of D-serine as both a neurotransmitter and a metabolic regulator in the central neural system, which raises a broad potential interest to the neuroscience and metabolism communities.

      Strengths:

      This work provides a conceptual advance that D-serine not only serves as a traditional neurotransmitter in the central neural system but also critically contributes to metabolic regulation of neural cells. The authors performed solid metabolomic assays to validate the suppressive effect of D-serine on the one-carbon metabolic pathway, providing some evidence that D-serine competitively inhibits mitochondrial serine transport, but not directly impairs SHMT2 enzymatic activity. All these data indicate a critical role of D-serine synthesis during neural maturation and suggest a potential translational strategy for targeting serine metabolism in neural tumors.

      Weaknesses:

      (1) The detailed mechanism by which D-serine competes with L-serine for its mitochondrial transport is not investigated. For example, although the authors made some discussion, they did not provide direct genetic or biochemical evidence linking these effects to the specific transporters, such as SFXN1.

      (2) Unlike tumor cells, where SHMT2 usually plays a predominant role in catalyzing serine/THF-derived one-carbon metabolism, normal cells may employ both SHMT1 and SHMT2 to do the work. Even under certain conditions that SHMT2-mediated one-carbon metabolism is suppressed, the activity of SHMT1 could be elevated for compensation. Thus, it is important to investigate whether D-serine affects SHMT1 activity or changes the balance between SHMT1- and SHMT2-mediated one-carbon metabolism. To this aim, the authors are strongly encouraged to perform a metabolic flux assay (MFA) by using 13C-labeled L-serine in the model cells in the presence and absence of D-serine.

      (3) A defect in serine-derived one-carbon metabolism may cause multiple cellular stress responses. It is valuable to detect whether cellular NADPH/NADH, GSH, or ROS is altered before and after D-serine treatment.

      (4) The physiological relevance between D-serine and neural cell maturation/death should be further tested and discussed, since the dosage of D-serine used in the in vitro assay is much higher than that in physiological conditions.

    1. Reviewer #2 (Public review):

      Summary:

      The authors aim to address whether nuclear pore complex components localize and function at PD in plant cells to mediate cell-to-cell communication.

      Strengths:

      (1) Novelty and Significance:<br /> The core hypothesis, drawing parallels between PD and NPC transport, is highly original and addresses a critical gap in understanding plant intercellular communication. The idea that phase-separated domains formed by FG-NUPs could act as diffusion barriers at PD offers a plausible and sophisticated explanation for their complex transport properties, including size exclusion and facilitated translocation. This could fundamentally change how we view PD function.

      (2) Comprehensive Evidence:<br /> The study employs a rigorous and diverse set of experimental approaches, including a comprehensive bioinformatic analysis of both moss and Arabidopsis NUPs in available PD proteomic datasets, extensive imaging analysis of Nup localization in vivo, and functional transport assays using a loss-of-function nup mutant (cpr5). The transport assay is particularly important to provide functional evidence linking CPR5 to PD-mediated transport. The finding that callose levels were not significantly different in cpr5 mutants under these conditions is helpful and supports a distinct, callose-independent mechanism of transport regulation.

      (3) Objectivity:<br /> The authors are forthright in discussing the limitations and potential artifacts of their own data, clearly distinguishing between observations and definitive conclusions.

      Weaknesses:

      While the claims are generally justified as hypotheses or consistent observations, the authors themselves extensively detail the caveats, which are worth reiterating for clarity:

      (1) Potential Overexpression Artifacts in Localization:<br /> Although efforts were made to control expression levels, the authors acknowledge that transient overexpression could still lead to NUP accumulation at PD, either as a physiologically relevant accumulation under excess conditions or due to mis-targeting, or even as storage depots. The resolution of confocal microscopy also does not allow for a definitive conclusion on the nature of the location.

      (2) Proteomics Purity:<br /> The authors note that the presence of NUPs in PD fractions/proteomics cannot definitively rule out contamination, as PD cannot currently be purified to absolute homogeneity and is often contaminated with other organelles, including the nucleus.

      (3) CPR5 Mutant Interpretation:<br /> While cpr5 mutants exhibited reduced macromolecular transport, the authors state that they cannot exclude that the reduced transport is due to secondary effects in the cpr5 mutants, which show rather severe phenotypic defects. This is an important distinction, as CPR5 has known roles in defense responses and hormone signaling that could indirectly influence PD integrity, independent of callose deposition. The lack of effect on small molecule transport is a good control, but the broader pleiotropic effects of cpr5 mutants remain a consideration.

      (4) Conceptual Distinction between NPC and PD:<br /> The authors correctly point out that while similarities exist, the physical assembly of NUPs at PD must differ from that at the NPC due to the presence of the desmotubule and smaller cytoplasmic sleeve width at PD. Moreover, nucleocytoplasmic transport depends on karyopherin proteins that interact with the NPC central channel to complete the transport. Yet the role of karyopherins in this case is not clear. Therefore, the proposed "PD pore complex" may bear some NPC features, but not be identical.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review): 

      Summary: 

      The manuscript by Senn and colleagues presents a comprehensive study on the developing synthetic gene circuits targeting mutant RAS-expressing cells. This study aims to exploit these RAS-targeting circuits as cancer cell classifiers, enabling the selective expression of an output protein in correlation with RAS activity. The system is based on the bacterial two-component system NarX/NarL. A RAS-binding domain, the RBDCRD domain of the RAS effector protein CRAF, is fused to the histidine kinase domain, which carries an inactivating amino acid exchange either in its ATP-binding site (N509A) or in its phosphorylation site (H399Q). Dimerization or nanocluster formation of RAS-GTP reconstitutes an active histidine kinase sensor dimer that phosphorylates the response regulator NarL. The phosphorylated DNA-binding protein NarL, fused to the transcription activator domain VP48, binds its responsive element and induces the expression of the output protein. In comparison to mutated RAS, the effect of the RAS activator SOS-1 and the RAS inhibitor NF1 on the sensing ability as well as the tunability of the RAS sensor were examined. A RAS targeting circuit with an AND gate was designed by expressing the RAS sensor proteins under the control of defined MAPK response elements, resulting in a large increase in the dynamic range between mutant and wild-type RAS. Finally, the RAS targeting circuits were evaluated in detail in a set of twelve cancer cell lines expressing endogenous levels of mutant or wild-type RAS or oncogenes affecting RAS signaling upstream or downstream. 

      Strengths: 

      This proof-of-concept study convincingly demonstrates the potential of synthetic gene circuits to target oncogenic RAS in tumor cell lines and to function, at least in part, as an RAS mutant cell classifier. 

      Weaknesses: 

      The use of an appropriate "therapeutic gene" might revert the oncogenic properties of RAS mutant cell lines. However, a therapeutic strategy based on this four-plasmid-based system might be difficult to implement in RAS-driven solid cancers. 

      Thank you for the insightful comments. We agree that the delivery of a four-plasmid system represents a major challenge for translating RAS-targeting circuits into therapeutic applications. Reducing the number of plasmids –ideally consolidating all components onto a single vector– will be critical for clinical implementation.

      Viral delivery is generally the most efficient strategy for DNA-based therapies, but viral vectors have limited packaging capacities, which differ by virus type[1]. The RAS_sensor_F.L.T. circuit under the EF1α promoter requires ~7.7 kb for the sensing components alone, excluding the output gene. This exceeds the packaging limit of adeno-associated virus (AAV) and is at the upper boundary for lentiviral vectors but could potentially be accommodated by larger vectors such as γ-retroviruses, poxviruses, or herpesviruses¹. Co-transduction with dual AAVs [2] or ongoing engineering to expand packaging capacity [3] may also offer future solutions. An additional route to reduce construct size could be alternative splicing, especially given redundancy between the two NarX fusion proteins[4]. 

      An advantage of our current architecture is that synthetic response elements replace constitutive promoters, reducing construct size. For example, the MAPK-driven PY2_NarX&NarL circuits range between 4.9 and 5.2 kb depending on the transactivation domain, bringing them within AAV packaging limits for the sensor module[5], though co-delivery of the output gene would still be necessary. For lentiviruses, this is within the packaging capacity of 8 kb<sup>1</sup> and would allow for inclusion of ~3 kb output genes.

      Still, assembling multiple modules onto a single vector introduces new challenges, including possible crosstalk or interference between neighboring promoters [6]. For example, placing the output gene too close to MAPK response elements may trigger unwanted MAPKdependent expression, potentially bypassing the intended AND-gate logic. Moreover, expressing three genes under separate response elements may shift expression ratios and reduce circuit functionality. Nonetheless, the absence of constitutive promoters and the RAS-dependence of MAPK response elements could provide partial robustness, since even unintended activation would still reflect RAS signaling to some extent. Further, our data (Fig. 1d) show that some deviation in component levels can be tolerated, provided all parts are sufficiently expressed. Nonetheless, assembling the circuit on a single vector will require careful design and rigorous validation to ensure optimal performance. 

      While addressing this is beyond the scope of the current study, we agree that future efforts should focus on vector consolidation and delivery strategies. We now include a paragraph discussing these challenges in the revised manuscript.

      Reviewer #2 (Public review): 

      The manuscript describes an interesting approach towards designing genetic circuits to sense different RAS mutants in the context of cancer therapeutics. The authors created sensors for mutant RAS and incorporated feed-forward control that leverages endogenous RAS/MAPK signaling pathways in order to dramatically increase the circuits' dynamic range. The modularity of the system is explored through the individual screening of several RAS binding domains, transmembrane domains, and MAPK response elements, and the author further extensively screened different combinations of circuit components. This is an impressive synthetic biology demonstration that took it all the way to cancer cell lines. However, given the sole demonstrated output in the form of fluorescent proteins, the authors' claims related to therapeutic implications require additional empirical evidence or, otherwise, expository revision. 

      Thank you very much for the thoughtful evaluation, precise critique, and constructive suggestions.

      As correctly noted, our study initially focused on developing and optimizing input sensors and processing units for synthetic gene circuits targeting mutated RAS. To address the concern regarding therapeutic relevance, we have now incorporated functional validation using a clinically relevant output protein: herpes simplex virus thymidine kinase (HSV-TK), which converts ganciclovir into a cytotoxic compound. We replaced the mCerulean reporter with HSV-TK and tested the resulting RAS-targeting circuits in both RAS-mutant and wild-type cancer cell lines. The results, now presented in a new chapter (Figure 8 and Supplementary Fig. 14), demonstrate robust killing of RAS-mutant cells and support the potential therapeutic utility of these circuits.

      Major comments: 

      "These therapies are limited to cancers with KRASG12C mutations" is technically accurate. However, in this fast-moving field, there are examples such as MRTX1133 which holds the promise to target the very G12D mutation that is the focus of this paper. There are broader efforts too. It would help the readers better appreciate the background if the authors could update the intro to reflect the most recent landscape of RAS-targeting drugs. 

      Thank you for this helpful suggestion. We have updated the introduction to reflect the rapidly evolving landscape of RAS-targeting therapies, including the development of inhibitors for nonG12C mutations such as KRASG12D (e.g., MRTX1133). Given the pace and breadth of these advances, we also refer readers to a recent comprehensive review that provides an in-depth overview of current RAS-targeting strategies.

      Only KRASG12D was used as a model in the design and optimization work of the genetic circuits. Other mutations should be quite experimentally feasible and comparisons of the circuits' performances across different KRAS mutations would allow for stronger claims on the circuits' generalizability. Particularly, the cancer cell line used for circuit validation harbored a KRASG13D mutation. While the data presented do indeed support the circuit's "generalizability," the model systems would not have been consistent in the current set of data presented. 

      To further support the generalizability of our RAS sensor, we titrated plasmid doses for a panel of oncogenic RAS variants, including multiple KRAS mutants as well as HRAS<sup>G12D</sup and NRAS<sup>G12D</sup. Across all tested variants, we observed concentration-dependent activation of the RAS sensor. At 1.67 ng/well, the sensor output for all oncogenic RAS variants was at least as high as that for KRAS<sup>G12D</sup>, suggesting that the behavior observed in our initial design and optimization is representative of a broader set of RAS mutations.

      We also noted that high overexpression of wildtype HRAS and NRAS can lead to substantial activation of the sensor, exceeding that observed with wildtype KRAS. This underscores the importance of considering all RAS isoforms when assessing circuit specificity and avoiding potential off-target activation in healthy cells.

      In Figure 2a, the text claims that "inactivation of endogenous RAS with NF1 resulted in a lower YFP/RBDCRD-NarX expression," but Figure 2a does not show a statistically significant reduction in expression of SYFP (measured by "membrane-to-total signal ratio [RU]). 

      Thank you for pointing this out. We repeated the experiment to reassess the effect of NF1 on RBDCRD-NarX-SYFP2 expression and were able to confirm statistical significance. Accordingly, we have replaced Figure 2a with updated data. To facilitate better visual comparison across conditions, we also standardized the y-axis range across all relevant flow cytometry plots.

      The therapeutic index of the authors' systems would be better characterized by a functional payload, other than florescent proteins, that for example induce cell death, immune responses, etc. 

      Thank you for this insightful comment. We agree that fluorescent reporters are limited to approximating expression levels, and that a functional output protein is more appropriate for assessing therapeutic potential. To address this, we replaced mCerulean with the therapeutic suicide-gene, HSV-TK, and tested the circuits in RAS-mutant and wild-type cancer cell lines. These experiments demonstrate that our circuits can express functional proteins and induce cell death in two RAS-mutant cell lines while showing low toxicity in a RAS wild type cell line (new chapter including Fig. 8 and Supplementary Fig.14). 

      Comparing confluence of cells transfected with the RAS-targeting circuits to cells transfected with non-toxic GFP-output negative control or the constitutively expressed EF1αHSV-TK positive control allowed us to estimate the killing-strength of the circuits in each cell line. In RAS-mutant HCT-116 the confluence curves were similar to the positive control, indicating effective killing (Fig. 8b). At lower DNA dose in HCT-116, or in SW620 with lower transfection efficiency, the killing of transfected RAS-driven cancer cells was less pronounced, falling approximately midway between the controls (Fig. 8g&j). In the RAS wild type cell line, Igrov-1, the RAS circuits showed continued growth similar to the non-toxic negative control (Fig. 8d), suggesting low toxicity. 

      While this may indicate low circuit activation in Igrov-1, an alternative explanation for the low toxicity could also be insufficient transfection efficiency. Testing in SW620 –which had similar transfection efficiency as Igrov-1 (Supplementary Fig. 14a)– showed that this moderate transfection efficiency was sufficient for RAS-circuit-dependent killing (Fig. 8d & 8g), supporting the notion of low activation in Igrov-1 and selective cytotoxicity in RAS-driven cancer cells.

      Nonetheless, it is important to note that comparisons between the cell lines need to be interpreted cautiously because of inter-cell line differences in transfection, growth, and HSV-TK/ganciclovir (GCV)-sensitivity (Supplementary Fig. 14) and further validation will be essential. 

      A conclusive assessment will require more efficient delivery strategies, such as viral vectors (as discussed above). Efficient delivery would allow to investigate selectivity in a more realistic setting with patient-derived RAS-mutant cancer and healthy cells as well as testing in an vivo model. While beyond the scope of the current study, we view it as a critical direction for future work and have therefore added a paragraph about this to our discussion.

      Regarding data presented in "Mechanism of action" (Figure 2), the observations are interesting and consistent across different fluorescent reporters. However, with regard to interpretations of the underlying molecular mechanisms, it is not clear whether the different output levels in 2b, 2c, and 2d are due to the pathway as described by the authors or simply from varied expression levels of RBDCRD-NarX itself (2a) that is nonlinearly amplified by the rest of the circuit. From a practical standpoint, this caveat is not critical with respect to the signal-to-noise ratios in later parts of the paper. From a mechanistic interpretation standpoint, claims made forth in this section are not clearly substantiated. Some additional controls would be nice. For example, if the authors express NarXs that constitutively dimerize on the membrane, what would the RasG12Dresponsiveness look like? Does RasG12D alter the input-output curve of NarL-RE? How would Figure 4f compare to a NaxR constitutively dimerized control that only relies on transcriptional amplification of the Ras-dependent promoters? 

      This is a great point. We agree that the observed differences in output levels (Fig. 2) could arise from non-linear amplification due to increased expression of RBDCRD-NarX, rather than RAS binding or dimerization alone. To further investigate this possibility, we performed titrations of KRAS<sup>G12D</sup> in combination with the functional RAS sensor and a series of constitutively active and inactive control constructs (Supplementary Fig. 4).

      Inactive controls lacking NarX dimerization showed only a modest increase in output expression, similar to direct mCerulean expression under the EF1α promoter. Transfection of the output plasmid alone, with NarL, or with NarL and non-RAS-binding RBD<sup>R89L</sup> CRD<sup>C168S</sup> -NarX, resulted in minimal RAS-dependent increases (Supplementary Fig. 4a). Importantly, after normalization using the EF1α-driven mCherry transfection control, these effects were fully or even slightly over-compensated (Supplementary Fig. 4b), showing that we don’t include the effect of EF1α-dependent increased leakiness in the data presented throughout the manuscript, but also that –due to the normalization– we potentially underestimate the dynamic range of the RAS-targeting circuits.

      In contrast, constitutively dimerizing NarX controls (both membrane-bound and cytosolic dimerized via the FKBP–FRB system) exhibited a more pronounced RAS-dependent increase in output –even after normalization– confirming the presence of non-linear amplification (up to 3–4fold). However, this effect was still lower than that achieved with the functional RAS-binding sensor (8-fold at 1.67 ng/well KRAS<sup>G12D</sup>; 14-fold at 5–15 ng/well), indicating that the increase in expression of the sensor parts is not the full explanation of the effect we see. Instead, RAS binding and dimerization further amplify the response and are necessary for full activation (Supplementary Fig. 4b).

      We also addressed the reviewer’s suggestion by testing the MAPK response elements used in Fig. 4f with constitutively dimerizing NarX. These controls generally showed lower fold changes between KRAS<sup>G12D</sup>; and KRAS<sup>WT</sup> than the corresponding RAS-binding circuits  (Supplementary Fig. 7), with one exception: the combination of SRE_NarX and PY2_NarL-VP48. 

      Together, these data show that non-linear amplification via increased expression and dimerization contributes to output activation. However, RAS binding and induced dimerization of the NarX sensor are required for full functionality and enhanced signal strength. This underscores that integrating the MAPK response elements with the binding-based RAS sensor into RAS-targeting circuits generally improves the distinction between cells with KRAS<sup>G12D</sup>;  and KRAS<sup>WT</sup> and that it was the combination that allowed to reach maximal fold changes.

      It's also possible that these Ras could affect protein production at the post-transcriptional or even post-translational levels, which were not adequately considered. 

      Thank you for this comment. We now mention in the manuscript the potential mechanisms by which (over-)activated RAS or MAPK signaling can increase protein synthesis. We cite relevant reports of the mechanisms we found, including upregulation of translational initiation and machinery[10]  and ribosomal biogenesis[11].

      The text claims that "in contrast to what we saw in HEK293 overexpressing RAS (Figure 5d), the "AND-gate" RAS-targeting circuits do not generate higher output than the EF1a-driven, bindingtriggered RAS sensor in HCT-116. Instead, the improved dynamic range results from decreased leakiness in HCT- 116k.o." Comparing the experiment from Figure 5d, which looks at activation in KRASG12D and KRASWT, to the experiments in Figure 6b-d, which looks at activation in HCT-116WT and HCT-116KO is misleading. In Fig 5d., cells are transfected with KRASG12D and KRASWT to emulate high levels of mutant RAS and high levels of wild-type RAS. In Figures 6b-d, HCT-116WT has endogenous levels of mutant RAS, while the KCT-116KO is a knock-out cell line, and does not have mutant or WT RAS. Therefore, the improved dynamic range or "decreased leakiness in HCT-116KO" in comparison to Figure 5d. is more comparable to the NF1 condition from Figure 2, which deactivates endogenous RAS. While this may not be feasible, the most accurate comparison would have been an HCT-116KO line with KRASWT stably integrated. 

      Thank you for this input. We understand that comparing the results from HEK293 cells transfected with KRAS<sup>G12D</sup>;  or KRAS<sup>WT</sup> (Fig. 5d) to those from HCT-116<sup>WT</sup>    and HCT-116<sup>k.o</sup>. cells (Fig. 6b–d) may be misleading if interpreted as a direct comparison of RAS signaling levels. Our intent was not to compare HEK293 with KRAS<sup>WT</sup> directly to HCT-116<sup>k.o</sup>.., but rather to contrast the behavior of the EF1α-driven RAS sensor and the MAPK-responsive RAS-targeting circuits within each cell line context.

      Specifically, we observed that in HEK293 cells expressing KRAS<sup>G12D</sup>, the MAPK-based RAS-targeting circuits produced higher output than the EF1α-expressed RAS sensor. In contrast, in HCT-116<sup>WT</sup> cells, the EF1α-expressed RAS sensor resulted in higher output levels than the RAS-targeting circuits. Despite this, the MAPK-driven circuits showed an improved dynamic range compared to the EF1α-expressed RAS sensor in HCT-116, due to the reduced background expression in the HCT-116<sup>k.o</sup>.. cells. We have revised the manuscript text to clarify this distinction.

      We agree that an HCT-116<sup>k.o</sup> cell line with stable integration of KRAS<sup>WT</sup> would provide a more direct comparison. Nonetheless, HCT-116<sup>k.o</sup>.. cells still express endogenous NRAS and HRAS, both of which are capable of activating the RAS sensor (as shown in Fig. 1g). Therefore, we believe that HCT-116<sup>k.o</sup>. cells are more comparable to HEK293 with KRAS<sup>WT</sup> than to the NF1 condition in Fig. 2, in which all endogenous RAS isoforms are inactivated.

      We couldn't locate the citation or discussion of Figure 4d in the text. Conversely, based on the text description, Figure 6g would contain exciting results. But we couldn't find Figure 6g anywhere ... unless it was a typo and the authors meant Figure 6f, in which case the cool results in Figure S8 could use more elaboration in the main text. 

      Thank you for this helpful observation. The figure references were indeed incorrect due to a typo. The results discussed in the text refer to Figure 6f (not 6g), which is now Figure 7a in the revised version. To further highlight these findings, we have added a new Figure 7b that better illustrates how different MAPK response elements enabled us to identify, for each RAS-mutant cell line, a RAS-targeting circuit that showed stronger activation than in all RAS wild-type lines. We have also expanded the corresponding section in the main text to elaborate on these results and their significance.

      Reviewer #3 (Public review): 

      Summary: 

      Mutations that result in consistent RAS activation constitute a major driver of cancer. Therefore, RAS is a favorable target for cancer therapy. However, since normal RAS activity is essential for the function of normal cells, a mechanism that differentiates aberrant RAS activity from normal one is required to avoid severe adverse effects. To this end, the authors designed and optimized a synthetic gene circuit that is induced by active RAS-GTP. The circuit components, such as RAS-GTP sensors, dimerization domains, and linkers. To enhance the circuit selectivity and dynamic range, the authors designed a synthetic promoter comprised of MAPK-responsive elements to regulate the expression of the RAS sensors, thus generating a feed-forward loop regulating the circuit components. Circuit outputs with respect to circuit design modification were characterized in standard model cell lines using basal RAS activity, active RAS mutants, and RAS inactivation. 

      This approach is interesting. The design is novel and could be implemented for other RASmediated applications. The data support the claims, and while this circuit may require further optimization for clinical application, it is an interesting proof of concept for targeting aberrant RAS activity. 

      Strengths: 

      Novel circuit design, through optimization and characterization of the circuit components, solid data. 

      Weaknesses: 

      This manuscript could significantly benefit from testing the circuit performance in more realistic cell lines, such as patient-derived cells driven by RAS mutations, as well as in corresponding non-cancer cell lines with normal RAS activity. Furthermore, testing with therapeutic output proteins in vitro, and especially in vivo, would significantly strengthen the findings and claims. 

      Thank you very much for the thoughtful and supportive comments. We fully agree with the reviewer’s suggestions for improving the translational potential of the RAS-targeting circuits.

      As a first step toward therapeutic relevance, we replaced the fluorescent reporter with HSV-TK, a clinically validated suicide gene, and demonstrated killing in RAS-mutant cancer cell lines. This is described above and in the new section of the manuscript (Figure 8).

      We also agree that testing in patient-derived cancer cells and especially healthy cells with wild-type RAS activity will be essential. However, testing in primary or patient-derived cells presents delivery challenges: transient transfection of our current four-plasmid system is unlikely to achieve sufficient expression. As discussed in our response to Reviewer #1, development of a more efficient delivery strategy –such as viral vector-based delivery– is a necessary next step.

      Once a delivery system is established, identifying relevant off-target tissues throughout the body with high physiological RAS signaling will be key to assessing selectivity. While comparative data on RAS activation across healthy tissues are scarce[12,13], recent atlases of transcription factor activity[14,15] provide insights to identify off-target cells with high activation of RAS-dependent transcription factors and may even approximate RAS activity across healthy tissue. Alternatively, our single-input sensors for RAS and MAPK pathway activity could be used in vivo to identify off-target cells based on endogenous activity.

      Once relevant target and off-target cells have been identified, patient-derived cancer and healthy cells can help select and adapt cancer-specific RAS-targeting circuits and nominate therapeutic candidates for further safety and efficacy assessment[6,8].

      Reviewer #1 (Recommendations for the authors): 

      For the most part, the data in this study are very convincing and very well presented. The cartoons make it easier to understand the complex experimental setups. 

      (1) Did the authors use wild-type Sos-1 or a constitutively active membrane-bound catalytic domain in their studies? How is SOS-1 activated when in case Sos-1 wild-type was used? 

      Thank you for this feedback. We used the constitutively active catalytic domain of Sos-1 (AA5641049; PDB ID 2II0). 

      (2) Figure 1f: In case of KRAS-G12D, it looks like the output expression does not really correlate with the RAS-GTP level. Can the authors give an explanation? 

      Thank you for this interesting question. We believe the observed discrepancy arises primarily from differences in the sensitivity and readout dynamics of the two assays. The RAS-GTP pulldown ELISA appears insufficiently sensitive to detect small changes in RAS-GTP levels at lower KRAS<sup>G12D</sup> plasmid doses (0.19, 0.56, or 1.67 ng). Only at 5 ng and 15 ng do we observe clear increases in RAS-GTP signal (25% and 700%, respectively). In contrast, the RAS sensor shows strong activation already in the 0.56–5 ng range but begins to saturate at higher doses (see Figure 1f and Figure 1e).

      Beyond the differing technical sensitivities of the ELISA (plate reader) and flow cytometry, an important conceptual distinction may further explain this behavior: the RAS sensor likely integrates RAS signaling over time. Once NarX binds RAS-GTP and dimerizes, it activates NarL, triggering mCerulean expression. If the rate of mCerulean production exceeds its degradation, signal accumulates throughout the assay duration. Thus, the flow cytometry readout reflects time-integrated signaling, allowing small differences in RAS-GTP to be amplified into measurable differences in output—especially at low input levels. This may explain why flow cytometry detects circuit activation earlier and more steeply than the pulldown assay, which provides a snapshot of RAS-GTP abundance at a single time point and saturates less readily at high input levels.

      Together, these factors likely explain the observed differences in signal dynamics: the RAS sensor exhibits steep activation followed by saturation at high plasmid doses (flow cytometry), while the ELISA shows limited sensitivity at low doses but a broader linear range at higher doses.

      (3) Figure 2b: It appears that even in the case of KRAS-G12D and Sos-1, only a few cells are positive. Does this result depend on low cell density, low transfection efficiency, or a wide range of the expression level? As a control, nuclear staining could be shown. 

      Thank you for this question. In the experiment shown in Figure 2b, our goal was to assess the membrane localization of the RBD^CRD-NarX-SYFP2 construct, which serves as a proxy for RAS-bound sensor. To enable accurate computational segmentation and separation of membrane signal from adjacent cells, we intentionally reseeded cells at low density in glassbottom plates for confocal imaging.

      The observed variability in signal likely reflects a combination of transient transfection and heterogeneous expression levels. While the overall transfection efficiency was approximately 70%, expression varied between individual cells. To account for this, we analyzed the membrane-to-total signal ratio per cell, which internally normalizes the membrane signal to the total cellular expression of SYFP2 and controls for differences in transfection efficiency.

      In response to the reviewer’s suggestion, we have updated the figure to include nuclear staining to aid interpretation. We would like to emphasize, however, that the images are intended to illustrate subcellular localization per cell, not expression frequency or intensity across the population.

      Minor points 

      (1) Figure 1b: "The third plasmid expresses NarL, .." should be changed to "The third plasmid expresses NarL-VP48, .." 

      Done

      (2) Figure 1c, right part: The orange arrow should be labeled NarX-H399Q (not N509A). 

      Done

      (3) Supplementary Table 6 and 7: [cells/wells] - should probably be [cells 10*3/well]. 

      Thank you for these points, we updated the manuscript accordingly

      Reviewer #2 (Recommendations for the authors): 

      Minor comments: 

      (1) N509A seems mislabeled in Figure 1b. 

      (2) It would help the readers if the authors could elaborate a bit on what is known about the RBD and CRD mutations used here. 

      Thank you for the input, we added a paragraph in the paper to expand on the effect of these commonly used mutations.

      (3) The KRASWT&Sos1 condition is not explained within the text for Figure 1f, which is the first figure with the KRASWT&Sos1 condition, but rather later on for Figure 2a. Adding a description of this condition to the discussion of Figure 1f would add clarity to this figure. 

      Thank you, we corrected this.

      (4) Citing AlphaFold2 structural predictions as having "revealed that longer linkers between the sensor's RBDCRD and NarX-derived domains could bring the NarX domains into closer proximity" is probably an overstatement. AlphaFold2 generally has low confidence in the placement of long flexible linkers, and the longer linkers in the illustration could facilitate NarX and NarL being even farther apart than they are in the original design. 

      Thank you for this input. We agree that AlphaFold2 predictions generally have low confidence in the placement of long, flexible linkers, and we did not intend to imply that the structural models were predictive of actual linker conformations. Rather, the models were used heuristically to generate the hypothesis that longer linkers might facilitate better positioning of the NarX domains for dimerization.

      As described in the Methods, we manually rotated the flexible linker regions to explore plausible conformations. These exploratory models showed that with a short (1x GGGGS) linker, it was more challenging to bring the NarX domains into close proximity, whereas longer linkers allowed greater positional flexibility. This modeling exercise provided a structural rationale for experimentally testing longer linkers. We have revised the manuscript text to clarify that the structural predictions were used to motivate linker design –not to validate or predict structural outcomes.

      (5) Figure 3b shows that the fold change (KRASG12D/KRASWT) is higher at shorter linker lengths and lower at longer linker lengths, and that the output expression of mCerulean is lower at shorter linker lengths and higher at longer linker lengths. Having a bar plot with the output expression mCerulean levels comparing KRASG12D and KRASWT next to each other would be a significantly more informative representation of this data. In particular, the readers might be interested in understanding the effect of linker length on off-target activation from the sensor, which is not clear from this figure. 

      Thank you for the suggestion. We adapted Figure 3b to better present this. 

      (6) While it is implied that the sentence "Among the tested binding domains, the Ras association domain (RA) of the natural RAS effector Rassf5, the RAS association domain 2 (RA2) of the phospholipase C epsilon (PLCe)33, and the synthetic RAS binder K5534 showed a slightly higher or similar dynamic range." is comparing these RAS binding domains to RBDCRD, for clarity it should be noted what the point of reference is for this "slightly higher or similar dynamic range." 

      (7) Claims are made throughout the text that require supporting data, and thus require a reference to a figure, but there are a few instances where the reference is several sentences after the discussion of data and findings begins. For example, the discussion of Figure 3c begins with the claim "Among the tested binding domains, the Ras association domain (RA) of the natural RAS effector Rassf5, the RAS association domain 2 (RA2) of the phospholipase C epsilon (PLCe)33, and the synthetic RAS binder K5534 showed a slightly higher or similar dynamic range," but there is no reference to the data or figure being discussed until the end of the discussion of Figure 3c. This formatting is also present in Figure 3d and Figure 6f. 

      Thank you for mentioning these imprecisions and inconsistencies, we addressed them in the manuscript. 

      (8) In Figures 5d and 5e, the formatting of underscores and dashes is occasionally inconsistent within the text. (ex. "PY2_NarX_FLT or PY2_NarL-FLT" on page 13.). 

      Thank you for this precise observation. The formatting differences were intentional and reflect distinct design principles. Specifically:

      An underscore (e.g., PY2_NarX_FLT) denotes that two separate proteins are expressed –here, PY2-driven RBDCRD-NarX and EF1α-driven NarL-F.L.T.

      A dash (e.g., PY2_NarL-F.L.T.) indicates a fusion protein –i.e., PY2-driven NarL-F.L.T. combined with EF1α-driven RBDCRD-NarX.

      This notation is used to distinguish expression sources and fusion constructs while avoiding redundancy with the base circuit (EF1α_NarX + EF1α_NarL-VP48). We hope the included schematic diagrams in each relevant figure helps the reader interpret these combinations.

      (9) The text claims that "loss-of-function mutations in RBDCRD decreased activation. However, the dynamic range was only 3-fold" and attributes this claim to Figure 6a. For a claim about specific fold-change activation, one would expect a corresponding figure with quantitative measurements of this fluorescence to be referenced. 

      Thank you for this remark. We made a supplementary figure (Supplementary Fig. 11) to show the quantitative measurement of the 3-fold dynamic range between HCT-116<sup>WT</sup> and HCT-116<sup>k.o</sup>. when using the EF1a-expressed RAS sensor with NarL-VP48.

      (10) The claim of this Figure 2d is that the effect of RAS-GTP levels on mCerulean output is amplified in comparison to Figures 2a, 2b, and 3c, representing expression, RAS binding, and dimerization respectively. While visually this might be true from the figure, the readers might be confused by the lack of significance between the control and the NF1 condition, alongside the variation between the triplicates. Could this experiment be repeated to gain clearer data and to support their claim more effectively? 

      Thank you for this important observation. To address the concern regarding variability and statistical significance in Figure 2d, we repeated the experiment using 24-well plates to increase the number of cells analyzed per condition. This improved the consistency of the data and allowed us to reduce variability across replicates. As a result, we now observe a statistically significant difference between the control and the NF1 condition. The updated results are shown in the revised Figure 2.

      (11) The readers might be less familiar with the concept of "composability" than "modularity" and it would be good to explain it if the authors did intend to use the former. 

      Thank you for this comment. We changed it to modularity to avoid confusion. 

      References

      (1) Shahryari, A., Burtscher, I., Nazari, Z. & Lickert, H. Engineering Gene Therapy: Advances and Barriers. Advanced Therapeutics vol. 4 Preprint at https://doi.org/10.1002/adtp.202100040 (2021).

      (2) Mcclements, M. E. & Maclaren, R. E. Adeno-Associated Virus (AAV) Dual Vector Strategies for Gene Therapy Encoding Large Transgenes. YALE JOURNAL OF BIOLOGY AND MEDICINE vol. 90 (2017).

      (3) Wagner, H. J., Weber, W. & Fussenegger, M. Synthetic Biology: Emerging Concepts to Design and Advance Adeno-Associated Viral Vectors for Gene Therapy. Advanced Science vol. 8 Preprint at https://doi.org/10.1002/advs.202004018 (2021).

      (4) Doshi, J., Willis, K., Madurga, A., Stelzer, C. & Benenson, Y. Multiple Alternative Promoters and Alternative Splicing Enable Universal Transcription-Based Logic Computation in Mammalian Cells. Cell Rep 33, 108437 (2020).

      (5) Wu, Z., Yang, H. & Colosi, P. Effect of genome size on AAV vector packaging. Molecular Therapy 18, 80–86 (2010).

      (6) Dastor, M. et al. A Workflow for in Vivo Evaluation of Candidate Inputs and Outputs for Cell Classifier Gene Circuits. ACS Synth Biol 7, 474–489 (2018).

      (7) Preuß, E. et al. TK.007: A novel, codon-optimized HSVtk(A168H) mutant for suicide gene therapy. Hum Gene Ther 21, 929–941 (2010).

      (8) Angelici, B., Shen, L., Schreiber, J., Abraham, A. & Benenson, Y. An AAV gene therapy computes over multiple cellular inputs to enable precise targeting of multifocal hepatocellular carcinoma in mice. Sci Transl Med 13, (2021).

      (9) Mesnil, M. & Yamasaki, H. Bystander Effect in Herpes Simplex Virus-Thymidine Kinase/Ganciclovir Cancer Gene Therapy: Role of Gap-Junctional Intercellular Communication 1. CANCER RESEARCH vol. 60 http://aacrjournals.org/cancerres/articlepdf/60/15/3989/2478218/ch150003989.pdf (2000).

      (10) Proud, C. G. Ras, PI3-kinase and mTOR signaling in cardiac hypertrophy. Cardiovascular Research vol. 63 403–413 Preprint at https://doi.org/10.1016/j.cardiores.2004.02.003 (2004).

      (11) Azman, M. S. et al. An ERK1/2driven RNAbinding switch in nucleolin drives ribosome biogenesis and pancreatic tumorigenesis downstream of RAS oncogene. EMBO J 42, (2023).

      (12) von Lintig, F. C. et al. Ras activation in normal white blood cells and childhood acute lymphoblastic leukemia. Clin Cancer Res 6, 1804–10 (2000).

      (13) Guha, A., Feldkamp, M. M., Lau, N., Boss, G. & Pawson, A. Proliferation of human malignant astrocytomas is dependent on Ras activation. Oncogene 15, 2755–2765 (1997).

      (14) Pan, L. et al. HTCA: a database with an in-depth characterization of the single-cell human transcriptome. Nucleic Acids Res 51, D1019–D1028 (2023).

      (15) Pan, L. et al. Single Cell Atlas: a single-cell multi-omics human cell encyclopedia. Genome Biol 25, (2024).

    1. Reviewer #2 (Public review):

      Summary:

      The authors generate an optimized small molecule inhibitor of SMARCA2/4 and test it in a panel of cell lines. All uveal melanoma (UM) cell lines in the panel are growth inhibited by the inhibitor making the focus of the paper. This inhibition is correlated with loss of promoter occupancy of key melanocyte transcription factors e.g. SOX10. SOX10 overexpression and a point mutation in SMARCA4 can rescue growth inhibition exerted by the SMARCA2/4 inhibitor. Treatment of a UM xenograft model results in growth inhibition and regression which correlates with reduced expression of SOX10 but not discernible toxicity in the mice. Collectively, the data suggest a novel treatment of uveal melanoma.

      Strengths:

      There are many strengths of the study, including the strong challenge of the on-target effect, the assays used and the mechanistic data. The results are compelling as are the effects of the inhibitor. The in vivo data is dose-dependent and doses are low enough to be meaningful and associated with evidence of target engagement.

      Weaknesses:

      The authors have addressed weaknesses in the revised version.

    2. Author response:

      The following is the authors’ response to the current reviews.

      Reviewer 1:

      While BAP1 mutant UM cell lines were included for some of the experiments, it seems the in-vivo data mentioned in the response to the reviewers comment is missing? The authors stated that "MP46 (Supplementary Fig. 3a) is BAP1-null uveal melanoma cell line with no detectable protein expression (Amirouchene-Angelozzi et al., Mol Oncol 2014), and we have observed strong tumor growth inhibition in this CDX model with our BAF ATPase inhibitor." But the CDX model data shown in Figure 4 is from 92.1 cells. If this data is available, then the manuscript would benefit from its addition.

      We thank the reviewer for bringing this to our attention. As the reviewer mentioned, we show 92-1 CDX model in our manuscript. Additionally, strong tumor growth inhibition in MP-46  CDX model treated with our BAF ATPase inhibitor can be found in Vaswani et al., 2025 (PMID:39801091, https://pubmed.ncbi.nlm.nih.gov/39801091/).

      Reviewer 3:<br /> Supplementary Figure 2C<br /> Is the T910M mutation in the parental MP41 cells heterozygous? If so, the authors should indicate this in the figure legend. If this is a homozygous mutation, the authors should explain how the inhibitors suppress SMARCA4 activity in cells that have a LOF mutation.

      We thank the reviewer for bringing this to our attention. We updated the figure legend accordingly to reflect the genotype of the mutations highlighted in the table.


      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      Summary:

      The presented study by Centore and colleagues investigates the inhibition of BAF chromatin remodeling complexes. The study is well-written, and includes comprehensive datasets, including compound screens, gene expression analysis, epigenetics, as well as animal studies. This is an important piece of work for the uveal melanoma research field, and sheds light on a new inhibitor class, as well as a mechanism that might be exploited to target this deadly cancer for which no good treatment options exist.

      Strengths:

      This is a comprehensive and well-written study.

      Weaknesses:

      There are minimal weaknesses.

      We thank the reviewer for the positive comments.

      Reviewer #2 (Public Review):

      Summary:

      The authors generate an optimized small molecule inhibitor of SMARCA2/4 and test it in a panel of cell lines. All uveal melanoma (UM) cell lines in the panel are growth-inhibited by the inhibitor making the focus of the paper. This inhibition is correlated with the loss of promoter occupancy of key melanocyte transcription factors e.g. SOX10. SOX10 overexpression and a point mutation in SMARCA4 can rescue growth inhibition exerted by the SMARCA2/4 inhibitor. Treatment of a UM xenograft model results in growth inhibition and regression which correlates with reduced expression of SOX10 but not discernible toxicity in the mice. Collectively the data suggest a novel treatment of uveal melanoma.

      Strengths:

      There are many strengths of the study including the strong challenge of the on-target effect, the assays used, and the mechanistic data. The results are compelling as are the effects of the inhibitor. The in vivo data is dose-dependent and doses are low enough to be meaningful and associated with evidence of target engagement.

      Weaknesses:

      The authors introduce the field stating that SMARCA4 inhibitors are more effective in SMARCA2 deficient cancers and the converse. Since the desirable outcome of cancer therapy would be synthetic lethality it is not clear why a dual inhibitor is desirable. Wouldn't this be associated with more side effects? It is not known how the inhibitor developed here impacts normal cells, in particular T cells which are essential for any durable response to cancer therapies in patients. Another weakness is that the UM cell lines used do not molecularly resemble metastatic UM. These UM most frequently have mutations in the BAP1 tumor suppressor gene. It is not clear if the described SMARCA2/4 inhibitor is efficacious in BAP1 mutant UM cell lines in vitro or BAP1 mutant patient-derived xenografts in vivo.

      We thank the reviewer for their insightful and constructive comments. As we demonstrate in Fig. 1d, uveal melanoma cells are selectively and deeply sensitive to BAF ATPase inhibition, and provides a therapeutic window. This is confirmed in Fig. 4a-c, as we demonstrated robust tumor growth inhibition, achieved at a dose well-tolerated in xenograft study. FHD-286, a dual BRM/BRG1 inhibitor similar to FHT-1015 with optimized physical properties, has been evaluated in a Phase I trial in patients with metastatic uveal melanoma (NCT04879017) and manuscript describing results of this clinical trial is currently in preparation.

      As the reviewer mentioned, BAP1 loss is a signature of metastatic uveal melanoma. MP38 is a BAP1 mutant uveal melanoma cell line, and we demonstrated growth inhibition and robust caspase 3/7 activity in response to FHT-1015 (Supplementary Fig. 3a and 3f). MP46 (Supplementary Fig. 3a) is BAP1-null uveal melanoma cell line with no detectable protein expression (Amirouchene-Angelozzi et al., Mol Oncol 2014), and we have observed strong tumor growth inhibition in this CDX model with our BAF ATPase inhibitor.

      Reviewer #3 (Public Review):

      Summary:

      This manuscript reports the discovery of new compounds that selectively inhibit SMARCA4/SMARCA2 ATPase activity that work through a different mode as previously developed SMARCA4/SMARCA2 inhibitors. They also demonstrate the anti-tumor effects of the compounds on uveal melanoma cell proliferation and tumor growth. The findings indicate that the drugs exert their effects by altering chromatin accessibility at binding sites for lineage-specific transcription factors within gene enhancer regions. In uveal melanoma, altered expression of the transcription factor, SOX10, and SOX10 target gene underlies the anti-proliferative effects of the compounds. This study is significant because the discovery of new SMARCA4/SMARCA2 inhibitory compounds that can abrogate uveal melanoma tumorigenicity has therapeutic value. In addition, the findings provide evidence for the therapeutic use of these compounds in other transcription factor-dependent cancers.

      Strengths:

      The strengths of this manuscript include biochemical evidence that the new compounds are selective for SMARCA4/SMARCA2 over other ATPases and that the mode of action is distinct from a previously developed compound, BRM014, which binds the RecA lobe of SMARCA2. There is also strong evidence that FHT1015 suppresses uveal melanoma proliferation by inducing apoptosis. The in vivo suppression of tumor growth without toxicity validates the potential therapeutic utility of one of the new drugs. The conclusion that FHT1015 primarily inhibits SMARCA4 activity and thereby suppresses chromatin accessibility at lineage-specific enhancers is substantiated by ATAC-seq and ChIP-seq studies.

      Weaknesses:

      The weaknesses include a lack of more precise information on which SMARCA4/SMARCA2 residues the drugs bind. Although the I1173M/I1143M mutations are evidence that the critical residues for binding reside outside the RecA lobe, this site is conserved in CHD4, which is not affected by the compounds. Hence, this site may be necessary but not sufficient for drug binding or specifying selectivity. A more precise evaluation of the region specifying the effect of the new compounds would strengthen the evidence that they work through a novel mode and that they are selective. Another concern is that the mechanisms by which FHT1015 promotes apoptosis rather than simply cell cycle arrest are not clear. Does SOX10 or another lineage-specific transcription factor underlie the apoptotic effects of the compounds?

      We thank the reviewer for the valuable comments.

      We believe that our dual ATPase inhibitor is selective and additional insights into binding specificity and selectivity for earlier stage compounds of this series were recently published in Vaswani et al., 2025 (PMID:39801091, https://pubmed.ncbi.nlm.nih.gov/39801091/).

      The reviewer also poses a great question regarding the mechanism of apoptosis. The mechanism of apoptosis is extremely complex, but we observed a decrease in pro-survival BCL-2 protein expression in response to FHT-1015, in the experiment corresponding to Supplementary Fig. 5e. In the experiment described in Fig. 3k, we also monitored caspase 3/7 activity over time, and SOX10 overexpression rescued 92-1 cells from FHT-1015 induced apoptosis. This suggests the role of SOX10 as an important mediator of response to BAF ATPase inhibition, including apoptosis induced by FHT-1015.

      Additional Reviews:

      The referees would like to draw the authors' attention to the following issues that would best benefit from additional revision. 

      The clinical relevance of the study would be strengthened by the use of uveal melanoma cell lines with BAP1 mutations that better represent metastatic uveal melanoma. The use of patient-derived xenografts would also be pertinent and would be a useful addition. Similarly, attention to the effects of the inhibitor on non-cancerous proliferative cells such as blood/T/immune cells would also strengthen the manuscript. As the study reports the administration of one of the inhibitors in mice for the xenograft experiments, it would be important to assess any potential effects on blood cell counts and better discuss the eventual toxicity or lack of toxicity and how it was assessed. 

      The authors should better explain how SOX10 over expression can rescue viability in the presence of the inhibitor. Similarly given the critical roles of BRG1, SOX10, and MITF in cutaneous melanoma some specific discussion on the sensitivity of cutaneous melanoma cells to the inhibitor should be considered, and potential differences with uveal melanoma highlighted. 

      Aside from these issues, the authors are urged to consider the other points mentioned below. 

      Reviewer #1 (Recommendations For The Authors): 

      Figure 1d, as well as the text in the manuscript referring to this figure, would benefit from indicating specific cell lines used for UM. The same for the sentence in line 153. 

      We thank the reviewer for bringing this to our attention. We have added the cell line names and updated the manuscript accordingly.

      For any of the studies conducted, is there any link with the genetics of UM? E.g. BAP1 wildtype/BAP1 mutant? 

      As addressed above in the public review section, MP38 is a BAP1 mutant uveal melanoma cell line, and we demonstrated growth inhibition and robust caspase 3/7 activity in response to FHT-1015 (Supplementary Fig. 3a and 3f). MP46 (Supplementary Fig. 3a) is BAP1-null uveal melanoma cell line with no detectable protein expression (Amirouchene-Angelozzi et al., Mol Oncol 2014), and we have observed strong tumor growth inhibition in this CDX model with our BAF ATPase inhibitor.

      Row 191 - How were peaks classified as enhancer-occupied? 

      We used annotatePeaks function of HOMER package to annotate genomic locations, as well as H3K27ac ChIP-seq to annotate peaks as enhancer-occupied. We thank the reviewer to pointing it out and have updated the manuscript accordingly to include this information.

      Row 259, the two cell lines should be named, also in Figure 3i. 

      We have added the cell line names and updated the manuscript accordingly.

      Reviewer #2 (Recommendations For The Authors): 

      As a proof of concept, this study is truly excellent and the authors should be commended. However, it is desirable that new knowledge in cancer is translated to the clinic. To this end there are a few things needed to strengthen the study. 

      I am rephrasing my statements from the public review to say that I would recommend testing the inhibitor in T cells (side effects) and BAP1 mutant cell lines (for clinical relevance). 

      As addressed in the public review section, MP38 is a BAP1 mutant uveal melanoma cell line, and we demonstrated growth inhibition and robust caspase 3/7 activity in response to FHT-1015 (Supplementary Fig. 3a and 3f). MP46 (Supplementary Fig. 3a) is BAP1-null uveal melanoma cell line with no detectable protein expression (Amirouchene-Angelozzi et al., Mol Oncol 2014), and we have observed strong tumor growth inhibition in this CDX model with our BAF ATPase inhibitor.

      Regarding concerns for any potential side effect on T cells, we observed an increase in both CD4 and CD8 T-cell populations in the peripheral blood and the spleen, when naïve, non-tumor bearing CD-1 mice were dosed with SMARCA2/4 dual ATPase inhibitor FHD-286 once daily for 14 days. FHD-286 is a compound similar to FHT-1015 described in Vaswani et al., 2025 (PMID:39801091, https://pubmed.ncbi.nlm.nih.gov/39801091/). In addition, FHD-286 has been tested in tumor bearing syngeneic models. When B16F10 tumor bearing C57BL/6 were dosed with FHD-286 for 10 days, we observed an increase in CD69+ activated CD8 T-cell infiltration in the tumor microenvironment (doi:10.1136/jitc-2022-SITC2022.0888).

      Reviewer #3 (Recommendations For The Authors): 

      (1) Determine drug binding by crystal structure or generate additional SMARCA4 or SMARCA2 mutations in the region near I1173/I1143 that are not conserved in CHD4 and test them in an ATPase assay for effects on drug inhibition. For example, Q1166 in SMARCA4 and Q1136 in SMARCA4 could be changed to Alanine as in CHD4. Would this abrogate drug inhibition? 

      We believe that our dual ATPase inhibitor is selective and additional insights into binding specificity and selectivity for earlier stage compounds of this series were recently published in Vaswani et al., 2025 (PMID:39801091, https://pubmed.ncbi.nlm.nih.gov/39801091/).

      (2) The finding that SOX10 can rescue the antiproliferative effects of FHT1015 suggests that SMARCA4 is primarily needed for SOX10 expression. However, the co-occupancy of SMARCA4 and SOX10 at enhancers suggests that they cooperate to promote chromatin accessibility. It is unclear how over-expression of SOX10 can promote chromatin accessibility in drug-inhibited cells since SOX10 does not have chromatin remodeling activity. ATAC-seq in cells over-expressing SOX10 and treated with the drug could identify SOX10-dependent targets that do not require SMARCA4 activity and clarify the mechanism. It would also be informative to determine if SOX10 over-expression abrogates the effects of FHT1015 on both cell cycle and apoptosis, helping to resolve whether it is a partial or complete rescue of proliferation. 

      We agree that running ATAC-seq in cells overexpressing SOX10 would clarify this mechanism. However, shifts in corporate strategy deprioritized any further experiments for this project. One potential mechanism that SOX10 overexpression can partially rescue BAF inhibition phenotype is through overexpressed SOX10 localizing to open chromatin regions (mostly promoters) across the genome. We know from our ATAC-seq data (Fig. 2) that BAF inhibition leads to loss of chromatin accessibility at SOX10 enhancer sites, while promoter regions are only partially affected. Therefore, we think that overexpression of SOX10 would allow upregulation of its target genes via binding to the promoter regions. In this model, the enhancer-driven SOX10 target genes are likely to remain silenced.  

      (3) Although the in vivo studies indicate that the drugs are well-tolerated, additional in vitro studies to determine the effects of the drug on the proliferation/survival of non-cancerous cells would further validate their therapeutic utility.

      Author Response: The reviewer raises a critical question. FHD-286, a dual BRM/BRG1 inhibitor similar to FHT-1015 with optimized physical properties, has been evaluated in a Phase I trial in patients with metastatic uveal melanoma (NCT04879017), and it was well tolerated at continuous daily dose of up to 7.5 mg QD and at intermittent dose of up to 17.5 mg QD.  Manuscript describing results of this clinical trial is currently in preparation.

    1. Author response:

      Reviewer #1 (Public review):

      It appears obvious that with no or a little fitness penalty, it becomes beneficial to have MHC-coding genes specific to each pathogen. A more thorough study that takes into account a realistic (most probably non-linear in gene number) fitness penalty, various numbers of pathogens that could grossly exceed the self-consistent fitness limit on the number of MHC genes, etc, could be more informative.

      The reviewer seems to be referring to the cost of excessively high presentation breadth.  Such a cost is irrelevant to the inferior fitness of a polymorphic population with heterozygote advantage compared to a monomorphic population with merely doubled gene copy number.  It is relevant to the possibility of a fitness valley separating these two states, but this issue is addressed explicitly in the manuscript.

      An addition or removal of one of the pathogens is reported to affect "the maximum condition", a key ecological characteristic of the model, by an enormous factor 10^43, naturally breaking down all the estimates and conclusions made in [RS]. This observation is not substantiated by any formulas, recipes for how to compute this number numerically, or other details, and is presented just as a self-standing number in the text.

      It is encouraging that the reviewer agrees that this observation, if correct, would cast doubt on the conclusions of Siljestam and Rueffler.  I would add that it is not the enormity of this factor per se that invalidates those conclusions, but the fact that the automatic compensatory adjustment of c<sub>max</sub> conceals the true effects of removing a pathogen, which are quite large.

      I am not sure why the reviewer doubts that this observation is correct.  The factor of 2.7∙10<sup>43</sup> was determined in a straightforward manner in the course of simulating the symmetric Gaussian model of Siljestam and Rueffler with the specified parameter values.  A simple way to determine this number is to have the simulation code print the value to which c<sub>max</sub>  is set, or would be set, by the procedure of Siljestam and Rueffler for different parameter values.  In another section of this response I will describe how to do this with the simulation code written and used by Siljestam and Rueffler; doing so confirms the value that I obtained with my own code.  Furthermore, I will now give a theoretical derivation of this factor.

      As specified by Siljestam and Rueffler, the positions of the m pathogens in (m-1)-dimensional antigenic space correspond to the vertices of a regular simplex centered at the origin, with distance between vertices equal to 1.  The squared distance from the origin to each of the m vertices of such a simplex is (m-1)/2m (https://polytope.miraheze.org/wiki/Simplex).  Thus, the sum of the m squared distances is (m-1)/2.  For the (0, 0) homozygote, condition is multiplied by a factor of exp(-(vr)<sup>2</sup>/2) for each pathogen, where r is the distance from the origin.  It follows that, with v=20, all the pathogens together decrease condition by a factor of exp(20<sup>2</sup>∙(m-1)/4) = exp(100∙(m-1)).  Thus, increasing or decreasing m by 1 changes this value by a factor of exp(100) = 2.7∙10<sup>43</sup>.

      This begs the conclusion that the branching remains robust to changes in c_max that span 4 decades as well.

      That shows only that the results are not extremely sensitive to c<sub>max</sub> or K.  They are, nonetheless, exquisitely sensitive to m and v.  This difference in sensitivities is the reason that a relatively small change to m leads to such a large compensatory change in c<sub>max</sub> a change large enough to have a major effect on the results.

      As I wrote above, there is no explanation behind this number, so I can only guess that such a number is created by the removal or addition of a pathogen that is very far away from the other pathogens. Very far in this context means being separated in the x-space by a much greater distance than 1/\nu, the width of the pathogens' gaussians. Once again, I am not totally sure if this was the case, but if it were, some basic notions of how models are set up were broken. It appears very strange that nothing is said in the manuscript about the spatial distribution of the pathogens, which is crucial to their effects on the condition c.

      I did not explicitly describe the distribution of pathogens in antigenic space because it is exactly the same as in Siljestam and Rueffler, Fig. 4: the vertices of a regular simplex, centered at the origin, with unity edge length.

      The number in question (2.7∙10<sup>43</sup>) pertains to the Gaussian model with v=20.  As specified by Siljestam and Rueffler, each pathogen lies at a distance of 1 from every other pathogen, so the distance of any pathogen from the others is indeed much greater than 1/v.  This condition holds, however, for most of the parameter space explored by Siljestam and Rueffler (their Fig. 4), and for all of the parameter space that seemingly supports their conclusions.  Thus, if this condition indicates that “basic notions of how models are set up were broken”, they must have been broken by Siljestam and Rueffler.

      Overall, I strongly suspect that an unfortunately poor setup of the model reported in the manuscript has led to the conclusions that dispute the much better-substantiated claims made in [SD].

      The reviewer seems to be suggesting that my simulations are somehow flawed and my conclusions unreliable.  I will therefore describe how my conclusions about sensitivity to parameter values can be verified using the simulation code provided by Siljestam and Rueffler themselves, with only small, easily understood modifications.  I will consider adding this description as a supplement when I revise the manuscript.

      The starting point is the Matlab file MHC_sim_Dryad.m, available at https://doi.org/10.5061/dryad.69p8cz98j.  First, we can add a line that prints the value of the variable logcmax, which represents the natural logarithm of cmax determined and used by the code.  Below line 116 (‘prework’), add the line ‘logcmax’ (with no semicolon).

      Now, at the Matlab prompt, execute MHC_sim_Dryad(false, 8, 20, 1) to run the simulation for the Gaussian model with m=8, v=20, and K=1.  The output will indicate that logcmax=700, in accord with the theoretical factor exp(100*(m-1)) derived above.  The allelic diversity, n<sub>e</sub>, will rise to a steady state-level of about 140, as in the red curve of my Fig. 2.

      Now lower m to 7, i.e,  run MHC_sim_Dryad(false, 7, 20, 1).  The output will indicate that logcmax=600.  This confirms that lowering m by 1 causes the code to lower the value of c<sub>max</sub> by a factor exp(100)=2.7∙10<sup>43</sup>, which must also be the factor by which the condition of the most fit homozygote would increase without this adjustment.

      With the change of m to 7 and the compensatory change in c<sub>max</sub>, steady-state allelic diversity remains high.  But what if m changes but c<sub>max</sub> remains the same, as it would in reality?

      To find out, we can fix the value of c<sub>max</sub> to the value used with m=8 by adding the following line below the line previously added: ‘logcmax = 700’.  With this additional modification in place, executing MHC_sim_Dryad(false, 7, 20, 1) confirms that without a compensatory change to c<sub>max</sub>, lowering m from 8 to 7 mostly eliminates allelic diversity, in accord with the corresponding curve in my Fig. 2.  Similarly, raising m from 8 to 9, or changing v from 20 to 19.5 or 20.5 (executing MHC_sim_Dryad(false, 8, 19.5, 1) or MHC_sim_Dryad(false, 8, 20.5, 1)), largely eliminates diversity, confirming the other results in my Fig. 2.  Results for the bitstring model can also be confirmed, though this requires additional changes to the code.

      Thus, the extreme sensitivity of the results of Siljestam and Rueffler to parameter values can be verified with the code that they used for their simulations, indicating that my conclusions are not consequences of my having done a “poor setup of the model”.

      Response to Reviewer #2 (Public review):

      (1) The statement that the model outcome of Siljestam and Rueffler is very sensitive to parameter values is, in this form, not correct. The sensitivity is only visible once a strong assumption by Siljestam and Rueffler is removed. This assumption is questionable, and it is well explained in the manuscript by J. Cherry why it should not be used. This may be seen as a subtle difference, but I think it is important to pin done the exact nature of the problem (see, for example, the abstract, where this is presented in a misleading way).

      I appreciate the distinction, and the importance of clearly specifying the nature of the problem.  However, Siljestam and Rueffler do not invoke the implausible assumption that changes to the number of pathogens or their virulence will be accompanied by compensatory changes to c<sub>max</sub>.  Rather, they describe the adjustment of c<sub>max</sub> (Appendix 7) as a “helpful” standardization that applies “without loss of generality”.  Indeed, my low-diversity results could be obtained, despite such adjustment, by combining the small change to m or v with a very large change to K (e.g., a factor of 2.7∙10<sup>43</sup>).  In this sense there is no loss of generality, but the automatic adjustment of c<sub>max</sub> obscures the extreme sensitivity of the results to m and v.

      (2) The title of the study is very catchy, but it needs to be explained better in the text.

      I had hoped that the final paragraph of the Discussion would make the basis for the title clear.  I will consider whether this can be clarified in a revision.

    1. Reviewer #2 (Public review):

      Summary:

      In 2021 (PMID: 33503405) and 2024 (PMID: 38578830) Constantinou and colleagues published two elegant papers in which they demonstrated that the Topbp1 checkpoint adaptor protein could assemble into mesoscale phase-separated condensates that were essential to amplify activation of the PIKK, ATR, and its downstream effector kinase, Chk1, during DNA damage signalling. A key tool that made these studies possible was the use of a chimeric Topbp1 protein bearing a cryptochrome domain, Cry2, which triggered condensation of the chimeric Topbp1 protein, and thus activation of ATR and Chk1, in response to irradiation with blue light without the myriad complications associated with actually exposing cells to DNA damage.

      In this current report Morano and co-workers utilise the same optogenetic Topbp1 system to investigate a different question, namely whether Topbp1 phase-condensation can be inhibited pharmacologically to manipulate downstream ATR-Chk1 signalling. This is of interest, as the therapeutic potential of the ATR-Chk1 pathway is an area of active investigation, albeit generally using more conventional kinase inhibitor approaches.

      The starting point is a high throughput screen of 4730 existing or candidate small molecule anti-cancer drugs for compounds capable of inhibiting the condensation of the Topbp1-Cry2-mCherry reporter molecule in vivo. A surprisingly large number of putative hits (>300) were recorded, from which 131 of the most potent were selected for secondary screening using activation of Chk1 in response to DNA damage induced by SN-38, a topoisomerase inhibitor, as a surrogate marker for Topbp1 condensation. From this the 10 most potent compounds were tested for interactions with a clinically used combination of SN-38 and 5-FU (FOLFIRI) in terms of cytotoxicity in HCT116 cells. The compound that synergised most potently with FOLFIRI, the GSK3-beta inhibitor drug AZD2858, was selected for all subsequent experiments.

      AZD2858 is shown to suppress the formation of Topbp1 (endogenous) condensates in cells exposed to SN-38, and to inhibit activation of Chk1 without interfering with activation of ATM or other endpoints of damage signalling such as formation of gamma-H2AX or activation of Chk2 (generally considered to be downstream of ATM). AZD2858 therefore seems to selectively inhibit the Topbp1-ATR-Chk1 pathway without interfering with parallel branches of the DNA damage signalling system, consistent with Topbp1 condensation being the primary target. Importantly, neither siRNA depletion of GSK3-beta, or other GSK3-beta inhibitors were able to recapitulate this effect, suggesting it was a specific non-canonical effect of AZD2858 and not a consequence of GSK3-beta inhibition per se.

      To understand the basis for synergism between AZD2858 and SN-38 in terms of cell killing, the effect of AZD2858 on the replication checkpoint was assessed. This is a response, mediated via ATR-Chk1, that modulates replication origin firing and fork progression in S-phase cell under conditions of DNA damage or when replication is impeded. SN-38 treatment of HCT116 cells markedly suppresses DNA replication, however this was partially reversed by co-treatment with AZD2858, consistent with the failure to activate ATR-Chk1 conferring a defect in replication checkpoint function.

      Figures 4 and 5 demonstrate that AZD2858 can markedly enhance the cytotoxic and cytostatic effects of SN-38 and FOLFIRI through a combination of increased apoptosis and growth arrest according to dosage and treatment conditions. Figure 6 extends this analysis to cells cultured as spheroids, sometimes considered to better represent tumor responses compared to single cell cultures.

      Significance:

      Liquid phase separation of protein complexes is increasingly recognised as a fundamental mechanism in signal transduction and other cellular processes. One recent and important example was that of Topbp1, whose condensation in response to DNA damage is required for efficient activation of the ATR-Chk1 pathway. The current study asks a related but distinct question; can protein condensation be targeted by drugs to manipulate signalling pathways which in the main rely on protein kinase cascades?

      Here, the authors identify an inhibitor of GSK3-beta as a novel inhibitor of DNA damage-induced Topbp1 condensation and thus of ATR-Chk1 signalling.

      This work will be of interest to researchers in the fields of DNA damage signalling, biophysics of protein condensation, and cancer chemotherapy.

      Comments on latest version:

      Having read the revised manuscript and rebuttal I am satisfied that the authors have resolved my various original concerns through a combination of clarification/ explanation and textual changes necessary to make the description of certain data precise. My impression is that they have also largely or completely satisfied the concerns of the other reviewers, with the possible exception of reviewer 1's point about the relative toxicity of AZD and FOLFIRI in colorectal cancer cell lines versus the untransformed CCD841 cell line. This is of course an important point with respect to the possible practical application of this combination for cancer therapy, however this seems somewhat subsidiary to the main novelty and significance of the findings, which are that protein liquid phase separation/ condensation can be manipulated pharmacologically to modify signal transduction processes and that existing drugs can be re-purposed to this end.

    2. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1:

      Comments on revised version: 

      I have reviewed the revised manuscript and read the rebuttal. The authors have carefully addressed my concerns. There is however one point that needs further work: 

      This follows up on my major point #1 in my initial review. I had I asked the authors to demonstrate that FOLFIRI + AZD are less toxic to untransformed colorectal cells than colorectal cancer cell lines.  It is good to see that the authors took my advice and show effects of the drug treatments on the untransformed colorectal cell line CCD841. It seems to be less sensitive to AZD and FOLFIRI in the figure in the rebuttal. What surprises me is that I cannot find these new figures anywhere in the revised manuscript. Also, the data seem preliminary, because I do not see any standard errors in the graphs, and I cannot find a description of the time of drug incubation. I ask the authors to make sure that the CCD841 data are reproducible, and make sure they incorporate the data in the revised manuscript. 

      We thank the reviewer for this insightful comment. In the initial revised version of the manuscript, we did not include results from the untransformed colorectal cell line CCD841, as those experiments had only been performed once and were considered preliminary. However, we fully agree with the reviewer on the importance of including these data.

      To address this, we have repeated the experiments in CCD841 cells to ensure reproducibility. We now report the results from three independent experiments testing the combination of AZD2858 and FOLFIRI on healthy epithelial colon cells. These results are shown in Supplementary Figure S7, where blue matrices represent cell viability and black matrices reflect the level of synergy between AZD2858 and FOLFIRI.

      Our results confirm that, individually, each drug has little to no effect on healthy cells, and no consistent synergistic interaction was observed, except in Experiment 1, which could not be reproduced. Importantly, the drug concentrations used were identical to those applied in the cancer cell experiments, allowing for direct comparison between normal and malignant cell responses.

      Reviewer #2:

      Comments on latest version: 

      Morano et al. have revised their manuscript in response to the points raised by reviewer #3 as follows.

      (1) Fig. 2E: Correcting the previously erroneous labelling of this Fig. makes it match the textual description. 

      (2) Figs 3A and B: The revised textual description of the flow cytometry BrdU incorporation is now precise. 

      (3) Fig. 3E: Removing the suspect WB images is a pragmatic decision that does not significantly affect the overall conclusions of the paper. 

      (4) Fig. 3D: Despite its puzzling appearance this data is now described accurately in the text as "DSBs remained elevated after the combined treatment" rather than "increased after the combined treatment. A more convincing increase in the presumed damaged DNA band is evident in Fig. 4D when AZD2858 is combined with a much lower concentration of SN38 (1.5nM) which could mean that the concentration used in Fig. 3D (300nM) induced maximal damage that could not be further enhanced. 

      We thank the reviewer for their thoughtful comments and constructive feedback, which have helped us improve the clarity and rigor of the manuscript.

      Reviewer #3:

      Comments on latest version: 

      The authors have addressed most of the concerns that I raised in the first round of revision and I have no further questions. I appreciate the authors's efforts in carrying out an preliminary in vivo work, although as the authors pointed out the compound seems to be not effective in vivo. Future work is desired to address this to clarify the significance of the work. 

      We thank the reviewer for acknowledging our efforts in addressing the previous concerns. We also appreciate the recognition of our preliminary in vivo work. While these results suggest limited in vivo efficacy of the compound at this stage, we agree that additional studies will be necessary to fully evaluate its therapeutic relevance. We consider this an important next step and are committed to pursuing it in future work.

    1. Reviewer #1 (Public review):

      Summary:

      The authors use Dyngo-4a, a known Dynami inhibitor to test its influence on caveolar assembly and surface mobility. They investigate whether it incorporates into membranes with Quartz-Crystal Microbalance, they investigate how it is organized in membranes using simulations. Finally, they use lipid-packing sensitive dyes to investigate lipid packing in the presence of Dyngo-4a, membrane stiffness using AFM and membrane undulation using fluorescence microscopy. They also use a measure they call "caveola duration time" to claim that something happens to caveolae after Dyngo-4a addition and using this parameter, they do indeed see an increase in it in response to Dyngo-4a, which is reduced back to the baseline after addition of cholesterol.

      Overall, the authors claim: 1) Dyngo-4a inserts into the membrane and this 2) results in "a dramatic dynamin-independent inhibition of caveola scission". 3) Dyngo-4a was inserted and positioned at the level of cholesterol in the bilayer and 4) Dyngo-4a-treatment resulted in decreased lipid packing in the outer leaflet of the plasma membrane 5) but Dyngo-4a did not affect caveola morphology, caveolae-associated proteins, or the overall membrane stiffness 6) acute addition of cholesterol counteracts the block in caveola scission caused by Dyngo-4a.

      Overall, in this reviewers opinion, claims 1, 3, 4, 5 are well-supported by the presented data from electron and live cell microscopy, QCM-D and AFM.

      However, there is no convincing assay for caveolar endocytosis presented besides the "caveola duration" which although unclearly described seems to be the time it takes in imaging until a caveolae is not picked up by the tracking software anymore in TIRF microscopy.

      Since the main claim of the paper is a mechanism of caveolar endocytosis being blocked by Dyngo-4a, a true caveolar internalization assay is required to make this claim. This means either the intracellular detection of not surface connected caveolar cargo or the quantification of caveolar movement from TIRF into epifluorescence detection in the fluorescence microscope. Otherwise, the authors could remove the claim and just claim that caveolar mobility is influenced.

      Significance:

      A number of small molecule inhibitors for the GTPase dynamics exist, that are commonly used tools in the investigation of endocytosis. This goes as far that the use of some of these inhibitors alone is considered in some publications as sufficient to declare a process to be dynamin-dependent. However, this is not correct, as there are considerable off-target effects, including the inhibition of caveolar internalization by a dynamin-independent mechanism. This is important, as for example the influence of dynamin small molecule inhibitors on chemotherapy resistance is currently investigated (see for example Tremblay et al., Nature Communications, 2020).

      The investigation of the true effect of small molecules discovered as and used as specific inhibitors and their offside effects is extremely important and this reviewer applauds the effort. It is important that inhibitors are not used alone, but other means of targeting a mechanism are exploited as well in functional studies. The audience here thus is besides membrane biophysicists interested in the immediate effect of the small molecule Dyngo-4a also cell biologists and everyone using dynamic inhibitors to investigate cellular function.

      Comments on revised version:

      Please include the promised data on caveolar internalization and remove the above mentioned claim on membrane undulations from the text.

    2. Reviewer #2 (Public review):

      Summary:

      In this manuscript, the authors probe the mechanisms by which Dyngo-4a, a dynamin inhibitor used to block endocytosis, disrupts caveolae dynamics. They provide compelling evidence that Dyngo-4a inhibits caveolae dynamics and endocytosis (as well as several other aspects of plasma membrane dynamics) by a dynamin-independent mechanism. They also provide strong computational and experimental data showing that Dyngo-4a inserts into membranes and decreases lipid packing in the outer leaflet of the plasma membrane. Finally, they demonstrate that the addition of excess cholesterol to cells reverses the effects of Dyngo-4a on caveolae dynamics, presumably by reversing lipid packing defects. Based on these findings they conclude that lipid packing regulates caveolae dynamics and endocytosis in a cholesterol-dependent manner.

      This work should be of value to cell biologists interested in plasma membrane remodeling and membrane trafficking, biophysicists that study small molecule/membrane interactions and membrane remodeling processes, and chemists interested in designing drugs to target membrane trafficking machinery and pathways.

      Strengths:

      This work addresses the important topic of how a widely used endocytic inhibitor actually works. In the process of addressing this question, the authors uncover unexpected connections between how lipids are packed in cell membranes and membrane dynamics. The methods are appropriate and many of the claims made in this work are well supported by data.

      Weaknesses:

      I appreciate that the manuscript has already gone through one round of revisions and that many of the concerns from the previous reviewers appear to have been addressed. However, as an interested reader, I would like to offer several additional comments for the authors to consider.

      (1) It is not clear based on the data presented whether the effects of Dyngo-4a on lipid packing give rise to defects in caveolae dynamics or if these effects are merely correlated. To show this more definitively, one might expect additional experimental approaches to be used to perturb lipid packing. I appreciate this is probably beyond the scope of the current study. However, it seems important for the manuscript to be clear about how far this interpretation can be pushed in the absence of additional independent lines of evidence.

      (2) On a related note, it is not obvious how changes in lipid packing in the outer leaflet could impact caveolae dynamics. It would be helpful to include a cartoon illustrating how this might work.

      (3) The authors note that Dyngo-4a inhibits several dynamic processes including generalized plasma membrane mobility (Fig 4A&B), transferrin uptake (Fig S4C), and fusion of fusogenic liposomes (Fig S4G). This clearly indicates there is a major disruption of the plasma membrane going on here that is not limited to caveolae. They go on to show that the addition of cholesterol reverses the effects of Dyngo-4a on caveolae dynamics. However, they do not discuss whether adding back cholesterol has similar effects on plasma membrane mobility and transferrin uptake. This information could help to further pinpoint whether the mechanisms of action are shared, and if the role of cholesterol is more general in controlling these events or is instead specific to caveolae.

      (4) In Fig 4C, the morphology of the neck region of the Dyngo04a treated caveolae structure appears to be "pinched" compared to the control. I appreciate that more EM studies are underway. It would be useful to specifically compare the morphology of the caveolae as part of those studies.

      (5) In Line 91, a statement is made that 8S complex formation requires cholesterol. This is debatable, as they appear to form in E. coli in the absence of cholesterol (reference 14).

    3. Author response:

      General Statements

      In this paper we demonstrate that the lipid packing of the plasma membrane has a huge impact on the stability of caveolae. By using interdisciplinary techniques, we show that the widely used dynamin inhibitor Dyngo-4a adsorbs and inserts to lipid bilayers leading to a decreased lipid packing and hence reduced caveolae dynamics and internalization even in cells lacking dynamin. We have added experiments that validates that Dyngo-4a treatment does not result in fragmentation or disassembly of the caveolae.  A FRAP assay of cytosolic caveolae has been employed to address questions concerning scission. Moreover, as suggested by the reviewers, we have also included new simulation data that show and expand on the fact that Dyngo-4a positions in the lipid leaflet similar to cholesterol and preferentially associates with cholesterol clusters, affecting the spatial distribution of cholesterol in the membrane. We believe that these added data have greatly improved the paper and strengthened our conclusions that the lipid packing is a critical determinant in the balance between internalization and stable plasma membrane association of membrane vesicles.

      As requested, we have expanded the introduction to provide more detailed information about previous findings in the field. Changes and addition to the text has been highlighted in red for easier tracking.

      Point-by-point description of the revisions

      Reviewer #1 (Evidence, reproducibility and clarity):

      The authors use Dyngo-4a, a known Dynami inhibitor to test its influence on caveolar assembly and surface mobility. They investigate, whether it incorporates into membranes with Quartz-Crystal Microbalance, they investigate how it is organized in membranes using simulations. Finally, they use lipid-packing sensitive dyes to investigate lipid packing in the presence of Dyngo-4a, membrane stiffness using AFM and membrane undulation using fluorescence microscopy. They also use a measure they call "caveola duration time" to claim that something happens to caveolae after Dyngo-4a addition and using this parameter, they do indeed see an increase in it in response to Dyngo-4a, which is reduced back to the baseline after addition of cholesterol.

      Overall, the authors claim: 1) Dyngo-4a inserts into the membrane and this 2) results in "a dramatic dynamin-independent inhibition of caveola scission". 3) Dyngo-4a was inserted and positioned at the level of cholesterol in the bilayer and 4) Dyngo-4a-treatment resulted in decreased lipid packing in the outer leaflet of the plasma membrane 5) but Dyngo-4a did not affect caveola morphology, caveolae-associated proteins, or the overall membrane stiffness 6) acute addition of cholesterol counteracts the block in caveola scission caused by Dyngo-4a.

      Overall, in this reviewers opinion, claims 1, 3, 4, 5 are well-supported by the presented data from electron and live cell microscopy, QCM-D and AFM.

      However, there is no convincing assay for caveolar endocytosis presented besides the "caveola duration" which although unclearly described seems to be the time it takes in imaging until a caveolae is not picked up by the tracking software anymore in TIRF microscopy.

      Since the main claim of the paper is a mechanism of caveolar endocytosis being blocked by Dyngo-4a, a true caveolar internalization assays is required to make this claim. This means either the intracellular detection of not surface connected caveolar cargo or the quantification of caveolar movement from TIRF into epifluorescence detection in the fluorescence microscope. Otherwise, the authors could remove the claim and just claim that caveolar mobility is influenced.

      We thank the reviewer for the nice constructive comments, and we very much appreciate the positive critique. We have now included a FRAP experiment of endocytic Cav1-GFP supporting the effect on internalization. In addition, we are currently preforming CTxB HRP experiments to quantify the number of caveolae at PM using EM but due to reasons out of our control we have not managed to finish these on time, they will be included in the manuscript once they are ready in hopefully not too long.

      Reviewer #1 (Significance):

      A number of small molecule inhibitors for the GTPase dynamics exist, that are commonly used tools in the investigation of endocytosis. This goes as far that the use of some of these inhibitors alone is considered in some publications as sufficient to declare a process to be dynamin-dependent. However, this is not correct, as there are considerable off-target effects, including the inhibition of caveolar internalization by a dynamin-independent mechanism. This is important, as for example the influence of dynamin small molecule inhibitors on chemotherapy resistance is currently investigated (see for example Tremblay et al., Nature Communications, 2020).

      The investigation of the true effect of small molecules discovered as and used as specific inhibitors and their offside effects is extremely important and this reviewer applauds the effort. It is important that inhibitors are not used alone, but other means of targeting a mechanism are exploited as well in functional studies. The audience here thus is besides membrane biophysicists interested in the immediate effect of the small molecule Dyngo-4a also cell biologists and everyone using dynamic inhibitors to investigate cellular function.

      Reviewer #2 (Evidence, reproducibility and clarity):

      This manuscript uses the small molecule dynamin inhibitors dynasore and dyngo to show that in dynamin triple knockout cells that these inhibitors impact lipid packing and organization in the plasma membrane. Data showing that dyngo affects caveolin dynamics using tirf microscopy is also shown and is interpreted to reflect inhibition of caveolae scission from the membrane.

      This data showing that dyngo and dynasore target membrane order is quite compelling and argues that the effects of these inhibitors is not dynamin specific and that inhibition of endocytosis by these small molecule inhibitors is dynamin-independent. The in vitro and in vivo data they provide is convincing.

      Similarly, the data showing that dynasore and dyngo affect caveolin dynamics and clathrin endocytosis (transferrin) is quite convincing and argues that altered lipid packing is impacting membrane dynamics at the plasma membrane.

      What is less convincing is the conclusion that dyngo is preventing caveolae scission from the membrane. Study of caveolae endocytosis is based on a TIRF assay that has inherent limitations:

      - Caveolae are defined as bright cav1-positive spots in diffraction limited TIRF and their disappearance presumed to be endocytic events. Cav1 spots are presumed to be caveolae but the authors do not consider that they may be flat non-caveolar oligomers. The diffraction limited TIRF approach interprets the large structures as caveolae but evidence to that effect is lacking.

      This is a valid comment and to address this we have now included data showing colocalization of cavin1 and EHD2 to the Cav1-GFP spots. We can however not determine if they are flat or invaginated. We do have extensive experience imaging caveolae using TIRF microscopy and carefully chose cells that display low expression of fluorescently labelled caveolin to avoid non-caveolar structures.

      - The analysis (and the diagram presented in figure 4) considers that caveolae can either diffuse laterally in the membrane or internalize and does not consider that caveolae can flatten and possibly fragment in the membrane. Is it not possible that loss of Cav1 spots is a fragmentation event and not necessarily a scission event?

      This is a good question, yet, fragmentation and disassembly would result in shorter track durations and this is not what is observed in data. We have now also included data showing that cavin1 is persistently associated with the Cav1 spots identified as caveolae during Dyngo-4a treatment indicating that these are caveolae. Furthermore, IF stainings showing colocalization of Cav1GFP with cavin1 or EHD2 after Dyngo-4a treatment have also been added. We have now also expanded on the different interpretations of the data in the results section.

      - The analysis is based on overexpression of Cav1-GFP that may alter the stoichiometry between Cav1 and cavin1 such that while caveolae may be expressed, larger non-caveolar structures may accumulate.

      Yes, this is correct, we have specifically imaged cell expressing low levels of Cav1-GFP to avoid accumulated non-caveolar structures that can be spotted in cells with high expression.

      - Cav1 has been shown to be internalized via the CLIC pathway (Chaudary et al, 2014) and if dyngo is impacting clathrin then maybe it is also impacting CLIC endocytosis and thereby Cav1 endocytosis via this pathway?

      Dyngo-4a has been shown to not affect CLIC endocytosis (McCluskey et al., 2013) and in our data we do not see internalization following Dyngo-4a treatment.

      - The longer Cav1 TIRF track time and shorter displacement with dyngo is consistent with inhibition of caveolae scission. However, as the authors discuss, could not reduced membrane undulations due to dyngo's impact on membrane order be responsible for the longer tracks? Alternatively, perhaps the altered lipid packing is corralling Cav1 movement and reducing non-caveolar Cav1 endocytosis, resulting in shorter tracks of longer duration? The proposed interaction of dyngo with cholesterol could prevent scission but also stabilize large (flat?) Cav1 oligomers in the membrane, perhaps reducing Cav1 oligomer fragmentation.

      We completely agree that membrane undulations contribute to instability of the TIRF-field and therefore disruption of cav1-GFP tracks as we discuss in the results section and have been described in previous work (Larsson et al., 2023). Yet, we have also shown that internalization of caveolae results in shorter tracks (Hubert et al., 2020; Larsson et al., 2023; Mohan et al., 2015). Furthermore, the tracked Cav1-GFP spots are persistently positive for cavin1 both with and without Dyngo-4a treatment showing that the majority do not disassemble become internalized by other pathways. Additionally, the added IF stainings after 30 min Dyngo-4a treatment also show that the Cav1-GFP spots remain positive for cavin1 and EHD2 just as ctrl-treated cells.

      My point here is not to discredit the data but only to suggest that the TIRF approach used is an indirect measure of caveolae scission from the membrane that requires substantiation using other approaches.

      We appreciate these comments and have tried to address these by adding new data and discussions on the interpretation of the tracking data in the results section.

      Dyngo is certainly generally affecting lipid packing via cholesterol and thereby affecting Cav1 dynamics in the plasma membrane. The claim of caveolae scission should be qualified and alternative possibilities considered and discussed. If the authors persist in arguing that dyngo is affecting caveolae scission then the effect should be substantiated by accumulation of caveolae by quantitative EM and high spatial and temporal resolution imaging of Cav1 and cavin1 to define the endocytic events. As the latter represents a new, and potentially very challenging, line of experimentation, I would suggest that it is beyond the scope of the current study. As indicated above the additional experiments are not necessary and qualification of the claims would be sufficient.

      We have now included a FRAP experiment of endocytic Cav1-GFP supporting the effect on internalization. We are also currently preforming CTxB HRP experiments to quantify the number of caveolae at the PM using EM but due to reasons out of our control we have not managed to finish these on time, they will be included in the manuscript once they are ready in hopefully not too long.

      Other points

      Figure 1C - Cav1 positive spots cannot be interpreted to be caveolae from diffraction limited confocal images. Same comment applies to Fig 4G - caveola? duration.

      We completely agree with this and that the claims should be qualified. We have added IF stainings showing that the Cav1-GFP structures are also positive for cavin1. We have now clarified that we cannot distinguish between flat or different curved states of caveolae using this methodology. We have also changed the labelling of Fig. 4G.

      Figure 4C - it is not clear why this EM data is not quantified - for both the number of caveolae and clathrin coated pits - as this would help clarify the interpretation of the effect reported.

      We are currently preforming CTxB HRP experiments to quantify the number of caveolae using EM but due to reasons out of our control we have not managed to finish these on time, they will be included in the manuscript once they are ready in hopefully not too long.

      Figure 4D - the AFM experiments should perhaps be repeated as the non-significant effect of dyngo on the Young's modulus may be a result of insufficient n values.

      We would like to clarify that to ensure the robustness of our AFM measurements, we performed the experiments with sufficient biological and technical replicates. Specifically, each data point shown in Figure 4D represents a Young’s modulus value averaged from approximately sixty force-distance curves per cell. For each condition, we collected force-distance maps on eight to nine individual cells, obtained from two separate petri dishes per day. We repeated this process on two independent days. In total, we analysed thirty-one cells for the DMSO control and thirty-three cells for the Dyngo-4a treatment. We performed the “student’s t-test with Welch’s correction” to access the statistical significance between the two conditions, as described in the main text. We believe that the sample size and statistical approach are sufficient to support the conclusions presented. Furthermore, we also analysed cell stiffness by calculating the slope of the linear portion of the force-distance curves. This analysis also did not reveal any statistically significant differences between the conditions (data not shown), further supporting our conclusion that Dyngo-4a treatment does not significantly alter the Young’s modulus under our experimental setup (or conditions).

      Reviewer #2 (Significance):

      This data showing that dyngo and dynasore target membrane order is quite compelling and argues that the effects of these inhibitors is not dynamin specific and that inhibition of endocytosis by these small molecule inhibitors is dynamin-independent. The in vitro and in vivo data they provide is convincing.

      Similarly, the data showing that dynasore and dyngo affect caveolin dynamics and clathrin endocytosis (transferrin) is quite convincing and argues that altered lipid packing is impacting membrane dynamics at the plasma membrane.

      What is less convincing is the conclusion is that dyngo is preventing caveolae scission from the membrane.

      Reviewer #3 (Evidence, reproducibility and clarity):

      Larsson et al present experimental and computational data on the role of Dyngo4a (a compound that was developed to inhibit dynamin) on the dynamics of caveolae. The manuscript mostly documents effects of Dyngo on caveolae, with one experiment to suggest a mechanism for this result. This one rather unconvincing result forms the focus of the manuscript contributing to a disconnect between the data and the presentation. Additionally, there are concerns with data interpretation. The writing could also benefit from revision to address grammar mistakes, strengthen referencing, and increase precision. Overall, the manuscript requires substantial revisions before being considered for publication. The central claim, in particular, needs stronger evidence to support the proposed mechanism.

      We thank the reviewer for the thorough review and for experimental suggestions that we believe has strengthened our data further.

      Significant issues (in approximate order of importance):

      (1) The data supporting the central mechanistic explanation appears limited. There is no evidence that Dyngo remains in one leaflet

      The simulations show that the energy barrier for moving in between bilayers is very high. Furthermore, simulations of C-Laurdan has shown that it does not readily flip in between membrane leaflets (Barucha-Kraszewska et al., 2013) supporting that it reports on the outer lipid leaflet when added to cells. We have however now changed this and state that Dyngo-4a decreased the lipid order in the plasma membrane.

      - the GP of the PM is very low compared to previous measurements,

      The absolute GP-values will vary between setups depending on what filters are used so they are not comparable between laboratories. What is of importance is that we found a significant change in the relative GP-values in cells treated with Dyngo-4a and control cells. It is this change that we report. We have not performed any GP-measurements on this cell type earlier so it is unclear what previous measurements reviewer #3 are referring to.

      - effects on other membranes are not explored,

      The order of the intracellular membranes is as expected lower than that of the plasma membrane. Differentiating different intracellular membranes of interest like endocytotic vesicles from other intracellular membranes would be very difficult but, more importantly, our study is focused on what is happening in the plasma membrane where caveolae reside and would be of minor interest for plasma membrane dynamics.

      - dynamin-directed effects of Dyngo are not considered,

      In the discussion section we discuss the difficulties with disentangling dynamin-direct and indirect effects.

      (2) The QCM-D measurements and claims require explanation as several aspects remains unclear. In Fig S2, the 'softness' (what does this mean?) changes by 4-fold with DMSO alone (what does this mean?), then fractionally more with Dyngo. Then fractionally more again when Dyngo is removed (why?). Then it remains somewhat higher when both Dyngo and DMSO are removed, which is somehow interpreted as Dyngo remaining in the bilayer, but not DMSO.

      We understand the confusion of the reviewer and hope our explanations provide clarity. QCM-D measurements are based on an oscillating quartz crystal sensor. Specifically, alterations in oscillation frequency (ΔF) and the rate of energy dissipation from the sensor surface (ΔD) are what is measured. Allowing the measurement of: 1) materials adsorbing to the sensor surface, 2) changes in the viscoelastic properties of a solution in contact with the sensor surface, 3) changes in the material adsorbed to the sensor surface upone exposure to different solutions. The ratio of ΔD/-ΔF reports the mechanical softness or rigidity of an adsorbed material, in this case the SLB.

      A “buffer shift” is the term used when there is not an adsorption to the sensor surface, but rather an effect from altering the solution above the sensor surface. One reason is because different solutions can have different densities (e.g., a DMSO-buffer mixture vs buffer alone), which impacts the oscillations of the sensor. It was observed that the DMSO-buffer mixture alone gave a large buffer shift in comparison to the adsorption of the Dyngo-4a into the SLB, thereby muddling the data interpretation. Thus, in Fig. S2 the system was first equilibrated with the DMSO-buffer mixture prior to addition of the Dyngo-4a solution to allow for clearer visualization of the two events. In QCMD to assess if something has made a permeant change to the system you change back to the solutions used before the addition, thus first we washed with a DMSO-Buffer mixture followed by buffer alone. Control experiments were carried out in which no Dyngo-4a was added (also shown in Fig. S2). The control shows the same “buffer shift” from the DMSO-buffer mixture occurs in both systems and that upon returning to a buffer only condition there is no permanent change to the system caused from exposure to the DMSO. In contrast, once the system that received Dyngo-4a is changes back to a buffer only system we see that mass has been added to the system (ΔF) with little change to the dissipation (ΔD), thereby resulting in a lower ratio of ΔD/-ΔF, which is to say that the SLB after the adsorption of Dyngo-4a was more rigid that the SLB without Dyngo-4a.

      These interpretations are difficult to grasp, as the authors seem to be implying simple amphiphilic partitioning into the membrane, which should all be removable by efficient washing.

      Amphiphilic partitioning is not fully reversible by “efficient washing” it depends on partitioning coefficients.

      I do not doubt that this compound interacts with membranes, but the quantifications appear ambiguous. A bilayer with 16 mol% (or worse, 30% if all in one leaflet) Dyngo is very unlikely (to remain a bilayer). Even if such a bilayer was conceivable, the authors are claiming an ADDITION of Dyngo that would INCREASE the area of one leaflet by 30%, which needs explanation as it appears unlikely.

      We understand that in our attempt provide numbers in the results section for the amount of binding observed in QCM-D, this can easily be interpreted as this is what is observed to insert into the PM. However, as discussed in the discussion, we also see aggregations of Dyngo-4a that associate with the membrane in the simulations which likely could contribute to the binding observed in QCM-D prior to washing. The precise amount of membrane inserted Dyngo-4a is difficult to measure as we discuss in the text. In order to make this clearer, we have now moved all these details to the discussion section where we elaborate on this. Furthermore, since Dyngo-4a, like cholesterol, is intercalating in between the head groups of the lipids the area would not increase in direct proportion to the mol%.

      Also, there are no replicates shown, so unclear how reproducible these effects are?

      For clarity, only single experiments are shown. However, multiple experiments were performed and the range in measured values for 3 technical repeats can be observed in the standard deviations found in the main text (e.g., 6 ± 2 mol%).

      (3) The simulations are insufficiently described and difficult to interpret. How big are these systems? Why do the figures show the aqueous system with lateral boundaries?

      There are no explicit boundaries used in the simulations, periodic boundary conditions are applied in all three dimensions. The lateral boundaries observed in the figures correspond to the simulation box edges and are a visual artifact of 2D projections with QuickSurf representation. No artificial wall or constraints were introduced laterally. Additional technical details, including the system size and periodic boundary conditions have now been added to the methods section.

      It seems quite important that multiple Dyngo molecules aggregate rather than partition into membranes - is this likely to occur in experiment?

      Yes, this is important and with the additional simulation experiments suggested by Reviewer #3 it has been clarified that they contribute a great deal to the change in lipid packing of lipid bilayers containing cholesterol.  However, it is hard to test aggregation is the cellular system, but we believe that this happens and contribute to the effect on membranes. We have now emphasized the effect of the aggregates in the text.

      PMF simulations are strongly suggesting that Dyngo does not spontaneously cross membranes, which is inconsistent with its drug-like amphiphilicity (cLogP~2.5 is optimally suited for membrane permeation) and known effects on intracellular proteins. This suggests an artefact in these PMFs.

      As stated in the submitted version of the manuscript, logP was used to validate the topology and the observed value was in a very good agreement with cLogP. Moreover, this validation complemented the standard procedure of CHARMM-GUI ligand modelling, that provided a reasonable penalty score (around 20) for the Dyngo-4a topology. POPC and cholesterol molecules are standard in the force field and validated by numerous studies. The parameters used for the membrane simulations and AWH in particular are very common for this type of studies. Thus, we do not see what may cause any artifacts in the free energy profile construction. In fact, amphiphilicity of the molecule may be one of the key reasons that Dyngo-4a molecule remains at the aqueous interface of the membrane and does not cross the membrane spontaneously. Also, we believe that the energy barrier of 40-60 kJ/mol is not prohibitively high and Dyngo-4a molecules may still overcome the barrier eventually, though we expect majority to reside in the upper leaflet.

      The authors should experimentally measure the permeation of Dyngo through bilayers (or lack thereof), to more robustly support their finding that Dyngo does not cross membranes spontaneously.

      We thank the reviewer for the suggestion, however this if very technically challenging and would require establishment of precise systems which is beyond the scope of this manuscript.

      (4) Why not measure effect of Dyngo on lipid packing directly and more broadly in model membranes?

      With the added modelling experiments supporting the previous simulations and the calculated GP values from the C-Laurdan experiments on cellular plasma membrane, we do not find it necessary to include more model membranes experiments than the already existing ones on lipid monolayers and supported lipid bilayers.

      (5) Statistics should not be done on individual cells (n>26), but rather on independent experiment (N=3?)

      We have performed the statistics on live cell particle tracking according to previous literature on similar systems (Boucrot et al., 2011; Larsson et al., 2023; Shvets et al., 2015; Stoeber et al., 2012).

      (6) Fig 1G is important but rather unclear. Firstly, these kymographs are an odd way to show that the caveolae are not moving. More importantly, caveolae in normal cells have been shown to be quite stable and immobile (eg doi: 10.1074/jbc.M117.791400), yet here they are claimed to be very mobile.

      Although this might be an odd and unconventional way to depict dynamic processes, we believe that this is a very illustrative way to show track stability over time in bulk rather than just a kymograph over a few structures in a cell. Furthermore, we are not claiming that caveolae are very mobile but rather the opposite very stable in agreement with previous work (Boucrot et al., 2011; Larsson et al., 2023; Mohan et al., 2015). We have now edited the text to make this even clearer.

      Also, if Dyngo prevents caveolae scission, there should be more of them at the membrane - why no quantification like Fig 1C to show accumulation of caveolae upon Dyngo treatment? Or directly counting caveolae via EM, as in Fig 4C?

      We are currently preforming CTxB HRP experiments using EM but due to reasons out of our control we have not managed to finish these on time, they will be included in the manuscript once they are ready in hopefully not too long. However, Dynasore has previously been shown, by EM, to increase the number of caveolae at the PM (Moren et al., 2012; Sinha et al., 2011).

      (7) The writing can be made more precise and referencing could be strengthened.

      The introduction was written in a short format, and we have now extended this and made it more precise.

      Some examples:

      (a) 'scissoned' is not a word in English,

      Thanks, we have now changed this.

      (b) what is meant by "Cav1 assembly is driven by high chol content"? There are many types of caveolin assemblies.

      We agree that this can be made more precise and have now clarified this in the introduction.

      (c) "This generates a unique membrane domain with distinct lipid packing and a very high curvature." Unclear what 'this' refers to and there is no reference here, so what is the evidence for either of these claims? Caveolin-8S oligomers are not curved. Perhaps 'this' is caveolae, but they are relatively large and also not very highly curved and I am unaware of measurements of lipid packing therein.

      Caveolae are around 50 nm which in biology is a very high curvature of a membrane. It has been extensively proven that caveolae have a distinct lipid composition highly enriched in cholesterol and sphingolipids, which thereby also will generate a unique lipid packing as compared to the surrounding membrane. Yet, the reviewer is correct that lipid packing has not been measured in a caveola for obvious technical challenges. Thus, we have now changed the text to “special lipid composition”.

      The sentence following that one again makes a specific, but unreferenced, claim.

      (d) intro claims that lipid packing is critical for fission, but it is unclear quite what is meant by this claim. The references do not help, as they are often about the basic biophysics of lipids, rather than how packing affects fission.

      We have now edited the text.  

      (e) intro strongly implies that caveolae remain membrane attached because of stalled scission. How strong is the evidence for this? The fact that EHD2 is at the neck is not definitive,

      We used the term stalled scission to describe that all omega shaped membrane invaginations do not scission in the same automatic way as clathrin coated vesicles. We have now changed this in the text. Caveolae are shown to be released (undergo scission) and be detected as internal caveolae if the protein EHD2 is removed. Hence this must be interpreted as if EHD2 stalls scission. The evidence includes data compiled over the last 12 years from others and us which include for example: 1) Caveolae with EHD2 have a longer duration time (Larsson et al., 2023; Mohan et al., 2015; Moren et al., 2012; Stoeber et al., 2012), Knock down of EHD2 results in more internalized caveolae as measured by CTxB HRP using EM (Moren et al., 2012) and shorter duration time at the PM (Hubert et al., 2020; Larsson et al., 2023; Mohan et al., 2015; Stoeber et al., 2012). 2) EHD2 overexpression results in less internalized caveolae as measured by CTxB HRP using EM (Stoeber et al., 2012). Furthermore, 3) overexpression or acute addition of purified EHD2 via microinjection counteracts lipid induced scission of caveolae and hence result in caveolae stabilization at the PM (Hubert et al., 2020). It is very hard to see that the release and internalization of caveolae could result from anything else than that these have undergone scission. EHD2 has been found around the rim of caveolae (Matthaeus et al., 2022) and overexpression of EHD2 oligomerizing mutants have been shown to expand the caveola neck (Hoernke et al., 2017; Larsson et al., 2023).

      (f) unclear what is meant by 'lipid packing frustration' and how Dyngo supposedly induces it.

      Lipid packing frustration refers to what is usually referred to as lipid packing defect, but since lipid membranes are describe as a fluid system it should not have defects whereby, we believe that lipid packing frustration is more accurate. However, we have now changed the text and use “decreased lipid packing” or “decreased lipid order” more thoroughly to describe the effect on the plasma membrane.

      (8) IF of Cav1 is insufficient to claim puncta as caveolae. Co-stained puncta of caveolin with cavin are much stronger evidence. Same issue for Cav1-GFP puncta.

      We agree and have now provided IF showing cavin1 and EHD2 colocalization to Cav1GFP in non and Dyngo-4a-treated cells.

      (9) Fig 3E claims that "preferred position of Dyngo-4a was closer to the head groups" but the minimum looks to be in similar place as Fig 3B without cholesterol. Response:

      We appreciate the reviewer’s observation. The PMF minima in the POPC and POPC:Chol membranes are indeed close in absolute position (~1.1–1.2 nm from the bilayer center). However, as clarified in the revised text, the presence of cholesterol leads to a slight shift of Dyngo-4a closer to the headgroup region and broadens the positional distribution. This is also evident from the added density profiles (Fig. S3A) and is now described more precisely in the manuscript.

      Critically, these results do not support the notion that Dyngo affects lipid packing sufficiently, which is not measured in the simulations (though could be).

      We thank the reviewer for the excellent suggestion. In response, we have now included a detailed analysis of Dyngo-4a’s effect on lipid packing in the simulations. As described in the revised manuscript, we measured deuterium order parameters, area per lipid (APL), and lipid–Dyngo–cholesterol spatial distributions (Figs. 3-H, S3C-E). The results demonstrate that Dyngo-4a decreases lipid order in POPC:Chol membranes. Both single molecules and clusters reduce the order parameter by up to 0.04 units, particularly in the upper leaflet, where Dyngo-4a reside.The reduction is most pronounced in the midchain region of the sn1 tail and around the double bond of the sn2 tail. These effects were accompanied by increased APL in POPC:Chol membranes and by colocalization of Dyngo-4a near cholesterol-rich regions. Together, these data confirm that Dyngo-4a perturbs membrane organization and lipid packing in a composition-dependent manner. We believe these additions directly address the concern and demonstrate that the simulations indeed support the conclusion that Dyngo-4a modulates lipid packing.

      Finally, the simulation data do not show "that Dyngo-4a is competing with cholesterol"; it is unclear what 'competition' means in this context, but regardless, the data only shows that Dyngo sits at a similar location as cholesterol.

      We agree with the reviewer that “competition” was an imprecise term. We have rephrased the relevant sections to clarify that Dyngo-4a and cholesterol localize to overlapping regions and exhibit spatial coordination. As now stated in the manuscript, cholesterol appears to partially displace Dyngo-4a from its preferred depth seen in pure POPC, broadens its membrane distribution, and alters lipid packing. According to the order parameters there is an interplay between chol and Dyngo-4a and the heatmaps show that the distribution of chol in the membrane gets less uniform in the presence of Dyngo-4a. These interactions suggest that Dyngo-4a perturbs cholesterol-rich domains.

      As new analysis routines were added to the study, we have now also added the details on those to the Methods section of the text.

      (10) AFM measures the stiffness of the cell (as correctly explained in Results section) not "overall stiffness of the PM" as stated in the Discussion.

      We thank the reviewer for pointing this out, we have now altered this in the discussion section.

      (11) Fig2A: what was the starting lipid surface pressure? How does Dyngo insertion depend on initial lipid packing?

      The starting pressure lipid pressure was 20 mN m<sup>-1</sup which we now have incorporated in the figure legend. We performed several such experiments with a starting pressure ranging from 20-23 mN m<sup>-1</sup> showing consistent results which we described in the materials and methods section. Given that we also performed QCMD analysis and simulations on bilayers showing that Dyngo-4a adsorbed and inserted respectively, we have not performed a titration of starting pressures resulting in a MIP of Dygo-4a.

      (12) Fig 4B is a strange approach to measure membrane motion. Why not RMSD or some other displacement based method? As its shown, it implies that the area of the cell changes.

      The method that we used to quantify the area of the cell which is attached (or close to) the glass and thereby is visible in TIRF microscopy. This is area indeed changes over time which has been frequently observed and used to describe and quantify the mobility, lamellipodia and filopodia formation among other things. We agree that RMSD can also be used to analyze the data before and after treatments and we have now included RMSD­­­­ analysis in the manuscript.

      Reviewer #3 (Significance):

      The title, abstract, and introduction of the manuscript are largely framed around lipid packing, but most of the data investigate other unexpected effects of treating cells with Dyngo4a. The only measurement for lipid packing (or any other membrane properties) is Fig 4E-F. Therefore, this paper is effectively an investigation of an artefact of a common reagent, which itself could be a valuable contribution. However, the mechanism to explain its effect requires stronger evidence, and its broad biological significance needs further exploration.

      Overall, the impact of documenting the effects of Dyngo4a on membranes appears modest but may be valuable to the membrane trafficking community.

      Barucha-Kraszewska, J., S. Kraszewski, and C. Ramseyer. 2013. Will C-Laurdan dethrone Laurdan in fluorescent solvent relaxation techniques for lipid membrane studies? Langmuir. 29:1174-1182.

      Boucrot, E., M.T. Howes, T. Kirchhausen, and R.G. Parton. 2011. Redistribution of caveolae during mitosis. J Cell Sci. 124:1965-1972.

      Hoernke, M., J. Mohan, E. Larsson, J. Blomberg, D. Kahra, S. Westenhoff, C. Schwieger, and R. Lundmark. 2017. EHD2 restrains dynamics of caveolae by an ATP-dependent, membrane-bound, open conformation. Proc Natl Acad Sci U S A. 114:E4360-E4369.

      Hubert, M., E. Larsson, N.V.G. Vegesna, M. Ahnlund, A.I. Johansson, L.W. Moodie, and R. Lundmark. 2020. Lipid accumulation controls the balance between surface connection and scission of caveolae. Elife. 9.

      Larsson, E., B. Moren, K.A. McMahon, R.G. Parton, and R. Lundmark. 2023. Dynamin2 functions as an accessory protein to reduce the rate of caveola internalization. J Cell Biol. 222.

      Matthaeus, C., K.A. Sochacki, A.M. Dickey, D. Puchkov, V. Haucke, M. Lehmann, and J.W. Taraska. 2022. The molecular organization of differentially curved caveolae indicates bendable structural units at the plasma membrane. Nat Commun. 13:7234.

      McCluskey, A., J.A. Daniel, G. Hadzic, N. Chau, E.L. Clayton, A. Mariana, A. Whiting, N.N. Gorgani, J. Lloyd, A. Quan, L. Moshkanbaryans, S. Krishnan, S. Perera, M. Chircop, L. von Kleist, A.B. McGeachie, M.T. Howes, R.G. Parton, M. Campbell, J.A. Sakoff, X. Wang, J.Y. Sun, M.J. Robertson, F.M. Deane, T.H. Nguyen, F.A. Meunier, M.A. Cousin, and P.J. Robinson. 2013. Building a better dynasore: the dyngo compounds potently inhibit dynamin and endocytosis. Traffic. 14:1272-1289.

      Mohan, J., B. Moren, E. Larsson, M.R. Holst, and R. Lundmark. 2015. Cavin3 interacts with cavin1 and caveolin1 to increase surface dynamics of caveolae. J Cell Sci. 128:979-991.

      Moren, B., C. Shah, M.T. Howes, N.L. Schieber, H.T. McMahon, R.G. Parton, O. Daumke, and R. Lundmark. 2012. EHD2 regulates caveolar dynamics via ATP-driven targeting and oligomerization. Mol Biol Cell. 23:1316-1329.

      Shvets, E., V. Bitsikas, G. Howard, C.G. Hansen, and B.J. Nichols. 2015. Dynamic caveolae exclude bulk membrane proteins and are required for sorting of excess glycosphingolipids. Nat Commun. 6:6867.

      Sinha, B., D. Koster, R. Ruez, P. Gonnord, M. Bastiani, D. Abankwa, R.V. Stan, G. Butler-Browne, B. Vedie, L. Johannes, N. Morone, R.G. Parton, G. Raposo, P. Sens, C. Lamaze, and P. Nassoy. 2011. Cells respond to mechanical stress by rapid disassembly of caveolae. Cell. 144:402-413.

      Stoeber, M., I.K. Stoeck, C. Hanni, C.K. Bleck, G. Balistreri, and A. Helenius. 2012. Oligomers of the ATPase EHD2 confine caveolae to the plasma membrane through association with actin. EMBO J. 31:2350-2364.

    1. Résumé vidéo [00:47:45][^1^][1] - [01:05:30][^2^][2]:

      La troisième partie de la vidéo traite de la question de faire vivre les valeurs de la République en établissement scolaire, avec les témoignages de trois intervenants : une professeure d'histoire-géographie, un principal de collège et une chercheuse en sciences de l'éducation. Ils partagent leurs expériences, leurs pratiques et leurs réflexions sur les enjeux et les moyens de transmettre et d'incarner ces valeurs auprès des élèves, des personnels, des parents et des partenaires extérieurs.

      Points forts : + [00:47:45][^3^][3] L'école comme parenthèse pour les élèves * Leur faire sentir qu'ils peuvent vivre les valeurs républicaines à l'école * Leur faire découvrir la mixité sociale, l'égalité, la culture * Leur proposer des sorties pédagogiques, des projets, des actions + [00:49:01][^4^][4] L'interconnaissance avec les acteurs extérieurs à l'école * Développer des partenariats avec le monde associatif, la politique de la ville, etc. * Coproduire des projets avec ces acteurs pour prendre en charge le jeune dans sa globalité * Former les enseignants et les chefs d'établissement avec ces acteurs pour avoir un référentiel commun + [00:53:05][^5^][5] La question du public allophone * Lever la barrière de la langue pour faire vivre les valeurs de citoyens français * Profiter de la grande diversité et de la fascination pour la France des élèves allophones * Faire partie de collectifs pour penser et débattre ensemble + [00:58:51][^6^][6] La synthèse graphique de la table ronde * Représenter le parcours professionnel des intervenants comme un parcours d'équilibriste * Identifier les valeurs qui les animent, les publics qu'ils touchent, les actions qu'ils mènent * Souligner l'importance du collectif, de la politique, de la mixité sociale harmonieuse

    2. Résumé vidéo [00:23:45][^1^][1] - [00:47:41][^2^][2]:

      La deuxième partie de la vidéo parle de comment faire vivre les valeurs de la République en établissement scolaire, à travers des expériences, des dispositifs et des instances. Les intervenants partagent leurs pratiques, leurs réflexions et leurs témoignages sur les enjeux de l'éducation à la citoyenneté, à la santé, à l'égalité, à la laïcité et à la fraternité.

      Points forts: + [00:24:14][^3^][3] L'articulation entre l'individuel et le collectif * Comment chacun se positionne sur ses valeurs * Comment on travaille avec l'ensemble des acteurs * Comment on crée du lien et du sens entre les actions + [00:24:26][^4^][4] Le comité d'éducation à la santé, à la citoyenneté et à l'environnement (CESCE) * Une instance de réflexion, d'observation, de veille et de proposition * Une instance qui associe les parents, les partenaires, les élus, les référents et les professeurs * Une instance qui définit la politique de prévention et d'éducation dans les domaines de la citoyenneté, de la santé, de la lutte contre les violences, de l'égalité fille-garçon, etc. + [00:31:00][^5^][5] L'expérience du collège Berlios à Paris * Un établissement qui accueille une grande diversité d'élèves * Un établissement qui a mis en place une mixité harmonieuse et une bienveillance éducative * Un établissement qui a amélioré son climat scolaire, sa réussite et son ouverture + [00:38:16][^6^][6] L'éducation universelle en classe * Une approche pédagogique qui vise à lever les obstacles à l'apprentissage * Une approche qui s'interroge sur les besoins éducatifs particuliers des élèves * Une approche qui favorise l'équité et la réussite de tous les élèves

    3. Résumé vidéo [00:00:01][^1^][1] - [00:23:42][^2^][2]:

      Cette vidéo est la première partie d'une table ronde sur le thème "Faire vivre les valeurs en établissement". Trois intervenants partagent leurs expériences et leurs réflexions sur les questions d'éducabilité, de réussite, d'égalité et de citoyenneté à l'école. Ils abordent également les enjeux de l'éducation inclusive, de la mixité sociale, du climat scolaire et du bien-être des élèves et des personnels.

      Points forts: + [00:00:01][^3^][3] Présentation de la table ronde et des intervenants * Sandrine Benevkir, conseillère technique établissement vie scolaire au cabinet de la rectrice de l'académie de Lille * Faride Boualifa, proviseur au lycée professionnel Le Chatelier à Marseille * Anne-Lore Perrin, conseillère pédagogique de circonscription et docteure en psychologie sociale + [00:07:52][^4^][4] Intervention de Sandrine Benevkir * Elle évoque son parcours personnel et professionnel marqué par les questions d'éducabilité, de réussite et d'égalité * Elle présente les différents dossiers qu'elle porte au niveau académique, comme la vie lycéenne, l'égalité fille-garçon, la prévention de la radicalisation ou les valeurs de la République * Elle insiste sur l'importance de l'appartenance à des collectifs pour transmettre et promouvoir ces valeurs + [00:11:47][^5^][5] Intervention de Faride Boualifa * Il raconte son parcours de CPE puis de chef d'établissement dans des zones difficiles, animé par la volonté de lutter contre les déterminismes sociaux * Il relate son expérience de la mixité sociale à travers une expérimentation de montée alternée entre deux collèges aux profils sociologiques opposés * Il souligne le rôle du climat scolaire, du bâti, des projets, des sorties et du management dans la réussite et l'épanouissement des élèves + [00:18:18][^6^][6] Intervention d'Anne-Lore Perrin * Elle expose ses recherches en psychologie sociale sur les attitudes des enseignants à l'égard de l'éducation inclusive * Elle montre que les valeurs personnelles et organisationnelles des enseignants influencent leur perception de l'éducation inclusive * Elle révèle que le terme de réussite peut avoir des effets contradictoires sur les attitudes des enseignants, selon qu'il est défini et compris

    4. il s'agit de l'atelier 2 autour du bien-être à l'école de l'atelier 3 du coup je vous le fais de têtes alors 00:57:45 que j'avais des des notes en tout cas c'est l'atelier 3 et l'atelier 4 les ateliers donc 2 3 et 4 sont annulés en revanche il reste des places dans les autres ateliers du coup vous pouvez vous y rendre de façon spontanée et vous 00:57:58 ajouter sur les listes d'émargements donc il reste le 1 euh valeur de la République avec du coup l'équipe de collègues Carole Janine et j'ai oublié le nom de la troisème personne le 00:58:11 l'atelier 5 l'Escape game autour de l'inclusion le 6 euh le 6 les réseaux sociaux merci beaucoup autour des réseaux sociaux avec le Clémi 00:58:22 euh l'atelier 7 euh qui est autour de l'interculturalité justement du plurilinguisme et l'atelier 8 avec la question de l'expérience du débat en classe
    1. Résumé de la vidéo [00:00:00][^1^][1] - [00:19:25][^2^][2]:

      La vidéo présente une avocate spécialisée dans la défense des enfants atteints de troubles neurodéveloppementaux.

      Elle explique son approche flexible en droit pour s'adapter aux besoins spécifiques de ses clients, souvent confrontés à des diagnostics erronés et à des prises en charge inadaptées.

      Elle souligne l'importance de lutter contre la stigmatisation et de promouvoir des soins adaptés, en mettant l'accent sur la formation des professionnels et l'écoute des parents.

      Points forts: + [00:00:11][^3^][3] Défense des enfants atypiques * Troubles TDAH, TSA * Syndrome d'alcoolisation fœtale + [00:01:00][^4^][4] Flexibilité en droit * Droit diversifié * Centré sur le neurodéveloppement + [00:02:01][^5^][5] Lutte contre les diagnostics erronés * Responsabilité médicale * Prises en charge inadaptées + [00:03:02][^6^][6] Droits des handicapés * Allocations via MDPH * Prise en charge adaptée + [00:04:01][^7^][7] Importance de l'accompagnement humain * AESH individuels * Besoin d'attention soutenue + [00:05:03][^8^][8] Contentieux en droit administratif * Éducation nationale * Conseils de discipline + [00:07:00][^9^][9] Responsabilité pénale des jeunes * Discernement * Alternatives à la prison + [00:09:01][^10^][10] Protection de l'enfance * Formations obsolètes * Importance du diagnostic + [00:11:01][^11^][11] Rôle des parents * Observations parentales * Collaboration avec professionnels + [00:12:01][^12^][12] Risques de placement abusif * Informations préoccupantes * Mesures administratives + [00:15:46][^13^][13] Assistance éducative * Évaluations sociales et psychologiques * Importance des soins adaptés + [00:17:00][^14^][14] Départ des CMP * Risque d'IP pour défaut de soins * Importance de la prise en charge en libéral + [00:19:00][^15^][15] Écoute des parents * Non-toxicité * Besoin de répit, pas de retrait Video summary [00:20:00][^1^][1] - [00:39:54][^2^][2]:

      La vidéo présente une avocate spécialisée dans la défense des enfants avec des troubles neurodéveloppementaux.

      Elle explique son approche flexible en droit pour s'adapter aux besoins spécifiques de ses clients et aborde les défis liés à la responsabilité médicale, les droits des handicapés, et la protection de l'enfance.

      Highlights: + [00:20:00][^3^][3] Défense des enfants * Troubles neurodéveloppementaux * Flexibilité en droit + [00:21:01][^4^][4] Responsabilité médicale * Diagnostics inadaptés * Culpabilisation des parents + [00:22:26][^5^][5] Droits des handicapés * Allocations via MDPH * Prise en charge adaptée + [00:24:02][^6^][6] Protection de l'enfance * Évaluation des besoins * Formations pour professionnels + [00:27:44][^7^][7] Enjeux du placement * Impact sur les familles * Importance du diagnostic + [00:30:03][^8^][8] Rôle du juge des enfants * Lien entre services et familles * Compréhension des troubles