10,000 Matching Annotations
  1. May 2025
    1. Reviewer #2 (Public review):

      Summary:

      Mohanty et al. present a new deep learning method to identify intracellular allosteric modulators of GPCRs. This is an interesting field for e.g. the design of novel small molecule inhibitors of GPCR signalling. A key limitation, as mentioned by the authors, is the limited availability of data. The method presented, Gcoupler, aims to overcome these limitations, as shown by experimental validation of sterols in the inhibition of Ste2p, which has been shown to be relevant molecules in human and rat cardiac hypertrophy models.<br /> They have made their code available for download and installation, which can easily be followed to set up software on a local machine.

      Strengths:

      - Clear GitHub repository

      - Extensive data on yeast systems

      Weaknesses:

      - No assay to directly determine the affinity of the compounds to the protein of interest.

      In conclusion, the authors present an interesting new method to identify allosteric inhibitors of GPCRs, which can easily be employed by research labs. Whilst their efforts to characterize the compounds in yeast cells, in order to confirm their findings, it would be beneficial if the authors show their compounds are active in a simple binding assay.

    1. Expressing loops as iterator transformations can also produce code that is more efficient. In the interests of safety, Rust performs bounds checking on access to contiguous containers such as vectors and slices; an attempt to access a value beyond the bounds of the collection triggers a panic rather than an access to invalid data. An old-style loop that accesses container values (e.g., values[i]) might be subject to these runtime checks, whereas an iterator that produces one value after another is already known to be within range.

      iterator 방식을 사용하는 것이 bound checking을 피하기 때문에 효율적이다.

    2. This rich collection of iterator transformations is there to be used. It produces code that is more idiomatic, more compact, and has clearer intent.

      풍부한 collection들이 코드들을 이상적이고, 명확적으로 보이도록 만들어준다.

    1. It is not a simple task to identify parts of the code where user-controlled data enters the system (so-called ‘sources’ in taint analysis nomenclature), and it is especially hard to find all feasible paths from these to the security sensitive ‘sinks’.

      Could be an interesting problem to solve?

    1. For example, the syntactic analysis stage builds a conventional expression tree, but this tree is expressed as objects—instances of various classes relevant to compilation: Class, Method, Message, Selector, Argument, etc. […] The code for these operations is encapsulated inside each object, and is not spread throughout the system so changes are easily made. And all these specialized classes rely heavily on reusable code inherited from more primitive classes like Object, HashTable, Symbol, Token, and LinkedList.

      This is an interesting claim, given that one of the common criticisms of the classic, inheritance-heavy style of classic OOP is precisely that it leads to code that is indeed "spread throughout". Adele Goldberg commenting on the legacy of Smalltalk made a well-known quip, "In Smalltalk, everything happens somewhere else."

      It's notable that although there many common word sequences between this paper and the later, similarly titled one that appeared in IEEE Software (1984), this passage doesn't appear to be present (based on a quick skim—I could be missing something), and the closest corresponding statements are significantly reworded.

    Tags

    Annotators

    1. Input Risk. An LLM does not challenge a prompt which is leading1 or whose assumptions are flawed or context is incomplete. Example: An engineer prompts, "Provide a thread-safe list implementation in C#" and receives 200 lines of flawless, correct code. It's still the wrong answer, because the question should have been, "How can I make this code thread-safe?" and whose answer is "Use System.Collections.Concurrent" and 1 line of code. The LLM is not able to recognize an instance of the XY problem2 because it was not asked to.

      This is a good example

    1. Author response:

      The following is the authors’ response to the original reviews

      eLife Assessment

      This valuable study investigates prey capture by archer fish, showing that even though the visuomotor behavior unfolds very rapidly (within 40-70 ms), it is not hardwired; it can adapt to different simulated physics and different prey shapes. Although there was agreement that the model system, experimental design, and main hypothesis are certainly interesting, opinions were divided on whether the evidence supporting the central claims is incomplete. A more rigorous definition and assessment of "reflex speed", more detailed evidence of stimulus control, and a more detailed analysis of individual subjects could potentially increase confidence in the main conclusions.

      Thank you very much. There are several points that we had to absolutely make sure that they are very well understood. (1) Explaining in the best possible way the experiment with a fly sliding on top of a glass plate. Here, the virtual ballistic landing point can be calculated using simple high school physics. It turns out that this is where the fish turn to – even though the fly is not falling at all. Once this is understood it becomes clear that we can precisely measure latency and accuracy of the C-start turns. In the new version we explain this essential aspect in more detail and add an extra Figure (new Figure 2). This may, perhaps, help readers to notice this important background (previously covered in Fig. 1C). (2) The full experimental evidence that the VR method works is presented in more detail and all measurements necessary will be clear after the new Figure 2. They will however not be clear if this Figure is ignored. (3) We have rewritten the manuscript to make it easier to understand what we wanted to show, why we needed VR to proceed and why the archerfish highspeed decision lent itself so readily to tackle the problem. (4) We emphasize the importance of speed-accuracy tradeoffs in standard decision-making and also include data on the absence of such a relation in the archerfish highspeed decisions.

      So, in summary, we have emphasized what we wanted to show and what we did not want to show, we have rewritten the text to make it easier for future readers and we have tried to add more guidance to the figures. We do hope very much that the beauty of the quite unexpected findings is more easily visible to those who take the trouble of actually reading the paper.  

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      The authors test whether the archerfish can modulate the fast response to a falling target. By manipulating the trajectory of the target, they claim that the fish can modulate the fast response. While it is clear from the result that the fish can modulate the fast response, the experimental support for the argument that the fish can do it for a reflex-like behavior is inadequate.

      Please note that we have not simply tested whether archerfish can 'modulate the fast response'. We quantitatively test specific hypotheses on the rules used by the fish. For this the accuracy of the decisions is analyzed with respect to specific points that can be calculated precisely in each of the experiments. These points are shown on the figures and in the movies that were meant to illustrate this important aspect. We had to make sure that the way we calculate the predicted point(s) is made as clear as possible in the text. We added more text and separated the fundamentally important aspects in a separate Figure 2 to make it more difficult to overlook the fundamental aspects that lay the foundation for everything that follows.

      Strengths:

      Overall, the question that the authors raised in the manuscript is interesting.

      Thank you and we do hope very much that, with our revision, you will see the beauty of the findings.

      Weaknesses:

      (1) The argument that the fish can modulate reflex-like behavior relies on the claim that the archerfish makes the decision in 40 ms. There is little support for the 40 ms reaction time. The reaction time for the same behavior in Schlegel 2008, is 6070 ms, and in Tsvilling 2012 about 75 ms, if we take the half height of the maximum as the estimated reaction time in both cases. If we take the peak (or average) of the distribution as an estimation of reaction time, the reaction time is even longer. This number is critical for the analysis the authors perform since if the reaction time is longer, maybe this is not a reflex as claimed. In addition, mentioning the 40 ms in the abstract is overselling the result. The title is also not supported by the results.

      Although the minimum latency is indeed 40 ms (it can be slightly less: e.g., see the evidence in the paper, for instance the plots in the new Fig. 4) the paper's statements are not dependent on a specific number. Even if minimum latency was 100 ms (which it is not) the speed of the response and the absence of a speedaccuracy relation (now shown directly in Fig. 4) is what is of importance. To show this we have completely rewritten large parts of the manuscript.

      (2) A critical technical issue of the stimulus delivery is not clear. The frame rate is 120 FPS and the target horizontal speed can be up to 1.775 m/s. This produces a target jumping on the screen 15 mm in each frame. This is not a continuous motion. Thus, the similarity between the natural system where the target experiences ballistic trajectory and the experiment here is not clear. Ideally, another type of stimulus delivery system is needed for a project of this kind that requires fast-moving targets (e.g. Reiser, J. Neurosci.Meth. 2008). In addition, the screen is rectangular and not circular, so in some directions, the target vanishes earlier than others. It must produce a bias in the fish response but there is no analysis of this type.

      Please note that the new Fig. 3 (former Fig. 2) reports all the evidence that is needed to just show this and in a way that could in no way have been better. We have rewritten the text to explain what needs to be shown experimentally in order to be able to proceed, what critical tests were done and what results were obtained. We also add a short comment on another unsuccessful attempt that we have tried before.

      (3) The results here rely on the ability to measure the error of response in the case of a virtual experiment. It is not clear how this is done since the virtual target does not fall. How do the authors validate that the fish indeed perceives the virtual target as the falling target? Since the deflection is at a later stage of the virtual trajectory, it is not clear what is the actual physics that governs the world of the experiment. Overall, the experimental setup is not well designed.

      Understanding this aspect is essential. If the glass plate experiment is not thoroughly understood (new Fig. 2 with new text to emphasize that this is absolutely essential) nothing that follows makes any sense, including what is meant by the statement that the decision could be hardwired to ballistic motion.

      Reviewer #2 (Public review):

      Summary:

      This manuscript studies prey capture by archer fish, which observe the initial values of motion of aerial prey they made fall by spitting on them, and then rapidly turn to reach the ballistic landing point on the water surface. The question raised by the article is whether this incredibly fast decision-making process is hardwired and thus unmodifiable or can be adjusted by experience to follow a new rule, namely that the landing point is deflected from a certain amount of the expected ballistic landing point. The results show that the fish learn the new rule and use it afterward in a variety of novel situations that include height, side, and speed of the prey, and which preserve the speed of the fish's decision. Moreover, a remarkable finding presented in this work is the fact that fish that have learned to use the new rule can relearn to use the ballistic landing point for an object based on its shape (a triangle) while keeping simultaneously the 'deflected rule' for an object differing in shape (a disc); in other words, fish can master simultaneously two decisionmaking rules based on the different shape of objects.

      Strengths:

      The manuscript relies on a sophisticated and clever experimental design that allows changing the apparent landing point of a virtual prey using a virtual reality system. Several robust controls are provided to demonstrate the reliability and usefulness of the experimental setup.

      Overall, I very much like the idea conveyed by the authors that even stimuli triggering apparently hardwired responses can be relearned in order to be associated with a different response, thus showing the impressive flexibility of circuits that are sometimes considered mediating pure reflexive responses.

      Thank you so much for this precise assessment of what we have shown!

      This is the case - as an additional example - of the main component of the Nasanov pheromone of bees (geraniol), which triggers immediate reflexive attraction and appetitive responses, and which can, nevertheless, be learned by bees in association with an electric shock so that bees end up exhibiting avoidance and the aversive response of sting extension to this odorant (1), which is a fully unnatural situation, and which shows that associative aversive learning is strong enough to override preprogrammed responding, thus reflecting an impressive behavioral flexibility.

      That's very interesting, thanks and we are very happy to mention this important study in the revised version.

      Weaknesses:

      As a general remark, there is some information that I missed and that is mandatory in the analysis of behavioral changes.

      Firstly, the variability in the performances displayed. The authors mentioned that the results reported come from 6 fish (which is a low sample size). How were the individual performances in terms of consistency? Were all fish equally good in adjusting/learning the new rule? How did errors vary according to individual identity? It seems to me that this kind of information should be available as the authors reported that individual fish could be recognized and tracked (see lines 620-635) and is essential for appreciating the flexibility of the system under study.

      Secondly, the speed of the learning process is not properly explained. Admittedly, fish learn in an impressive way the new rule and even two rules simultaneously; yet, how long did they need to achieve this? In the article, Figure 2 mentions that at least 6 training stages (each defined as a block of 60 evaluated turn decisions, which actually shows that the standard term 'Training Block' would be more appropriate) were required for the fish to learn the 'deflected rule'. While this means 360 trials (turning starts), I was left with the question of how long this process lasted. How many hours, days, and weeks were needed for the fish to learn? And as mentioned above, were all fish equally fast in learning? I would appreciate explaining this very important point because learning dynamics is relevant to understanding the flexibility of the system.

      First, it is very important to keep the question in mind that we wanted to clarify: Does the system have the potential to re-tune the decisions to other non-ballistic relations between the input variables and the output? This would have been established if one fish was found capable of doing that. We have rewritten the introduction and discussion to specifically say what our aim was. We feel that the paper is already extremely long and difficult to understand (even after we tried very hard in this revision to explain everything in detail and as good as we could), requires the establishment of a method whose success was really unexpected and finding a degree of plasticity that we did not expect at all. We also have added a section in the discussion stating what we can, and we cannot say given the number of fish examined. For instance, we do not know if there are differences in the speed at which the different individuals mastered the new rules and if social learning could play a role to speed up the acquisition. That is a brilliant idea and we are very interested in checking this - but we wanted to stick with the (quite ambitious) goal of the present study.

      Reference:

      (1) Roussel, E., Padie, S. & Giurfa, M. Aversive learning overcomes appetitive innate responding in honeybees. Anim Cogn 15, 135-141, doi:10.1007/s10071011-0426-1 (2012).

      Thanks for this reference!

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Minor comments:

      (1) What is the difference between Reinel, J. Exp. Bio. 2016 and the current study?

      Clearly in that study all objects were strictly falling ballistically, and latency and accuracy of the turn decisions were determined when the initial motion was not only horizontal but had an additional vertical component of speed. The question of that study was if the need to account to an additional variable (vertical speed) in the decision would affect its latency or accuracy. The study showed that also then archerfish rapidly turn to the later impact point. It also showed that accuracy and latency were not changed by the added degree of freedom.

      (2) How do Figures 2 F and G demonstrate that an accurate start is possible?

      See above.

      (3) Figure 4 is hard to follow, it is not clear what is presented and how it supports the claim that the new rule is represented in a way that allows immediate generalization.

      Yes, this is not at all an easy experiment. Briefly, fish were re-trained at only one height level and then are tested at other levels. The strategy is as in the experiments Schuster et al. 2004 Current Biology, Vol. 14, 1565–1568, Figure 5. We have changed text and Figure (new Figure 5) to show how the predictions were reached.

      Reviewer #2 (Recommendations for the authors):

      Minor remarks

      Lines 88-90: I was surprised to see that in this section, the authors did not mention the speed-accuracy trade-off off which has inspired numerous experiments in animal behavior (1). This could be used to back their point, namely, that speed comes with an apparent cost of a loss in accuracy.

      Yes, that is a crucial aspect that was completely missing even though it demonstrates a key aspect of 'standard' versus some 'highspeed' decisions! We definitely had to include it and also to show, directly under the conditions of our present experiments (in the new Fig. 4) the absence of a significant speedaccuracy relation for the archerfish highspeed decisions! Thank you very much for emphasizing this crucial aspect!

      Lines 182-184: Specify that this situation corresponds to the hatched bar in Figure (this can be specified in the caption of the figure, where the bar is not mentioned).

      Thanks!

      Lines 187-188: here and elsewhere (e.g. lines 224-225, etc), the error made by the fish is presented in cm (see Figure 2 where the inset shows how the error was computed). I wonder if it would not be more appropriate to present it in terms of the angular difference between the trajectory made by the fish and the food delivery location.

      Angles could also be used, but because of the large variation in initial distances (that we wanted to make sure that the fish had to capture a rule, allowing them to respond from various distances) another measure was used that we found somehow more natural: it is simply how close a fish would get to the landing point if it continued in the direction assumed after the turn. Although we describe how we defined accuracy we did not discuss why this measure was used in this (and many previous studies). We are very happy to add this. Please also note that running all tests based on angular errors (which we also have done throughout to ensure that the conclusions are independent on an arbitrary measure of the error) leads to no different conclusion. We have added a brief explanation in the text and in the new Fig. 2.

      Lines 299-323: Is it my impression or did fish have more trouble in generalizing their learned rule to the condition untrained larger height (see for instance red curves in Figures 4 D, E, G)? Could the authors elaborate on this point?

      We changed the code to make this more clear. The red curves (before marked A to highlight impact point option A) correspond to the errors to the ballistic impact point without deflection, so what would have to be compared are the black curves (marked P to highlight the virtual impact point that should be chosen had the fish immediately generated to the untrained conditions). We have rewritten the text and the labels in the Figure (now Figure 5) to illustrate the predictions and to name them in more helpful ways and so that they can't be confused with panel labels. At any rate, what needs to be compared, to check the idea, are the black curves, and these are not statistically different between both heights (p=0.525, Mann-Whitney). Interestingly, none of the black curves from all panels (D-G) differ (p>0.3).

      Line 559: if we are speaking here about luminance contrast, it should read 'Michelson Contrast' rather than 'Michelsen Contrast'.

      Absolutely, thanks!

      References

      (1) Chittka, L., Skorupski, P. & Raine, N. E. Speed-accuracy tradeoffs in animal decision making. Trends Ecol Evol 24, 400-407, doi:10.1016/j.tree.2009.02.010 (2009).

      An excellent paper that helps to stress our main question

    1. Reviewer #1 (Public review):

      Summary:

      This paper examines how geometric regularities in abstract shapes (e.g., parallelograms, kites) are perceived and processed in the human brain. The manuscript contains multimodal data (behavior, fMRI, MEG) from adults and additional fMRI data from 6-year-old children. The key findings show that (1) processing geometric shapes lead to reduced activity in ventral areas in comparison to complex stimuli and increased activity in intraparietal and inferior temporal regions, (2) the degree of geometric regularity modulates activity in intraparietal and inferior temporal regions, (3) similarity in neural representation of geometric shapes can be captured early by using CNN models and later by models of geometric regularity. In addition to these novel findings, the paper also includes a replication of behavioral data, showing that the perceptual similarity structure amongst the geometric stimuli used can be explained by a combination of visual similarities (as indexed by a feedforward CNN model of the ventral visual pathway) and geometric features.

      Strengths:

      (1) The study incorporates multi-modal data that uses more than one task and different populations of participants (adults and children).

      (2) It replicates behavioral findings of an earlier study in a larger cohort.

      (3) The paper comes with openly accessible code in a well-documented GitHub repository, and the data will be published with the paper on OpenNeuro.

      Weaknesses:

      I wonder how task difficulty and linguistic labels interact with the current findings. Based on the behavioral data, shapes with more geometric regularities are easier to detect when surrounded by other shapes. Do shape labels that are readily available (e.g., "square") help in making accurate and speedy decisions? Can the sensitivity to geometric regularity in intraparietal and inferior temporal regions be attributed to differences in task difficulty? Similarly, are the MEG oddball detection effects that are modulated by geometric regularity also affected by task difficulty?

    1. For larger projects with multiple interconnected components, monorepos can be a game-changer, providing efficient dependency management, atomic commits, simplified code sharing, and an improved developer experience.
    1. Vendor lock-in is everywhere — Wix, Shopify, no-code platforms, even overly restrictive SaaS tools. If you haven’t written the code, don’t truly own or control the product and data, and can’t deploy or host it wherever and however you choose — then you haven’t actually built anything.

      I would say shopify and Wix are definitely useful for quite a few people who wouldn't otherwise make a website.

      But for those who are able to do it with other means, giving up freedoms with a vendor lock-in might not be worth it will be my interpretation of this statement

    2. LLMs and code assistants are amazing — but they’re just tools. Like a power drill vs. a screwdriver. Faster, maybe smarter, but still dependent on the person holding them.

      profound!

    1. Reviewer #3 (Public review):

      Summary:

      The authors used recurrent neural network modelling of spatial navigation tasks to investigate border and place cell behaviour during remapping phenomena.

      Strengths:

      The neural network training seemed for the most part (see comments later) well-performed, and the analyses used to make the points were thorough.

      The paper and ideas were well-explained.

      Figure 4 contained some interesting and strong evidence for map-like generalisation as environmental geometry was warped.

      Figure 7 was striking and potentially very interesting.

      It was impressive that the RNN path-integration error stayed low for so long (Fig A1), given that normally networks that only work with dead-reckoning have errors that compound. I would have loved to know how the network was doing this, given that borders did not provide sensory input to the network. I could not think of many other plausible explanations... It would be even more impressive if it was preserved when the network was slightly noisy.

      Update:

      The analysis of how the RNN remapped, using a context signal to switch between largely independent maps, and the examination of the border like tuning in the recurrent units of the RNN, were both thorough and interesting. Further, in the updated response I appreciated the additional appendix E which helped substantiate the claim that the RNN neurons were border cells.

      Weaknesses:

      The remapping results were also puzzling. The authors present convincing evidence that the recurrent units effectively form 6 different maps of the 6 different environments (e.g. the sparsity of the code, or fig 6a), with the place cells remapping between environments. Yet, as the authors point out, in neural data the finding is that some cells generalise their co-firing patterns across environments (e.g. grid cells, border cells), while place cells remap, making it unclear what correspondence to make between the authors network and the brain. There are existing normative models that capture both entorhinal's consistent and hippocampus' less consistent neural remapping behaviour (Whittington et al. and probably others), what have we then learnt from this exercise?

      Update: see summary below

      I felt that the neural data analysis was unconvincing. Most notably, the statistical effect was found in only one of seven animals. Random noise is likely to pass statistical tests 1 in 20 times (at 0.05 p value), this seems like it could have been something similar? Further, the data was compared to a null model in which place cell fields were randomly distributed. The authors claim place cell fields have two properties that the random model doesn't (1) clustering to edges (as experimentally reported) and (2) much more provocatively, a hexagonal lattice arrangement. The test seems to collude the two; I think that nearby ball radii could be overrepresented, as in figure 7f, due to either effect. I would have liked to see a computation of the statistic for a null model in which place cells were random but with a bias towards to boundaries of the environment that matches the observed changing density, to distinguish these two hypotheses.

      Update: the authors acknowledge these shortcomings and have appropriately tempered their data related claims.

      Some smaller weaknesses:<br /> - Had the models trained to convergence? From the loss plot it seemed like not, and when including regularisors recent work (grokking phenomena, e.g. Nanda et al. 2023) has shown the importance of letting the regularisor minimise completely to see the resulting effect. Else you are interpreting representations that are likely still being learnt, a dangerous business.<br /> Update: I understand that practical limitations make testing this thoroughly impossible, which is fair enough.

      - The claim that this work provided a mathematical formalism of the intuitive idea of a cognitive map seems strange, given that upwards of 10 of the works this paper cite also mathematically formalise a cognitive map into a similar integration loss for a neural network.<br /> Update: the introduction of these ideas hasn't changed, and my concerns above remain.

      Aim Achieved? Impact/Utility/Context of Work

      I think this is a thorough exploration of how this network with these losses is able to path-integrate its position and remap. This is useful, it is good to know how another neural network with slightly different constraints learns to perform these behaviours.

      In the updated version of the manuscript I am happy to say that I think there are few claims that are unsubstantiated (see weakness section above that has been significantly updated). The link to neuroscience remains the biggest shortcoming of this work in my view. The authors point to two main results in this direction. First, the ability for interactions only between border-type and place cells to produce many observed place-cell results, providing a new hypothesis. Second, a connection between grid cells, place cells, and border cells, in the production of hexagonal arrangements of place cells.

      Regarding the first, as the authors discuss, current evidence suggests border cells are invariant across environments whereas this work finds border cells for specific environments (they use the words rate-remapping boundary-type cells). It seems likely to me that there are many ways a neural network can path-integrate across different environments. In other models where the same base map is re-used (e.g. TEM) grid cells emerge, in this work where the maps for different environments are disjoint these border-like cells that do not match an observed cell type in their tuning to environment are involved. I find this a really interesting alternative (I think what an RNN does is interesting in its own right), but I don't see why I should think it is what the brain does, given that it appears to match observations less well (existence of grid cells, consistent firing patterns of border cells across environments). The smoking gun in favour of the author's hypothesis would be finding these sparse border like cells, or some other evidence of gating like interactions between border and place cells as they discuss. Finding such evidence sounds difficult (so not reasonable to ask for in a rebuttal), and to reiterate, I applaud the authors for clearly outlining an alternative, but I remain unconvinced.

      Regarding the second point, while the grid-like placement of field centres was cool, and I applaud the authors for including real neural data comparisons, as the authors say, the data is preliminary, and further evidence would be required to fully substantiate this claim.

      As such, in my mind it is an interesting alternative hypothesis. I look forward to seeing experimental predictions or comparisons that can tighten the link, substantiating the claim that what this particular RNN is doing reflects the algorithms at work in the brain.

    2. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      This work studies representations in a network with one recurrent layer and one output layer that needs to path-integrate so that its position can be accurately decoded from its output. To formalise this problem, the authors define a cost function consisting of the decoding error and a regularisation term. They specify a decoding procedure that at a given time averages the output unit center locations, weighted by the activity of the unit at that time. The network is initialised without position information, and only receives a velocity signal (and a context signal to index the environment) at each timestep, so to achieve low decoding error it needs to infer its position and keep it updated with respect to its velocity by path integration.

      The authors take the trained network and let it explore a series of environments with different geometries while collecting unit activities to probe learned representations. They find localised responses in the output units (resembling place fields) and border responses in the recurrent units. Across environments, the output units show global remapping and the recurrent units show rate remapping. Stretching the environment generally produces stretched responses in output and recurrent units. Ratemaps remain stable within environments and stabilise after noise injection. Low-dimensional projections of the recurrent population activity forms environment-specific clusters that reflect the environment's geometry, which suggests independent rather than generalised representations. Finally, the authors discover that the centers of the output unit ratemaps cluster together on a triangular lattice (like the receptive fields of a single grid cell), and find significant clustering of place cell centers in empirical data as well.

      The model setup and simulations are clearly described, and are an interesting exploration of the consequences of a particular set of training requirements - here: path integration and decodability. But it is not obvious to what extent the modelling choices are a realistic reflection of how the brain solves navigation. Therefore it is not clear whether the results generalize beyond the specifics of the setup here.

      Strengths:

      The authors introduce a very minimal set of model requirements, assumptions, and constraints. In that sense, the model can function as a useful 'baseline', that shows how spatial representations and remapping properties can emerge from the requirement of path integration and decodability alone. Moreover, the authors use the same formalism to relate their setup to existing spatial navigation models, which is informative.

      The global remapping that the authors show is convincing and well-supported by their analyses. The geometric manipulations and the resulting stretching of place responses, without additional training, are interesting. They seem to suggest that the recurrent network may scale the velocity input by the environment dimensions so that the exact same path integrator-output mappings remain valid (but maybe there are other mechanisms too that achieve the same).

      The clustering of place cell peaks on a triangular lattice is intriguing, given there is no grid cell input. It could have something to do with the fact that a triangular lattice provides optimal coverage of 2d space? The included comparison with empirical data is valuable, although the authors only show significant clustering - there is no analysis of its grid-like regularity.

      First of all, we would like to thank the reviewer for their comprehensive feedback, and their insightful comments. Importantly, as you point out, our goal with this model was to build a minimal model of place cell representations, where representations were encouraged to be place-like, but free to vary in tuning and firing locations. By doing so, we could explore what upstream representations facilitate place-like representations, and even remapping (as it turned out) with minimal assumptions. However, we agree that our task does not capture some of the nuances of real-world navigation, such as sensory observations, which could be useful extensions in future work. Then again, the simplicity of our setup makes it easier to interpret the model, and makes it all the more surprising that it learns many behaviors exhibited by real world place cells.

      As to the distribution of phases - we also agree that a hexagonal arrangement likely reflects some optimal configuration for decoding of location.

      And we agree that the symmetry within the experimental data is important; we have revised analyses on experimental phase distributions, and included an analysis of ensemble grid score, to quantify any hexagonal symmetries within the data.

      Weaknesses:

      The navigation problem that needs to be solved by the model is a bit of an odd one. Without any initial position information, the network needs to figure out where it is, and then path-integrate with respect to a velocity signal. As the authors remark in Methods 4.2, without additional input, the only way to infer location is from border interactions. It is like navigating in absolute darkness. Therefore, it seems likely that the salient wall representations found in the recurrent units are just a consequence of the specific navigation task here; it is unclear if the same would apply in natural navigation. In natural navigation, there are many more sensory cues that help inferring location, most importantly vision, but also smell and whiskers/touch (which provides a more direct wall interaction; here, wall interactions are indirect by constraining velocity vectors). There is a similar but weaker concern about whether the (place cell like) localised firing fields of the output units are a direct consequence of the decoding procedure that only considers activity center locations.

      Thank you for raising this point; we absolutely agree that the navigation task is somewhat niche. However, this was a conscious decision, to minimize any possible confounding from alternate input sources, such as observations. In part, this experimental design was inspired by the suggestion that grid cells support navigation/path integration in open-field environments with minimal sensory input (as they could, conceivably do so with no external input). This also pertains to your other point, that boundary interactions are necessary for navigation. In our model, using boundaries is one solution, but there is another way around this problem, which is conceivably better: to path integrate in an egocentric frame, starting from your initial position. Since the locations of place fields are inferred only after a trajectory has been traversed, the network is free to create a new or shifted representation every time, independently of the arena. In this case, one might have expected generalized solutions, such as grid cells to emerge. That this is not the case, seems to suggest that grid cells may somehow not be optimal for pure path integration, or at the very least, hard to learn (but may still play a part, as alluded to by place field locations). We have tried to make these points more evident in the revised manuscript.

      As for the point that the decoding may lead to place-like representations, this is a fair point. Indeed, we did choose this form of decoding, inspired by the localized firing of place cells, in the hope that it would encourage minimally constrained, place-like solutions. However, compared to other works (Sorscher and Xu) hand tuning the functional form of their place cells, our (although biased towards centralized tuning curves) allows for flexible functional forms such as the position of the place cell centers, their tuning width, whether or not it is center-surround activity, and how they should tune to different environments/rooms. This allows us to study several features of the place cell system, such as remapping and field formation. We have revised to make this more clear in the model description.

      The conclusion that 'contexts are attractive' (heading of section 2) is not well-supported. The authors show 'attractor-like behaviour' within a single context, but there could be alternative explanations for the recovery of stable ratemaps after noise injection. For example, the noise injection could scramble the network's currently inferred position, so that it would need to re-infer its position from boundary interactions along the trajectory. In that case the stabilisation would be driven by the input, not just internal attractor dynamics. Moreover, the authors show that different contexts occupy different regions in the space of low-dimensional projections of recurrent activity, but not that these regions are attractive.

      We agree that boundary interactions could facilitate the convergence of representations after noise injection. We did try to moderate this claim by the wording “attractor-like”, but we agree that boundaries could confound this result. We have therefore performed a modified noise injection experiment, where we let the network run for an extended period of time, before noise injection (and no velocity signal), see Appendix Velocity Ablation in the revised text. Notably, representations converge to their pre-scrambled state after noise injection, even without a velocity signal. However, place-like representations do not converge for all noise levels in this case, possibly indicating that boundary interactions do serve an error-correcting function, also. Thank you for pointing this out.

      As for the attractiveness of contexts, we agree that more analyses were required to demonstrate this. We have therefore conducted a supplementary analysis where we run the trained network with a mismatch in context/geometry, and demonstrate that the context signal fixes the representation, up to geometric distortions.

      The authors report empirical data that shows clustering of place cell centers like they find for their output units. They report that 'there appears to be a tendency for the clusters to arrange in hexagonal fashion, similar to our computational findings'. They only quantify the clustering, but not the arrangement. Moreover, in Figure 7e they only plot data from a single animal, then plot all other animals in the supplementary. Does the analysis of Fig 7f include all animals, or just the one for which the data is plotted in 7e? If so, why that animal? As Appendix C mentions that the ratemap for the plotted animal 'has a hexagonal resemblance' whereas other have 'no clear pattern in their center arrangements', it feels like cherrypicking to only analyse one animal without further justification.

      Thank you for pointing this out; we agree that this is not sufficiently explained and explored in the current version. We have therefore conducted a grid score analysis of the experimental place center distributions, to uncover possible hexagonal symmetries. The reason for choosing this particular animal was in part because it featured the largest number of included cells, while also demonstrating the most striking phase distribution, while including all distributions in the supplementary. Originally, this was only intended as a preliminary analysis, suggesting non-uniformity in experimental place field distributions, but we realize that these may all provide interesting insight into the distributional properties of place cells.

      We have explained these choices in the revised text, and expanded analyses on all animals to showcase these results more clearly.

      Reviewer #2 (Public Review):

      Summary:

      The authors proposed a neural network model to explore the spatial representations of the hippocampal CA1 and entorhinal cortex (EC) and the remapping of these representations when multiple environments are learned. The model consists of a recurrent network and output units (a decoder) mimicking the EC and CA1, respectively. The major results of this study are: the EC network generates cells with their receptive fields tuned to a border of the arena; decoder develops neuron clusters arranged in a hexagonal lattice. Thus, the model accounts for entorhinal border cells and CA1 place cells. The authors also suggested the remapping of place cells occurs between different environments through state transitions corresponding to unstable dynamical modes in the recurrent network.

      Strengths:

      The authors found a spatial arrangement of receptive fields similar to their model's prediction in experimental data recorded from CA1. Thus, the model proposes a plausible mechanisms to generate hippocampal spatial representations without relying on grid cells. This result is consistent with the observation that grid cells are unnecessary to generate CA1 place cells.

      The suggestion about the remapping mechanism shows an interesting theoretical possibility.

      We thank the reviewer for their kind feedback.

      Weaknesses:

      The explicit mechanisms of generating border cells and place cells and those underlying remapping were not clarified at a satisfactory level.

      The model cannot generate entorhinal grid cells. Therefore, how the proposed model is integrated into the entire picture of the hippocampal mechanism of memory processing remains elusive.

      We appreciate this point, and hope to clarify: From a purely architectural perspective, place-like representations are generated by linear combinations of recurrent unit representations, which, after training, appear border-like. During remapping, the network is simply evaluated/run in different geometries/contexts, which, it turns out, causes the network to exhibit different representations, likely as solutions to optimally encoding position in the different environments. We have attempted to revise the text to make some of these interpretations more clear. We have also conducted a supplementary analysis to demonstrate how representations are determined by the context signal directly, which helps to explain how recurrent and output units form their representations.

      We also agree that our model does not capture the full complexity of the Hippocampal formation. However, we would argue that its simplicity (focusing on a single cell type and a pure path integration task), acts as a useful baseline for studying the role of place cells during spatial navigation. The fact that our model captures a range of place cell behaviors (field formation, remapping and geometric deformation) without grid cells also point to several interesting possibilities, such that grid cells may not be strictly necessary for place cell formation and remapping, or that border cells may account for many of the peculiar behaviors of place cells. However, we wholeheartedly agree that including e.g. sensory information and memory storage/retrieval tasks would prove a very interesting extension of our model to more naturalistic tasks and settings. In fact, our framework could easily accommodate this, e.g. by decoding contexts/observations/memories from the network state, alongside location.

      Reviewer #3 (Public Review):

      Summary:

      The authors used recurrent neural network modelling of spatial navigation tasks to investigate border and place cell behaviour during remapping phenomena.

      Strengths:

      The neural network training seemed for the most part (see comments later) well-performed, and the analyses used to make the points were thorough.

      The paper and ideas were well explained.

      Figure 4 contained some interesting and strong evidence for map-like generalisation as environmental geometry was warped.

      Figure 7 was striking, and potentially very interesting.

      It was impressive that the RNN path-integration error stayed low for so long (Fig A1), given that normally networks that only work with dead-reckoning have errors that compound. I would have loved to know how the network was doing this, given that borders did not provide sensory input to the network. I could not think of many other plausible explanations... It would be even more impressive if it was preserved when the network was slightly noisy.

      Thank you for your insightful comments! Regarding the low path integration error, there is a slight statistical signal from the boundaries, as trajectories tend to turn away from arena boundaries. However, we agree, that studying path integration performance in the face of noise would make for a very interesting future development.

      Weaknesses:

      I felt that the stated neuroscience interpretations were not well supported by the presented evidence, for a few reasons I'll now detail.

      First, I was unconvinced by the interpretation of the reported recurrent cells as border cells. An equally likely hypothesis seemed to be that they were positions cells that are linearly encoding the x and y position, which when your environment only contains external linear boundaries, look the same. As in figure 4, in environments with internal boundaries the cells do not encode them, they encode (x,y) position. Further, if I'm not misunderstanding, there is, throughout, a confusing case of broken symmetry. The cells appear to code not for any random linear direction, but for either the x or y axis (i.e. there are x cells and y cells). These look like border cells in environments in which the boundaries are external only, and align with the axes (like square and rectangular ones), but the same also appears to be true in the rotationally symmetric circular environment, which strikes me as very odd. I can't think of a good reason why the cells in circular environments should care about the particular choice of (x,y) axes... unless the choice of position encoding scheme is leaking influence throughout. A good test of these would be differently oriented (45 degree rotated square) or more geometrically complicated (two diamonds connected) environments in which the difference between a pure (x,y) code and a border code are more obvious.

      Thank you for pointing this out. This is an excellent point, that we agree could be addressed more rigorously. Note that there is no position encoding in our model; the initial state of the network is a vector of zeros, and the network must infer its location from boundary interactions and context information alone. So there is no way for positional information to leak through to the recurrent layer directly. However, one possible reason for the observed symmetry breaking, is the fact that the velocity input signal is aligned with the cardinal directions. To investigate this, we trained a new model, wherein input velocities are rotated 45 degrees relative to the horizontal, as you suggest. The results, shown and discussed in appendix E (Learned recurrent representations align with environment boundaries), do indicate that representations are tuned to environment boundaries, and not the cardinal directions, which hopefully improves upon this point.

      Next, the decoding mechanism used seems to have forced the representation to learn place cells (no other cell type is going to be usefully decodable?). That is, in itself, not a problem. It just changes the interpretation of the results. To be a normative interpretation for place cells you need to show some evidence that this decoding mechanism is relevant for the brain, since this seems to be where they are coming from in this model. Instead, this is a model with place cells built into it, which can then be used for studying things like remapping, which is a reasonable stance.

      This is a great point, and we agree. We do write that we perform this encoding to encourage minimally constrained place-like representations (to study their properties), but we have revised to make this more evident.

      However, the remapping results were also puzzling. The authors present convincing evidence that the recurrent units effectively form 6 different maps of the 6 different environments (e.g. the sparsity of the code, or fig 6a), with the place cells remapping between environments. Yet, as the authors point out, in neural data the finding is that some cells generalise their co-firing patterns across environments (e.g. grid cells, border cells), while place cells remap, making it unclear what correspondence to make between the authors network and the brain. There are existing normative models that capture both entorhinal's consistent and hippocampus' less consistent neural remapping behaviour (Whittington et al. and probably others), what have we then learnt from this exercise?

      Thanks for raising this point! We agree that this finding is surprising, but we hold that it actually shows something quite important: that border-type units are sufficient to create place-like representations, and learns several of the behaviors associated with place cells and remapping (including global remapping and field stretching). In other words, a single cell type known to exist upstream of place cells is sufficient to explain a surprising range of phenomena, demonstrating that other cell types are not strictly necessary. However, we agree that understanding why the boundary type units sometimes rate remap, and whether that can be true for some border type cells in the brain (either directly, or through gating mechanisms) would be important future developments. Related to this point, we also expanded upon the influence of the context signal for representation selection (appendix F)

      Concerning the relationship to other models, we would argue that the simplicity of our model is one of its core strengths, making it possible to disentangle what different cell types are doing. While other models, including TEM, are highly important for understanding how different cell types and brain regions interact to solve complex problems, we believe there is a need for minimal, understandable models that allows us to investigate what each cell type is doing, and this is where we believe our work is important. As an example, our model not only highlights the sufficiency of boundary-type cells as generators of place cells, its lack of e.g. grid cells also suggest that grid cells may not be strictly necessary for e.g. open-field/sensory-deprived navigation, as is often claimed.

      One striking result was figure 7, the hexagonal arrangement of place cell centres. I had one question that I couldn't find the answer to in the paper, which would change my interpretation. Are place cell centres within a single clusters of points in figure 7a, for example, from one cell across the 100 trajectories, or from many? If each cluster belongs to a different place cell then the interpretation seems like some kind of optimal packing/coding of 2D space by a set of place cells, an interesting prediction. If multiple place cells fall within a single cluster then that's a very puzzling suggestion about the grouping of place cells into these discrete clusters. From figure 7c I guess that the former is the likely interpretation, from the fact that clusters appear to maintain the same colour, and are unlikely to be co-remapping place cells, but I would like to know for sure!

      This is a good point, and you are correct: one cluster tends to correspond to one unit. To make this more clear, we have revised Fig. 7, so that each decoded center is shaded by unit identity, which makes this more evident. And yes, this is, seemingly in line with some form of optimal packing/encoding of space, yes!

      I felt that the neural data analysis was unconvincing. Most notably, the statistical effect was found in only one of seven animals. Random noise is likely to pass statistical tests 1 in 20 times (at 0.05 p value), this seems like it could have been something similar? Further, the data was compared to a null model in which place cell fields were randomly distributed. The authors claim place cell fields have two properties that the random model doesn't (1) clustering to edges (as experimentally reported) and (2) much more provocatively, a hexagonal lattice arrangement. The test seems to collude the two; I think that nearby ball radii could be overrepresented, as in figure 7f, due to either effect. I would have liked to see a computation of the statistic for a null model in which place cells were random but with a bias towards to boundaries of the environment that matches the observed changing density, to distinguish these two hypotheses.

      Thanks for raising this point. We agree that we were not clear enough in our original manuscript. We included additional analyses in one animal, to showcase one preliminary case of non-uniform phases. To mitigate this, we have performed the same analyses for all animals, and included a longer discussion of these results (included in the supplementary material). We have also moderated the discussion on Ripley’s H to encompass only non-uniformity, and added a grid score analysis to showcase possible rotational symmetries in the data. We hope this gets our findings across more clearly

      Some smaller weaknesses:

      - Had the models trained to convergence? From the loss plot it seemed like not, and when including regularisors recent work (grokking phenomena, e.g. Nanda et al. 2023) has shown the importance of letting the regularisor minimise completely to see the resulting effect. Else you are interpreting representations that are likely still being learnt, a dangerous business.

      Longer training time did not seem to affect representations. However, due to the long trajectories and statefulness involved, training was time-intensive and could become unstable for very long training. We therefore stopped training at the indicated time.

      - Since RNNs are nonlinear it seems that eigenvalues larger than 1 doesn't necessarily mean unstable?

      This is a good point; stability is not guaranteed. We have updated the text to reflect this.

      - Why do you not include a bias in the networks? ReLU networks without bias are not universal function approximators, so it is a real change in architecture that doesn't seem to have any positives?

      We found that bias tended to have a detrimental effect on training, possibly related to the identity initialization used (see e.g. Le et al. 2015), and found that training improved when biases were fixed to zero.

      - The claim that this work provided a mathematical formalism of the intuitive idea of a cognitive map seems strange, given that upwards of 10 of the works this paper cite also mathematically formalise a cognitive map into a similar integration loss for a neural network.

      We agree that other works also provide ways of formalizing this concepts. However, our goal by doing so was to elucidate common features across these seemingly disparate models. We also found that the concept of a learned and target map made it easier to come up with novel models, such as one wherein place cells are constructed to match a grid cell label.

      Aim Achieved? Impact/Utility/Context of Work

      Given the listed weaknesses, I think this was a thorough exploration of how this network with these losses is able to path-integrate its position and remap. This is useful, it is good to know how another neural network with slightly different constraints learns to perform these behaviours. That said, I do not think the link to neuroscience was convincing, and as such, it has not achieved its stated aim of explaining these phenomena in biology. The mechanism for remapping in the entorhinal module seemed fundamentally different to the brain's, instead using completely disjoint maps; the recurrent cell types described seemed to match no described cell type (no bad thing in itself, but it does limit the permissible neuroscience claims) either in tuning or remapping properties, with a potentially worrying link between an arbitrary encoding choice and the responses; and the striking place cell prediction was unconvincingly matched by neural data. Further, this is a busy field in which many remapping results have been shown before by similar models, limiting the impact of this work. For example, George et al. and Whittington et al. show remapping of place cells across environments; Whittington et al. study remapping of entorhinal codes; and Rajkumar Vasudeva et al. 2022 show similar place cell stretching results under environmental shifts. As such, this papers contribution is muddied significantly.

      Thank you for this perspective; we agree that all of these are important works that arrive at complementary findings. We hold that the importance of our paper lies in its minimal nature, and its focus on place cells, via a purpose-built decoding that enables place-like representations. In doing so, we can point to possibly under explored relationships between cell types, in particular place cells and border cells, while challenging the necessity of other cell types for open-field navigation (i.e. grid cells). In addition, our work points to a novel connection between grid cells, place cells and even border cells, by way of the hexagonal arrangement of place unit centers. However, we agree that expanding our model to include more biologically plausible architectures and constraints would make for a very interesting extension in the future.

      Thank you again for your time, as well as insightful comments.  

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      Even after reading Methods 5.3, I found it hard to understand how the ratemap population vectors that produce Fig 3e and Fig 5 are calculated. It's unclear to me how there can be a ratemap at a single timestep, because calculating a ratemap involves averaging the activity in each location, which would take a whole trajectory and not a single timestep. But I think I've understood from Methods 5.1 that instead the ratemap is calculated by running multiple 'simultaneous' trajectories, so that there are many visited locations at each timestep. That's a bit confusing because as far as I know it's not a common way to calculate ratemaps in rodent experiments (probably because it would be hard to repeat the same task 500 times, while the representations remain the same), so it might be worth explaining more in Methods 5.3.

      We understand the confusion, and have attempted to make this more clear in the revised manuscript. We did indeed create ratemaps over many trajectories for time-dependent plots, for the reasons you mentioned. We also agree that this would be difficult to do experimentally, but found it an interesting way to observe convergence of representations in our simulated scenario.

      Fig 3b-d shows multiple analyses to support output unit global remapping, but no analysis to support the claim that recurrent units remap by rate changes. The examples in Fig 3ai look pretty convincing, but it would be useful to also have a more quantitative result.

      We agree, and only showed that units turn off/become silent using ratemaps. We have therefore added an explicit analysis, showcasing rate remapping in recurrent units (see appendix G; Recurrent units rate remap)

      Reviewer #2 (Recommendations For The Authors):

      Some parts of the current manuscript are hard to follow. Particularly, the model description is not transparent enough. See below for the details.

      Major comments:

      (1) Mathematical models should be explained more explicitly and carefully. I had to guess or desperately search for the definitions of parameters. For instance, define the loss function L in eq.(1). Though I can assume L represents the least square error (in A.8), I could not find the definition in Model & Objective. N should also be defined explicitly in equation (3). Is this the number of output cells?

      Thank you for pointing this out, we have revised to make it more clear.

      (2) In Fig. 1d, how were the velocity and context inputs given to individual neurons in the network? The information may be described in the Methods, but I could not identify it.

      This was described in the methods section (Neural Network Architecture and Training), but we realize that we used confusing notation, when comparing with Fig. 1d. We have therefore changed the notation, and it should hopefully be clearer now. Thanks for pointing out this discrepancy.

      (3) I took a while to understand equations (3) and (4) (for instance, t is not defined here). The manuscript would be easier to read if equations (5) and (6) are explained in the main text but not on page 18 (indeed, these equations are just copies of equations 3 and 4). Otherwise, the authors may replace equations (3) and (4) with verbal explanations similar to figure legend for Fig. 1b.

      (4) Is there any experimental evidence for uniformly strong EC-to-CA1 projections assumed in the non-trainable decoder? This point should be briefly mentioned.

      Thank you for raising this point. The decoding from EC (the RNN) to CA1 (the output layer) consists of a trainable weight matrix, and may thus be non-uniform in magnitude. The non-trainable decoding acts on the resulting “CA1” representation only. We hope that improvements to the model description also makes this more evident.  

      (5) The explanation of Fig. 3 in the main text is difficult to follow because subpanels are explained in separate paragraphs, some of which are very short, as short as just a few lines.

      This presentation style makes it difficult to follow the logical relationships between the subpanels. This writing style is obeyed throughout the manuscript but is not popular in neuroscience.

      Thanks for pointing this out, we have revised to accommodate this.

      (6) Why do field centers cluster near boundaries? No underlying mechanisms are discussed in the manuscript.

      This is a good point; we have added a note on this; it likely reflects the border tuning of upstream units.

      (7) In Fig. 4, the authors presented how cognitive maps may vary when the shape and size of open arenas are modified. The results would be more interesting if the authors explained the remapping mechanism. For instance, on page 8, the authors mentioned that output units exhibit global remapping between contexts, whereas recurrent units mainly rate remapping.

      Why do such representational differences emerge?

      We agree! Thanks for raising this point. We have therefore expanded upon this discussion in section 2.4.

      (8) In the first paragraph of page 10, the authors stated ".. some output units display distinct field doubling (see both Fig. 4c), bottom right, and Fig. 4d), middle row)". I could not understand how Fig. 4d, middle row supports the argument. Similarly, they stated "..some output units reflect their main boundary input (with greater activity near one boundary)." I can neither understand what the authors mean to say nor which figures support the statement. Please clarify.

      This is a good point, there was an identifier missing; we have updated to refer to the correct “magnification”. Thanks!

      (9) The underlying mechanism of generating the hexagonal representation of output cells remains unclear. The decoder network uses a non-trainable decoding scheme based on localized firing patterns of output units. To what extent does the hexagonal representation depend on the particular decoding scheme? Similarly, how does the emergence of the hexagonal representation rely on the border representation in the upstream recurrent network? Showing several snapshots of the two place representations during learning may answer these questions.

      This is an interesting point, and we have added some discussion on this matter. In particular, we speculate whether it’s an optimal configuration for position reconstruction, which is demanded by the task and thus highly likely dependent on the decoding scheme. We have not reached a conclusive method to determine the explicit dependence of the hexagonal arrangement on the choice of decoding scheme. Still, it seems this would require comparison with other schemes. In our framework, this would require changing the fundamental operation of the model, which we leave as inspiration for future work. We have also added additional discussion concerning the relationship between place units, border units, and remapping in our model. As for exploring different training snapshots, the model is randomly initialized, which suggests that earlier training steps should tend to reveal unorganized/uninformative phase arrangements, as phases are learned as a way of optimizing position reconstruction. However, we do call for more analysis of experimental data to determine whether this is true in animals, which would strongly support this observation. We also hope that our work inspires other models studying the formation and remapping of place cells, which could serve as a starting point for answering this question in the future.

      (10) Figure 7 requires a title including the word "hexagonal" to make it easier to find the results demonstrating the hexagonal representations. In addition, please clarify which networks, p or g, gave the results shown here.

      We agree, and have added it!

      Minor comments:

      (11) In many paragraphs, conclusions appear near their ends. Stating the conclusion at the beginning of each paragraph whenever possible will improve the readability.

      We have made several rewrites to the manuscript, and hope this improves readability.

      (12) Figure A4 is important as it shows evidence of the CA1 spatial representation predicted by the model. However, I could not find where the figure is cited in the manuscript. The authors can consider showing this figure in the main text.

      We agree, and we have added more references to the experimental data analyses in the main text, as well as expanded this analysis.

      (13) The main text cites figures in the following format: "... rate mapping of Fig. 3a), i), boundary ...." The parentheses make reading difficult.

      We have removed the overly stringent use of double parentheses, thanks for letting us know.

      (14) It would be nice if the authors briefly explained the concept of Ripley's H function on page 14.

      Yes, we have added a brief descriptor.

    1. method": "ach",

      No support of ach yet. Let's pls do for cards. Forgot to check code snippets in some other pages. Please ensure we are using cards in all snippets

    1. use Razorpay Payment Button. Embed a Payment Button on your website by just adding a couple of lines of code.

      use Razorpay Payment Links. Create a link from the dashboard and start collecting payments from your customers.

    1. This boilerplate code is also brittle, in the sense that a future change that adds a new field to the struct requires an update to every place that builds the structure.

      이러한 보일러플레이트 코드는 다루기 힘든데, 새로운 필드가 추가되면 모든 장소의 코드를 업데이트해줘야 하기 때문이다.

    2. Rust insists that all fields in a struct must be filled in when a new instance of that struct is created. This keeps the code safe by ensuring that there are never any uninitialized values but does lead to more verbose boilerplate code than is ideal.

      struct의 모든 필드 값이 초기화 되는 것을 강제하고 있다. 이는 코드 안정성을 보장하지만 장황한 보일러플레이트 코드를 유발한다.

  2. www.tripleeframework.com www.tripleeframework.com
    1. Students are focused on the task because the software is helping them create the code that represents their content learning goals (characterization, setting, plot..etc).  There are no games or rewards at the end of using the software that distract from the process of learning

      The concept of using games in learning, though fun for students and teachers, seems to be a gateway to distractions if not applied correctly. In the schools I sub at there is a platform called Blooket that allows students to play games together or independently and solve math, reading, or science problems interspliced with video games designed to foster friendly competition to get to the top of a leaderboard. From what I've seen students usually focus more on the leaderboards than they do the actual content they are there to learn. This makes me question the context that platforms like Blooket should be used in (maybe it's better for test review than it is for learning new concepts).

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Reviewer #1

      Evidence, reproducibility, and clarity

      The work by Pinon et al describes the generation of a microvascular model to study Neisseria meningitidis interactions with blood vessels. The model uses a novel and relatively high throughput fabrication method that allows full control over the geometry of the vessels. The model is well characterized. The authors then study different aspects of Neisseria-endothelial interactions and benchmark the bacterial infection model against the best disease model available, a human skin xenograft mouse model, which is one of the great strengths of the paper. The authors show that Neisseria binds to the 3D model in a similar geometry that in the animal xenograft model, induces an increase in permeability short after bacterial perfusion, and induces endothelial cytoskeleton rearrangements. Finally, the authors show neutrophil recruitment to bacterial microcolonies and phagocytosis of Neisseria. The article is overall well written, and it is a great advancement in the bioengineering and sepsis infection field, and I only have a few major comments and some minor.

      Major comments:

      Infection-on-chip. I would recommend the authors to change the terminology of "infection on chip" to better reflect their work. The term is vague and it decreases novelty, as there are multiple infection on chips models that recapitulate other infections (recently reviewed in https://doi.org/10.1038/s41564-024-01645-6) including Ebola, SARS-CoV-2, Plasmodium and Candida. Maybe the term "sepsis on chip" would be more specific and exemplify better the work and novelty. Also, I would suggest that the authors carefully take a look at the text and consider when they use VoC or to current term IoC, as of now sometimes they are used interchangeably, with VoC being used occasionally in bacteria perfused experiments.

      We thank Reviewer #1 for this suggestion. Indeed, we have chosen to replace the term "Infection-on-Chip" by "infected Vessel-on-chip" to avoid any confusion in the title and the text. Also, we have removed all the terms "IoC" which referred to "Infection-on-Chip" and replaced with "VoC" for "Vessel-on-Chip". We think these terms will improve the clarity of the main text.

      Fig 3 and Suppmentary 3: Permeability. The authors suggest that early 3h infection with Neisseria do not show increase in vascular permeability in the animal model, contrary to their findings in the 3D in vitro model. However, they show a non-significant increase in permeability of 70 KDa Dextran in the animal xenograft early infection. This seems to point that if the experiment would have been done with a lower molecular weight tracer, significant increases in permeability could have been detected. I would suggest to do this experiment that could capture early events in vascular disruption.

      Comparing permeability under healthy and infected conditions using Dextran smaller than 70 kDa is challenging. Previous research [1] has shown that molecules below 70 kDa already diffuse freely in healthy tissue. Given this high baseline diffusion, we believe that no significant difference would be observed before and after N. meningitidis infection and these experiments were not carried out. As discussed in the manuscript, bacteria induced permeability in mouse occurs at later time points, 16h post infection as shown previoulsy [2]. As discussed in the manuscript, this difference between the xenograft model and the chip likely reflect the absence in the chip of various cell types present in the tissue parenchyma.

      The authors show the formation of actin of a honeycomb structure beneath the bacterial microcolonies. This only occurred in 65\% of the microcolonies. Is this result similar to in vitro 2D endothelial cultures in static and under flow? Also, the group has shown in the past positive staining of other cytoskeletal proteins, such as ezrin in the ERM complex. Does this also occur in the 3D system?

      We thank the Reviewer #1 for this suggestion. - According to this recommendation, we imaged monolayers of endothelial cells in the flat regions of the chip (the two lateral channels) using the same microscopy conditions (i.e., Obj. 40X N.A. 1.05) that have been used to detect honeycomb structures in the 3D vessels in vitro. We showed that more than 56% of infected cells present these honeycomb structures in 2D, which is 13% less than in 3D, and is not significant due to the distributions of both populations. Thus, we conclude that under both in vitro conditions, 2D and 3D, the amount of infected cells exhibiting cortical plaques is similar. We have added the graph and the confocal images in Figure S4B and lines 418-419 of the revised manuscript. - We recently performed staining of ezrin in the chip and imaged both the 3D and 2D regions. Although ezrin staining was visible in 3D (Fig. 1 of this response), it was not as obvious as other markers under these infected conditions and we did not include it in the main text. Interpretation of this result is not straight forward as for instance the substrate of the cells is different and it would require further studies on the behaviour of ERM proteins in these different contexts.

      One of the most novel things of the manuscript is the use of a relatively quick photoablation system. I would suggest that the authors add a more extensive description of the protocol in methods. Could this technique be applied in other laboratories? If this is a major limitation, it should be listed in the discussion.

      Following the Reviewer's comment, we introduced more detailed explanations regarding the photoablation: - L157-163 (Results): "Briefly, the chosen design is digitalized into a list of positions to ablate. A pulsed UV-LASER beam is injected into the microscope and shaped to cover the back aperture of the objective. The laser is then focused on each position that needs ablation. After introducing endothelial cells (HUVEC) in the carved regions,.." - L512-516 (Discussion): "The speed capabilities drastically improve with the pulsing repetition rate. Given that our laser source emits pulses at 10kHz, as compared to other photoablation lasers with repetitions around 100 Hz, our solution could potentially gain a factor of 100. Also,..." - L1082-1087 (Materials and Methods): "…, and imported in a python code. The control of the various elements is embedded and checked for this specific set of hardware. The code is available upon request."

      Adding these three paragraphs gives more details on how photoablation works thus improving the manuscript.

      Minor comments:

      Supplementary Fig 2. The reference to subpanels H and I is swapped.

      The references to subpanels H and I have been correctly swapped back in the reviewed version.

      Line 203: I would suggest to delete this sentence. Although a strength of the submitted paper is the direct comparison of the VoC model with the animal model to better replicate Neisseria infection, a direct comparison with animal permeability is not needed in all vascular engineering papers, as vascular permeability measurements in animals have been well established in the past.

      The sentence "While previously developed VoC platforms aimed at replicating physiological permeability properties, they often lack direct comparisons with in vivo values." has been removed from the revised text.

      Fig 3: Bacteria binding experiments. I would suggest the addition of more methodological information in the main results text to guarantee a good interpretation of the experiment. First, it would be better that wall shear stress rather than flow rate is described in the main text, as flow rate is dependent on the geometry of the vessel being used. Second, how long was the perfusion of Neisseria in the binding experiment performed to quantify colony doubling or elongation? As per figure 1C, I would guess than 100 min, but it would be better if this information is directly given to the readers.

      We thank Reviewer #1 for these two suggestions that will improve the text clarity (e.g., L316). (i) Indeed, we have changed the flow rate in terms of shear stress. (ii) Also, we have normalized the quantification of the colony doubling time according to the first time-point where a single bacteria is attached to the vessel wall. Thus, early adhesion bacteria will be defined by a longer curve while late adhesion bacteria by a shorter curve. In total, the experiment lasted for 3 hours (modifications appear in L318 and L321-326).}

      Fig 4: The honeycomb structure is not visible in the 3D rendering of panel D. I would recommend to show the actin staining in the absence of Neisseria staining as well.

      According to this suggestion, a zoom of the 3D rendering of the cortical plaque without colony had been added to the figure 4 of the revised manuscript.

      Line 421: E-selectin is referred as CD62E in this sentence. I would suggest to use the same terminology everywhere.

      We have replaced the "CD62E" term with "E-selectin" to improve clarity.}

      Line 508: "This difference is most likely associated with the presence of other cell types in the in vivo tissues and the onset of intravascular coagulation". Do the authors refer to the presence of perivascular cells, pericytes or fibroblasts? If so, it could be good to mention them, as well as those future iterations of the model could include the presence of these cell types.

      By "other cell types", we refer to pericytes [3], fibroblasts [4], and perivascular macrophages [5], which surround endothelial cells and contribute to vessel stability. The main text was modified to include this information (Lines 548 and 555-570) and their potential roles during infection disussed.

      Discussion: The discussion covers very well the advantages of the model over in vitro 2D endothelial models and the animal xenograft but fails to include limitations. This would include the choice of HUVEC cells, an umbilical vein cell line to study microcirculation, the lack of perivascular cells or limitations on the fabrication technique regarding application in other labs (if any).

      We thank Reviewer #1 for this suggestion. Indeed, our manuscript may lack explaining limitations, and adding them to the text will help improve it: - The perspectives of our model include introducing perivascular cells surrounding the vessel and fibroblasts into the collagen gel as discussed previously and added in the discussion part (L555-570). - Our choice for HUVEC cells focused on recapitulating the characteristics of venules that respect key features such as the overexpression of CD62E and adhesion of neutrophils during inflammation. Using microvascular endothelial cells originating from different tissues would be very interesting. This possibility is now mentioned in the discussion lines 567-568. - Photoablation is a homemade fabrication technique that can be implemented in any lab harboring an epifluorescence microscope. This method has been more detailed in the revised manuscript (L1085-1087).

      Line 576: The authors state that the model could be applied to other systemic infections but failed to mention that some infections have already been modelled in 3D bioengineered vascular models (examples found in https://doi.org/10.1038/s41564-024-01645-6). This includes a capillary photoablated vascular model to study malaria (DOI: 10.1126/sciadv.aay724).

      Thes two important references have been introduced in the main text (L84, 647, 648).}

      Line 1213: Are the 6M neutrophil solution in 10ul under flow. Also, I would suggest to rewrite this sentence in the following line "After, the flow has been then added to the system at 0.7-1 μl/min."

      We now specified that neutrophils are circulated in the chip under flow conditions, lines 1321-1322.

      Significance

      The manuscript is comprehensive, complete and represents the first bioengineered model of sepsis. One of the major strengths is the carful characterization and benchmarking against the animal xenograft model. Its main limitations is the brief description of the photoablation methodology and more clarity is needed in the description of bacteria perfusion experiments, given their complexity. The manuscript will be of interest for the general infection community and to the tissue engineering community if more details on fabrication methods are included. My expertise is on infection bioengineered models.

      Reviewer #2

      Evidence, reproducibility, and clarity

      Summary The authors develop a Vessel-on-Chip model, which has geometrical and physical properties similar to the murine vessels used in the study of systemic infections. The vessel was created via highly controllable laser photoablation in a collagen matrix, subsequent seeding of human endothelial cells and flow perfusion to induce mechanical cues. This vessel could be infected with Neisseria meningitidis, as a model of systemic infection. In this model, microcolony formation and dynamics, and effects on the host were very similar to those described for the human skin xenograft mouse, which is the current gold standard for these studies, and were consistent with observations made in patients. The model could also recapitulate the neutrophil response upon N. meningitidis systemic infection.

      Major comments:

      I have no major comments. The claims and the conclusions are supported by the data, the methods are properly presented and the data is analyzed adequately. Furthermore, I would like to propose an optional experiment could improve the manuscript. In the discussion it is stated that the vascular geometry might contribute to bacterial colonization in areas of lower velocity. It would be interesting to recapitulate this experimentally. It is of course optional but it would be of great interest, since this is something that can only be proven in the organ-on-chip (where flow speed can be tuned) and not as much in animal models. Besides, it would increase impact, demonstrating the superiority of the chip in this area rather than proving to be equal to current models.

      We have conducted additional experiments on infection in different vascular geometries now added these results figure 3/S3 and lines 288-305. We compared sheared stress levels as determined by Comsol simulation and experimentally determined bacterial adhesion sites. In the conditions used, the range of shear generated by the tested geometries do not appear to change the efficiency of bacterial adhesion. These results are consistent with a previous study from our group which show that in this range of shear stresses the effect on adhesion is limited [6] . Furthermore, qualitative observations in the animal model indicate that bacteria do not have an obvious preference in terms of binding site.

      Minor comments:

      I have a series of suggestions which, in my opinion, would improve the discussion. They are further elaborated in the following section, in the context of the limitations.

      • How to recapitulate the vessels in the context of a specific organ or tissue? If the pathogen is often found in the luminal space of other organs after disseminating from the blood, how can this process be recapitulated with this mode, if at all?

      • For reasons that are not fully understood, postmortem histological studies reveal bacteria only inside blood vessels but rarely if ever in the organ parenchyma. The presence of intravascular bacteria could nevertheless alter cells in the tissue parenchyma. The notable exception is the brain where bacteria exit the bacterial lumen to access the cerebrospinal fluid. The chip we describe is fully adapted to develop a blood brain barrier model and more specific organ environments. This implies the addition of more cell types in the hydrogel. A paragraph on this topic has been added (Lines 548 and 552-570).

      • Similarly, could other immune responses related to systemic infection be recapitulated? The authors could discuss the potential of including other immune cells that might be found in the interstitial space, for example.

      • This important discussion point has been added to the manuscript (L623-636). As suggested by Reviewer #2, other immune cells respond to N. meningitis and can be explored using our model. For instance, macrophages and dendritic cells are activated upon N. meningitis infection, eliminate the bacteria through phagocytosis, produce pro-inflammatory cytokines and chemokines potentially activating lymphocytes [7]. Such an immune response, yet complex, would be interesting to study in our model as skin-xenograft mice are deprived of B and T lymphocytes to ensure acceptance of human skin grafts.

      • A minor correction: in line 467 it should probably be "aspects" instead of "aspect", and the authors could consider rephrasing that sentence slightly for increased clarity.

      • We have corrected the sentence with "we demonstrated that our VoC strongly replicates key aspects of the in vivo human skin xenograft mouse model, the gold standard for studying meningococcal disease under physiological conditions." in lines 499-503.

        Strengths and limitations

      The most important strength of this manuscript is the technology they developed to build this model, which is impressive and very innovative. The Vessel-on-Chip can be tuned to acquire complex shapes and, according to the authors, the process has been optimized to produce models very quickly. This is a great advancement compared with the technologies used to produce other equivalent models. This model proves to be equivalent to the most advanced model used to date, but allows to perform microscopy with higher resolution and ease, which can in turn allow more complex and precise image-based analysis. However, the authors do not seem to present any new mechanistic insights obtained using this model. All the findings obtained in the infection-on-chip demonstrate that the model is equivalent to the human skin xenograft mouse model, and can offer superior resolution for microscopy. However, the advantages of the model do not seem to be exploited to obtain more insights on the pathogenicity mechanisms of N. meningitidis, host-pathogen interactions or potential applications in the discovery of potential treatments. For example, experiments to elucidate the role of certain N. meningiditis genes on infection could enrich the manuscript and prove the superiority of the model. However, I understand these experiments are time-consuming and out of the scope of the current manuscript. In addition, the model lacks the multicellularity that characterizes other similar models. The authors mention that the pathogen can be found in the luminal space of several organs, however, this luminal space has not been recapitulated in the model. Even though this would be a new project, it would be interesting that the authors hypothesize about the possibilities of combining this model with other organ models. The inclusion of circulating neutrophils is a great asset; however it would also be interesting to hypothesize about how to recapitulate other immune responses related to systemic infection.

      We thank Reviewer #2 for his/her comment on the strengths and limitations of our work. The difficulty is that our study opens many futur research directions and applications and we hope that the work serves as the basis for many future studies but one can only address a limited set of experiments in a single manuscript. - Experiments investigating the role of N. meningitidis genes require significant optimization of the system. Multiplexing is a potential avenue for future development, which would allow the testing of many mutants. The fast photoablation approach is particularly amenable to such adaptation. - Cells and bacteria inside the chambers could be isolated and analyzed at the transcriptomic level or by flow cytometry. This would imply optimizing a protocol for collecting cells from the device via collagenase digestion, for instance. This type of approach would also benefit from multiplexing to enhance the number of cells. - As mentioned above, the revised manuscript discusses the multicellular capabilities of our model, including the integration of additional immune cells and potential connections to other organ systems. We believe that these approaches are feasible and valuable for studying various aspects of N. meningitidis infection.

      Advance

      The most important advance of this manuscript is technical: the development of a model that proves to be equivalent to the most complex model used to date to study meningococcal systemic infections. The human skin xenograft mouse model requires complex surgical techniques and has the practical and ethical limitations associated with the use of animals. However, the Infection-on-chip model is completely in vitro, can be produced quickly, and allows to precisely tune the vessel's geometry and to perform higher resolution microscopy. Both models were comparable in terms of the hallmarks defining the disease, suggesting that the presented model can be an effective replacement of the animal use in this area.

      Other vessel-on-chip models can recapitulate an endothelial barrier in a tube-like morphology, but do not recapitulate other complex geometries, that are more physiologically relevant and could impact infection (in addition to other non-infectious diseases). However, in the manuscript it is not clear whether the different morphologies are necessary to study or recapitulate N. meningitidis infection, or if the tubular morphologies achieved in other similar models would suffice.

      We thank Reviewer #2 for his/her comment, also raised by reviewer 1. To answer this question, we have now infected vessel-on-chips of different geometries, to dissect the impact of flow distribution in N. meningitidis infection (Figures 3 and S3, explained in lines 288-307). In this range of shear stress, we show that bacterial infection is not strongly affected by geometry-induced shear stress variation. These observations are constistent with observations in flow chambers and qualitative observations of human cases and in the xenograft model [6].

      Audience

      This manuscript might be of interest for a specialized audience focusing on the development of microphysiological models. The technology presented here can be of great interest to researchers whose main area of interest is the endothelium and the blood vessels, for example, researchers on the study of systemic infections, atherosclerosis, angiogenesis, etc. Thus, the tool presented (vessel-on-chip) can have great applications for a broad audience. However, even when the method might be faster and easier to use than other equivalent methods, it could still be difficult to implement in another laboratory, especially if it lacks expertise in bioengineering. Therefore, the method could be more of interest for laboratories with expertise in bioengineering looking to expand or optimize their toolbox. Alternatively, this paper present itself as an opportunity to begin collaborations, since the model could be used to test other pathogen or conditions.

      Field of expertise: Infection biology, organ-on-chip, fungal pathogens.

      I lack the expertise to evaluate the image-based analysis.

      References:

      1. Gyohei Egawa, Satoshi Nakamizo, Yohei Natsuaki, Hiromi Doi, Yoshiki Miyachi, and Kenji Kabashima. Intravital analysis of vascular permeability in mice using two-photon microscopy. Scientific Reports, 3(1):1932, Jun 2013. ISSN 2045-2322. doi: 10.1038/srep01932.

      2. Valeria Manriquez, Pierre Nivoit, Tomas Urbina, Hebert Echenique-Rivera, Keira Melican, Marie-Paule Fernandez-Gerlinger, Patricia Flamant, Taliah Schmitt, Patrick Bruneval, Dorian Obino, and Guillaume Duménil. Colonization of dermal arterioles by neisseria meningitidis provides a safe haven from neutrophils. Nature Communications, 12(1):4547, Jul 2021. ISSN 2041-1723. doi:10.1038/s41467-021-24797-z.

      3. Mats Hellström, Holger Gerhardt, Mattias Kalén, Xuri Li, Ulf Eriksson, Hartwig Wolburg, and Christer Betsholtz. Lack of pericytes leads to endothelial hyperplasia and abnormal vascular morphogenesis. Journal of Cell Biology, 153(3):543–554, Apr 2001. ISSN 0021-9525. doi: 10.1083/jcb.153.3.543.

      4. Arsheen M. Rajan, Roger C. Ma, Katrinka M. Kocha, Dan J. Zhang, and Peng Huang. Dual function of perivascular fibroblasts in vascular stabilization in zebrafish. PLOS Genetics, 16(10):1–31, 10 2020. doi: 10.1371/journal.pgen.1008800.

      5. Huanhuan He, Julia J. Mack, Esra Güç, Carmen M. Warren, Mario Leonardo Squadrito, Witold W. Kilarski, Caroline Baer, Ryan D. Freshman, Austin I. McDonald, Safiyyah Ziyad, Melody A. Swartz, Michele De Palma, and M. Luisa Iruela-Arispe. Perivascular macrophages limit permeability. Arteriosclerosis, Thrombosis, and Vascular Biology, 36(11):2203–2212, 2016. doi: 10.1161/ATVBAHA. 116.307592.

      6. Emilie Mairey, Auguste Genovesio, Emmanuel Donnadieu, Christine Bernard, Francis Jaubert, Elisabeth Pinard, Jacques Seylaz, Jean-Christophe Olivo-Marin, Xavier Nassif, and Guillaume Dumenil. Cerebral microcirculation shear stress levels determine Neisseria meningitidis attachment sites along the blood–brain barrier . Journal of Experimental Medicine, 203(8):1939–1950, 07 2006. ISSN 0022-1007. doi: 10.1084/jem.20060482.

      7. Riya Joshi and Sunil D. Saroj. Survival and evasion of neisseria meningitidis from macrophages. Medicine in Microecology, 17:100087, 2023. ISSN 2590-0978. doi: https://doi.org/10.1016/j.medmic.2023.100087.

    1. So if Agent-3 is, for example, obviously writing backdoors into code that would allow it to escape, the weaker models would notice.

      the key word is 'obviously' here.

    1. synthèse détaillée basée sur les extraits fournis de la conférence de Nathalie Bajos avec Camille Lance :

      NOTE DE SYNTHÈSE : La production sociale des inégalités de santé (8) - Nathalie Bajos avec Camille Lance

      Intervenant(e)s : Nathalie Bajos (modératrice, charte de santé publique), Camille Lance (sociologue, spécialiste des inégalités sociales en matière de santé mentale)

      Date de la présentation : Année universitaire 2024-2025

      Objet : Réflexion sur la production sociale des inégalités de santé, en particulier en matière de santé mentale en prison, à travers l'exemple des transformations institutionnelles.

      Points clés :

      La "Prison Asile" : Un Concept pour Décrire une Réalité Pénitentiaire L'expression "prison asile" s'est imposée autour des années 2000 en France pour décrire l'augmentation significative du nombre de personnes présentant des troubles psychiatriques en prison. Des rapports parlementaires (Sénat et Assemblée nationale, 2000) ont souligné ce phénomène, parlant d'une population pénale en mutation conférant à la prison "une vocation asilaire que l'hôpital psychiatrique n'a plus". Ce constat ébranle le principe historique de séparation entre la prison (pour les condamnés/prévenus) et la psychiatrie (pour les aliénés), tel qu'énoncé par la loi de 1838. Des études épidémiologiques rigoureuses confirment la prévalence très élevée des troubles psychiatriques (humeur, anxiété, stress post-traumatique, psychotiques, usage de substances) en population carcérale par rapport à la population générale du même âge et sexe. Les prévalences sont "toujours supérieures et parfois très supérieures". L'étude du professeur Falissard (2006) estime qu'environ 36% des personnes détenues présentent un trouble de gravité marquée à sévère. Ces chiffres sont cohérents avec les études internationales. Cette réalité nourrit l'impression chez les professionnels que la prison est devenue "le dernier asile" et qu'une partie des personnes incarcérées n'y auraient "pas leur place". Transformations Institutionnelles à la Croisée de l'État Social et de l'État Pénal Les prisons se situent au carrefour de transformations multiples, loin d'être le fruit d'une unique rationalité étatique, qui produisent deux processus combinés : Un processus de responsabilisation des personnes avec des troubles psychiatriques :Déshospitalisation de la psychiatrie publique : Depuis les années 1960, la psychiatrie publique a connu une diminution importante du nombre de lits et la multiplication de structures extrahospitalières. Cependant, ce tournant ambulatoire inachevé rend les trajectoires des patients plus dépendantes de leurs ressources propres (financières, familiales, sociales). La psychiatrie est questionnée sur sa capacité à ne pas "abandonner sans filer les malades mentaux dans la société de la précarité". Évolution du regard des tribunaux : Si au 19ème siècle, les troubles mentaux (démence) exemptaient de prison (article 64 du code pénal), le regard a évolué. Les troubles psychiatriques "exemptent de moins en moins de prison les personnes qui en souffrent". Des travaux montrent que "Les malades qui se soignent sont protégés de la prison tandis que les malades qui ne se soignent pas ils sont plus souvent directement conduits". Inflation carcérale : La population carcérale a explosé en France, passant d'environ 29 000 en 1970 à plus de 80 000 aujourd'hui. Ce quasi-triplement est dû à l'allongement des longues peines et l'accélération des courtes peines (comparution immédiate), où les troubles psychiatriques passent souvent inaperçus. Cette "nasse carcérale" capture de nombreux hommes (96,5% de la population carcérale) et femmes avec des troubles, les prenant dans un cercle vicieux de "portes tournantes de la prison" (revolving doors) entre courts séjours et précarité. Ce phénomène est encore plus marqué aux États-Unis. Développement des soins psychiatriques en prison : L'existence, bien que limitée, d'une offre de soins psychiatriques organisée par le service public hospitalier en prison (consultations, hospitalisation de jour, unités d'hospitalisation complète) devient "une justification de plus en plus invoquée par certains magistrats à l'heure de prononcer une peine de prison avec l'argument qu'il ou elle y trouvera des soins adaptés". Un processus de psychologisation des personnes qui ont commis des infractions pénales :La peine comme moyen de "traiter" le crime : Dès le 19ème siècle, le projet de la prison moderne inclut l'idée de "méthode curative" pour "traiter le crime comme les autres espèces de folie". Cette ambition thérapeutique persiste, reposant sur le postulat que le délinquant/criminel souffre d'une "folie partielle", "dégénérescence", "altération morale", ou "trouble de la personnalité ou du comportement". Généralisation de la psychologisation des rapports sociaux : Cette ambition s'inscrit aujourd'hui dans un mouvement plus large de psychologisation de la société. Soins pénalement ordonnés : Des dispositifs tels que les soins pénalement ordonnés (étudiés par Virginie Gautron) obligent des personnes (initialement auteurs d'infractions sexuelles, puis violences aggravées, terrorisme) à un suivi psychiatrique/psychothérapeutique après leur peine. Le soin devient un "auxiliaire de la peine", une façon de garder les individus "sous main de justice" et potentiellement justifier un retour en prison. Cette logique est devenue "presque un automatisme pour les juges d'application des peines". Injonction à se soigner en prison : Ce processus crée en prison une "injonction permanente" aux condamnés, même sans diagnostic, à "entreprendre des soins" avec l'idée que la détention devrait être "un temps de traitement". Des programmes de prévention de la récidive se développent. Résistances professionnelles : Ce projet psychocriminologique rencontre des résistances, notamment de la part des conseillers pénitentiaires d'insertion et de probation (CPIP), dont la mission s'est judiciarisée, et des équipes psychiatriques hospitalières qui ont le sentiment d'être "enrôlées malgré eux dans une mission de contrôle pénal" et voient leur file active de patients augmenter. Ces deux processus modifient la composition de la population pénale et le regard porté sur elle. Une Institution Fragmentée : L'Illustration du Cas de Monsieur Cadar Contrairement à l'idée d'une hybridation harmonieuse ("Le vilain métier de punir serait ainsi retourné dans le beau métier de guérir", Foucault), la prison est une institution fragmentée où les logiques institutionnelles se superposent plus qu'elles ne s'articulent. Le cas de Monsieur Cadar, un homme en état de "décompensation psychotique" agressant un surveillant, illustre cette superposition dramatique : Il est hospitalisé sans consentement (incapable de consentir aux soins). Il est sanctionné disciplinairement (détenu violent à punir). Il est condamné pénalement (justiciable devant répondre de ses actes). Cette vignette montre comment différentes logiques (sanitaire, disciplinaire, judiciaire) s'appliquent à une même personne, parfois de manière contradictoire. La Composition Sociale de la Population Carcérale et les Inégalités de Santé "Il n'y a pas d'égalité devant la prison". L'incarcération procède d'une "sélection sociale", déterminée par les politiques pénales et l'activité de la chaîne pénale et médico-sociale. Les statistiques pénitentiaires montrent une composition très spécifique : hommes (96,5%), jeunes (moitié < 30 ans), d'origine populaire, issus des minorités racisées, forte représentation d'étrangers, souvent sans diplôme ni emploi officiel. Les chiffres de l'étude de Camille Lance montrent que 75% des hommes détenus ont subi des formes de négligence ou d'abus dans l'enfance (85% des femmes). Ce sont "certains individus" qui sont pris dans les "parcours de soins entre prison, précarité et psychiatrie". Ce "cercle vicieux" est difficile à briser, comme l'illustre le cas de Boris, un homme pris dans des allers-retours entre prison et tentatives de suivi psychiatrique/social. Dialogue entre Épidémiologie et Sociologie : Questions Soulevées par les Chiffres L'étude sur la santé mentale des sortants de prison, à laquelle Camille Lance a participé en tant que chef de projet, est "très psychiatrique, très médicale". Ses chiffres sont "révélateur[s] de la position qu'occupe[nt] les psychiatres qui l'ont mené". Ils "ne sont pas neutres". Ces chiffres : "Ne disent rien des associations possibles entre troubles psychiatriques et motifs d'incarcération". "N'explore[nt] d'ailleurs pas les possibles troubles de la personnalité ou du comportement qui pourrait être associé à des infractions sanctionnées d'une peine de prison". "Prennent position qui refusent de participer au processus de psychologisation des auteurs d'infraction pénale". "Suggèrent une stricte délimitation médicale du mandat de la psychiatrie en milieu pénitentiaire". Proposent de "porter le regard sur la sortie de prison" (moment de forte surmortalité) pour "visibiliser voire même de construire un problème de santé publique" et convaincre les autorités d'investir (équipes mobiles, mobilisation de la psychiatrie publique extérieure). La sociologie peut venir "étouffer" cette mesure épidémiologique en soulevant des questions : Place et rôle de la prison dans l'émergence des troubles : Les troubles préexistent-ils ou sont-ils produits/attisés par la détention ? Il faut penser l'articulation entre souffrance psychique et expérience carcérale. Les travaux d'Alexis Van Nasbrook sur les suicides en prison (risque 10x supérieur chez les hommes, 40x chez les femmes) montrent le rôle précipitant des événements carcéraux (quartier disciplinaire x20, première semaine x7, annonce condamnation, transfert). Il faut compléter en réfléchissant à la construction des "carrières de malades" en prison, façonnée par des facteurs sociaux et institutionnels. Existence des troubles mesurés pour les personnes concernées : Les chiffres épidémiologiques reposent sur des questionnaires structurés basés sur des classifications (DSM/CIM) et des comptages de symptômes ("objectivation statistique"). Ils "ne dit rien des subjectivités", des "réalités vécues", des troubles tels qu'ils sont vécus. La question est de savoir si "Les troubles mesurés existent-ils toujours pour les personnes concernées ?". Compte tenu du profil social des détenus (hommes jeunes, faiblement diplômés, classes populaires) ayant une "certaine distance aux soins de santé mentale", il existe une "très grande hétérogénéité des usages sociaux de la santé mentale en prison". Certains refusent les soins/contestent la pathologisation, rejetant la stigmatisation. D'autres saisissent l'offre pour se raconter, comprendre, trouver du réconfort. Il est important d'explorer les "masculinités des groupes populaires" face à la santé mentale dans un contexte (la prison) de "tension constante entre une incitation à se soigner (...) et une pénurie de soin". Conclusion : L'Intérêt du Dialogue Épidémiologie-Sociologie La recherche sur les inégalités de santé a "beaucoup à gagner du dialogue entre épidémiologie et sociologie". Ce dialogue prend du temps et doit se faire dans le "respect mutuel des cadres épistémologiques de chaque discipline". Ensemble, ces disciplines peuvent "décrire ces inégalités mais également de tenter de les expliquer", ce qui est une tâche complexe. Citations clés :

      "La prison pour asile" "une vocation asilaire que l'hôpital psychiatrique n'a plus" (rapports parlementaires) Prévalences en prison "toujours supérieures et parfois très supérieures" à la population générale. Psychiatrie questionnée sur sa capacité à "ne pas abandonner sans filer les malades mentaux dans la société de la précarité". "Les malades qui se soignent sont protégés de la prison tandis que les malades qui ne se soignent pas ils sont plus souvent directement conduits" (La Ramaï). "portes tournantes de la prison" (revolving doors) Soins en prison : "une justification de plus en plus invoquée par certains magistrats à l'heure de prononcer une peine de prison avec l'argument qu'il ou elle y trouvera des soins adaptés". "Le vilain métier de punir serait ainsi retourné dans le beau métier de guérir." (Foucault) "Il n'y a pas d'égalité devant la prison" (Didier Fassin) "une injonction permanente (...) à entreprendre des soins avec l'idée que la détention devrait être (...) un temps de traitement". Chiffres épidémiologiques : "Ces chiffres par exemple ne disent rien des associations possibles entre troubles psychiatriques et motifs d'incarcération". Objectivation statistique vs subjectivités : "Les troubles mesurés existent-ils toujours pour les personnes concernées ?". "une très grande hétérogénéité des usages sociaux de la santé mentale en prison". "une tension constante entre une incitation à se soigner (...) et une pénurie de soin". "la recherche sur les inégalités de santé a beaucoup pas gagner du dialogue entre épidémiologie et sociologie". Implications / Perspectives :

      La compréhension de la santé mentale en prison nécessite une approche interdisciplinaire (épidémiologie, sociologie, psychiatrie, histoire, droit). Les transformations institutionnelles de l'État social et pénal ont des conséquences directes sur la population carcérale et la santé mentale. La sélection sociale en amont de l'incarcération est un facteur déterminant des inégalités de santé observées en prison. Les chiffres épidémiologiques, bien qu'utiles pour documenter la prévalence, ne suffisent pas à expliquer la complexité des expériences vécues et l'interaction entre santé mentale et contexte carcéral. La sociologie peut éclairer les parcours individuels, l'impact des conditions de détention et les usages sociaux des soins en milieu carcéral. La collaboration entre chercheurs et professionnels de différents champs est essentielle pour aborder la problématique des inégalités de santé en prison.

    1. NOTE D'INFORMATION

      Objet : Analyse des enjeux et des défis de l'orientation au collège et au lycée en France, selon un rapport de la Cour des comptes et les réponses du Ministère de l'Éducation nationale et de l'Association Régions de France.

      Date : 19 mars 2025

      Sources :

      Excerpts from "https://www.ccomptes.fr/sites/default/files/2025-03/20250319-RPA2025-volume1-orientation-college-lycee.pdf"

      1. Introduction : L'orientation au cœur du système éducatif – Enjeux et Contexte

      L'orientation des élèves au collège et au lycée est un processus essentiel qui les guide à travers leur parcours scolaire et vers l'insertion professionnelle.

      Ce processus est décrit comme un "cheminement qui court sur plusieurs années".

      Les moyens publics consacrés à cette politique sont estimés à environ 400 M€ et 8 000 équivalents temps-plein.

      Les enjeux de l'orientation sont considérables, tant au niveau individuel que collectif. Ils visent à garantir l'équité, en dépassant les déterminismes sociaux, de genre et territoriaux, et à répondre aux besoins en compétences du marché de l'emploi et de la société.

      La loi du 5 septembre 2018 a établi une compétence partagée sur l'information à l'orientation entre les régions académiques et les régions, entraînant une réorganisation de l'Onisep.

      Cependant, la loi "ne permet pas d’aboutir à une situation claire". Une "multitude d’acteurs, publics et privés, ont émergé et jouent un rôle croissant".

      Les jeunes et leurs parents se sentent souvent "en manque de repères face à une offre de formation foisonnante, aux règles du jeu opaques et instables".

      Un sondage de mai 2024 par la Cour indique que 35 % des collégiens et 25 % des lycéens se disent insatisfaits des informations et conseils reçus. De plus, près de 20 % des bacheliers qui poursuivent leurs études regrettent leur choix d'inscription.

      2. Déterminismes et Inégalités dans l'Orientation

      Le processus d'orientation est fortement influencé par les résultats scolaires, l'image sociale des filières et l'offre de formation.

      Déterminismes sociaux et territoriaux : L'orientation "tend donc plutôt à amplifier le déterminisme social". Après la classe de troisième, près d'un tiers des élèves (32% en 2022) rejoignent la voie professionnelle.

      Les élèves issus de milieux peu favorisés sont davantage représentés dans les lycées professionnels.

      "À notes équivalentes, les familles de milieux défavorisés font beaucoup moins souvent des choix d’orientation vers la seconde générale et technologique" et les décisions des équipes éducatives peuvent être influencées par le contexte social de l'élève.

      Le lieu de résidence joue également un rôle, notamment pour les élèves en zone rurale éloignée où la voie professionnelle est plus souvent choisie en raison de difficultés de mobilité et d'un attachement au marché du travail local.

      Le dispositif des Cordées de la réussite, bien que non évalué de manière exhaustive, semble avoir un impact positif sur le taux de proposition d'admission dans Parcoursup pour les élèves participants.

      Déterminismes de genre : Les choix d'orientation "restent très genrés à tous les niveaux de formation". Les parents jouent un rôle majeur dans cette reproduction sociale.

      Malgré des actions visant à sensibiliser et à lutter contre les stéréotypes, la mixité nécessite "une action concertée et continue de l’enseignement supérieur et du monde du travail dans les secteurs traditionnellement très genrés".

      3. La Valorisation des Enseignements Professionnels et Technologiques

      Un objectif prioritaire du gouvernement est de "Refonder l’orientation des élèves", en mesurant le nombre de métiers connus en fin de troisième et le taux d'élèves ayant bénéficié de la découverte des métiers.

      La découverte des métiers est généralisée "dans toute la mesure du possible" dans tous les collèges depuis la rentrée 2023.

      La part des élèves en voie professionnelle a diminué globalement (de 38% en 2007 à 32% en 2022), mais les demandes des familles pour cette voie ont augmenté récemment (de 32,5% en 2020 à 36,2% en 2024), témoignant d'une "réels progrès dans la valorisation de la voie professionnelle".

      Cependant, le secteur industriel souffre encore d'un "déficit d’image".

      La coexistence des voies professionnelle et technologique, avec une "image de relégation attachée à la voie professionnelle auprès des familles et des enseignants", contribue à ce sentiment.

      Un rapprochement entre ces voies est suggéré, comme le modèle suisse où la filière professionnelle est "fortement valorisée" et suivie par 70 % des élèves, incluant les enfants de cadres.

      4. L'Accompagnement à l'Orientation : Un Besoin Insatisfait

      Il existe un "important besoin d’accompagnement à l’orientation insatisfait" qui a conduit à l'émergence d'une "multitude d’intervenants".

      Les lycéens s'informent principalement via les forums/salons de l'étudiant (41%), les journées portes ouvertes/immersions dans les établissements d'enseignement supérieur (38%) et le site de l'Onisep (36%).

      Le marché privé du conseil en orientation se développe en réponse à ce besoin, "accroissant en retour les inégalités entre les jeunes".

      5. L'Organisation et les Moyens de l'Orientation dans les Établissements Scolaires

      Le corps des psychologues de l'éducation nationale (psyEN), remplaçant les conseillers d'orientation-psychologues, s'est recentré sur le suivi psychologique des élèves.

      Le ratio d'élèves par psyEN est de 1 147 élèves par psychologue de l’Éducation nationale, ne permettant pas une action individualisée.

      Le transfert des missions des psyEN vers les régions est jugé "peu praticable, à l’heure actuelle".

      La mission d'éducation à l'orientation, confiée aux enseignants, est "insuffisamment accompagnée".

      Bien que l'orientation fasse partie de leur référentiel de compétences, les maquettes de formation initiale ne comportent pas de "formation obligatoire sur ce sujet".

      Les heures d'accompagnement à l'orientation prévues au collège et au lycée général et technologique ne sont pas des "moyens fléchés" et sont perçues comme "non financées et non obligatoires".

      La Cour estime le besoin de financement complémentaire pour sécuriser ces heures à entre 134 M€ et 309 M€ selon les modalités.

      Deux outils sont en développement pour appuyer les enseignants : le programme Avenir(s) de l'Onisep (doté de 20 M€ sur 10 ans) qui vise à structurer l'outillage pédagogique et à créer un portfolio pour les élèves, et le module MonProjetSup (développé par le ministère de l'enseignement supérieur en lien avec l'Onisep) pour aider les lycéens à préparer leurs vœux dans Parcoursup.

      L'absence de transparence des critères de décision dans Parcoursup est soulignée comme une difficulté.

      6. La Répartition des Compétences entre l'État et les Régions

      La répartition actuelle des compétences entre l'État et les régions "prête à confusion".

      Les régions ont la responsabilité de la coordination et de l'animation de l'orientation tout au long de la vie pour les adultes, mais leur compétence pour les publics scolaires, étudiants et apprentis se limite à "l'information sur les métiers et les formations".

      La phrase de la loi "la région organise des actions d’information", et non "les actions d’information", est jugée limitée.

      Les régions revendiquent un "transfert plein et entier de la compétence « orientation »", mais contestent l'analyse de la Cour selon laquelle cela impliquerait de leur confier les décisions d'orientation et d'affectation.

      Elles estiment que ces décisions relèvent de l'Éducation nationale.

      La profusion de sites d'information et d'orientation (avec des exemples de sites publics et privés) illustre cette complexité.

      Le développement de la plateforme Avenir(s) par l'Onisep est perçu par les régions comme faisant "doublon avec la compétence régionale sur l’information métiers" et rendant "invisibles les sites que toutes les régions ont développés".

      Les régions demandent une clarification en remplaçant "des actions" par "les actions" dans le code du travail et un transfert global des missions et moyens nécessaires.

      L'enveloppe actuelle de 8 M€ transférée aux régions pour l'information territorialisée est jugée "très insuffisante" au regard de leur engagement réel (plus de 170 M€ en 2024).

      La mise en synergie des nombreuses structures accueillant les jeunes (CIO, bureaux Information Jeunesse, missions locales, structures régionales, chambres consulaires) est nécessaire pour améliorer l'efficience et la lisibilité pour le public.

      Le "bassin d’éducation et de formation apparaît comme le niveau pertinent pour que les acteurs se connaissent et travaillent de manière harmonisée".

      7. La Transformation de l'Offre de Formation Professionnelle

      L'offre de formation "détermine pour une large part l’orientation des jeunes".

      Les taux d'emploi à six mois des diplômés du baccalauréat professionnel (48%) et du CAP (33%) sont relativement bas (mais plus élevés que pour les bacheliers généraux ou technologiques n'ayant pas poursuivi d'études supérieures).

      L'application Inserjeunes vise à fournir des informations sur les taux de poursuite d'études et d'insertion.

      La "transformation indispensable de l'offre de formation professionnelle" est une priorité pour une meilleure insertion des jeunes en lien avec les besoins des territoires.

      L'affectation dépend des capacités disponibles, et 25% à 30% des élèves n’obtiennent pas leur premier vœu, ce qui "peut renforcer le sentiment d’orientation subie".

      La baisse prévue des effectifs de lycéens à partir de 2028 offre une "opportunité de conduire cette transformation".

      Les campus des métiers et des qualifications sont un exemple d'articulation entre offre de formation et orientation, mais leur impact est variable.

      8. Conclusions et Recommandations de la Cour des Comptes

      Malgré les efforts, l'orientation reste marquée par de forts déterminismes sociaux, territoriaux et de genre.

      L'éducation à l'orientation au collège est difficile à mettre en place en raison du manque de formation des enseignants et de l'absence de moyens fléchés pour les heures dédiées.

      La répartition des compétences entre l'État et les régions doit être clarifiée, et une meilleure articulation locale des acteurs est nécessaire.

      L'orientation est indissociable de l'offre de formation professionnelle, qui doit s'adapter aux besoins locaux.

      La Cour formule les recommandations suivantes :

      • Insérer dans les maquettes de formation initiale des enseignants un module obligatoire sur la compétence orientation.
      • Adapter l'emploi du temps des professeurs principaux ou référents pour leur permettre d'assurer leur mission d'orientation.
      • Expérimenter avec une ou plusieurs régions volontaires un rapprochement entre les voies du lycée, professionnelle, technologique et générale.

      9. Réponses et Perspectives

      Le Ministère de l'Éducation nationale signale qu'une concertation nationale sur l'orientation est en cours (novembre 2024 - mars 2025) pour "faire de la politique d’orientation un levier de réussite et d’égalité des chances".

      Il souligne l'importance du programme Avenir(s) pour créer un "service public national, gratuit, exhaustif et égalitaire de l’orientation".

      Concernant la recommandation 1, le ministère indique que la compétence d'orientation est déjà incluse dans le référentiel de formation des enseignants et que la réforme de la formation initiale permettra de réfléchir à l'évolution des contenus.

      Il souligne également l'effort en matière de formation continue sur l'orientation (50 294 journées en 2023-2024).

      Concernant la recommandation 2, le ministère ne prévoit pas de quantifier le volume horaire de la mission d'orientation pour éviter la rigidité et ne pas nuire à la mission de coordination.

      Il précise que le dispositif actuel permet l'intervention de divers personnels et que tous les professeurs ont un rôle en matière d'orientation.

      Concernant la recommandation 3, le ministère y voit un intérêt pour une perspective décloisonnée et travaillera en articulation avec l'enseignement supérieur.

      Le ministère mentionne également une expérimentation d'"option santé" dans les lycées de déserts médicaux pour lutter contre les déterminismes territoriaux.

      Il précise que MonProjetSup est intégré à Avenir(s) et vise à fournir des suggestions personnalisées basées sur les données de Parcoursup.

      Il rappelle le rôle des psyEN en matière d'orientation, malgré une charge de travail accrue sur d'autres missions.

      Régions de France partage le constat de la confusion des compétences et le manque de repères pour les jeunes et les parents.

      Elles souscrivent aux objectifs de lutte contre les déterminismes et de valorisation des métiers.

      Elles partagent aussi le constat de l'émergence des acteurs privés.

      Cependant, elles contestent l'idée que leur demande de compétence pleine et entière implique de prendre les décisions d'orientation et d'affectation, qui relèvent selon elles de l'Éducation nationale.

      Elles demandent une clarification législative en remplaçant "des actions" par "les actions" dans le code du travail.

      Elles voient la plateforme Avenir(s) comme un doublon.

      Elles réclament un transfert global des missions et moyens nécessaires, incluant les équipes des directions territoriales ONISEP et les directeurs de CIO, et jugent l'enveloppe actuelle transférée pour l'information "très insuffisante".

      Elles soulignent que les moyens dédiés aux CIO diminuent et que les psyEN se concentrent davantage sur le suivi psychosocial, déportant l'accompagnement de l'affectation vers les régions sans moyens additionnels.

      Régions de France souhaite que le rapport de la Cour contribue à une clarification de la compétence et regrette l'absence de recommandation sur ce sujet.

      Elles souscrivent à l'expérimentation de rapprochement des voies au lycée et partagent le constat de l'inadéquation entre orientation et besoins réels des territoires.

      En synthèse, les points clés à retenir sont :

      Le système d'orientation actuel est complexe, coûteux, et manque de clarté, entraînant une insatisfaction chez les jeunes et leurs parents.

      L'orientation est fortement marquée par les déterminismes sociaux, de genre et territoriaux, avec un besoin de renforcer l'équité.

      La valorisation de la voie professionnelle est un enjeu majeur, avec un modèle comme la Suisse cité en exemple.

      Le besoin d'accompagnement à l'orientation est important mais insuffisamment satisfait par le système éducatif public, favorisant l'émergence d'acteurs privés et l'accroissement des inégalités.

      La formation des enseignants à l'orientation est jugée insuffisante.

      Les heures dédiées à l'orientation dans les établissements manquent de moyens fléchés et ne sont pas toujours concrétisées.

      La répartition des compétences entre l'État et les régions est source de confusion, et les régions réclament une compétence pleine et entière sur l'information et l'orientation métiers, mais pas sur les décisions d'affectation.

      La profusion d'outils et de structures d'information et d'orientation nécessite une meilleure synergie et lisibilité locale.

      L'offre de formation professionnelle doit être transformée pour mieux s'adapter aux besoins locaux et offrir des parcours d'insertion réussie.

      Des expérimentations, comme le rapprochement des voies au lycée et l'option santé, sont des pistes explorées pour améliorer le système.

      Cette note d'information résume les principaux éléments et les divergences de points de vue entre les acteurs sur la manière d'améliorer l'orientation des jeunes en France.

    1. Depending on your editor and its configuration, you probably now see some “Unbound module” errors about OUnit2 and Sum. Don’t worry; the code is actually correct. We just need to set up dune and tell it to link OUnit. Create a dune file and put this in it:

      I don't know if this was mentioned in previous chapters but on windows I actually needed to install ounit2 with opam first...

    1. A common measurement of code bulk is to count the number of lines of code, and a common measure for software productivity is to count the number of lines of code produced per unit time. These numbers are relatively easy to collect, and have actually demonstrated remarkable success in getting a quick reading of where things stand. But of course the code bulk metric pleases almost no one. Surely more than code bulk is involved in software productivity. If productivity can really be measured as the rate at which lines of code are produced, why not just use a tight loop to spew code as fast as possible, and send the programmers home?

      The modern incarnation of the fallacy of bulk-is-better is judging the quality of a project (esp. a component) by being overly concerned with whether it is more or less "active" than some other project.

      The question posed here is an obvious response for when you encounter instances of the bulk-is-better mindset (although it's not as easy to automate as Cox suggests here).

      I've often mused about the thought of pulling a Reddit* and automating some level of churn within a codebase for a project that's hosted on e.g. GitHub for the sole purpose of making sure that it appears "active".

      * A story about the early days of Reddit has become well-known after the creators volunteered some information about the early days when they made use of a bunch of "fake" accounts to submit links that the creators had aggregated/curated for the purpose of seeding the site.

    2. The way you make code last a long time is you minimize dependencies that are likely to change and, to the extent you must take such dependencies, you minimize the contact surface between your program and those dependencies.

      It's strange that this basic truth is something that has to be taught/explained. It indicates a failure in analytical thinking, esp. regarding those parts of the question which the quoted passage is supposed to be a response to.

    1. Reaching basic code quality prevents large merge conflicts, and allows for testing so PRs don’t break functionality. It also allows for a speedy pr review process. Until then, I’ll be maintaining an active fork with best practices for python development. Feel free to fork this code to make the idea work in your repo.
    1. I'm hoping with the recent additions of to/from_dict(), we'll be able to support serialization of many llama index objects. Path to model definition code (hopefully) won't be necessary for most use cases.

      IMPLEMENTATION

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      We thank the reviewer for their constructive comments and the fair and interesting discussion between reviewers.

      __Reviewer #1 __

      We are delighted to read that the reviewer finds the manuscript “very clear and of immediate impact […] and ready for publication” regarding this aspect. We have toned down the conclusion, proposing rather than concluding that “the incapacitation of Cmg2[KO] intestinal stem cells to function properly […] is due to their inability to transduce Wnt signals”.

      We have addressed the 3 points that were raised as well as the minor comments.

      Point #1

      The mouse mutant is just described as 'KO', referring to the previous work by the authors. The cited work simply states that this is a zygotic deletion of exon 3, which somehow leads to a decrease in protein abundance that is almost total in the lung but not so clear in the uterus. Exon 3 happens to be 72 bp long [https://www.ncbi.nlm.nih.gov/nuccore/NM_133738], so its deletion (assuming there are no cryptic splicing sites used) leads to an internal in-frame deletion of 24 amino acids. So, at best, this 'KO' is not a null, but a hypomorphic allele of context-dependent strength.

      Unfortunately, neither the previous work nor this paper (unless I have missed it!) contains information provided about the expression levels of Cmg2 in the intestine of KO mice - nor which cell types usually express it (see below). I think that using anti Cmg2 in WB and immunohistofluorescence of with ISC markers with intestine homogenate/sections of wild-type and mutant mice would be necessary to set the stage for the rest of the work.

      We now provide and explanation and characterization the Cmg2KO mice. Exon 3 indeed only encodes a short 24 amino acid sequence. This exon however encodes a ß-strand that is central to the vWA domain of CMG2, and therefore critical for the folding of this domain. As now shown in Fig. S1c, CMG2Dexon3 is produced in cells but cleared by the ER associated degradation pathway, therefore it is only detectable in cells treated with the proteasome inhibitor MG132, at a slightly lower molecular weight than the full-length protein. This is consistent, and was inspired by the fact that multiple Hyaline Fibromatosis missense mutations that map to the vWA domain lead to defective folding of CMG2, further illustrating that this domain is very vulnerable to modifications. In Fig. S1c, we moreover now show immunoprecipitation of Cmg2 from colonic tissue of wild-type (WT) and knockout (KO) mice, which confirm the absence of Cmg2 protein in Cmg2KO samples.

      Point #2

      Connected to the previous point, the expression pattern of Cmg2 in the intestine is not described. Maybe this is already established in the literature, but the authors do not refer to the data. This is important when considering that the previous work of the authors suggests that Cmg2 might contribute to Wnt signalling transduction through physical, cis interactions with the Wnt co-receptor LRP6. Therefore, one would expect that Cmg2 would be cell-autonomously required in the intestinal stem cells.

      The expression pattern of Cmg2 in the gut has not been characterized and is indeed essential to understanding its function. To address this gap, we now added a figure (Fig. 1) providing data from publicly available RNA-seq datasets and from our RNAscope experiments on Cmg2WT mice. Of note, we unfortunately have never managed to detect Cmg2 protein expression by immunohistochemistry of mouse tissue with any of the antibodies available, commercial or generated in the lab.

      In the RESULTS section we now mention:

      To investigate Cmg2 expression in the gut, we first analyzed publicly available spatial and scRNA-seq datasets to identify which cell types express Cmg2 across different gut regions. Spatial transcriptomic data from the mouse small intestine and colon revealed that Cmg2 is broadly expressed throughout the gut, including in the muscular, crypt, and epithelial layers (Fig. 1A–C). To validate these findings, we performed RNAscope in situ hybridization targeting Cmg2 in the duodenum and colon of wild-type mice. The expression pattern observed was consistent with the spatial transcriptomics data (Fig. 1D–E). We then analyzed scRNA-seq data from the same dataset to assess cell-type-specific expression in the mouse colon. Cmg2 was detected at varying levels across multiple cell types, including enterocytes and intestinal stem cells, as well as mesenchymal cells, notably fibroblasts.

      Of note for the reviewer, not mentioned in the manuscript, this wide-spread distribution of Cmg2 across the different cell types is not true for all organs. We have recently investigated the expression of Cmg2 in muscle and found that it is almost exclusively expressed in fibroblasts (so-called fibro-adipocyte progenitors) and very little in any other muscle cells, in particular fibers.

      Interestingly also, as now mentioned in the manuscript and shown in Fig. S1,the ANTXR1 protein, which is highly homologous to Cmg2 at the protein level and share its function of anthrax toxin receptor, displayed a much more restricted expression pattern, being confined primarily to fibroblasts and mural cells, and notably absent from epithelial cells. This differential expression highlights a potentially unique and epithelial-specific role for Cmg2 in maintaining intestinal homeostasis.

      Point #3

      The authors establish that the regenerating crypts of Cmg2[KO] mice are unable to transduce Wnt signalling, but it is not clear whether this situation is provoked by the DSS-induce injury or existed all along. Can Cmg2[KO] intestinal stem cells transduce Wnt signalling before the DSS challenge? If they were, it might suggest that the 'context-dependence' of the Cmg2 role in Wnt signalling is contextual not only because of the tissue, but because of the history of the tissue or its present structure. It would also suggest that Cmg2 mutant mice, unless reared in a germ-free facility for life, would eventually lose intestinal homeostasis, and maybe suggest the level of intervention/monitoring that HFS patients would require. It might also provide an explanation in case Cmg2 was not expressed in ISCs - if the state of the tissue was as important as the presence of the protein, then the effect on Wnt transduction could be indirect and therefore it might not be required cell-autonomously.

      We agree that understanding whether Cmg2KO intestinal stem cells are intrinsically unable to transduce Wnt signals, or whether this defect is contextually induced following injury (such as DSS treatment), is a critical point.

      As a first line of evidence, we show than under homeostatic condition, Wnt signaling appears largely intact in Cmg2KO crypts, with comparable levels of ß-catenin and expression levels of canonical Wnt target genes (e.g., Axin2, Lgr5) to those observed in WT animals (Figs. S1j-l and S3d-e). This indicates that Cmg2 is not essential for basal Wnt signaling under steady-state conditions.

      These findings thus support the idea that the requirement for Cmg2 in Wnt signal transduction is context-dependent—not only at the tissue level but also temporally, being specifically required during regenerative processes or in altered microenvironments such as during inflammation or epithelial damage. This context-dependence may reflect changes in the composition or accessibility of Wnt ligands, receptors, or matrix components during repair, where Cmg2 could play a scaffolding or stabilizing role.

      These aspects are now discussed in the text.

      I think points 1 and 2 are absolutely fundamental in a reverse genetics investigation. Point 3 would be nice to know but the outcome would not change the tenet of the paper. I believe that the work needed to deal these points can be performed on archival material. I do not think the mechanism proposed can be taken from 'plausible' to 'proven' without proposing substantial additional investigation, so I will not suggest any of it, as it could well be another paper.

      We have addressed points 1 and 2, and provided evidence and discussion for Point 3.

      __Minor points __

      1- Figure 1 legend says "In (c), results are mean {plus minus} SEM" - this seems applicable to (d) as (c) does not show error whiskers.

      We thank the reviewer for picking up this error. We modified : “In (c), results are median” and “In (d, f and g) Results are mean ± SEM.”

      2- Figure 1 legend says "(d) Body weight loss, (f) the aspect of the feces and presence of occult blood were monitored and used for the (e) DAI. Results are mean {plus minus} SEM. Each dot represents the mean of n = 12 mice per genotype". This part looks like has suffered some rearrangement of words. The first instance of (f) should be (e), I guess, and I am not sure what "(e) DAI" means. And for (e), "mean {plus minus} SEM" does not seem applicable. This needs some light revision.

      The legend was clarified as followed : “(d) __Body weight loss, and (e) aspect of the feces and presence of occult blood were monitored and used to evaluate Disease activity index in (f).__

      3 - Figure 1H legend does not say which statistical test was made in the survival experiment in (h) - presumably log-rank? A further comment on the survival statistics: euthanised animals should not be counted towards true mortality when that is what is recorded as an 'event'. They should be right-censored. However, in this case, reaching the euthanasia criterion is just as good an indicator of health as mortality itself. So, simply by changing the Y axis from 'survival' to 'event-free survival' (or something to that effect), where 'events' are either death or reaching the euthanasia criterion, leaves the analysis as it is, and authors do not need to clarify that figure 1H shows "apparent mortality", as it is straightforward "complication-free survival" (just not entirely orthogonal to weight loss).

      The Y axis was changed from 'survival' to “percentage of mice not reaching the euthanasia criterion”.

      4 - Some density measurements are made unnecessarily on arbitrary units (per field of view) - this should be simple to report in absolute measures (i.e. area of tissue screened or, better still, length of epithelium screened).

      Because the aera of tissue can vary significantly between damages, regenerating and undamaged tissue, we reported the length of epithelium screened as suggested : “per 800um tissue screened” in Fig S1c and Fig 2b.

      5 - Figure 2E should read "percent involvement"

      This has been corrected.

      6 - Figure 2J should read "lipocalin..."

      This has been corrected.

      7 - In section "CMG2 Is Dispensable for YAP/TAZ-Mediated Reprogramming to Fetal-Like Stem Cells", the authors write ""We measured the mRNA levels of two additional YAP target genes, Cyr61 and CTGF...". I presume the "additional" is because Ly6a is also a target of YAP/TAZ, but if the reader does not know, it is puzzling. I would suggest to make this link explicit.

      We added : “In addition to the fetal-like stem cell marker Ly6a, which is a YAP/TAZ target gene, we measured the mRNA levels of two others YAP target genes, Cyr61 and CTGF”

      8 - In Figures S2, 3 and S3, I think that the measures expressed as "% of homeostatic X in WT" really mean "% of average homeostatic X in WT". This should be made clear somewhere.

      We added: “Dotted line represents the average homeostatic levels of Cmg2 WT” in figure legends

      9 - In panel C, the nature of the data is not entirely clear. First, the corresponding part of the legend says "Representative images of n=4 mice per genotype" which I presume should refer to panel B. Then, the graph plots 4 data points, which suggests that they correspond to 4 mice - but how many fields of view? Also, the violin plot outline is not described - I presume it captures all the data points from the coarse-grained pixel analysis, but it should be clarified.

      It was modified as suggested : “(c) Results are presented as violin plot of the Ly6a mean intensity of all data points from the coarse-grain analysis. Each symbol represents the mean per mice of n=4 mice per condition. Results are mean ± SEM. Dotted line represents the average homeostatic levels of Cmg2WT. P values obtained by two-tailed unpaired t test.”

      10 - In Figure 3H and 3I, I would suggest to add the 7+3 timepoint where the data come from.

      We unfortunately do not understand the suggestion of the reviewer, given that these panels show the 7+3 time point.

      11 - In section "CMG2 Is Critical for Restoring the Lgr5+ Intestinal Stem Cell Pool", the authors say "...The mRNA levels of ... LRP6, β-catenin (Fig. S3a-b), and Wnt ligands (Wnt5a, 5b, and 2b) were comparable between the colons of Cmg2WT and Cmg2KO mice (Fig. S3c)..." without clarifying in which context - one needs to read the figure legend to realise this is "timepoint 7+3". I suggest to add "in the recovery phase" or "in regenerating colons" or something shorter, just to guide the reader.

      We added : “Initially, we quantified the expression of key molecular components involved in Wnt signaling in mice colon 3 days after DSS withdrawal using qPCR.”

      12 - Like with the previous point, it is not clear when the immunohistofluorescence of B-catenin is made - not even in the legend, as far as I could see. The only hint is that authors say "the nuclei of cells in the atrophic crypts of Cmg2KO..." with 'atrophic' probably indicating again the 7+3 timepoint.

      We have changed the text and now mention “Next, we analyzed β-catenin activation in the colon of Cmg2WT and Cmg2KO mice during the recovery phase.”

      13 - A typo in the discussion: tunning for tuning.

      This has been corrected.

      14 - In the discussion, the authors talk about the 'CMG2' protein (all caps - formatting convention for human proteins) but before they were referring to 'Cmg2' (formatting convention for mouse proteins). That is fine but some of the statements where "CMG2" is used clearly refer to observations made in the mouse.

      We have now used Cmg2, whenever referring to the mouse protein.

      15 - Typos in methods: "antigen retrieval by treating [with] Proteinase K"; "Image acquisition and analyze [analysis]"; "All details regarding code used for immunofluorescence analysis”.

      This has been corrected.

      __Reviewer #2 __

      We are very pleased to read that the reviewer found the study “overall well designed, meticulously carried out, and with clear and convincing results that are most reasonably and thoughtfully interpreted”.

      For this reader, one additional thought comes to mind. If I understand the field correctly it would be informative to know with greater confidence where - in what cell type, epithelial or mesenchymal - the CMG2-LRP6-WNT interaction occurs.

      This point was also raised by Reviewer I, and we have now added a new Figure 1, that describes Cmg2 expression in the gut, based both on from publicly available RNA-seq datasets and our RNAscope experiments on Cmg2WT mice. Of note, we unfortunately have never managed to detect Cmg2 protein expression by immunohistochemistry of mouse tissue with any of the antibodies available, commercial or generated in the lab.

      After injury the CMG2-KO mouse epithelium exhibits defective WNT signal transduction - as evidenced by failure of b-catenin to translocate into the nucleus. At first glance, this result is a disconnect with the paper by van Rijin that claims the defect in Hyaline Fibromatosis Syndrome cannot be due to loss of CMG2 expression/function in the barrier epithelial cell - a claim based on the mostly normal phenotypes of human CMG2 KO duodenal organoids. But the human organoids studied in the van Rijin paper, like all others, are established and cultured in very high WNT conditions, perhaps obscuring the lack of the CMG2-LRP6-WNT interaction. And in fact, the phenotypes of these human CMG2-KO duodenoids were not entirely normal - the CMG2-KO stem-like organoids (even when cultured in high WNT/R-spondin conditions) developed abnormal intercellular blisters consistent with a defect in epithelial structure/function - of unknown cause and not investigated.

      We thank the reviewer for raising this point and we fully agree. We now specify in the text that the human CMG2-KO duodenoids showed blisters, indeed consistent with a defect in epithelial structure/function, and that they were grown on high Wnt media which likely obscure the CMG2 requirement.

      I think it would be informative to prepare colon organoids (and duodenoids) from WT and CMG2-KO mice to quantify their WNT dependency during establishment and maintenance of the stem-like (and WNT-dependent) state. If CMG2 acts within the epithelial cell to affect WNT signaling (regardless of WNT source), organoids prepared from colons of CMG2-KO mice would require more WNT in culture media to establish and maintain the stem cell proliferative state - when compared to organoids prepared from WT mice. This can be quantified (and confirmed molecularly by transgene expression if successful). Enhanced dependency of high concentrations of exogenous WT would be evidence for a primary defect in WNT-(LRP2)-CMG2 signal transduction localized to the epithelial barrier cell - thus addressing the apparent discrepancy with the van Rijin paper - and for my part, advancing the field. And the discovery of a defect in the epithelium itself for WNT signal transduction would implicate a biologically most plausible mechanism for development of protein losing enteropathy.

      By no means do I consider these experiments to be required for publication (especially if considered to be incremental or already defined - WNT-CMG2 is not my field of research). This study already makes a meaningful contribution to the field as I state above. But in the absence of new experimentation, the issue should probably be discussed in greater depth.

      We are working out conditions to grow colon organoids that from WT and Cmg2 KO mice, indeed playing around with the concentrations of Wnt in the various media to identify those that would best mimic the regeneration conditions. This is indeed a study in itself. We have however included a discussion on this point in the manuscript as suggested.

      __Reviewer #3: __

      We thank the reviewer for her/his insightful comments.

      The premise is that the causative germline mutated gene, CMG2/ANTRX2, may have a functional role in colonic epithelium in addition to controlling the ECM composition. There is little background information but one study has shown no primary defect in epithelial organoids grown from patients with the syndrome. This leads the authors to wonder if non-homeostatic, conditions might reveal a function role for the gene in regeneration.

      Reviewer 2 commented on the fact that “human organoids studied in the van Rijin paper, like all others, are established and cultured in very high WNT conditions, perhaps obscuring the lack of the CMG2-LRP6-WNT interaction. And in fact, the phenotypes of these human CMG2-KO duodenoids were not entirely normal - the CMG2-KO stem-like organoids (even when cultured in high WNT/R-spondin conditions) developed abnormal intercellular blisters consistent with a defect in epithelial structure/function - of unknown cause and not investigated”.

      We have now added a discussion on this point in the manuscript.

      The authors' approach to test the hypothesis is to use a mouse germline knockout model and to induce colitis and regeneration by the established protocol of introducing dextran sodium sulfate (DSS) into the drinking water for five days. In brief there is no phenotype apparent in the untreated knockout (KO) but these animals show a more severe response to DSS that requires them to be killed by 10 days after the start of treatment. This effect following phenotypic characterisation of the colonic epithelium is interpreted as showing the CMG2 is a Wnt modifier required for the restoration of the intestinal stem cell population in the final stages of repair.

      The experiment and analysis seem reasonably well executed - although a few specific comments follow below. The narrative is simple and easy to understand. However, there are significant caveats that cast doubts on the interpretation made that loss of CMG2 impairs the transition of colonic epithelial cells from a fetal like state to adult ISCs.

      First there is only a single approach and single type of experiment performed. There is a lack of independent validation of the phenotype and how it is mediated.

      We do not fully understand what type of independent validation of the phenotype the reviewer would have liked to see. Is it the induction of intestinal damage using a stress other than DSS?

      The DSS dose in this kind of experiment is often determined empirically in individual units. Here the 3% used is within published range but at upper end. The control animals show a typical response with symptoms of colitis worsening for 2-3 days after the removal of DSS and then recovery commonly over another 5-7 days. Here the CMG2 KO mice fail to recover and are killed by 9 or 10 days. The authors attempt to exploit the time course by identifying normal initial (7days) and defective late (10days) repair phases in KO animals when compared to controls. It is from this comparison that conclusions are drawn. However, the alternative interpretation might be that the epithelium of KO animals is so badly damaged, and indeed non-existent (from viewing Fig2a), that it is incapable of mounting any other response other than death and that the profiling shown is of an epithelium in extremis. The repair capability and dynamics of the KO would have been better tested under more moderate DSS challenge, if this experiment had been regarded as a pilot rather than as definitive.

      The choice of 3% DSS was in fact based on a pilot experiment. As now shown in Fig. S4, we tested different concentrations and found that 3% DSS was the lowest concentration that reliably induced the full spectrum of colitis-associated symptoms, including significant body weight loss, diarrhea, rectal bleeding (summarized in the Disease Activity Index), as well as macroscopic signs such as colon shortening and spleen enlargement. Based on these criteria, we selected 3% DSS for the study described in the manuscript.

      In this model, WT mice showed a typical progression: body weight stabilized rapidly after DSS withdrawal, with resolution of diarrhea and rectal bleeding. Histological analysis at day 9 revealed signs of epithelial regeneration, including hypertrophic crypts and increased epithelial proliferation.

      In contrast, Cmg2KO mice failed to initiate this recovery phase. Clinical signs such as weight loss, diarrhea, and bleeding persisted after DSS withdrawal, ultimately necessitating euthanasia at day 9–10 due to humane endpoint criteria. Unfortunately, this prevented us from exploring later timepoints to determine whether regeneration was delayed or completely abrogated in the absence of Cmg2.

      Regarding the severity of epithelial damage, as raised by Reviewer 1, we now provide detailed histological scoring in the supplementary data. This analysis shows that the severity of inflammation and crypt damage was similar between WT and KO animals, as were inflammatory markers such as Lipocalin-2. The key difference lies in the extent of tissue involvement. While the lesions in WT mice were more localized, Cmg2KO mice displayed widespread and diffuse damage with no sign of regeneration as shown by the absence of hypertrophic crypts and a marked reduction in both epithelial coverage and proliferative cells. Importantly, at day 7, the percentage of epithelial and proliferating cells was comparable between genotypes, further supporting the idea that Cmg2KO mice failed to initiate this recovery phase and present a defective repair response.

      The animals used were young (8 weeks) and lacked any obvious defect in collagen deposition. Does this change with treatment? Even if not, is it possible that there is a defect in peristalsis or transit time of gut contents, resulting in longer dwell times and higher effective dose of DSS to the KO epithelium?

      Collagen deposition, particularly of collagen VI, is known to increase in response to intestinal injury and plays a critical role in promoting tissue repair following DSS-induced damage (Molon et al., PMID: 37272555). As suggested, we investigated whether Cmg2KO mice exhibit abnormal collagen VI accumulation following DSS treatment.

      Our results show that, consistent with published data, WT mice exhibit a marked increase in collagen VI expression during the acute phase of colitis, with levels returning toward baseline following DSS withdrawal. A similar expression pattern was observed in Cmg2KO mice, with no significant differences in Col6a1 mRNA levels between WT and KO animals throughout the entire time course of the experiment. This observation was further confirmed at the protein level by western blot and immunohistochemistry analyses, suggesting that the impaired regenerative capacity observed in Cmg2KO mice is independent of Collagen VI.

      Regarding the possibility of altered peristalsis or intestinal transit time contributing to increased DSS exposure in KO mice, this is indeed a possibility. Although we did not directly measure gut motility in this study, we did not observe any signs of intestinal obstruction or fecal retention in Cmg2KO mice. Indeed, during the experiment, animals were single caged for 30min in order to collect feces and no difference in the amount of feces collected was observed between WT and KO mice, arguing against a substantial difference in transit time (see figure below). The possible altered peristalsis and these observations are now mentioned in the discussion.

      Is CMG2 RNA and protein expressed in the colonic epithelium? It is not indicated or tested in the submitted manuscript. This reviewer struggled to find evidence, notably it did not seem to be referenced in the organoid paper they reference in introduction (ref 13).

      This very valid point was also raised by Reviewers 1 and 2. The expression pattern of Cmg2 in the gut has indeed not been characterized and is essential to understanding its function. To address this gap, we added a figure (Fig. 1) providing data from publicly available RNA-seq datasets and from our RNAscope experiments on Cmg2WT mice. Of note, we unfortunately have never managed to detect Cmg2 protein expression by immunohistochemistry of mouse tissue with any of the antibodies available, commercial or generated in the lab.

      __Specific comments: __

      Figure 3 c-e and associated text are confusing. In c the Y scale seems inappropriate to show percentages up to 15,000%.

      In this graph values are normalized to homeostatic level of WT mice which represent 100%

      In d and e the use of percentages may by correct. However, it is claimed in text that Cty61 and CTFG are upregulated in the KO. That is not what the plots appear to show as the compare to WT untreated cells, in which case the KO have not downregulated these genes in the way the controls have.

      As clarified in the text, under regenerative conditions, a transient activation of YAP signaling is crucial to induce a fetal-like reversion of intestinal stem cells. However, in a subsequent phase, the downregulation of YAP and the reactivation of Wnt signaling are necessary to complete intestinal regeneration. Several studies have highlighted a strong interplay between the Wnt and YAP pathways, suggesting that their coordinated regulation is essential for effective gut repair. Nevertheless, the precise mechanisms governing this interaction remain incompletely understood.

      In our model, this critical transition—YAP downregulation and Wnt reactivation—appears to be impaired. CMG2 may either hinder Wnt reactivation directly, or lead to sustained YAP signaling, which in turn suppresses activation of the Wnt pathway. Further studies, using in-vivo model and organoid models, will be necessary to understand the mechanistic role of Cmg2 in this regulatory process.

      A precision of the figure has been updated as followed: both of which were significantly upregulated in the injured colons of Cmg2KO mice compared to DSS-injured Cmg2WT mice

      __**Referees cross-commenting** __

      Rev2 Points 1 and 2 made by Referee 1 (and point 4 of Referee 3) appear most reasonable, and if not already done should be.

      We have indeed addressed these 2 points.

      I also noted the more severe morphology of DSS damaged epithelium shown in Fig 2a noted by Referee 3 - and this I agree is a confounding factor. […] For my part, the concern is understandable but likely not operating in a confounding way. And the evidence for the reprogramming of the damaged epithelium into "fetal-like stem cells" (the 1st step in restitution of lost stem cells) occurs in both WT and KO mice - and these data are strong. For this reader, the block convincingly shows up for KO mouse at the WNT dependent step

      The representative image has been updated, and a transverse section has been added to better illustrate that, although both epithelium and crypt structures can be present, the epithelial morphology differs significantly. Indeed, the regenerating epithelium of Cmg2WT mice displays a thick epithelial layer with well-polarized epithelial cells, whereas in cmg2KO mice, the epithelium appears atrophic, characterized by a thinner epithelial layer and elongated epithelial cells.

      __Rev 3 __

      This reviewer remains sceptical. I agree the authors performed the experiment well to confirm that DSS dosing was as equivalent as possible across the study. But DSS acts to induce colitis because it is concentrated in the colonic lumen as water is absorbed. Also ECM responses and remodelling are a central part of colitis models. And my concern is that the actual exposure in the KO group is influenced by transit of faeces/DSS is secondary to the known action of CMG2 on collagen deposition. The consequence of this being a protracted damage phase in which a restoration of adult stem cells would not be expected and leading to epithelial failure.

      However, we differ. I might propose that the authors are asked to investigate and confirm expression of CMG2 in the epithelium and to repeat the analysis of collagen levels they performed on untreated CMG2 KO mice on colons from CMG2 KO mice having received DSS to see if these differ from controls.

      This has now been done.

      __Rev 1 __

      Both reviewer #2 and reviewer #3 make relevant points, from the point of view of extracting as much biological knowledge as we can from the observations reported in the manuscript.

      Reviewer #2 suggestion to use Cmg2[KO] organoids to investigate the dependence of Wnt transduction on Cmg2 is the type of experiments I refrained to propose. However, I think the "skeleton" of the mechanism is there and is reasonably solid. Fleshing it out may well be another paper.

      I agree with Reviewer #3 objections to the timing and severity of the DSS damage. However, I am not sure how much they invalidate the main tenet of the paper:

      • DSS may affect Cmg2[KO] more severely, but the overall disease score is comparable during the DSS treatment. If this severity was enough to be the main driver of the phenotype, it should have left a mark in the Histological and Disease activity scores. In this regard, I think it would be helpful if the authors provided an expanded version of Figure 2A with examples of the different levels of "Crypt damage" scored, and the proportions for each. This could be in the supplementary material and would balance the impressions induced by a single image.

      As suggested, we included a detail of histological score including the crypt damage score in Supplementary Fig 3i showing no significant differences in crypt damage between Cmg2WT and Cmg2KO mice.

      • If DSS affected the recovery, this would also be compatible with having a more severe histological phenotype (which is not shown overall, just in Fig 2A) because one would also expect the tissue to attempt regeneration during the 7 days of DSS treatment.

      This is an interesting point, and we now allude to this aspect in the manuscript.

      • The only objection that I find difficult to argue is the effective duration of the treatment. If indeed peristalsis is affected, it may be that during the 'recovery' phase there is still DSS in the intestine. This could be perhaps verified using a DS detection assay (e.g. https://arxiv.org/pdf/1703.08663) on the intestinal contents or the faeces of the mice during the 3-day recovery period.

      We have attempted to obtain and purchase Heparin Red to perform this assay. Unfortunately, we have not obtained the reagent, which has never been delivered. We now also mention the following in the Discussion:

      One could envision that Cmg2KO mice have a defect in peristalsis resulting in longer dwell times and possibly higher effective dose of DSS to the KO epithelium. We however did not observe any signs of intestinal obstruction or fecal retention in Cmg2KO mice. Animals were single-caged for 30 min to collect feces. We did not observe any difference in amounts collected from WT and KO mice, arguing against a substantial difference in transit time of gut contents. Moreover, if DSS affected the recovery, one would have expected a more severe histological phenotype in the colon of Cmg2KO since the tissue likely already attempts regeneration during the 7 days of DSS treatment. But this was not the case. Therefore, while we cannot formally rule out the presence of residual DSS in Cmg2KO mice during the DSS withdrawal phase, there is currently no indication that this was the case.

      I think of what the aim of scholarly publication is, with this paper, and I find myself going back to a statement of the authors' discussion - that this work suggests that infants risking death may be offered (compassionate, I guess) IBD treatment. What does this hinge upon? I think, on the basic observation that diarrhoea (in the mouse model) is not intrinsic but caused by an inflammation-promoting insult. Is this substantiated? I think it is. Could we learn more biology from this disease model, about Wnt and about how ECM affects tissue regeneration? Certainly. Can this learning wait? I believe it can.

      We thank the reviewer for this statement.

    2. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #1

      Evidence, reproducibility and clarity

      In this work, Bracq and colleagues provide clear evidence that the persistent diarrhoea seen in a mouse model of Hyaline Fibromatosis Syndrome is related to the inability of their intestinal epithelium to properly regenerate. This is very clear and of immediate impact. This aspect of the paper, I think, is ready for publication, and would merit immediate dissemination on its own. It is great that the manuscript is in bioRxiv already.

      I am not so thoroughly convinced about the mechanism that the author propose to explain the incapacitation of Cmg2[KO] intestinal stem cells to function properly. The authors propose that it is due to their inability to transduce Wnt signals, and while this is plausible, I think there are few things that the paper should contain before this can be proposed firmly:

      Point #1

      The mouse mutant is just described as 'KO', referring to the previous work by the authors. The cited work simply states that this is a zygotic deletion of exon 3, which somehow leads to a decrease in protein abundance that is almost total in the lung but not so clear in the uterus. Exon 3 happens to be 72 bp long [https://www.ncbi.nlm.nih.gov/nuccore/NM_133738], so its deletion (assuming there are no cryptic splicing sites used) leads to an internal in-frame deletion of 24 amino acids. So, at best, this 'KO' is not a null, but a hypomorphic allele of context-dependent strength. Unfortunately, neither the previous work nor this paper (unless I have missed it!) contains information provided about the expression levels of Cmg2 in the intestine of KO mice - nor which cell types usually express it (see below). I think that using anti Cmg2 in WB and immunohistofluorescence of with ISC markers with intestine homogenate/sections of wild-type and mutant mice would be necessary to set the stage for the rest of the work.

      Point #2

      Connected to the previous point, the expression pattern of Cmg2 in the intestine is not described. Maybe this is already established in the literature, but the authors do not refer to the data. This is important when considering that the previous work of the authors suggests that Cmg2 might contribute to Wnt signalling transduction through physical, cis interactions with the Wnt co-receptor LRP6. Therefore, one would expect that Cmg2 would be cell-autonomously required in the intestinal stem cells.

      Point #3

      The authors establish that the regenerating crypts of Cmg2[KO] mice are unable to transduce Wnt signalling, but it is not clear whether this situation is provoked by the DSS-induce injury or existed all along. Can Cmg2[KO] intestinal stem cells transduce Wnt signalling before the DSS challenge? If they were, it might suggest that the 'context-dependence' of the Cmg2 role in Wnt signalling is contextual not only because of the tissue, but because of the history of the tissue or its present structure. It would also suggest that Cmg2 mutant mice, unless reared in a germ-free facility for life, would eventually lose intestinal homeostasis, and maybe suggest the level of intervention/monitoring that HFS patients would require. It might also provide an explanation in case Cmg2 was not expressed in ISCs - if the state of the tissue was as important as the presence of the protein, then the effect on Wnt transduction could be indirect and therefore it might not be required cell-autonomously.

      I think points 1 and 2 are absolutely fundamental in a reverse genetics investigation. Point 3 would be nice to know but the outcome would not change the tenet of the paper. I believe that the work needed to deal these points can be performed on archival material. I do not think the mechanism proposed can be taken from 'plausible' to 'proven' without proposing substantial additional investigation, so I will not suggest any of it, as it could well be another paper.

      A few minor points picked along the way:

      1. Figure 1 legend says "In (c), results are mean {plus minus} SEM" - this seems applicable to (d) as (c) does not show error whiskers.
      2. Figure 1 legend says "(d) Body weight loss, (f) the aspect of the feces and presence of occult blood were monitored and used for the (e) DAI. Results are mean {plus minus} SEM. Each dot represents the mean of n = 12 mice per genotype". This part looks like has suffered some rearrangement of words. The first instance of (f) should be (e), I guess, and I am not sure what "(e) DAI" means. And for (e), "mean {plus minus} SEM" does not seem applicable. This needs some light revision.
      3. Figure 1H legend does not say which statistical test was made in the survival experiment in (h) - presumably log-rank? A further comment on the survival statistics: euthanised animals should not be counted towards true mortality when that is what is recorded as an 'event'. They should be right-censored. However, in this case, reaching the euthanasia criterion is just as good an indicator of health as mortality itself. So, simply by changing the Y axis from 'survival' to 'event-free survival' (or something to that effect), where 'events' are either death or reaching the euthanasia criterion, leaves the analysis as it is, and authors do not need to clarify that figure 1H shows "apparent mortality", as it is straightforward "complication-free survival" (just not entirely orthogonal to weight loss).
      4. Some density measurements are made unnecessarily on arbitrary units (per field of view) - this should be simple to report in absolute measures (i.e. area of tissue screened or, better still, length of epithelium screened).
      5. Figure 2E should read "percent involvement"
      6. Figure 2J should read "lipocalin..."
      7. In section "CMG2 Is Dispensable for YAP/TAZ-Mediated Reprogramming to Fetal-Like Stem Cells", the authors write ""We measured the mRNA levels of two additional YAP target genes, Cyr61 and CTGF...". I presume the "additional" is because Ly6a is also a target of YAP/TAZ, but if the reader does not know, it is puzzling. I would suggest to make this link explicit.
      8. In Figures S2, 3 and S3, I think that the measures expressed as "% of homeostatic X in WT" really mean "% of average homeostatic X in WT". This should be made clear somewhere.
      9. In panel C, the nature of the data is not entirely clear. First, the corresponding part of the legend says "Representative images of n=4 mice per genotype" which I presume should refer to panel B. Then, the graph plots 4 data points, which suggests that they correspond to 4 mice - but how many fields of view? Also, the violin plot outline is not described - I presume it captures all the data points from the coarse-grained pixel analysis, but it should be clarified.
      10. In Figure 3H and 3I, I would suggest to add the 7+3 timepoint where the data come from.
      11. In section "CMG2 Is Critical for Restoring the Lgr5+ Intestinal Stem Cell Pool", the authors say "...The mRNA levels of ... LRP6, β-catenin (Fig. S3a-b), and Wnt ligands (Wnt5a, 5b, and 2b) were comparable between the colons of Cmg2WT and Cmg2KO mice (Fig. S3c)..." without clarifying in which context - one needs to read the figure legend to realise this is "timepoint 7+3". I suggest to add "in the recovery phase" or "in regenerating colons" or something shorter, just to guide the reader.
      12. Like with the previous point, it is not clear when the immunohistofluorescence of B-catenin is made - not even in the legend, as far as I could see. The only hint is that authors say "the nuclei of cells in the atrophic crypts of Cmg2KO..." with 'atrophic' probably indicating again the 7+3 timepoint.
      13. A typo in the discussion: tunning for tuning.
      14. In the discussion, the authors talk about the 'CMG2' protein (all caps - formatting convention for human proteins) but before they were referring to 'Cmg2' (formatting convention for mouse proteins). That is fine but some of the statements where "CMG2" is used clearly refer to observations made in the mouse.
      15. Typos in methods: "antigen retrieval by treating [with] Proteinase K"; "Image acquisition and analyze [analysis]"; "All details regarding code[s] used for immunofluorescence analysis"

      Referees cross-commenting

      *this session contains comments from ALL the reviewers"

      Rev2

      Points 1 and 2 made by Referee 1 (and point 4 of Referee 3) appear most reasonable, and if not already done should be.

      I also noted the more severe morphology of DSS damaged epithelium shown in Fig 2a noted by Referee 3 - and this I agree is a confounding factor. But overall, multiple lines of evidence were assembled to show that the KO mice and WT mice suffered DSS-induced colitis with equal severity - and with closely equal severity of damage to the intestinal epithelium (though the image in Fig 2a is disturbing). For my part, the concern is understandable but likely not operating in a confounding way. And the evidence for the reprogramming of the damaged epithelium into "fetal-like stem cells" (the 1st step in restitution of lost stem cells) occurs in both WT and KO mice - and these data are strong. For this reader, the block convincingly shows up for KO mouse at the WNT dependent step

      Rev 3 This reviewer remains sceptical. I agree the authors performed the experiment well to confirm that DSS dosing was as equivalent as possible across the study. But DSS acts to induce colitis because it is concentrated in the colonic lumen as water is absorbed. Also ECM responses and remodelling are a central part of colitis models. And my concern is that the actual exposure in the KO group is influenced by transit of faeces/DSS is secondary to the known action of CMG2 on collagen deposition. The consequence of this being a protracted damage phase in which a restoration of adult stem cells would not be expected and leading to epithelial failure.

      However, we differ. I might propose that the authors are asked to investigate and confirm expression of CMG2 in the epithelium and to repeat the analysis of collagen levels they performed on untreated CMG2 KO mice on colons from CMG2 KO mice having received DSS to see if these differ from controls.

      Rev 1 Both reviewer #2 and reviewer #3 make relevant points, from the point of view of extracting as much biological knowledge as we can from the observations reported in the manuscript.

      Reviewer #2 suggestion to use Cmg2[KO] organoids to investigate the dependence of Wnt transduction on Cmg2 is the type of experiments I refrained to propose. However, I think the "skeleton" of the mechanism is there and is reasonably solid. Fleshing it out may well be another paper.

      I agree with Reviewer #3 objections to the timing and severity of the DSS damage. However, I am not sure how much they invalidate the main tenet of the paper:

      • DSS may affect Cmg2[KO] more severely, but the overall disease score is comparable during the DSS treatment. If this severity was enough to be the main driver of the phenotype, it should have left a mark in the Histological and Disease activity scores. In this regard, I think it would be helpful if the authors provided an expanded version of Figure 2A with examples of the different levels of "Crypt damage" scored, and the proportions for each. This could be in the supplementary material and would balance the impressions induced by a single image.

      • If DSS affected the recovery, this would also be compatible with having a more severe histological phenotype (which is not shown overall, just in Fig 2A) because one would also expect the tissue to attempt regeneration during the 7 days of DSS treatment.

      • The only objection that I find difficult to argue is the effective duration of the treatment. If indeed peristalsis is affected, it may be that during the 'recovery' phase there is still DSS in the intestine. This could be perhaps verified using a DS detection assay (e.g. https://arxiv.org/pdf/1703.08663) on the intestinal contents or the faeces of the mice during the 3-day recovery period.

      I think of what the aim of scholarly publication is, with this paper, and I find myself going back to a statement of the authors' discussion - that this work suggests that infants risking death may be offered (compassionate, I guess) IBD treatment. What does this hinge upon? I think, on the basic observation that diarrhoea (in the mouse model) is not intrinsic but caused by an inflammation-promoting insult. Is this substantiated? I think it is. Could we learn more biology from this disease model, about Wnt and about how ECM affects tissue regeneration? Certainly. Can this learning wait? I believe it can.

      Significance

      In this work, Bracq and colleagues provide clear evidence that the persistent diarrhoea seen in a mouse model of Hyaline Fibromatosis Syndrome is related to the inability of their intestinal epithelium to properly regenerate. This is very clear and of immediate impact. For instance, the authors themselves point at the possibility of applying treatments for Inflammatory Bowel Disease to HFS patients. While what happens in a mouse model is not necessarily the same as in human patients, the fact that persistent diarrhoea is a life-threatening symptom in HFS make this proposal, at least in compassionate use of the therapies and until its efficacy is disproven, very plausible. This is a clear gap of knowledge that addresses an unmet medical need.

      I find that the work shows clearly that HFS mouse model subjects have normal intestinal function until challenged with a standard chemically-induced colitis. Then, the histological and health deterioration of the HFS mouse model is clear in comparison with normal mice, which can regenerate appropriately. This is shown with a multiplicity of orthogonal techniques spanning molecular, histological and organismal, which are standard and very well reported in the paper.

      The authors propose a specific cellular and molecular mechanism to explain the incapacity of the intestinal epithelium in the mouse model of HFS to regenerate. According to this mechanism, the protein Cmg2, whose mutation causes HFS in humans, would be necessary for intestinal stem cells to transduce the signal of Wnt ligands and therefore support their behaviour as regenerative cells. This mechanism is plausible, but more basic and advanced work would be needed to take it as proven.

      This work would be of interest to both the clinical, biomedical, and basic research communities interested in rare diseases, the gastrointestinal system, collagen and extracellular matrix, and Wnt signalling.

      My general expertise is in developmental and stem cell biology using reverse genetics, transgenesis and immunohistological and molecular methods of data production, and lineage tracing, digital imaging and bioinformatic analytical methods; I work with Drosophila melanogaster and its adult gastrointestinal system.

    1. HC4 Motivatie Grit (Angela Duckworth): Mensen met grit geven niet snel op, blijven gemotiveerd ondanks tegenslag, en blijven jarenlang werken aan iets wat ze belangrijk vinden.

      Motivatie bestaat uit 3 factoren: 1. direction: wat doet de persoon 2. intensity (hoe hard een persoon werkt) 3. persistence (hoe lang doen ze het)

      Process van een iemand motiveren: eerst energizing the behavior= het activeren van onderliggende behoeften en drives daarna directing behavior naar het doel toe als laatste moet het gedrag nog volgehouden worden Sustaining behavior (door bijvoorbeeld een reward). Voor de motivatie om effectief te zijn is feedback ook nog nodig, want dan gaat het weer terug naar energizing behavior.

      Goal setting: a goal without a plan is just a wish> SMART goal setting (specific, measurable, achievable, relevant, time-based)

      Maslow's hierarchy of needs: Mensen hebben een hierarchie in motivatie: 1. belangrijkste eerst fysieke benodigdheden (honger, huis, drinken)-een lower order need 2. veiligheid (fysiek en emotioneel)-lower order 3. sociale benodigdheden (vriendschap en belonging)- higher order 4. esteem needs (status en recognition from others)-higher order 5. self-actualization: je volle potentie als een persoon willen behalen - higher order GEEN bewijs voor deze theorie

      McClelland: stelt 3 fundamentele behoeften: achievement, power en affiliation (close personal relationships), deze worden aangeleerd in de jeugd en deze behoeften kunnen dus niet op latere leeftijd nog meer worden.

      Hertzberg's motivator-hygiene theory (two-factor theory): Hygiene is wat mensen niet leuk vinden aan hun werk en wat de werk tevredenheid omlaag brengt. Motivator is het tegenovergestelde.

      Regulatory focus theory RFT: promotion focused (oriented towards growth and development, becoming their ideal self > learning goals orientation (LGO) prevention focused (vigilant and careful, emphasize fears, focus on avoiding threats and set prevention goals. Ze willen alleen dingen doen die binnen hun kunnen vallen om stress te voorkomen.

      Job characteristics theory: niet alleen andere mensen maar ook het werk zelf kan mensen motiveren. Er zijn vijf core job dimensies: 1. skill variety 2. task identity 3. task significance 4. autonomy 5. feedback

      The job charcteristics model: core job charcteristics-critical psychological states- personal and work outcomes

      Skill variety, task idenity en task significance > experiended meaningfullness of the work

      Job characteristics theory-designed jobs give internal rewards Individual's growth needs are moderating factors motivating jobs must: be autonomous, provide feedback, have at least one of the three meaningfulness factors (Skill variety, task idenity en task significance )

      Om de motivatie voor werk hoger te maken gebruiken organisaties: work redesign of job enrichment (horizontaal toevoegen, meer taken toedienen op hetzelfde level OF verticaal meer vertrouwen in keuzes maken, dus job loading.

      VB horizontaal: job rotation (iemand een keer een ander soort baan geven) en combining tasks (nieuwe taak leren die toegevoegd kan worden aan de taak die diegene al deed)

      Vertical: prosocial motivation (iemand krijgt de kans om met de klant te praten die voordelen heeft van de moeite die diegene in zijn werk stopt) en meer autonomy krijgen

      Job crafting: de hoeveelheid initiatief dat iemand toont in het vormgeven van zijn of haar werk. (changing job responsibilities, changing interactions with others, changing how the work is seen)

      Importance of fairness -equity theory (minder motivatie als werknemers vinden dat ze meer geven dan ze krijgen): distributive justice bestaat uit balans, underpayment inequity en overpayment inequity

      There are 4 types of organisational justice: (it is the members semse of the moral propriety of how they are treated) 1. distributive: is de uitkomst goed vergeleken met je performance? 2. procedural: kan je je mening en behoeften uitspreken bij het proces? 3. interpersonal: respect? 4. informational: heeft de leider alles goed uitgelegd?

      Expectancy theory: (the valance-instrumentality-expectancy (VIE) theory: deze theorie stelt dat de self-efficacy van de werknemer sterker gemaakt moet worden door de leider. 3 principes: 1. werknemers doen meer hun best als ze geloven dat de effort tot een goede performance leidt (E-P) 2. De werknemers performance wordt goed geevalueerd en het leidt tot bonussen (P-O)- instrumentally 3. de werknemers hechten waarden aan de rewards (V van Valences)

      DUS expectancy + instrumentally +valance = high effort and motivation

      De expectancy theorie = vanuit het perspectief van de werknemer

      The Pygmalion effect= perspectief vanjuit de leider

      Het Pygmalion-effect is een psychologisch fenomeen waarbij hogere verwachtingen leiden tot betere prestaties. Het wordt ook wel een zelfvervullende voorspelling genoemd (self-fulfilling prophecy)

      The galatea effect (gaat over de verwachtingen van de werknemer zelf):individually-set high expectations and then performs to these expectations

      Het tegenovergestelde is het Golem-effect: lage verwachtingen leiden tot slechtere prestaties. De-golemization > reversing golem effect

      Who will lead: motivation to lead (MTL) 1. affective identity MTL: een natuurlijke neigin gom mensen te leiden 2. Social-normative MTL: leiden omdat je een gevoel van verantwoordelijkheid hebt (volgens sociale normen) 3. noncalculative MTL: willen leiden zonder de voordelen en nadelen van het leiderschap hebben afgewogen: altruism

      H9 motivatie: applications

      The reinforcement theory: De Law of Effect (de wet van het effect) is een psychologisch principe dat werd geformuleerd door Edward Thorndike in 1898. Het is een fundamenteel principe binnen het behaviorisme.

      Definitie: De Law of Effect stelt dat: Gedrag dat gevolgd wordt door een bevredigend gevolg, zal in de toekomst vaker voorkomen. Gedrag dat gevolgd wordt door een onaangenaam gevolg, zal minder vaak voorkomen. (operant conditioneren)

      Punishment by application of punishment by removal (van iets positief) Negative reinforcement (iets negatief niet doen) of positive reinforcement (beloning)

      Social learning theory: extends operant conditioning, people can learn from watching other people succed or fail. (modeling of observational learning) Bestaat uit 4 stappen: 1. attention 2. retention (informatie moet gesorteerd worden om later te gebruiken) 3. reproduction (imiteren van het gedrag en oefenen zodat je beter wordt) 4. motivation: je moet gemotiveerd zijn om te imiteren (als je ziet dat iemand anders een beloning er voor krijgt wil je het sneller ook)

      Intrinsieke vs extrinsieke beloningen

      Self-determination theory: SDT stelt dat deze intrinsieke motivatie vooral wordt bevorderd wanneer aan drie basisbehoeften wordt voldaan: -1. Autonomy (Zelfbeschikking) Betekenis: de behoefte om je eigen keuzes te maken en handelen in overeenstemming met je eigen waarden en interesses.

      Voorbeeld: in het onderwijs krijgen leerlingen de vrijheid om een onderwerp voor hun project te kiezen. Omdat ze zelf de koers bepalen, voelen ze meer eigenaarschap en zijn ze gemotiveerder om er iets van te maken.

      1. Competence (Bekwaamheid) Betekenis: de behoefte om effectief en vaardig te zijn in wat je doet, en het gevoel te hebben dat je kunt groeien door uitdagingen te overwinnen.

      Voorbeeld: een softwareontwikkelaar die steeds nieuwe technieken leert en ziet hoe zijn code écht werkt in een product, ervaart dat hij steeds capabeler wordt. Dat stimuleert hem om door te gaan met leren en verbeteren.

      1. Relatedness (Verwantschap) Betekenis: de behoefte aan warme, ondersteunende relaties en het gevoel ergens bij te horen.

      Voorbeeld: op de werkvloer vormt een team dat open communiceert en elkaar steunt een sociale basis. Medewerkers voelen zich dan verbonden en gewaardeerd, wat hun betrokkenheid vergroot.

      Motivation wrk cylcle match: er zijn inverschillende fases van het werk andere vormen van motivatie nodig: intrinsieke motivatie voorspeld kwaliteit van de performance en extrinsieke motivatie voorspeld kwantiteit van de performance

    1. reply to u/highspeed_steel at https://reddit.com/r/typewriters/comments/1krspvh/im_totally_blind_and_new_to_typewriters_wax/

      Your question is a great one, but I'll go another direction since I'd dug into some of the history and details of Helen Keller's mid-century typewriters a while back. You can find some details and descriptions here (and in the associated links which includes an accessible video of Ms. Keller using a solid and sexy black Remington Noiseless standard typewriter): https://www.reddit.com/r/typewriters/comments/1ihot96/helen_kellers_typewriters/

      She managed on both her Remington as well as her brailler as well as any sighted person, though obviously had someone to check her printed work.

      I recently saw another heavily modified midcentury typewriter for someone who, if I recall correctly was not only blind, but had no arms. It was set up so that they could move a selector and type using a custom chin rest. Sadly, I didn't index it at the time, but it's interesting to know that such things existed for accessibility reasons.

      As for Braillers, you might appreciate this recent article about a repairman in Britain who was retiring: https://www.theguardian.com/society/2025/jan/02/wed-be-stuck-alarm-as-uks-last-braille-typewriter-repairer-ponders-retirement

      I've got my own brailler, which is a sleek-looking art-deco industrial piece of art with the loveliest shade of dark shiny gray paint I've ever seen on a typewriter. (I'm both a mathematician and information theorist into the areas of coding and cryptography, so Morse code, Braille, etc. are professionally fascinating to me.) I still need to take it apart and repair a few portions to get it back to perfection, but it generally works well.

      As for the aesthetics, I personally enjoy the solid industrial look and feel of the machines from the 1930s-1960s. The early 30s and some 40s have glossy black enamel and machines like the Corona Standard/Silent from the 30s are low slung with flat tops that sort of resemble small pianos and just scream out "I'm a writer" with a flair for dark academia and just a hint of classical Roman design. Many of these machines come with gold tinged water-slide decals which really set themselves off against the black enamel, though on the majority of machines the gold is beginning to dim from time, wear, and uncareful application of cleaning solutions.

      I love the Royal KMM, KMG, and the Remington 17, Standard, and Super-Riter for their industrial chonkiness and (usually) their glass keytops. One of my favorites is the Henry Dreyfuss designed Royal Quiet De Luxe from 1948 which always gives me the feel of what it would look like if a typewriter wore a tuxedo or the 1948 gray and chrome model which is similar but has the feel of a sleek gray flannel suit on a 1950s advertising executive prone to wearing dapper hats, smoking cigarettes, and always with a cocktail in his hand. Into the 50s and 60s almost everyone had moved to plastic keytops which I don't think are as pretty as the older glass keytops with the polished metal rings around them.

      At the opposite end of that spectrum are the late 50s Royal FP and Futura 800s which have some colorful roundness which evokes the aesthetic of the coming space age. They remind me of the modern curves and star shapes of the television show The Jetsons. Similarly space-aged are the sexy curves of the silver metalic spray paint on wooden cases for the Olympia SM3 from the same period. These to me are quintessential typewriter industrial design. In gray, green, maroon, brown, and sometimes yellow crinkle paint with just a hint of sparkle in their keytops I really love the combination of roundedness and slight angularity these German designed machines provide. They have a definite understated sort of elegance most other typewriters just miss. I suspect that late-in-life Steve Jobs would have had an Olympia SM3.

      There's something comforting about the 40s and 50s sports-car vibe of the smaller Smith-Corona portables of the 5 series machines in the 1950s with their racing stripes on the hood. They feel like the sort of typewriter James Dean would have used as a student—just hip enough to be cool while still be solid and functional.

      Sadly into the 70s, while machines typically got a broader range of colors outside of the typical black, gray, and browns things became more plastic and angular. They also begin to loose some of the industrial mid-century aesthetic that earlier machines had. They often feel very 70s in an uncomplimentary way without the fun color combinations or whimsy that art and general design of of that period may have had in the music or fashion spaces. They make me think of politics and war rather than the burgeoning sexual revolution of the time period.

      Interestingly, for me, I feel like most typewriter design was often 10-20 years behind the general design aesthetic/zeitgeist for the particular decades in which they were made.

      Good luck in your search for the right typewriter(s) for your own collection.

    1. Briefing sur le programme d'Éducation à la Vie Affective, Relationnelle et à la Sexualité (EVARS)

      Contexte et Objectifs

      Ce briefing est basé sur un podcast produit par l'Institut des hautes études de l'éducation et de la formation (I2F), abordant l'éducation à la vie affective, relationnelle et à la sexualité (EVARS) en milieu scolaire en France.

      L'émission met en lumière l'historique, les enjeux, le nouveau programme et sa mise en œuvre.

      L'éducation à la sexualité en milieu scolaire est une préoccupation du Ministère de l'Éducation depuis plus de 50 ans, jalonnée par des événements importants comme Mai 68, les lois sur la contraception (1967) et l'IVG (1975), et la circulaire Fontana (1973).

      Un tournant majeur a été la loi de 2001 sur l'IVG, instaurant l'obligation de trois séances annuelles d'éducation à la sexualité par tranche d'âge.

      Le contexte actuel, marqué par le développement des réseaux sociaux, l'accès facile à l'information (et à la désinformation), le partage d'images sexuellement explicites, l'accès à la pornographie, et la prise de conscience accrue des violences sexuelles (y compris intrafamiliales et entre jeunes, avec des statistiques alarmantes sur les agressions sexuelles d'enfants et de jeunes), a souligné la nécessité de réinterroger la mise en œuvre de ces séances.

      Le constat a été fait que ces trois séances annuelles ne sont pas totalement mises en place sur le terrain.

      L'Inspection Générale de l'Éducation, du Sport et de la Recherche a identifié la clarification des objectifs de formation des élèves comme un levier essentiel.

      C'est dans ce cadre que le Ministre a souhaité l'élaboration d'un programme spécifique.

      Le Nouveau Programme EVARS

      Le nouveau programme EVARS, élaboré par le Conseil Supérieur des Programmes et fortement soutenu (voté 60 voix pour et 0 contre au Conseil Supérieur de l'Éducation fin janvier 2025), vise à clarifier les objectifs de formation des élèves de la maternelle au lycée.

      Il a été conçu avec un large éventail d'experts et a fait l'objet d'une consultation nationale.

      Ses intentions principales, clarifiées dans le préambule, visent à trouver un équilibre entre les enjeux de citoyenneté et de santé.

      Le programme encourage le développement de l'esprit critique des élèves pour construire des relations respectueuses et acquérir des connaissances scientifiques objectives leur permettant de faire des choix éclairés pour leur santé et celle des autres.

      Le programme est décliné en :

      • Éducation à la vie affective et relationnelle pour le premier degré (maternelle et élémentaire), axée sur les émotions et les relations.

      • Éducation à la vie affective et relationnelle et à la sexualité pour le second degré (collège et lycée), incluant des questionnements plus avancés adaptés à l'âge et à la maturité des élèves.

      Il est structuré autour de trois grands axes, présents de la maternelle au lycée :

      • Se connaître, vivre et grandir avec son corps.
      • Construire des relations, s'y épanouir.

      • Trouver sa place dans la société, s'y sentir libre et responsable.

      Ces axes sont déclinés en objectifs d'apprentissage par tranche d'âge, avec un souci de progressivité.

      Un tableau synoptique est proposé pour visualiser ces objectifs et inspirer les personnels.

      Points de Vue et Enjeux Clés

      • Sociologie et Utilité des Programmes :

      Yael Am Salem Mingi, sociologue, souligne que les pays ayant mis en place l'éducation à la sexualité en milieu scolaire légitiment ces questions et la reconnaissance sociale de la sexualité des jeunes.

      Cela permet une prévention des violences (intrafamiliales, sexuelles) et facilite l'accès à l'information sur la contraception, les relations, et la place des jeunes.

      Les pays prônant l'abstinence ont de moins bons résultats en matière de grossesses adolescentes et d'IST. L'éducation à la sexualité a des effets positifs sur la connaissance et les pratiques.

      • Point de Vue des Élèves :

      Les élèves considèrent l'école comme un espace légitime pour parler de la vie affective, relationnelle et sexuelle, à condition que l'école ne juge pas leur comportement, écoute leurs questionnements et adapte le discours. Ils attendent de l'école des informations scientifiques et documentées, pas des jugements de valeur.

      Les questionnements évoluent avec l'âge, allant de "c'est quoi l'amour ou l'amitié" au premier degré à la définition de termes liés à la sexualité à l'adolescence (pansexualité, plurisexualité, LGBTQ+).

      Ils veulent un langage collectif et une définition collective des termes.

      • Intégration et Adaptabilité :

      Le programme est conçu pour être inclusif et s'adapter aux questionnements spécifiques de chaque classe et groupe d'élèves, influencés par le contexte géographique, social et de genre.

      Il peut être décliné dans différentes matières (arts plastiques, musique, sport, français, SES, SVT, etc.) et à différents niveaux, soulignant son lien avec les autres disciplines et la promotion de l'égalité et des rapports de genre.

      • Postulat des Pratiques :

      Yael Am Salem Mingi précise que les élèves ne souhaitent pas que l'école aborde les pratiques sexuelles, considérées comme relevant de la sphère intime.

      En revanche, ils attendent des outils pour la lecture critique de contenu, notamment sur les réseaux sociaux, pour discerner le vrai du faux.

      • Rôle des Personnels et Mise en Œuvre :

      L'EVARS est l'affaire de tous les personnels de l'éducation nationale. La co-animation est encouragée là où elle est possible pour croiser les regards.

      Il est essentiel de réfléchir à la posture de l'enseignant.

      Les chefs d'établissement et directeurs d'école ont un rôle déterminant dans le pilotage, la constitution d'équipes, la désignation éventuelle de référents, l'organisation de la formation et la gestion des partenariats (associations agréées, de préférence au second degré, avec un travail préparatoire conjoint et une co-animation).

      • Freins et Leviers :

      Les freins incluent les peurs de certaines familles et des défis organisationnels.

      Les leviers incluent l'existant (ce qui est déjà fait), l'appui des pilotes académiques, des directeurs d'école et chefs d'établissement, les référents égalité, et les instances (conseils de maître, conseils d'école, conseils école-collège).

      La conscientisation des personnels sur l'enjeu de santé publique et la nécessité de fournir des repères aux élèves sont cruciales.

      • Mise en Œuvre Opérationnelle :

      La mise en œuvre se traduit par des plans de formation pour les personnels.

      Il s'agit d'accompagner les enseignants dans l'appropriation du programme et l'utilisation des ressources.

      Dans le premier degré, l'organisation des séances est facilitée par le format classe unique pour l'enseignant.

      Dans le second degré, la coordination est essentielle, impliquant les conseils pédagogiques et les CESCE (Comités d'éducation à la santé, à la citoyenneté et à l'environnement).

      L'interdisciplinarité (lettres, philosophie, EPS, EMC, etc.) est un élément clé.

      Les heures de vie de classe peuvent être mobilisées.

      • Accompagnement et Formation :

      L'accompagnement des personnels est une priorité.

      Des groupes de formateurs experts existent dans les académies.

      Des formations d'établissement et des parcours inter-degrés sont envisagés.

      Des personnels ressources de proximité (infirmières, CPE, professeurs documentalistes, etc.) peuvent intervenir en co-animation.

      Un plan national de formation et un parcours Magistère en ligne sont en cours de déploiement, complétés par des livrets de séances types.

      • Communication :

      La communication avec les familles est cruciale pour lever les appréhensions.

      Le conseil d'école est une instance importante pour communiquer.

      Il est essentiel de communiquer de manière positive sur la légitimité de l'école dans ce domaine, en soulignant qu'il ne s'agit pas d'aborder l'intime.

      En cas de contestation ou de refus d'enseignement, il est important d'engager le dialogue, de rappeler l'obligation de cet enseignement et de s'appuyer sur les ressources académiques (cellules valeurs de la République).

      Une page Eduscol dédiée et une page education.gouv.fr pour les familles sont disponibles, avec des éléments de langage, des plaquettes et une FAQ.

      • Cadre Juridique :

      Un point juridique rappelle que la notion de droit à la sexualité des mineurs n'est pas reconnue par les textes. La majorité sexuelle en France est fixée à 15 ans (article 227-25 du code pénal).

      Toute relation sexuelle avec un mineur de moins de 15 ans est considérée comme une atteinte sexuelle, même si elle est consentie.

      La loi du 21 avril 2021 a posé un seuil de non-consentement pour les mineurs de moins de 15 ans avec des majeurs et a renforcé la lutte contre l'inceste et la sexorsion.

      Une clause dite "Roméo et Juliette" permet de préserver les relations entre mineurs de moins de 5 ans d'écart, sauf inceste, absence de consentement ou prostitution.

      La notion de consentement valable implique plusieurs exigences cumulatives (volontaire, libre, éclairé, spécifique, réversible, exprimé, perçu).

      Le contrôle des intervenants extérieurs est également mentionné comme un point délicat.

      En Synthèse

      L'introduction de ce programme marque une étape historique en France, offrant une opportunité institutionnelle précieuse pour l'éducation des élèves sur des sujets fondamentaux.

      Il répond à un besoin criant de clarification et d'équité dans l'accès à cette éducation.

      Au-delà de la prévention, le programme vise à développer la réflexion des élèves et à construire une culture commune sur ces sujets.

      Sa mise en œuvre requiert une intelligence et une inventivité collectives, un accompagnement soutenu des personnels et une communication proactive avec les familles.

      Il s'agit également d'un programme qui écoute les questionnements des élèves et leur offre un espace pour forger leur liberté.

      Franck Durbage résume l'importance de ce programme comme une opportunité de "désinquiétude" pour les familles et les établissements privés, une chance pour la "culture" en offrant un espace de pensée et de réflexion, une occasion pour l'"intelligence collective" et l'"inventivité collective" des équipes pédagogiques, et un engagement envers la "liberté" des élèves en accueillant leurs questions et en leur permettant de cheminer.

    1. Functions are reusable blocks of code that can be executed multiple times from different parts of your program. They help in organizing code, making it more readable, and reducing redundancy. Functions can take input arguments and return output values.

      Das verstehe ich gar nicht!

    1. #Creating an if, elif, else block of code # Created 3 variables to use for testing against the block isMorning=False isAfternoon=False isEvening=False if isMorning: print("Good Morning!") # prints if true and then the code ends elif isAfternoon: print("Good Afternoon!") # prints if isAfternoon is true and isMorning is false, then ends. elif isEvening: print("Good Evening!") # prints if isMorning and isAfternoon is false and isEvening is true, then ends else: print("It's late! Go to bed!") # prints if isMorning, isAfternoon, and isEvening is False. Then ends.

      An if, elif, else block is created

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      __Reviewer #1 (Evidence, reproducibility and clarity (Required)): __ Summary In this work, the authors present a careful study of the lattice of the indirect flight muscle (IFM) in Drosophila using data from a morphometric analysis. To this end, an automated tool is developed for precise, high-throughput measurements of sarcomere length and myofibril width, and various microscopy techniques are used to assess sub-sarcomeric structures. These methods are applied to analyze sarcomere structure at multiple stages in the process of myofibrillogenesis. In addition, the authors present various factors and experimental methods that may affect the accurate measurement of IFM structures. Although the comprehensive structural study is appreciated, there are major issues with the presentation/scope of the work that need to be addressed: Major Comments 1. The main weakness of the paper is in its claim of presenting a model of the sarcomere. Indeed, the paper reports a structural study that is drawn onto a 3D schematic. There is no myofibrillogenesis model that would provide insights into mechanisms. Therefore, the use of the word model is grossly overstated.

      In biology, the term “model” is used in various contexts, but it generally refers to a simplified representation of a biological system, a structure or a process. Accordingly, we consider “model” the most fitting phrase for what we present in Figure 4 (Figure 7 in the revised manuscript). These are not arbitrary 3D schematics; they are scaled representations in which the length, the number and the relative three-dimensional arrangement of thin and thick filaments are based on measurements. These measurements are primarily based on our own data (presented in the main text and provided in the supplementary materials), as published data were either lacking or inconsistent. Moreover, we would like to highlight that we do not claim to present a conceptual or mechanistic model of myofibrillogenesis, but we do present structural reconstructions or models for four developmental time points. Therefore, we disagree with the remark that “the use of the word model is grossly overstated”, as our wording fully corresponds to the common sense.

      In general, the major focus and contribution of the work is unclear. How does the comprehensive nature of the measurements contribute to existing literature?

      We significantly revised the text to highlight the main points more firmly, and added an additional section to help non-specialist readers to better understand our aims and findings.

      Figure labels are often rather confusing - for example it is unclear why there is a B, B', B' etc instead of B,C,D, etc.

      The figure labels have been revised in accordance with the reviewer’s recommendation.

      Some comments in the text are not clearly tied to the figures. For example, in lines 108-109, are the authors referring to the shadow along the edges of the myofibril when saying they are not clearly defined (Figure 1C)?

      The lines refer to the fact that identifying the boundary of an “object” in a fluorescence microscopy image is inherently challenging - even under ideal conditions where the object’s image is not affected by nearby signals or background noise. To improve clarity, we revised this section and now it reads: The other key parameter - myofibril diameter - is typically measured using phalloidin staining. However, accurately delineating their boundaries in micrographs is difficult - even under optimal conditions (high signal‑to‑noise ratio, no overlapping fibers, etc.; Fig. 1C). This limitation arises from the fundamental nature of light microscopy as the image produced is a blurred version of the actual structure, due to convolution with the microscope’s point spread function.

      In line 116, it is unclear what "surrounding structures" the authors are referring to if the myofibrils are isolated.

      We revised the text for clarity. It now states: Once isolated, myofibrils lie flat on the coverslip, aligning with the focal plane of the objective lens. This orientation allows for high-resolution, undistorted imaging and accurate two-dimensional measurements, free from interference by neighboring biological structures (e.g.: other myofibrils).

      In lines 141-142, there is no reference of data to back up the claim of validation.

      We addressed this mistake by including a reference to Fig. S1E (Fig. S1D in the revised manuscript).

      In line 170, the authors mention the mef2-Gal4/+ strain as a Gal4 driver line but do not clearly state how this strain is different from the wildtypes or how this impacts their results.

      Mef2-Gal4 is a muscle-specific Gal4 driver, often used in Drosophila muscle studies. It is a convention between Drosophila geneticists that presence of a transgene (i.e. Mef2-Gal4) changes the genetic background, and although it does not necessariliy cause any phenotypic effect, it is clearly distinguished from the wild type situation, and whenever relevant, Mef2-Gal4/+ is the preferred choice (if not the correct choice) as a control instead of wild type. As clear from our data, presence of the Mef2-Gal4 driver line does not affect the length or width of IFM sarcomeres as compared to wild type.

      In lines 182-185, the authors discuss the effects of tissue embedding on morphometrics. Were factors such as animal sex, age, fiber type, etc. conserved in these experiments? If not, any differences in results may be confounding.

      We fully agree with the reviewer that when testing the effect of a single variable, all other variables should remain constant. This is actually one of the main points emphasized in the results section. Additionally, this information is already provided in the Source Data files for each panel.

      In lines 199-201, the authors discuss results of myofibril diameter using different preparation methods, yet no data is cited to support the claims. In line 220, the phrase "6 independent experiments" is unclear. Is each independent experiment performed using a different animal? Furthermore, are 6 experiments performed for each time point?

      We substantially revised the relevant paragraphs and ensured that the corresponding data (Figure 2A in the revised manuscript) is cited each time when it is discussed. We conducted six independent experiments at each time point. This is consistently indicated in the figures and can be verified in the SourceData files (specifically, Fig3SourceData in this case). To clarify what we mean by "independent experiments," we added the following sentence to the Methods section: Experiments were considered independent when specimens came from different parental crosses, and each experiment included approximately six animals to capture individual variability.

      In line 254, the authors refer to "number of sarcomeres". It must be clearly stated if this refers to sarcomeres per myofibril, image area, etc.

      It is now clearly stated as: "number of sarcomeres per myofibril".

      In line 274, the authors refer to "myofilament number". It must be clearly stated if this refers to myofilaments per myofibril, image area, etc.

      We counted the number of myofilaments in developing myofibrils, and this is now clearly stated in the text and in the legend of Figure 3 (Figure 4 in the revised manuscript).

      In line 299, the authors mention that thin filaments measured less than 560 nm in length, yet no data is cited to support this.

      The previously missing reference to Figure 4 (Figure 7 in the revised manuscript) has now been added in addition to the revised Supplementary Figure 5.

      In the "Quantifying sarcomere growth dynamics" section of the summary (starting from line 402) the authors introduce data that would be more naturally placed in the results and discussion section.

      As suggested by the reviewer, we incorporated the key aspects of sarcomere growth dynamics into the Results and Discussion section.

      In lines 422-423, it is not mentioned what the controls are for.

      This was already explained in the main text between lines 167 and 173.

      In the caption of Figure 1C, it is not mentioned what the red dashed lines in the microscope images represent.

      The caption has been updated to include the following clarification: The red dashed lines border the ROI used for generating the intensity profiles.

      In the caption of Figure 1D, the difference between the lighter and darker grey points is not mentioned.

      This was already explained in each relevant figure legend. In this specific case, it is stated between lines 850 and 852: “Light gray dots represent individual measurements of sarcomere length and myofibril diameter, while the larger dots indicate the mean values from independent experiments.”

      In line 849, the stated p-value (0.003) does not match that mentioned in the figure (0.0003).

      We thank the reviewer for noticing this small mistake; correction was made to display the accurate p-value of 0.0003 at both places.

      In line 874, it is not clear what an "independent experiment" refers to (different animal, etc.?).

      We refer the reviewer to point 9, where this question has already been addressed.

      Figure 2A is hard to read. Using different colored dots for different time points might help.

      As suggested by the reviewer, we generated a plot with the individual points color-coded by time.

      The significant figures presented in Figure 4 give a completely inaccurate representation of the variability of the measurements achieved with these techniques.

      Certainly, each measured parameter exhibits inherent biological and technical variability. We have made all the raw data available to the reader through the SourceData files, and this variability is also evident in Figures 1, 2, 3, Supplementary Figure 1, 3, and 5 (Figure 1, 2, 3, 4, 6, and Supplementary Figure 1 in the revised manuscript). Also we have included an additional plot (Supplementary Figure 5 in the revised manuscript) that presents the calculated thin and thick filament lengths and their uncertainty. However, in Figure 4 (Figure 7 in the revised manuscript), our goal was to present an easily understandable visual representation of the sarcomeric structures for each time point, based on the averages of the relevant measurements.

      In line 877, it should be mentioned that the number of filaments is counted per myofibril. The y-axes in the figure should also be adjusted to clarify this.

      As suggested by the reviewer, both the figure legend and the plot have been updated to clearly indicate that the filament count refers to the number per myofibril.

      In line 883, it is not clear what an "independent experiment" refers to (different animal, etc.?).

      We refer the reviewer to point 9, where this question has already been addressed.

      The statement of sample sizes in all figures is a little confusing.

      Following general guidelines, we used SuperPlots to effectively present the data, as nicely demonstrated in the JCB viewpoint article by Lord et al., 2020 (PMID: 32346721). Individual measurements are shown as pooled data points, allowing readers to appreciate the spread, distribution and number of measurements. Overlaid on these pooled dot plots are the mean values from each independent experiment, with error bars representing variability between independent experiments. Sample sizes are provided for both individual measurements and independent experiments. This is now clearly explained in the Materials and Methods section, and we corrected the legends to improve clarity (“n” indicates the number of independent experiments/individual measurements).

      In lines 1007-1008, the authors imply that the lattice model is needed for calculation of myofilament length. However, from the equations and previous data, it seems that this can be estimated using the confocal and dSTORM images.

      As the reviewer correctly noted, myofilament length can be estimated using measurements from confocal and dSTORM images, following the equations provided. However, constructing even a simplified model requires multiple constraints to be defined and applied in a specific order. In practice, one must first determine the number and arrangement of myofilaments in a cross-sectional view of an “average sarcomere” before attempting to build a longitudinal model, where length calculations become relevant. This is now clarified in the text.

      A more specific discussion of future directions is needed to put this paper in context. For example: Can anything from the overall process be used to better understand sarcomere dynamics in larger animals/humans? Can this be applied to disease modelling?

      To address these questions, we have added a section titled STUDY LIMITATIONS, which states: “Our study is focused on describing the growth of IFM sarcomeres during myofibrillogenesis at the level of individual myofilaments. Additionally, we developed a user-friendly software tool for precise sarcomere size measurements and demonstrate that these measurements are sensitive to varying conditions. Whereas, this tool can be used successfully on whole muscle fiber preparations as well, our pipeline was intentionally optimized for individual IFM myofibrils ensuring higher measurement precision in our hands than other type of preparations. Thus, we predict that future work will be required to extend it to sarcomeres from other muscle tissues or species. Nevertheless, our study exemplifies a workflow how to measure sarcomere dimensions precisely. With some variations, it should be possible to adopt it for other muscles, including vertebrate and human striated muscles. To facilitate this and to enhance the accessibility and usability of this dataset, we welcome any feedback and suggestions from researchers in the field.”

      One of the major claims of the paper is that there is a measurable variability with sex and other parameters. However, this data is never clearly summarized, presented (except for supplement), or discussed for its implications.

      We followed the suggestion of the reviewer, and we moved this supplementary data into a main figure, and thoroughly revised the corresponding paragraphs to present and discuss the findings more clearly.

      Minor Comments: 1. Lines 60-65 seem to break the flow of the introduction. As the authors discuss existing methods in literature for IFM analysis in the previous couple sentences, the following sentences should clearly state the limitations of existing methods/current gap in literature and a general idea of what the current work is contributing.

      We agree with this remark, and we substantially revised the Introduction to clearly define the existing gap in the literature and to articulate how our work addresses this gap.

      In line 104, the acronym for ZASPs is not spelled out.

      The acronym has now been spelled out for clarity.

      **Referee Cross-commenting**

      I agree as well.

      Reviewer #1 (Significance (Required)):

      In summary, this paper provides a multi-scale characterization of Drosophila flight muscle sarcomere structure under a variety of conditions, which is potentially a significant contribution for the field. However, the paper scope is overstated in that it does not provide an actual sarcomere model. Further, there are multiple issues with data presentation that impact the readability of the manuscript.

      Although it is somewhat unclear what would be “an actual sarcomere model” for the reviewer, but we cannot accept that we made on overstatement by using the word “model”, because one of the main outcomes of our work are indeed the myofilament level sarcomere models depicted in Figure 4 (Figure 7 in the revised manuscript). As said above, we do not claim that these would be molecular models, or mechanistic models or developmental models, but it makes absolutely nonsense (even in common terms!) that our scaled graphical representations (based on a wealth of measurements) should not be or cannot be called models.

      As to the comment with data presentation, we thank the reviewer for the numerous suggestions, and we substantially revised the manuscript to increase clarity and overall readability.

      __Reviewer #2 (Evidence, reproducibility and clarity (Required)): __ Summary: In this manuscript titled "A myofilament lattice model of Drosophila flight muscle sarcomeres based on multiscale morphometric analysis during development," Görög et al. perform a detailed analysis of morphological parameters of the indirect flight muscle (IFM) of D. melanogaster. The authors start by illustrating the range of measurements reported in the literature for mature IFM sarcomere length and width, showing a need to revisit and determine a standardized measurement. They develop a new Python-based tool, IMA, to analyze sarcomere lengths from confocal micrographs of isolated myofibrils stained with phalloidin and a z-disc marker. Using this tool, they demonstrate that sample preparation (especially mounting medium), as well as fiber type, sex, and age influence sarcomere measurements. Combining IMA, TEM, and STORM data, they measure sarcomere parameters across development, providing a comprehensive and up-to-date set of "standardized" sarcomere measurements. Using these data, they generate a model integrating all of the parameters to model sarcomeres at four discrete timepoints of development, recapitulating key phases of sarcomere formation and growth.

      Major comments: Line 200 & 901 - Figure S1B - The authors make a strong statement about the use of liquid versus hardening media, and it is clear from the image provided in Figure S1 that there is a difference in the apparent sarcomere width. The identity of the "liquid media" versus the "hardening media" should be clearly identified in the Results, in addition to the legend for Figure S1. The authors show that "glycerol-based solutions" increase sarcomere width, but the Materials only list 90% glycerol and PBS. However, a frequently used liquid mounting media is Vectashield. Based on the literature, measurements in liquid Vectashield show diameters significantly less than 2.2 microns observed here with presumably 90% glycerol or PBS. Can the authors qualify this statement, or provide data that all forms of liquid mounting media cause this effect? Does this also apply to hemi-thorax and sectioned preparations, or just isolated myofibrils?

      We used a PBS-based solution containing 90% glycerol as our liquid medium, as now stated in the main text. In response to the reviewer’s suggestion, we also tested a non-hardening version of Vectashield (H-1000). Myofibrils in Vectashield were significantly thicker than those in ProLong Gold but still thinner than those in the 90% glycerol–PBS solution, shown in Figure 2B. The mechanisms that could potentially explain these observations have been described in several studies (Miller et al., 2008; Tanner et al., 2011, 2012). Briefly, IFM is a densely packed macromolecular assembly. Upon removal of the cell membrane, myofibrillar proteins attract water, leading to overhydration of the myofilament lattice. This increases the spacing between filaments, resulting in an expansion of overall myofibril diameter. The extent of hydration depends on the osmolarity of the surrounding medium, as the system eventually reaches osmotic equilibrium. While both liquid media induced significant swelling, the observed differences likely reflect variations in their osmotic properties. In contrast, dehydration - an essential step in electron microscopy sample preparation - reduces the spacing between filaments, making myofibrils appear thinner. This explains why EM micrographs consistently show significantly smaller myofibril diameters (Chakravorty et al., 2017).

              Hardening media such as ProLong Gold introduce additional artifacts: during polymerization, these media shrink, exerting compressive forces on the tissue (Jonkman et al., 2020). We therefore propose that isolated myofibrils first expand due to overhydration in the dissection solution, and are then compressed back toward their *in vivo* dimensions during incubation in ProLong Gold. The average *in vivo* diameter of IFM myofibrils can be estimated without direct measurements, as it is determined by two key factors: (i) the number of myofilaments, which has been quantified in EM cross-sections in several studies (Fernandes & Schöck, 2014; Shwartz et al., 2016; Chakravorty et al., 2017) including our own, and (ii) the spacing between filaments, which can be measured by X-ray diffraction even in live *Drosophila* or under various experimental conditions (Irving & Maughan, 2000; Miller et al., 2008; Tanner et al., 2011, 2012). Our findings suggest that the effects of lattice overhydration and media-induced shrinkage are most pronounced in isolated myofibrils. In larger tissue preparations, the inter-myofibrillar space likely acts as a mechanical and osmotic buffer, reducing the extent of such distortions
      

      Can the authors comment on whether the length of fixation or fixation buffer solution, in addition to the mounting medium, make a difference on sarcomere length and diameter measurements? This is another source of variation in published protocols.

      The effect of fixation time on sarcomere morphometrics in whole-mount IFM preparations has been previously demonstrated by DeAguero et al. (2019), as briefly noted in our manuscript. To extend these findings, we performed a comparison using isolated myofibrils, assessing morphometric parameters after fixation for 10, 20 (standard) and 60 minutes. We found no difference between the 10- and 20-minute fixation conditions; however, fixation for 60 minutes resulted in significantly increased myofibril diameter (and these data are now shown in Supplementary Figure 1C). A comparable increase in thickness was also observed when using a glutaraldehyde-based fixative. These results suggest that more extensively fixed myofibrils may better resist the compressive forces exerted by hardening media.

      Line 237-238. The authors conclude that premyofibrils are much thinner than previously measured. The use of Airyscan to more accurately measure myofibril width at this timepoint is a good contribution, as indeed diffraction and light scatter likely contribute to increased width measured in light microscopy images. I also wonder, though, how well the IMP software performs in measuring width at 36h APF, given how irregular the isolated myofibrils at this stage look (wide z-lines but thinner and weaker H and I bands as shown in Fig. 2B)?

      The reviewer is correct that measurements during the early stages of myofibrillogenesis require additional effort. However, in addition to its automatic mode, IMA can also operate in semi-automatic or manual modes, ensuring complete control over the measurements. Myofibril width is determined from the phalloidin channel at the Z-line (as described in the software’s User Guide and Supplementary Figure 2), where it is at its thickest.

      Also, how much of the difference in sarcomere width arises due to effects of "stripping" components off of the sarcomere at the earliest timepoint (for example alpha-actinin or Zasp proteins)?

      A comparison between isolated myofibrils and those from microdissected muscles (Supplementary Figure 3B, Figure 3C in the revised manuscript) shows that the isolation process does not alter the morphometric measurements of sarcomeres. Moreover, the measured myofibril width aligns well with what we expect based on the number of myofilaments observed in TEM cross-sections of myofibrils at 36 hours APF (Figure 3A, now Figure 4A in the revised manuscript), supporting the consistency of our model.

      Myofibrils at early timepoints do contain more than 4-12 sarcomeres in a line (they extend the full length of the myofiber), so it is possible they are breaking due to the detergent and mechanical disruption induced by the isolation method.

      The reviewer is correct - myofibrils likely span the full length of the myofiber from the onset of myofibrillogenesis. However, during the isolation of individual myofibrils, they often break, and even mature myofibrils typically fragment into pieces of about 300 µm in length (illustrated in Figure 1E, now Figure 2A in the revised manuscript). Importantly, our measurements show that this fragmentation does not affect the assessed sarcomere length or width (as shown in Supplementary Figure 3B, now Figure 3C in the revised manuscript).

      Line 312 - What does "stable association" mean in this context? The authors mention early timepoints lack stable association of alpha-Actinin or Zasp52, and they reference Fig. S4C, but this figure only shows 72h and 24 AE, not 36h and 48 h APF. Previous reports have seen localization of both alpha-Actinin and Zasp52, so presumably the detergent or mechanical isolation is stripping these components off of the isolated myofibrils up until 72h.

      In agreement with previous reports, we also detected both α-Actinin (as shown in former Supplementary Figure 3B, now Figure 3C) and Zasp52 in microdissected IFM starting from 36 hours APF. However, these markers were largely absent from the isolated myofibrils of young pupae (36 to 60 hours APF). By 60 hours APF, strong α-Actinin and Zasp52 staining became evident in isolated myofibrils, whereas dTitin epitopes were clearly detectable from the earliest time point examined. This indicates that some proteins, such as α-Actinin and Zasp52, can be lost during the isolation process, whereas others like dTitin are retained and this differential sensitivity appears to depend on developmental stage. A likely explanation is that α-Actinin and Zasp52 are recruited early to Z-bodies but are only fully incorporated as more mature Z-disks form between 48 and 60 hours APF. This incomplete incorporation at the earlier stages could account for their loss during the isolation process. This interpretation is supported by our morphological analysis of the Z-discs, as shown in the dSTORM dataset (former Figure 3B, B’’, now Figure 4C, E) and in longitudinal TEM sections (former Supplementary Figure 5B, now in Figure 6B). Because α-Actinin and Zasp52 are not detected in isolated myofibrils at 36 and 48 hours APF, they are not included in Figure S4C (Figure 5C in the revised manuscript). This is explained in the updated figure legend.

      This same type of issue comes up again in Lines 325-334, where the authors talk about 3E8 and MAC147. They state that 3E8 signal significantly declines in later stages and that MAC147 is not suitable to label myofibrils in young pupae, but they only show data from 72 APF and 24 AE (which looks to have decent staining for both 3E8 and MAC147). A clearer explanation here would be helpful.

      To put it simply: we used one myosin antibody to label the A-band in the IFM of 36h APF and 48h APF animals, and a different antibody for the 72h APF and 24h AE stages. In more detail: Myosin 3E8 is a monoclonal antibody targeting the myosin heavy chain and labels the entire length of mature thick filaments except for the bare zone (former Supplementary Figure 4D, now in Figure 5D), suggesting its epitope is near the head domain. As a result, we expect a uniform A-band staining - excluding the bare zone - which is exactly what we observe in the IFM of young pupae (36h APF and 48h APF; formerly Figure 3B, now Figure 4C in the revised manuscript). However, at 72h APF and 24h AE, Myosin 3E8 produces a different staining pattern: two narrow stripes flanking the bare zone and two broader, more diffuse stripes near the A/I band junction (former Supplementary Figure 4D, now Figure 5D). This change is likely due to restricted antigen accessibility at these later developmental stages - a common issue in the densely packed IFM - making this antibody unsuitable for reliably measuring thick filament length in these stages.

      MAC147 is another monoclonal antibody against Mhc that recognizes an epitope near the head domain. However, it only works reliably in more mature myofibrils (72h APF and 24h AE; formerly Figure 3B, now Figure 4C in the revised manuscript), likely due to its specificity for a particular Mhc isoform. This is why we do not include images from earlier developmental stages using this antibody. We added a revised, concise explanation in the main text for general readers, and provided a more detailed description for specialist readers in the legend of Supplementary Figure 4D (updated as Figure 5D in the revised manuscript).

      Figure 3B. The authors show the H, Z, and I lengths in B', B', and B' and discuss these lengths in the text (lines 305-320). It would also be nice to actually have the plots showing the measured/calculated lengths for thin and thick filaments. These are mentioned in the results, but I cannot find the plots in the figures and there is no panel reference.

      A summary table of the measured and calculated parameters is provided in Fig4SourceData (Fig7Source Data in the revised manuscript). However, following the reviewer’s suggestion, we also generated an additional plot (Supplementary Figure 5 in the revised manuscript) that displays the calculated thin and thick filament lengths.

      Line 400. Does the model in Figure 4 actually have molecular resolution as the authors claim? From these views, thick and thin filaments appear to be represented by cylindrical objects. Localization of specific molecules would require further modeling with individual proteins. Or do the authors mean localization from STORM imaging relative to the ends of the thick and/or thin filaments? The model itself is a useful contribution, but based on Figure 4, resolution of individual molecules is not evident.

      The reviewer is correct; and we fully agree that we do not present a molecular model of sarcomeres in this study - nor do we claim to. Instead we present a myofilament level model. Nevertheless, the scaled myofilament lattice model we introduce could serve as a geometric constraint when constructing supramolecular models of sarcomeres. As the reviewer rightly notes, implementing such an approach would require additional effort.

      The main Results section of the text is condensed into 4 figures. However, I found myself flipping back and forth between the main figures and the supplement continuously, especially parts of Supplemental Figures 1, 3, 4, and 5. With such large amounts of detail in the Results relying on the supplement, it may be worth considering reorganizing the main and supplemental figures, and having 7 main figures, to include important panels that are currently in the supplement (esp. Fig S1B, S1C, S1D, S3B, S4, S5).

      We found it a very useful suggestion, and we substantially reorganized the figures in the revised manuscript according to the recommendations of the reviewer.

      Minor comments: On the plots in Fig. S1B, D, and F, it is hard to see the color of the dots because the red error bars are on top of them. Can the other distribution dots be tinted the correct color or the x-axis labels be added, so it is clear which dataset is which?

      We significantly enlarged the dots to enhance visual clarity.

      Line 142 needs a reference to Figure S1, Panel E, which shows the accuracy and precision measurements.

      The requested panel reference has now been included in the revised manuscript.

      Lines 198 - is this range from the above publications? Needs to be clearly cited.

      The range has indeed been estimated using measurements from the aforementioned publications, and this point is now further clarified in the revised text.

      Figure S3B is confusing - why do the blow-ups overlap both the top (presumably microdissected) and the bottom (presumably isolated) images? The identity of microdissected images should be labeled, as they are hard to see underneath of the blown-up images and the identity of individual image planes wasn't immediately obvious.

      We refined the panel structure of Figure S3B (Figure 3C in the revised manuscript) to enhance clarity as the reviewer suggested.

      Line 298. By "misaligned," do the authors mean the pointed ends are not uniformly anchored in the z-disc, leading to the wide z-disc measurements? At this early stage, I'm not sure "misaligned" is the right word - perhaps "were not yet aligned in register at the z-disc" or something similar.

      We revised the text for clarity. It now reads: At 36 hours APF, thin filaments had not yet aligned in perfect register at the Z-disc, with most measuring less than 560 nm in length - and exhibiting considerable variability.

      Figure S6 - spelling mistake in label of panel A, "sarcomer" should be "sarcomere"

      The typo is corrected.

      Line 487. Spelling "Zaps52" should be "Zasp52"

      The typo is corrected.

      Line 887. Spelling "Myofilement" should be "Myofilament"

      The typo is corrected.

      Line 946-947. In the legend for Supp. Fig. 3., the authors should specify which published datasets on sarcomere length are shown in the figure by including the references in the legend. Presumably the "isolated individual myofibrils" are the blue "this study" lines, leaving the "microdissected muscles" as the magenta "previous reports" on the figure. Without the reference, it is not clear if these are microdissected, isolated myofibrils, hemi-thorax sections, cryosections, or another preparation method for the "previous reports" data.

      The references have now been added to both the figure and its legend.

      **Referee Cross-commenting**

      I agree with the comments from the other reviewers. Many of the major themes are consistent across the reviews, including regarding the model, preparation methods, and the software tool.

      Reviewer #2 (Significance (Required)):

      Strengths: This manuscript is an important contribution to the field of sarcomere development. The authors use modern technologies to revisit variation in morphometric measurements in the literature, and they identify parameters that influence this variation. Notably, sex-specific differences, DLM versus DVM measurements, and mouting media are potential contributors to the variability. Combining TEM and STORM with a confocal timecourse of isolated myofibrils, they refine previously published values of sarcomere length and width, and add more comprehensive data for filament length, number and spacing. This highly accurate timecourse demonstrates continual growth of sarcomeres after 48 h APF, and correct some inconsistencies from previous large-scale timecourse datasets. These data are very valuable to the field, especially Drosophila muscle biologists, and will serve as a comparative resource for future studies. Weaknesses: At early timepoints, loss of sarcomere components through mechanical or detergent-mediated artifacts may influence the authors' measurements. In addition, isolating myofibrils is not always the most ideal approach, as it loses information on myofiber structure as well as organization and structure of the myofibrils in vivo.

      We believe that the control experiments we presented here adequately demonstrate that sarcomere measurements are not affected by the myofibril isolation process at early timepoints (Figure 3C). Nevertheless, we certainly agree with the reviewer that isolated myofibrils alone cannot capture the entire complexity of muscle tissues, and additional approaches should also be applied in complex projects. Yet, we are confident that our approach offers the most reliable and efficient method for precise morphometric analysis of the sarcomeres, and although alone it is very unlikely to be sufficient to address all questions of a muscle development project, it can still be applied as a very useful and robust tool.

      The point regarding liquid versus hardening mounting media is valuable, but remains to be tested and validated with the diverse liquid and hardening media used by other labs.

      Whereas it would not be feasible for us to test all possible liquid and hardening media used by others in all possible conditions, we tested the effect of Vectashield (the most commonly used liquid media) according to the suggestion of the reviewer, and the results are now included in the manuscript. We think that this is a valuable extension of the list of the materials and conditions we tested, although we need to point out that our primary goal was not necessarily to test as many conditions as possible (because the number of those conditions is virtually endless), rather to raise awareness among colleagues that these variables can significantly impact the data obtained and affect their comparability.

      The IMA software seems to be designed specifically for analysis of isolated myofibrils, and it is unclear if it would work for other types of IFM preparations.

      As stated in the manuscript, IMA is a specialized tool designed for the analysis of individual myofibrils. While it can also process other types of IFM preparations in semi-automatic or manual modes, we believe these approaches compromise both efficiency and accuracy. This is further clarified in the revised manuscript.

      A last point is that TEM and STORM may not be available on a regular basis to many labs, hindering wide implementation of the approach used in this manuscript to generate very accurate and detailed measurements of sarcomere morphometrics.

      Regarding the availability of TEM and STORM, we acknowledge that these techniques are not universally accessible. However, that is exactly one major value of our work that our open-source software tool now allows researchers to generate valuable data using only a confocal microscope in combination with our published datasets.

      Audience: Scientists who study sarcomerogenesis or Drosophila muscle biology.

      My expertise: I study muscle development in the Drosophila model.

      __Reviewer #3 (Evidence, reproducibility and clarity (Required)): __ Summary: This manuscripts presents a computational tool to quantify sarcomere length and myofibril width of the Drosophila indirect flight muscles, including developmental samples. This tool was applied to confocal and STORM super-resolution images of isolated myofibrils from adult and developing flight muscles. Thick filament numbers per myofibril were counted during development of flight muscles. A myofilament model of developing flight muscle myofibrils is presented that remains speculative for the early developmental stages.

      Major comments: 1. The title of the manuscript appears unclear. What is a lattice model? Lattice is an ordered array. The filament array parameters for mature flight muscles was aready measured. It appears that the authors speculate how this order might be generated during sarcomere assembly, which is not studied in this manuscript as it is limited to periodic arrays after 36h APF.

      As the reviewer correctly points out, a lattice refers to an ordered array - in the case of IFM sarcomeres, this includes both thin and thick filaments. Therefore, the phrase "myofilament lattice model of Drosophila flight muscle sarcomeres" specifically describes a model representing the spatial organization of these filament arrays within the sarcomere. To provide additional clarity for readers, we have revised the title to include more context. It now reads: Developmental Remodeling of Drosophila Flight Muscle Sarcomeres: A Scaled Myofilament Lattice Model Based on Multiscale Morphometrics

      To create a model of these arrays, three essential pieces of information are required:

      1) The length of the filaments,

      2) The number of filaments, and

      3) The relative position of the filaments.

      While some direct measurements are available in the literature, and others can be used to calculate the necessary values, available data is often contradictory or simply different from each other (as described in our ms) making them unsuitable for constructing scaled models of the myofilament arrays. In contrast to that, here we present a comprehensive and consistent set of measurements that enabled us to build models not only of mature sarcomeres but also of sarcomeres at three other significant developmental time points.

      Regarding the mention of "sarcomere assembly" in line 37, we intended it to refer to the growth of the sarcomeres, not their initial formation. We do not speculate about sarcomere assembly anywhere in the text. In fact, we have clearly stated multiple times that our focus is on the growth of the IFM myofilament array during myofibrillogenesis. Nevertheless, to avoid confusion, we revised the phrase in line 37 to "sarcomere growth".

      The authors review the flight muscle sarcomere length literature and conclude it is variable because of imprecise measurements. Likely this is partially true, however, more importantly is that the sarcomere length and width changes during isolation methods of the myofibrils, as well as by various embedding methods, as the authors show here as well in Figure 1B-E.

      We dedicated two sections of the Results - “An automated method to accurately measure sarcomeric parameters” and “IFM sarcomere morphometrics are affected by sex, age, fiber type, and sample preparation” - to exploring potential sources of variability in published IFM sarcomere measurements. Based on these analyses, we conclude that such variability stems from both measurement imprecision and biological or technical factors, including sex, age, fiber type and, of foremost, sample preparation. Because it is difficult to quantify the relative impact of each variable across published studies, we have refrained from speculations about the relative contribution of the different factors in the revised manuscript.

      Hence, I find the strongly claims the authors make here surprising, while they are isolating the myofibrils. Hence, these myofibrils are ruptured at the ends, relaxed or contracted, depending on buffer choice and passive tension is released. On page 8, the authors correctly state that the embedding medium causes shrinkage of the myofibrils. While isolation is state of the art for electron microscopy techniques, other methods including sectioning or even whole mount preparation have been developed for high resolution microscopy of IFMs that avoid these artifacts. Unfortunately, this manuscript only uses isolated myofibrils that were fixed and then mechanically dissociated by pipetting. This method likely induces variations as seen by the large spread of sarcomere length reported in Figure 1C (2.8-3.9µm?) and even bigger spreads for myofibril widths. Are these also seen in tissue without dissections? Unfortunately, no comparision to intact flight muscles are reported with the here presented quantification tool. The sarcomere length spread in the developmental samples is even larger.

      The major issue raised in this paragraph is the use of isolated myofibril versus intact flight muscle preparations. The reviewer claims that the latter might be superior because the isolated myofibrils are ruptured at their ends. Clearly, the intact IFMs cannot be imaged in vivo by light microscopy because the adult fly cuticle is opaque. To visualize these muscles, one must open the thorax, but neither microdissection nor sectioning preserves them perfectly, even the cleanest longitudinal cuts sever some myofibrils, and dissection itself can damage the tissue. Although published images often show only the most pristine regions, the practice of selective cropping cannot be taken as a scientific argument. Here, by comparing sarcomere lengths measured in isolated myofibrils with those from whole-mount longitudinal DLM sections and microdissected IFM myofibers, we demonstrate that isolation does not alter sarcomere length (Figure 1E, now Figure 2A in the revised manuscript). As to myofibril width, it is determined by two parameters: the number of myofilaments and the spacing between them. In vivo filament spacing has been measured directly, and filament counts can be obtained from EM cross-sections of DLM fibers. Combining these values gives an expected in vivo myofibril diameter. While isolated myofibrils measure thinner than those in whole-mount or microdissected samples (Figure 1E, now Figure 2A in the revised manuscript), their diameter closely matches this in vivo estimate (see manuscript, lines 187–198). Therefore, we conclude that isolated myofibrils (even if it seems counterintuitive for this reviewer) are superior for sarcomere measurements than whole-mount preparations - and that is why we primarily rely on them here.

      Despite that, we certainly recognize that isolated myofibrils cannot recapitulate every aspect of an IFM fiber, and the need for whole-mount preparations during our IFM studies is not questioned by us.

              In addition to this general answer to the issues raised in the above paragraph of the reviewer, we would like to specifically reflect for some of the remarks:
      

      „Unfortunately, this manuscript only uses isolated myofibrils that were fixed and then mechanically dissociated by pipetting.”

      This is a false statement that “this manuscript only uses isolated myofibrils” as we used different preparation methods for initial comparisons (see Figure 1E, now Figure 2A in the revised manuscript). Additionally, unlike the reviewer assumed, the myofibrils were first dissociated and then fixed, and not vice versa (as described in the Materials and Methods section).

      „This method likely induces variations as seen by the large spread of sarcomere length reported in Figure 1C (2.8-3.9µm?) and even bigger spreads for myofibril widths. Are these also seen in tissue without dissections?”

      This remark makes absolutely no sense, as we do not report sarcomere length values in Figure 1C at all. By assuming that the reviewer meant to refer to Figure 1B, it still remains a misunderstanding or a false statement, because that panel refers to the variations found in published data (not in our current data), and this is clearly explained both in the figure legend and the main text. Regardless of that, the stated spread does not appear unusual. In the article by Spletter et al. (2018), the authors report a similar spread (2.576–3.542 µm) for sarcomere length in mature IFM using whole-mount DLM cross-sections. As to the second question here, we do observe a comparable spread in other preparations as well (see Figure 1E, now Figure 2A in the revised manuscript), which is again the opposite conclusion as compared to the (clearly false) assumption of the reviewer.

      „Unfortunately, no comparision to intact flight muscles are reported with the here presented quantification tool. „

      This is also a false statement; as we do report comparison to whole mount cross sections which we belive the reviewer considers „intact” in Figure 1E (Figure 2A in the revised manuscript).

      „The sarcomere length spread in the developmental samples is even larger.”

      The spread is not larger at all than in previous reports, as clearly shown in Supplementary Figure 3A.

      The authors suggest that there are sex differences in sarcomere length and pupal development duration. This is potentially interesting, unfortunately they then use mixed sex samples to analyse sarcomeres during flight muscle development.

      In the revised manuscript, we now provide a more detailed description of a subtle post-eclosion difference in IFM sarcomere metrics between male and female Drosophila. We attribute this variation to the well-established observation that female pupae develop slightly faster than males, a property that may last till shortly after eclosion. Confirming this experimentally would require considerable effort with limited scientific benefit. Nonetheless, the subtle nature of this sex-linked variation reinforced our decision to include IFM sarcomeres from both male and female flies in our comprehensive developmental analysis.

      The IMA software tool lacks critical assessment of its performance compared to other tools and the validation presented is too limited. IMA seems to generate systematic errors, based on Fig S1E, as it does not report the ground truth. These have to be discussed and compared to available tools. The principles of fitting used in IMA seem well adapted to IFM myofibrils in low noise conditions, but may not be usable in other situations. This should be assessed and discussed.

      IMA is a specialized software tool developed to address a specific need, notably, to accurately and efficiently measure sarcomere length and myofibril diameter in individual IFM myofibril images labeled with both phalloidin and Z-disc markers. For our purposes, it remains the most suitable and reliable option, and we are confident that IMA outperforms all other available tools. To demonstrate this, we have included a table comparing the few alternatives (MyofibrilJ, SarcGraph, and sarcApp) capable of both measurements, which further supports our conclusion. Given IMA's focused application, extensive validation under artificially low signal-to-noise conditions is unnecessary. While IMA may introduce minor systematic errors (~0.01 µm for sarcomere length and ~0.03 µm for myofibril diameter), these are negligible errors relative to the limitations of the simulated ground truth data used for benchmarking. This point is now addressed in the manuscript.

      It is claimed that validation was achieved on simulated IFM images: do the authors rather mean simulated isolated IFM myofibril images? This is not quite the same in terms of algorithm complexity and this should be corrected if this is the case.

      Indeed, we used simulated individual IFM myofibril images, where both phalloidin labeling and Z-disc labeling are present. This is clearly shown in Supplementary Figure 1A, and stated in the text when first introduced: „we generated artificial images of IFM myofibrils with known dimensions, simulating the image formation process”

      The authors need to revise their comparison to other tools. It is incomplete and seemingly incorrect. It should be clearly stated that IMA is limited to isolated myofibrils, which is a far easier segmentation task than what other tools can do, such as sarcApp (Neininger-Castro et al. 2023, PMID: 37921850). Defining the acronym would be valuable in that sense. The claim line 129-130 "none can adequately measure myofibril diameter from regular side view images" is unclear. What do the authors refer to as "side view images"? Sarc-Graph from Zhao et al 2021, PMID: 34613960, and sarcApp from Neininger-Castro et al. 2023 provide sarcomere width, in conditions that are very similar to what IMA does, e.g. on xy images based on the documentation provided on github. A performance comparison with these tools would be valuable. Does installation and use of IMA require computational skills?

      Motivated by the reviewer’s comments, we revised the section introducing IMA. However, we chose not to include an extensive comparison with other software tools, as this would divert the manuscript’s focus without impacting the main conclusions. Instead, we added a summary table highlighting the key requirements for analyzing IFM sarcomere morphometrics from Z-stacks of phalloidin- and Z-line-labeled individual myofibrils and compared the available tools accordingly. In our experience, most software tools are developed to address very specific problems, even those marketed as general-purpose solutions. Consequently, applying them beyond their intended scope often results in reduced efficiency and suboptimal performance. Although sarcApp was initially available as a free tool, one of its dependencies (PySimpleGUI 5) has since adopted a commercial license model. Using a trial version of PySimpleGUI 5, we evaluated sarcApp on our dataset. The software is limited to single-plane image input, hence raw image stacks must be preprocessed into a suitable format, which is a time consuming step. Furthermore, implementation requires basic programming proficiency, as parameter adjustments must be performed directly within the source code to accommodate dataset-specific configurations. Once appropriately configured, sarcApp reliably quantifies both sarcomere length and myofibril width with accuracy comparable to that of IMA. However, it lacks built-in diagnostic feedback or visualization tools to facilitate measurement verification or troubleshooting during batch processing. SarcGraph also supports only single-plane image inputs and requires prior image preprocessing. Additionally, images must be loaded manually one by one, which further reduces processing efficiency. Parameter optimization relies on direct code modification through a trial-and-error process, demanding a certain level of programming proficiency. Even with these adjustments, the software frequently introduces artifacts - such as Z-line splitting - when applied to our dataset. Even when segmentation is successful, sarcomere length is often overestimated, whereas myofibril diameter is consistently underestimated. As compared to these issues, IMA was designed for ease of use and does not require any programming experience to install or operate. It can automatically handle raw microscopic image formats without the need for preprocessing. Segmentation is fully automated, with no requirement for parameter tuning. The tool provides visual feedback during both the segmentation and fitting steps, allowing users to confidently assess and validate the results. IMA produces accurate and precise measurements of sarcomere length and diameter. Batch processing is enabled by default, significantly improving efficiency when analyzing multiple images. Finally, unlike the reviewer stated, IMA is not limited to isolated myofibrils. It is optimized for isolated myofibrils (i.e. full performance is achieved on these samples), but it can also work on whole-mount preparations in semi-automatic and manual mode, which still allow precise measurements (with some reduction in processing efficiency).

      As to the minor comments, the acronym IMA was already defined in lines 541 and 917–918 of the original submission, as well as on the software’s GitHub page. Additionally, we replaced the phrase "side view images" with "longitudinal myofibril projections" to improve clarity.

      How do the authors know that the bright phallodin signal visible that the Z-disc at 36h and 48h APF is due to actin filament overlap, as suggested? An alternative solution are more short actin filaments at the early Z-discs.

      It is widely accepted that the bright phalloidin signal at the Z-line in mature sarcomeres reflects actin filament overlap (e.g., Littlefield and Fowler, 2002; PMID: 11964243). Accordingly, in slightly stretched myofibrils, this bright signal diminishes, and in more significantly stretched myofibrils, a small gap appears (e.g., Kulke et al., 2001; PMID: 11535621). The width of this bright phalloidin signal corresponds to the electron-dense band seen in longitudinal EM sections (Figure 3B and Supplementary Figure 5B, now Figure 4B and Figure 6B in the revised manuscript) and matches the actin filament overlap observed in Z-disc cryo-EM reconstructions from other species (Yeganeh et al., 2023; Rusu et al., 2017), where individual thin filaments can be resolved. By extension, we interpret the bright phalloidin signals at the Z-discs observed at 36 h and 48 h APF as arising from similar actin filament overlaps, given their comparable width to the electron-dense Z-bodies described both in our study (Supplemantary Figure 5B, now Figure 6B in the revised manuscript) and by Reedy and Beall (1993). While we cannot fully rule out the reviewer’s alternative interpretation, for the time being it remains a bold speculation without supporting evidence, and therefore we prefer to stay with the conventional view.

      The authors seem to doubt their own interpretation that actin filaments shrink when reading line 304 and following. This is obviously critical for the "model" presented.

      Unlike the reviewer implies, we certainly do not doubt our own interpretation, but to avoid confusion we revised the corresponding paragraph in the manuscript and provided more details on our explanation, and we also provide a brief overview of it here. Between 36 h and 48 h APF we observe a pronounced structural transition in the IFM sarcomeres. In EM cross-sections, the previously irregular myofilament lattice becomes organized into a regular hexagonal pattern (Figure 3A, now Figure 4A in the revised manuscript) with filament spacing typical of mature myofibrils (Supplementary Figure 5A, now Figure 6A in the revised manuscript). In longitudinal EM sections, the elongated, amorphous Z-bodies condense along the myofibril axis to form well-defined, adult-like Z-discs (Supplementary Figure 5B, now Figure 6B in the revised manuscript). Similarly, dSTORM imaging shows that the Z-disc associated D-Titin epitopes become more compact and organized during this period (Supplementary Figure 4E, now Figure 5E in the revised manuscript). The edges of the thick filament arrays also become more sharply defined, and the appearance of a distinct bare zone indicates the establishment of a regular register (Figure 3B, now Figure 4B in the revised manuscript). By assuming that a similar reorganization occurs within the thin filament array, the apparent length of the thin filament array would decrease—not due to shortening of individual filaments, rather due to improved alignment. Although we cannot directly resolve single thin filaments, this reorganization offers the most plausible explanation for the observed change.

      Minor comments: 1. Figure S1B is not called out in the text.

      The reviewer might have missed this, but in fact, it is explicitly called out in line 181.

      Fig. 1: Please state whenever images are simulations?

      We appreciate the reviewer’s observation that the simulated IFM myofibril images are indistinguishable from the real ones, as this confirms the adequacy of these images for testing our software tool. However, this is already clearly indicated: Figure 1B features simulated images, as noted in the figure legend (line 824), and Supplementary Figure 1A similarly shows simulated images, as stated both in the legend (line 886) and in the figure.

      Fig. 2: Length-width correlation - please provide individual points color-coded by time point?

      As suggested by the reviewer, we generated a plot with the individual points color-coded by time.

      "newly eclosed males and females, we observed that males have slightly shorter sarcomeres and narrower myofibrils". Please provide a statistical test supporting the difference.

      In the revised manuscript, we compared sarcomere length and myofibril width between males and females from 0 to 96 hours AE using a two-way ANOVA with Sidak’s multiple comparisons test. We expanded our description of these observations in the main text, and details of the statistical analysis are now included in the revised figure legend (Figure 1E). Briefly, newly eclosed males showed slightly shorter sarcomeres than females - a consistent but non-significant trend (p = 0.9846) - which resolved by 12 h AE, with sarcomere lengths remaining similar thereafter (p = 0.1533; Figure 1E). In contrast, myofibril width was significantly narrower in the newly eclosed males (p = 0.0374), but this difference disappeared between 24 and 48 h AE as myofibrils expanded in diameter during post-eclosion development (p

      Were statistical tests performed using animals as sample numbers? Please clarify in the images what are animal and what are sarcomere numbers.

      Following standard guidelines, statistical tests were performed using the means of independent experiments, as noted in the figure legends. For each experiment, we used approximately 6 animals, and this information is now included in the Materials and Methods section.

      mef2-Gal4 should be spelled Mef2-GAL4 according to Flybase.

      This has been corrected in the revised text and figures.

      Are the images shown in Figure 2B representative? 96h AE appears thicker than 24h AE but the graph reports no difference.

      We aimed to show representative images, however, in the case of 96h APF we may have selected a wrong example. We now changed the image for a more appropriate one.

      The authors only found Zasp52 and alpha-Actinin at the Z-discs from 72h APF onwards, which is different to what others have reported.

      Similarly to former reports, we detected both α-Actinin (see Supplementary Figure 3B, now Figure 3C in the revised manuscript) and Zasp52 in microdissected IFMs as early as 36 hours APF. However, these markers were largely absent in isolated myofibrils from the early pupal stages (36–60 hours APF). By 60 hours APF, strong α-Actinin and Zasp52 signals were clearly visible in isolated myofibrils (the closest timepoint captured by dSTORM is 72h APF). As discussed in the manuscript, a likely explanation is that α-Actinin and Zasp52 are recruited to developing Z-bodies early on but are only fully incorporated into mature Z-discs between 48 and 60 hours APF. Their incomplete integration at earlier stages may lead to their loss during the isolation procedure.

      Thick filament length during development has also been estimated by Orfanos and Sparrow, which should be cited (PMID: 23178940)

      Contrary to the reviewer’s claim, the article 'Myosin isoform switching during assembly of the Drosophila flight muscle thick filament lattice' does not provide any measurements or estimates of thick filament length; it only includes a schematic illustration where the length of the thick filaments is not based on empirical data.

      **Referee Cross-commenting**

      I also agree with my colleagues comments, which are largely consistent.

      Reviewer #3 (Significance (Required)):

      This paper introduces a tool to measure sarcomere length. Easy to use tools that do this as well already exist. The tool can also measure sarcomere width, which it claims as unique point, which is not the case, see above comment.

      We are aware that other tools exist to measure sarcomere parameters (and we did not claim the opposite in our ms), nevertheless, we need to emphasize that based on our comparisons, IMA is superior to all three alternatives. Three software tools could, in principle, be used to measure both sarcomere length and myofibril diameter: MyofibrilJ, SarcGraph, and sarcApp. However, two of them - MyofibrilJ and SarcGraph - consistently under- or overestimate these values. The only tool capable of performing these measurements reliably, sarcApp, is no longer freely available, it requires programming expertise, and it does not support raw image file formats, making it difficult to use in practice (see above comments for more details). In contrast, IMA is user-friendly and does not require any programming expertise to install or operate. It can automatically process raw microscopic image formats without the need for preprocessing. Segmentation is fully automated, and no parameter tuning is necessary. The tool offers visual feedback on both the segmentation and fitting processes, enabling users to validate results with confidence. IMA delivers accurate and precise measurements of sarcomere length and diameter. Additionally, batch processing is enabled by default, significantly enhancing workflow efficiency.

      This manuscript shows that depending on the isolation and embedding media sarcomere and myofibrils width changes and hence artifacts can be introduced. While this is not suprising, it has not been well controlled in a number of previous publications.

      Furthermore, this paper measures sarcomere length and width during flight muscle development and consolidates what was already known from previous publications. Sarcomeres are added until 48 h APF, then they grow in diameter. Despite strong claims in the text, I do not see any significant novel findings how sarcomeres grow in length or width or any significant deviations from what has been published before. This is even documented in the supplementary graphs by comparing to published data. It is close to identical.

      The overall process has been quantitatively described in four previous studies (Reedy and Beall, 1993, Orfanos et al., 2015, Spletter et al., 2018, Nikonova et al., 2024). While there is general agreement on the pattern of sarcomere development, significant discrepancies exist among these datasets; differences that become particularly problematic when attempting to build structural models. More specifically: Reedy and Beall (1993) report substantially shorter sarcomeres compared to all other datasets, including ours. This discrepancy likely stems from two factors: (i) their use of longitudinal EM sections, where sample preparation is known to cause considerable tissue shrinkage; and (ii) the maintenance of their flies at 23 °C, a temperature that clearly delays development relative to the more commonly used 25 °C. Interestingly, Spletter et al. (2018) and Nikonova et al. (2024) conducted their experiments at 27 °C, which also deviates from standard conditions and may complicate comparisons. Orfanos et al. (2015) suggested that mature sarcomere length is reached by approximately 88 hours after puparium formation (APF). In contrast, our measurements show that sarcomeres continue to elongate beyond this point, reaching mature length between 12 and 24 hours post-eclosion. All four earlier studies report a mature sarcomere length around 3.2-3.3 µm, only slightly longer than the ~3.2 µm length of thick filaments (Katzemich et al., 2012; Gasek et al., 2016). This would imply an I-band length below ~100 nm, which is an implausibly short distance. In contrast, our data, along with several recent studies (González-Morales et al., 2019; Deng et al., 2021; Dhanyasi et al., 2020; DeAguero et al., 2019), support a mature sarcomere length of approximately 3.45 µm, placing the length of the I-band at around 250 nm. This estimate is more consistent with high-resolution structural observations from longitudinal EM sections and fluorescent nanoscopy (Szikora et al., 2020; Schueder et al., 2023). Although Reedy and Beall (1993) provide limited data on myofibril diameter during myofibrillogenesis, a more detailed quantitative analysis is presented by Spletter et al. (2018) and by Nikonova et al. (2024). Interestingly, Spletter et al. report two separate datasets - one based on longitudinal sections and another on cross-sections of DLM fibers. While the measurements are consistent during early pupal stages, they diverge significantly in mature IFMs (1.116 ± 0.1025 µm vs. 1.428 ± 0.0995 µm), a discrepancy that is not addressed in their publication. Nikonova et al. (2024) report even narrower myofibril widths (0.9887 ± 0.1273 µm). Moreover, the reported diameters of early myofibrils in all three datasets are nearly twice as large as those reported by Reedy and Beall (1993) and in our own measurements, directly contradicting the reviewer's claim that the values are “close to identical.” Finally, our data clearly demonstrate that both the length and diameter of IFM sarcomeres reach a plateau in young adults, which is a key developmental feature not examined in previous studies.

      In summary, we did not and we do not intend to claim that our conclusions are novel as to the general mechanisms of myofibril and sarcomere growth. Rather, our contribution lies in providing a high-precision, robust analysis of the growth process using a state-of-the-art toolkit, resulting in a comprehensive description that aligns with structural data obtained from TEM and dSTORM. We therefore believe that expert readers will recognize numerous valuable aspects of our approaches that will advance research in the field.

      Counting the total number of thick filaments during myofibril development is nice, however, this also has been done (REEDY, M. C. & BEALL, C. 1993, PMID: 8253277). In this old study, the authors reported the amount of filament across one myofibril. How does this compare to the new data here counting all filaments? Unfortunatley, this is not discussed.

      Indeed, the study by Reedy and Beall (1993) was primarily based on longitudinal DLM sections, which were used to estimate myofibril width and count the number of thick filaments on this lateral view images (e.g., ~15 thick filaments wide at 75 hours APF), but total thick filament numbers were not provided. While such data could theoretically be used to estimate the number of myofilaments per myofibril, these estimations would depend on the unverified assumption that the section includes the full width of the myofibril. Additionally, the study did not provide standard deviations or the number of measurements, limiting the interpretability and reproducibility of their findings. These points highlight the need for a more rigorous and quantitative approach. For these reasons, we chose to quantify myofilament number using cross-sections, providing more accurate and reliable assessments.

      Besides the difference between the lateral versus cross sections, a direct comparison of our studies is further complicated by differences in the developmental time points and experimental conditions used. Reedy and Beall (1993) reports data from pupae aged 42, 60, 75 and 100 hours, as well as from adults, whereas we present data from 36, 48, and 72 hours APF, and from 24 hours after eclosion, which corresponds to approximately 124 hours APF. Moreover, their experiments were carried out at 23 °C, a temperature that somewhat slows down pupal development and results in adult eclosion at around 112 hours APF, as stated in their study. In contrast, our experiments were carried out at the more commonly used 25 °C, where adults typically emerge around 100 hours APF.

      Collectively, these differences prevented meaningful comparisons between the two datasets, and therefore we preferred to avoid lengthy discussions on this issue.

    1. Get your code into Android Studio Cloud from a version control system using Get from VCS from the welcome dialog. For more information about importing projects from Git, see Set up a Git repository in the IntelliJ documentation.

      Đã thử sử dụng Android Studio Cloud, nhưng sau phiên làm việc thì không thể Bookmark URL nên chưa biết lưu Project ở đâu 17/05/2025 : Sẽ thử nghiệm lại sau

    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Public review):

      The ventral nerve cord (VNC) of organisms like Drosophila is an invaluable model for studying neural development and organisation in more complex organisms. Its well-defined structure allows researchers to investigate how neurons develop, differentiate, and organise into functional circuits. As a critical central nervous system component, the VNC plays a key role in controlling motor functions, reflexes, and sensory integration.

      Particularly relevant to this work, the VNC provides a unique opportunity to explore neuronal hemilineages - groups of neurons that share molecular, genetic, and functional identities. Understanding these hemilineages is crucial for elucidating how neurons cooperate to form specialized circuits, essential for comprehending normal brain function and dysfunction.<br /> A significant challenge in the field has been the lack of developmentally stable, hemilineage-specific driver lines that enable precise tracking and measurement of individual VNC hemilineages. The authors address this need by generating and validating a comprehensive, lineage-specific split-GAL4 driver library.

      Strengths and weaknesses

      The authors select new marker genes for hemilineages from previously published single-cell data of the VNC. They generate and validate specific and temporally stable lines for almost all the hemilineages in the VNC. They successfully achieved their aims, and their results support their conclusions. This will be a valuable resource for investigating neural circuit formation and function.

      We thank the reviewer for her/his positive comments and time reviewing our manuscript. We are pleased that the reviewer recognized the value of our work in generating a comprehensive, lineage-specific split-GAL4 driver library for VNC hemilineages. We agree that this will be a critical resource for investigating neural circuit formation and function, and we are encouraged by the positive comments regarding the novelty and potential impact of our approach.

      Reviewer #1 (Recommendations for the authors):

      I have no suggestions for further experiments, data, or analyses. There are some grammatical errors and referencing issues throughout, but the editors will hopefully catch them.

      We appreciate the reviewer’s comments regarding the grammatical errors and referencing issues and have carefully checked the revised manuscript.

      Reviewer #2 (Public review):

      It is my pleasure to review this manuscript from Soffers, Lacin, and colleagues, in which they identify pairs of transcription factors unique to (almost) every ventral nerve cord hemilineage in Drosophila and use these pairs to create reagents to label and manipulate these cells. The advance is sold as largely technical-as a pipeline for identifying durably expressed transcription factor codes in postmitotic neurons from single cell RNAseq data, generating knock-in alleles in the relevant genes, using these to match transcriptional cell types to anatomic cell types, and then using the alleles as a genetic handle on the cells for downstream explication of their function. Yet I think the work is gorgeous in linking the expression of genes that are causal for neuron-type-specific characteristics to the anatomic instantiations of those neurons. It is astounding that the authors are able to use their deep collective knowledge of hemilineage anatomy and gene expression to match 33 of 34 transcriptional profiles. Together with other recent studies, this work drives a major course correction in developmental biology, away from empirically identified cell type "markers" (in Drosophila neuroscience, often genomic DNA fragments that contain enhancers found to be expressed in specific neurons at specific times), and towards methods in which the genes that generate neuronal type identity are actually used to study those neurons. Because the relationship between fate and form/function is built into the tools, I believe that this approach will be a trojan horse to integrate the fields of neural development and systems neuroscience.

      We thank the reviewer for their time reviewing our manuscript, generous compliments, and appreciation of the potential of our study to drive a major shift in developmental biology, moving away from traditional marker-based methods toward utilizing the genes that mark neuronal type identity in “omics” datasets. Much like the Trojan Horse, which, though initially a concealed and subtle tool, we hope that the strategy outlined here will have continued impact, as we and others plan to leverage future high-resolution and developmental series of scRNAseq datasets to generate driver lines to target neuronal cell types with uttermost precision.

      Reviewer #2 (Recommendations for the authors):

      Line 126-127: I'm not sure if it is true to say "most TFs in the CNS are expressed in a hemilineage-specific manner." As the authors haven't formally interrogated how different neuronal features relate to expression patterns of all ~600 Drosophila TFs, how about replacing "most" with "many?"

      The reviewer makes an excellent point. Work by Lacin and colleagues has demonstrated via genetic studies that lineage-specific transcription factors that regulate the specification and differentiation of postembryonic neurons are stably expressed during development. This was documented for 15 transcription factors in Lacin et al., 2014, and our lab has identified additional examples since. When we refer to the stable expression of transcription factors, we refer to such transcription factors, not the complete set of ~600 transcription factors described to date. We have added this citation to clarify this statement and replaced p6 line 135 ”Most”  by “Many”. We have also address this now in the introduction (p5 line 109-116). Of note, as we conducted this study, we found that is closer to be a rule than an exception that if a transcription factor acted cluster as marker, it was also stably expressed during development. Thus, a growing number of transcription factors is now documented to be stably expressed in a hemilineage-specific manner

      Line 265: Typo? 334 should be 34?

      We thank the reviewer for noting this type error. We have corrected this typographical error.

      Line 522: Refs 56, 57 here related to chinmo, mamo, br-c don't show br-c or mamo mark temporal cohorts of postmitotic neurons. Consider adding PMID: 19883497, 18510932, and 31545163.

      We thank the reviewer for pointing this out and have added these references that demonstrate that broad, Mamo and Chinmo mark temporal cohorts in the developing adult CNS (p17 line 535).

      Reviewer #3 (Public review):

      Soffers et al. developed a comprehensive genetic toolkit that enables researchers to access neuronal hemilineages during developmental and adult time points using scRNA-seq analysis to guide gene cassette exchange-based or CRISPR-based tool building. Currently, research groups studying neural circuit development are challenged with tying together findings in the development and mature circuit function of hemilineage-related neurons. Here, authors leverage publicly available scRNA-seq datasets to inform the development of a split-Gal4 library that targets 32 of 34 hemilineages in development and adult stages. The authors demonstrated that the split-Gal4 library, or genetic toolkit, can be used to assess the functional roles, neurotransmitter identity, and morphological changes in targeted cells. The tools presented in this study should prove to be incredibly useful to Drosophila neurobiologists seeking to link neural developmental changes to circuit assembly and mature circuit function. Additionally, some hemilineages have more than one split-Gal4 combination that will be advantageous for studies seeking to disrupt associated upstream genes.

      Strengths:

      Informing genetic tool development with publicly available scRNA-seq datasets is a powerful approach to creating specific driver lines. Additionally, this approach can be easily replicated by other researchers looking to generate similar driver lines for more specific subpopulations of cells, as mentioned in the Discussion.

      The unification of optogenetic stimulation data of 8B neurons and connectomic analysis of the Giant-Fiber-induced take-off circuit was an excellent example of the utility of this study. The link between hemilineage-specific functional assays and circuit assembly has been limited by insufficient genetic tools. The tools and data present in this study will help better understand how collections of hemilineages develop in a genetically constrained manner to form circuits amongst each other selectively.

      Weaknesses:

      Although cell position, morphology (to some extent), and gene expression are good markers to track cell identity across developmental time, there are genetic tools available that could have been used to permanently label cells that expressed genes of interest from birth, ensuring that the same cells are being tracked in fixed tissue images.

      Although gene activation is a good proxy for assaying neurochemical features, relying on whether neurochemical pathway genes are activated in a cell to determine its phenotype can be misleading given that the Trojan-Gal4 system commandeers the endogenous transcriptional regulation of a gene but not its post-transcriptional regulation. Therefore, neurochemical identity is best identified via protein detection. (strong language used in this section of the paper).

      The authors mainly rely on the intersectional expression of transcription factors to generate split-Gal4 lines and target hemilineages specifically. However, the Introduction (Lines 97-99) makes a notable point about how driver lines in the past, which have also predominantly relied on the regulatory sequences of transcription factors, lack the temporal stability to investigate hemilineages across time. This point seems to directly conflict with the argument made in the Results (Lines 126-127) that states that most transcription factors are stably expressed in hemilineage neurons that express them. It is generally known that transcription factors can be expressed stably or transiently depending on the context. It is unclear how using the genes of transcription factors in this study circumvents the issue of creating temporally stable driver lines.<br />

      We thank the reviewer for their time to thoroughly and carefully review our manuscript. We appreciate the reviewer’s comments on its strengths, and we to hope that this body of work will prove to be incredibly useful to Drosophila neurobiologists seeking to link neural developmental changes to circuit assembly and mature circuit function. Likewise, we also appreciate the reviews careful consideration of its weaknesses, as the reviewer raises valid points. We have addressed these in our revised manuscript and believe this has significantly improved our manuscript.

      Weakness 1: Although cell position, morphology (to some extent), and gene expression are good markers to track cell identity across developmental time, there are genetic tools available that could have been used to permanently label cells that expressed genes of interest from birth, ensuring that the same cells are being tracked in fixed tissue images.

      The reviewer is fully correct, and we are aware of techniques developed by the laboratories of U. Banerjee, T. Lee, and J. Truman that can make transient GAL4 expression permanent, such as G-TRACE and lineage filtering. A common feature of these techniques is that effector activity is permanent (FLP-mediated removal of the FRT-flanked stop codon preceding GFP in G-TRACE or LexA in lineage filtering) but not the GAL4 activity, which is needed to take advantage of the vast UAS based effector lines such as RNAi libraries. For example, the study of Harris et al., 2015 from the Truman lab beautifully showed the strength of this kind of approaches for labeling the hemilineages but their approach cannot be used for functional studies for the reasons mentioned above. Fly lines using these approaches already have several transgenes and require the addition of several more to be used for functional studies. Our approach requires only two transgenes and is compatible with all UAS lines. One additional advantage of the splitGAL4 combinations that we identify here is that they are inserted in genes that are stably expressed throughout larval and pupal development in postmitotic cells, such that they can be used for functional manipulations during development. We emphasized this point in the discussion on page 16 under the heading “Mapping and manipulating morphological outgrowth patterns of hemilineages during development”. 

      Weakness 2: Although gene activation is a good proxy for assaying neurochemical features, relying on whether neurochemical pathway genes are activated in a cell to determine its phenotype can be misleading given that the Trojan-Gal4 system commandeers the endogenous transcriptional regulation of a gene but not its post-transcriptional regulation. Therefore, neurochemical identity is best identified via protein detection. (strong language used in this section of the paper).

      We thank the reviewer for bringing up this important point. We agree that the Trojan-GAL4 approach will not faithfully recapitulate expression of genes that undergo posttranscriptional regulation. Our previous eLife paper (Lacin et al., 2019) showed that this is the case for Trojan driver lines for the ChAT gene. This study demonstrated that ChAT drivers unexpectedly but strongly labeled many GABAergic and Glutamatergic neurons in both the brain and VNC. With RNA in situ hybridization and immunostainings approaches, we showed that these neurons indeed express ChAT mRNA but not the protein. After our publication, another group showed a class of miRNA binds to the 3’UTR of the ChAT gene and regulates its expression post-transcriptionally (Griffith 2023). We believe that one major reason the Trojan driver lines do not faithfully recapitulate this expression pattern is due to the presence of the Hsp70 transcriptional terminator located at the 5’ end of the trojan exon which prematurely ends the transcript and affects the host gene’s 3’ UTR regulation. For this reason, we have recently generated new Trojan plasmids which allow the retention of the 3’UTR of the host gene in the transcript. We have revised the result section “Neurotransmitter use on pages 11-12 to address this point and have modified the language.

      Weakness 3: The authors mainly rely on the intersectional expression of transcription factors to generate split-Gal4 lines and target hemilineages specifically. However, the Introduction (Lines 97-99) makes a notable point about how driver lines in the past, which have also predominantly relied on the regulatory sequences of transcription factors, lack the temporal stability to investigate hemilineages across time. This point seems to directly conflict with the argument made in the Results (Lines 126-127) that states that most transcription factors are stably expressed in hemilineage neurons that express them. It is generally known that transcription factors can be expressed stably or transiently depending on the context. It is unclear how using the genes of transcription factors in this study circumvents the issue of creating temporally stable driver lines.

      We thank the reviewer for pointing out this apparent paradox, which we have clarified in the manuscript (p4. lines 94-102). Driver lines in the past have relied on the intersection of genes to label a defined set of neurons, which helped marking more narrow cell populations compared to enhancer traps in the adult CNS. Elegant and elaborate screening methods have been devised to identify hemidriver combinations that mark specific subset of neurons in the adult (Meissner et al, 2025 (eLife 98405.2) and citations therein). However, these hemidrivers do not leverage the expression pattern of hemilineage marker genes. Instead, their expression is controlled by random 2-3 kb genomic fragments. We and others observed that these drivers are not stably expressed during development. Hence, hemidrivers combinations that work beautifully to target adult neuronal cel populations can oftentimes not be directly used for developmental studies. Work by Lacin et al. 2014 has demonstrated that transcription factors that mark hemilineages are oftentimes stably expressed in the embryo larvae and even adult. When we made driver lines for these TF, using artificial exons, its complete endogenous enhancers elements remain intact. Consequently, we find that Trojan driver lines recapitulate the expression pattern of the transcription factor gene in which it was inserted, and the hemidrivers are stably expressed during development. Hence, leveraging scRNAseq cluster markers for hemilineages and converting them to Trojan driver lines, the approach we took in this paper, has proven a powerful method to generate stable driver lines for developmental studies.

      Reviewer #3 (Recommendations for the authors):

      (1) Line 14: Affiliations typo should be correct to "St. Louis".

      We thank the reviewer for catching this and have corrected the typo.

      (2) Line 26: "model systems have focused on only on a few".

      We have replaced the words “a few regions” by “select regions” to better contrast that studies to date have been performed, but not at CNS level, due to the lack of genetic driver lines.

      (3) Line 52: The use of "medium" here is ambiguous without a comparison.

      We agree that the term “medium” in line 52 could be ambiguous without context, and we appreciate your suggestion to clarify this. The revised sentence now reads: “Drosophila has served as a powerful model system to investigate how neuronal circuits function due to its medium complexity compared to vertebrate models”

      (4) Line 91-92: Consider shortening to "of behavioral circuit assembly".

      Thank you for this suggestion, we have revised p4 lines 90-91 to: “Thus, taking a hemilineage-based approach is essential for a systematic and comprehensive understanding of behavioral circuit assembly during development in complex nervous systems.”

      (5) Line 216-217: Consider establishing what the expected morphology and neurochemical phenotype for 2A neurons is before presenting findings.

      This suggestion is well-taken, and agree that this paragraph did not fully get the point across we were trying to make. This purpose of this paragraph is to explain our workflow of how we assigned 16 hemilineages to orphan clusters, which is why we present the data in this order and present the morphology of hemilineage 2A last. To accommodate the reviewer’s suggestion, we have now clarified our approach before diving into the results to improve the flow of this paragraph (p8 lines 218-223). Briefly, the starting point to annotate the 16 orphan scRNAseq clusters was each time taking one orphan scRNAseq cluster, picking its top cluster marker genes that had not been established yet as marker genes for any hemilineage, and visualizing the morphology of the neurons that expressed such cluster marker using a reporter line for the cluster marker or an antibody stain for its protein. We then compared this to documented hemilineage morphologies, and to narrow down our search, we compared the observed trajectories to those of unannotated hemilineages that used the same neurotransmitter as the orphan scRNAseq. The evaluation of the documented morphologies of the hemilineages came at the last part of our method to annotate the hemilineages to orphan scRNAseq clusters, which is why we chose to present the expected morphology of a hemilineage at the end.

      (6) If "neurochemical" phenotype and "neurotransmitter" identity are sometimes used interchangeably but seem to mean the same thing. Consider choosing one term throughout.

      We thank the reviewer for this suggestion and have changed the terminology to “neurotransmitter use” (p11-12 lines 326-359).

      (7) Line 235: MARCM technique citation needed.

      We thank the reviewer for pointing this out, the citation (no. 37, p9 line 249) was present in the method section, but we had inadvertently omitted it in the main text and we have now corrected this.

      (8) Line 281: typo, should be "patterns".

      We thank the reviewer for noting this and have corrected this.

      (9) Line 469: End of sentence needs a ".".

      We have added the punctuation mark.

      (10) Line 516: "driver line combinations to express...".

      We have inserted the word “to” to correct it.

      (11) Please make sure that the correct genotypes are matched in the figure legends and Table 1. For instance, knot-GAL4-DBD is listed as the hemi driver for 10B neurons in Figure 3 but only knot-p65.AD is listed in Table 1.

      We thank the reviewer for catching this, we made a mistake and the correct hemidriver combination used in Figure 3L i: knot-GAL4-AD with hb9-GAL4-DBD. We have updated the legend and carefully checked the legends and tables.

      (12) Consider making different color choices for readability when possible and be consistent with labeling CadN. For instance, in Figure 1 the magenta color has three separate meanings: CadN, Acj6, and unc-4. Either of the three genes can be mistaken for the other for a reader mainly paying attention to the magenta color. I find that one color can mean two things in a figure if organized properly but any more begs for confusion. Also, CadN can be easily labeled if used in a new figure (e.g. Figure 1-Supplment 1).

      We thank the reviewer for this insightful observation and have adjusted figure 1 so that cadN is displayed in blue and reporter genes expressing Acj6, Unc-4 or their intersection in green. The legend is modified to reflect these changes.

      (13) If Seurat object changes or additional quality control steps were taken from the original studies, please provide these changes. Similarly, provide any scRNA-seq code used or cite code used for readers to access. Also, provide a section in the methods briefly describing how genes were chosen (criteria) for tool development.

      We thank the reviewer for nothing we had not described our scRNA analysis pipeline and criteria to select transcription factors in the methods section of the manuscript. We have added this section at p19 lines 548-558. Briefly, we used the Seurat object generated by Allen et al., 2015, and did not change quality control steps, normalizations or scaling. Candidate genes to make split-GAL4 drivers from were chosen based on their ability to mark the clusters defined by Allen et al. We did not use computer-based algorithms and made a list of the top cluster markers. Then, we made binary combinations amongst these cluster markers and with hemilineages markers we had identified before (Lacin et al, 2014; Lacin et al 2019), and used the code generated by Allen et al., 2015 (deposited on Github) with Seurat v5 to test if these combinations marked unique clusters. We then prioritized testing these combinations based on the availability of antibodies, BAC lines and CRiMIC/MiMIC constructs to validate their expression pattern prior to creating split-GAL4 lines for these candidates.

      (14) In regard to the seemingly contradictory argument that most transcription factors are stably expressed when most drivers of the past used regulatory elements of transcription factors: the paper could be strengthened by either a) describing how older driver lines differ from the lines presented in the paper or b) remarking on the endogenous temporal stability of the transcription factors used in this study.

      We thank the reviewer for pointing this out, and we agree that it is necessary to clarify this apparent paradox since it is essential for understanding the impact of the present work. We have revised our manuscript described in our response to weakness 1.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      This manuscript is a focused investigation of the phosphor-regulation of a C. elegans kinesin-2 motor protein, OSM-3. In C-elegans sensory ciliary, kinesin-2 motor proteins Kinesin-II complex and OSM-3 homodimer transport IFT trains anterogradely to the ciliary tip. Kinesin-II carries OSM-3 as an inactive passenger from the ciliary base to the middle segment, where kinesin-II dissociates from IFT trains and OSM-3 gets activated and transports IFT trains to the distal segment. Therefore, activation/inactivation of OSM-3 plays an essential role in its ciliary function.

      Strengths:

      In this study, using mass spectrometry, the authors have shown that the NEKL-3 kinase phosphorylates a serine/threonine patch at the hinge region between coiled coils 1 and 2 of an OSM-3 dimer, referred to as the elbow region in ubiquitous kinesin-1. Phosphomimic mutants of these sites inhibit OSM-3 motility both in vitro and in vivo, suggesting that this phosphorylation is critical for the autoinhibition of the motor. Conversely, phospho-dead mutants of these sites hyperactivate OSM-3 motility in vitro and affect the localization of OSM3 in C. elegans. The authors also showed that Alanine to Tyrosine mutation of one of the phosphorylation rescues OS-3 function in live worms.

      Weaknesses:

      Collectively, this study presents evidence for the physiological role of OSM-3 elbow phosphorylation in its autoregulation, which affects ciliary localization and function of this motor. Overall, the work is well performed, and the results mostly support the conclusions of this manuscript. However, the work will benefit from additional experiments to further support conclusions and rule out alternative explanations, filling some logical gaps with new experimental evidence and in-text clarifications, and improving writing before I can recommend publication.

      We appreciate Reviewer #1’s comments and suggestions. We have now provided additional evidences and discussions to further support our conclusions and fill the logical gaps. We have also provided alternative explanations to our data and improved writing.

      Reviewer #2 (Public review):

      Summary:

      The regulation of kinesin is fundamental to cellular morphogenesis. Previously, it has been shown that OSM-3, a kinesin required for intraflagellar transport (IFT), is regulated by autoinhibition. However, it remains totally elusive how the autoinhibition of OSM-3 is released. In this study, the authors have shown that NEKL-3 phosphorylates OSM-3 and releases its autoinhibition.

      The authors found NEKL-3 directly phosphorylates OSM-3 (although the method is not described clearly) (Figure 1). The phophorylated residue is the "elbow" of OSM-3. The authors introduced phospho-dead (PD) and phospho-mimic (PM) mutations by genome editing and found that the OSM-3(PD) protein does not form cilia, and instead, accumulates to the axonal tips. The phenotype is similar to another constitutive active mutant of OSM-3, OSM-3(G444A) (Imanishi et al., 2006; Xie et al., 2024). osm-3(PM) has shorter cilia, which resembles with loss of function mutants of osm-3 (Figure 3). The authors did structural prediction and showed that G444E and PD mutations change the conformation of OSM-3 protein (Figure 3). In the single-molecule assays G444E and PD mutations exhibited increased landing rate (Figure 4). By unbiased genetic screening, the authors identified a suppressor mutant of osm-3(PD), in which A489T occurs. The result confirms the importance of this residue. Based on these results, the authors suggest that NEKL-3 induces phosphorylation of the elbow domain and inactivates OSM-3 motor when the motor is synthesized in the cell body. This regulation is essential for proper cilia formation.

      Strengths:

      The finding is interesting and gives new insight into how the IFT motor is regulated.

      Weaknesses:

      The methods section has not presented sufficient information to reproduce this study.

      We appreciate that Reviewer #2 is also positive to our study. We have now provided sufficient information in the revised Methods section.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Major Concerns

      (1) Why do the authors think that NEKL-3 phosphorylates OSM-3 in the first place? This seems to come out of nowhere and prior evidence indicating that NEKL-3 may be phosphorylating OSM-3 is not even mentioned in the Introduction.

      We thank the Reviewer for raising this important point. Our hypothesis that NEKL-3 phosphorylates OSM-3 stems from prior findings in our lab. In a previous study (Yi et al., Traffic, 2018, PMID: 29655266), we identified NEKL-4, a member of the NIMA kinase family, as a suppressor of the OSM-3(G444E) hyperactive mutation. This discovery prompted us to explore the broader role of NIMA kinases in regulating OSM3. Subsequent genetic screens (Xie et al., EMBO J, 2024, PMID: 38806659) revealed that both NEKL-3 and NEKL-4 suppress multiple OSM-3 mutations, further supporting their functional interaction. Given the established role of NIMA kinases in phosphorylation-dependent processes (Fry et al., JCS, 2012, PMID: 23132929; Chivukula et al., Nat. Med., 2020, PMID: 31959991; Thiel, C. et al. Am. J. Hum. Genet. 2011, PMID: 21211617; Smith, L. A. et al., J. Am. Soc. Nephrol., 2006, PMID: 16928806), we hypothesized that NEKL-3/4 may directly phosphorylate OSM-3 to modulate its activity.

      To test this hypothesis, we expressed recombinant C. elegans NEKL-3 and OSM-3 proteins and conducted in vitro phosphorylation assays. While we were unable to obtain active recombinant NEKL-4 (limitations noted in the revised text), our experiments with NEKL-3 revealed phosphorylation at residues 487-490 (YSTT motif) in OSM-3’s tail region, as confirmed by mass spectrometry. These findings are now explicitly contextualized in the Introduction and Results sections of the revised manuscript.

      Page #4, Line #11:

      “...In our previous study (Yi et al., Traffic, 2018, PMID: 29655266), a genetic screen targeting the OSM-3(G444E) hyperactive mutation identified NEKL-4, a member of the NIMA kinase family, as a suppressor of this phenotype. This finding, combined with reports that NIMA kinases regulate ciliary processes independently of their canonical mitotic roles (Fry et al., JCS, 2012, PMID: 23132929; Chivukula et al., Nat. Med., 2020, PMID: 31959991; Thiel, C. et al. Am. J. Hum. Genet. 2011, PMID: 21211617; Smith, L. A. et al., J. Am. Soc. Nephrol., 2006, PMID: 16928806), prompted us to investigate whether NIMA kinases modulate OSM-3-driven intraflagellar transport. We hypothesized that NEKL-3/4, as paralogs within this family, might directly phosphorylate OSM-3 to regulate its motility...”

      Page #4, line #26:  

      “... To determine whether NIMA kinase family members could directly phosphorylate

      OSM-3, we purified prokaryotic recombinant C. elegans NEKL-3/NEKL-4 and OSM3 protein in order to perform in vitro phosphorylation assays. We were able to obtain active recombinant NEKL-3 but not NEKL-4. The in vitro phosphorylation assays showed that NEKL-3, directly phosphorylates OSM-3 (Fig. 1A-B, Appendix Table S1). Subsequent mass spectrometric analysis revealed phosphorylation at residues 487-490, which localize to the conserved "YSTT" motif within OSM-3’s C-terminal tail region ...”

      (2) The authors need to characterize the proteins they expressed and purified for in vitro ATPase and motility assays. Are these proteins monomers or dimers?

      For our in vitro ATPase and motility assays, OSM-3 was expressed in E. coli BL21(DE3) and purified using established protocols (Xie et al., EMBO J, 2024, PMID: 38806659; Imanishi et al., JCB, 2006, PMID: 17000874). To confirm its oligomeric state, we analyzed recombinant OSM-3 by size-exclusion chromatography coupled with multiangle light scattering (SEC-MALS). As reported in Xie et al. (2024), OSM-3 (~80 kDa monomer) elutes with a molecular weight of 173–193 kDa under physiological buffer conditions, consistent with a homodimeric assembly. These findings confirm that the functional unit used in our assays is the biologically relevant dimer. This characterization has been added to the revised manuscript on Page #35, Line #7.

      “…OSM-3 was expressed in E. coli BL21(DE3) and purified for in vitro assays using established protocols (REFs). Size-exclusion chromatography coupled with multiangle light scattering (SEC-MALS) (Xie et al., EMBO J., 2024) confirmed that recombinant OSM-3 forms a homodimer (173–193 kDa) under physiological conditions, ensuring its dimeric state remained intact....” 

      (3) The authors primarily used PD and PM mutations, which affect all four amino acids in the region. This may or may not be physiologically relevant. Figure 5 indicates that T489 is a critical regulatory site. However, this conclusion is undermined by reliance on PD mutations, which affect all four amino acids. Creating PM (T489E) and PD (T489A) mutations based on WT OSM-3 would better reflect physiological relevance. In vitro assays with a single phosphomimic or phosphor-dead mutation at residue 489 are missing at the end of this story. This would better link Figure 5 with the rest of the manuscript.

      We thank the reviewer for this constructive critique. Below, we address the concerns and integrate new data to strengthen the link between T489 and autoinhibition:

      To probe the regulatory role of T489 phosphorylation, we generated osm-3(T489E) (phosphomimetic, PM) and osm-3(T489A) (phospho-dead, PD) mutant animals. Strikingly, both mutants formed axonal puncta (Figure S7), recapitulating the hyperactive phenotype of the OSM-3G444E mutant. While the similar puncta formation in PM and PD mutants initially appeared paradoxical, this observation underscores the necessity of dynamic phosphorylation cycling at T489 for proper autoinhibition. Specifically, the PD mutant (T489A) likely disrupts phosphorylationdependent autoinhibition stabilization, leading to constitutive activation, where as the PM mutant (T489E) may mimic a "locked" phosphorylated state, preventing dephosphorylation-dependent release of autoinhibition in cilia and trapping OSM-3 in an aggregation-prone conformation. These results highlight T489 as a structural linchpin whose post-translational modification dynamically regulates motor activity. While the precise molecular mechanism—such as how phosphorylation modulates tailmotor domain interactions—remains to be elucidated, our data conclusively demonstrate that perturbing T489 (even in isolation) destabilizes autoinhibition, driving puncta formation and the constitutive activity.

      We have integrated the above paragraph in the revised manuscript on page #8, line #27.

      (4) There seems to be a disconnect between the MT gliding assays in Figure 4C and single molecule motility assays in Figure 4E. The gliding assays show that all constructs can glide microtubules at near WT speeds. Yet, the motility assays show that WT and PM cannot land or walk on MTs. The authors need to explain why this is the case. Is this because surface immobilization of kinesin from its tail disrupts autoinhibition? Alternatively, the protein preparation may include monomers that cannot be autoinhibited and cannot land and processively walk on surface-immobilized microtubules (because they only have one motor domain) but can glide microtubules when immobilized on the surface from their tail.

      The surface immobilization of OSM-3 via its tail domain disrupts autoinhibition, a phenomenon previously observed in other kinesins such as kinesin-1 (Nitzsche et al, Methods Cell Biol., 2010, PMID: 20466139). In our assays, OSM-3 was nonspecifically immobilized on glass surfaces, enabling microtubule gliding by motors whose autoinhibition was relieved through tail anchoring. Critically, the PD and PM mutations reside in the tail region and do not alter the intrinsic properties of the motor head domain. Consequently, once autoinhibition is released via immobilization, the gliding velocities reflect the conserved motor head activity, which is expected to remain comparable across all constructs. While we cannot entirely rule out the presence of monomeric OSM-3 in solution, several lines of evidence argue against this possibility. First, the mutations are located in the elbow region, which is dispensable for motor dimerization. Second, SEC-MALS analysis from prior studies confirms that purified OSM-3 exists predominantly as dimers in solution. 

      We have discussed these issues in the revised text on page #10, line #18: 

      “…In our gliding assays, OSM-3PM has an increased gliding speed of 0.69 ± 0.07 μm/s (Fig. 4 C-D), similar to PD mutant. PD and PM mutations are confined to the elbow region, leaving the motor head’s mechanochemical properties intact. Upon tail immobilization—which releases autoinhibition—the gliding speeds reflect motor head activity. Single-molecule assays, however, directly resolve their native regulatory states: PD mutants are constitutively active, whereas PM mutants persist in an autoinhibited state (Fig. 4E-G). Although monomeric OSM-3 could theoretically mediate singlemotor gliding, the previous SEC-MALS data demonstrate that OSM-3 purifies as stable dimers (Xie et al., EMBO J, 2024, PMID: 38806659). Thus, dimeric OSM-3 is perhaps the predominant functional species in our assays…”

      (5) An alternative explanation for the data is that both PD and PM mutations result in loss-of-function effects, disrupting OSM-3 activity. For instance:

      a) In Figure 2C, both mutations cause shorter cilia than the wild type (WT).

      b) In Figure 4A, both mutations result in higher ATPase activity than WT.

      c) In Figure 4D, both mutations show increased gliding velocity compared to WT. These results suggest the observed effects could stem from loss of function rather than phosphorylation-specific regulation.

      Although PD and PM mutations exhibit superficially similar "loss-of-function" phenotypes in certain assays, they mechanistically disrupt motor regulation in distinct ways:

      a) Ciliary Length (Figure 2C) PD Mutants: Hyperactivation causes OSM-3-PD to prematurely aggregate into axonal puncta, preventing ciliary entry. Consequently, cilia are built solely by the weaker Kinesin-II motor, which only constructs shorter middle segments.

      PM Mutants: OSM-3-PM retains autoinhibition during transport (enabling ciliary entry) but cannot be dephosphorylated in cilia. This blocks activation, leaving OSM-3-PM partially functional and resulting in cilia intermediate in length between WT and PD.

      We have discussed this issue in the revised text on page #5, line #30:

      “…These findings indicate that OSM-3-PM is in an autoinhibited state capable of ciliary delivery, yet fails to achieve full activation due to defective dephosphorylation. This incomplete activation results in suboptimal motor function and intermediate ciliary length phenotypes (Fig.2 B-C). In contrast, OSM-3-PD exhibits constitutive activation leading to aggregation into axonal puncta, which completely abolishes its ciliary entry capacity (Fig.2 A-B)...”

      b) ATPase Activity (Figure 4A)

      PD Mutants: Fully autoinhibition-released (98.15% of KHC ATPase activity), consistent with constitutive activation.

      PM Mutants: Show partial ATPase activity (34.28% of KHC), reflecting imperfect phosphomimicry. While the DDEE substitution introduces negative charges, it fails to fully replicate the steric/kinetic effects of phosphorylated tyrosine (Y486; phenyl ring absent), resulting in incomplete autoinhibition stabilization. Despite this, the residual inhibition is sufficient to phenocopy shorter cilia in vivo.

      We have discussed this issue in the revised text on page #7, line#19:

      “…The PM mutant’s partial ATPase activity (34.28% of KHC) might arise from imperfect phosphomimicry—while the DDEE substitution introduces negative charges, it lacks the steric bulk of phosphorylated tyrosine (pY487). And this incomplete mimicry allows residual autoinhibition, sufficient to limit ciliary construction in vivo...”

      c) Microtubule Gliding Velocity (Figure 4D)

      Gliding Assay Limitation: Tail immobilization artificially releases autoinhibition, masking regulatory differences. Thus, all constructs (PD, PM) exhibit similar velocities (~0.7 µm/s), reflecting conserved motor head activity.

      Single-Molecule Assay (Figure 4E): Directly resolves native autoinhibition states:

      PD mutants show robust motility (autoinhibition released).

      PM mutants remain largely inactive (autoinhibition retained).

      We have discussed this issue in the revised text on page #10, line#18:

      “…In our gliding assays, OSM-3PM has an increased gliding speed of 0.69 ± 0.07 μm/s (Fig. 4 C-D), similar to PD mutant. PD and PM mutations are confined to the elbow region, leaving the motor head’s mechanochemical properties intact. Upon tail immobilization—which releases autoinhibition—the gliding speeds reflect motor head activity. Single-molecule assays, however, directly resolve their native regulatory states: PD mutants are constitutively active, whereas PM mutants persist in an autoinhibited state (Fig. 4E-G)...”

      Minor Suggestions and Concerns

      (1) Lines 60-66: References that support these observations are missing from this section.

      We have added the relevant references.

      (2) Lines 66-67: I would revise this sentence as "It remains unclear how OSM-3 becomes enriched...".

      We have made the changes.

      (3) Line 85: The authors should describe how they perform these assays (i.e. recombinantly expressed NEKL-3 and OSM-3, are these C. elegans proteins, and which expression system was used...).

      We have described them in the main text and methods

      Page #4 line #26

      “...To determine whether NIMA kinase family members could directly phosphorylate OSM-3, we purified prokaryotic recombinant C. elegans NEKL-3/NEKL-4 and OSM-3 protein in order to perform in vitro phosphorylation assays...”

      Page #35 line#12

      “...Basically, point mutations was introduced in to pET.M.3C OSM-3-eGFP-His6 plasmid for prokaryotic expression. Plasmid transformed E. coli (BL21) was cultured at 37°C and induced overnight at 23°C with 0.2 mM IPTG. Cells were lysed in lysis buffer (50 mM NaPO4 pH8.0, 250 mM NaCl, 20 mM imidazole, 10 mM bME, 0.5 mM ATP, 1 mM MgCl¬2, Complete Protease Inhibitor Cocktail (Roche)) and Ni-NTA beads were applied for affinity purification. After incubation, beads were washed with wash buffer (50 mM NaPO4 pH6.0, 250 mM NaCl, 10 mM bME, 0.1 mM ATP, 1 mM MgCl¬2) and eluted with elute buffer (50 mM NaPO4 pH7.2, 250 mM NaCl, 500 mM imidazole, 10 mM bME, 0.1 mM ATP, 1 mM MgCl¬2). Protein concentration was determined by standard Bradford assay. C elegans nekl-3 cDNA was cloned in to pGEX-6P GST vector and expressed in E. coli BL21 (DE3) and purified for in vitro phosphorylation assays. Plasmid transformed E. coli (BL21) was cultured at 37°C and induced overnight at 18°C with 0.5 mM IPTG. Cells were lysed in lysis buffer (50 mM NaPO4 pH8.0, 250 mM NaCl, 1 mM DTT, Complete Protease Inhibitor Cocktail (Roche)) and GST beads were applied for affinity purification. After incubation, beads were washed with wash buffer (50 mM NaPO4 pH6.0, 250 mM NaCl, 1 mM DTT) and eluted with elute buffer (50 mM NaPO4 pH7.2, 150 mM NaCl, 10 mM GSH, 1 mM DTT). Purified proteins were dialyzed against storge buffer (50 mM Tris-HCl, pH 8.0, 150 mM NaCl). Protein concentration was determined by standard Bradford assay...”

      (4) Line 141: The first sentence of this paragraph lacks motivation. I would start this sentence with "To directly observe the effects of phosphor mutants in the elbow region in microtubule binding and motility of OSM-3, we...".

      We have made the change.

      (5) Figure 1B: The mass spectrometry data in Figure 1B lacks adequate explanation. The Methods section should detail the experimental protocol, data interpretation, and any databases used. Additionally, the manuscript should list all identified phosphorylation sites on OSM-3 to provide context, including whether Y487_T490 is the major site.

      We have provided the detailed experimental protocol, data interpretation, and databases used in methods. We have provided all identified sites as Appendix table S1.

      (6) Figure 1C: Is it possible to model the effect of PM and PD mutations using AlphaFold? The authors should also show PAE or pLDDT scores of their model.

      AlphaFold cannot well model the effect of mutants, but we conducted the Rosetta relax to capture their possible conformational changes, as shown in the revised Figure 3. We have provided PAE and pLDDT as a new figure, Figure S2.

      (7) Figure 2D: The unit for speed should use a lowercase "s" for seconds.

      We have fixed it.

      (8) Figure 3: I am not sure whether this figure stands for a main text figure on its own, as it is only a Rosetta prediction and is not supported by any experimental data. In addition, it remains unclear what the labels on the x-axis mean.

      We have updated the figure and explain the labels on the x-axis in Figure S4 to make it more reader-friendly.

      (9) Figure 4: NEKL-3-treated OSM-1 should be included as a positive control in the in vitro experiments.

      We suspect that the Reviewer asked for NEKL-3-treated OSM-3. 

      In our other study which has just been accepted by the Journal of Cell Biology, NEKL3-treated OSM-3 significantly reduced the affinity between OSM-3 motor and microtubules and showed very low ATPase activity. We have cited and discussed this in the revised text on page #10, line #28: 

      “…As demonstrated in our recent study (Huang et al., JCB, 2025, In press, attached), phosphorylation of OSM-3 by NEKL-3 at two distinct regions—Ser96 and the conserved "elbow" motif—differentially regulates its activity and localization. Phosphorylation at Ser96 reduces OSM-3’s ATPase activity and alters its ciliary distribution from the distal segment to a uniform localization, while elbow phosphorylation induces autoinhibition, retaining OSM-3 in the cell body. Strikingly, in vitro phosphorylation of OSM-3 by NEKL-3 significantly reduces its microtubulebinding affinity, likely arising from combined modifications at both sites. We propose a model wherein elbow phosphorylation ensures anterograde ciliary transport, while Ser96 phosphorylation fine-tunes distal segment targeting. This multistep regulation may involve distinct phosphatases to reverse phosphorylation at specific sites, a hypothesis warranting further investigation….”

      (10) Figure 4C, D, and F: The unit of velocity is wrong. The authors should use the same units they used in the table shown in Figure 4B.

      We have fixed these errors

      (11) Figure 4F: The velocity of PD is a lot lower than G444E. Therefore, it would be more appropriate to refer to PD as partially active, rather than hyperactive.

      We have made the change. 

      (12) Figure 5: There is too much genetics jargon on this figure (EMF, F2, 100%Dyf,...). How are the alleles numbered? Is it OK to refer to them as Alleles 1 and 2 for simplicity?

      According to the established C. elegans allele nomenclature, each worm allele has a unique number named after the lab code for identification. We have simplified the labels and updated the figure to make it more reader-friendly.

      (13) Figure 5E: A plot would be more reader-friendly than a table. Additionally, the legend for Fig. 5E mistakenly refers to it as "D."

      We have changed the table to a plot and fixed the mistakes. We thank the Reviewer for pointing them out.

      Reviewer #2 (Recommendations for the authors):

      (1) The model appears as if NEKL-3 induces dephosphorylation of OSM-3 (Figure 6). This is not consistent with the conclusions described in the Discussion and is confusing.

      We have updated the model figure and fixed the error.

      (2) It should be described why the authors hypothesized NEKL-3 phosphorylates OSM3. Was there genetic evidence? Did the authors screened cilia-related kinases? or Did the authors identify it incidentally? Providing this information would help readers to understand the context of the research.

      We appreciate both Reviewers for pointing out this issue. 

      Our hypothesis that NEKL-3 phosphorylates OSM-3 stems from prior findings in our lab. In a previous study (Yi et al., Traffic, 2018, PMID: 29655266), we identified NEKL-4, a member of the NIMA kinase family, as a suppressor of the OSM-3(G444E) hyperactive mutation. This discovery prompted us to explore the broader role of NIMA kinases in regulating OSM-3. Subsequent genetic screens (Xie et al., EMBO J, 2024, PMID: 38806659) revealed that both NEKL-3 and NEKL-4 suppress multiple OSM-3 mutations, further supporting their functional interaction. Given the established role of NIMA kinases in phosphorylation-dependent processes (Fry et al., JCS, 2012, PMID: 23132929; Chivukula et al., Nat. Med., 2020, PMID: 31959991; Thiel, C. et al. Am. J. Hum. Genet. 2011, PMID: 21211617; Smith, L. A. et al., J. Am. Soc. Nephrol., 2006, PMID: 16928806), we hypothesized that NEKL-3/4 may directly phosphorylate OSM3 to modulate its activity.

      To test this hypothesis, we expressed recombinant C. elegans NEKL-3 and OSM-3 proteins and conducted in vitro phosphorylation assays. While we were unable to obtain active recombinant NEKL-4 (limitations noted in the revised text), our experiments with NEKL-3 revealed phosphorylation at residues 487-490 (YSTT motif) in OSM-3’s tail region, as confirmed by mass spectrometry. These findings are now explicitly contextualized in the Introduction and Results sections of the revised manuscript.

      Page #4, Line #11:

      “... In our previous study (Yi et al., Traffic, 2018, PMID: 29655266), a genetic screen targeting the OSM-3(G444E) hyperactive mutation identified NEKL-4, a member of the NIMA kinase family, as a suppressor of this phenotype. This finding, combined with reports that NIMA kinases regulate ciliary processes independently of their canonical mitotic roles (Fry et al., JCS, 2012, PMID: 23132929; Chivukula et al., Nat. Med., 2020, PMID: 31959991; Thiel, C. et al. Am. J. Hum. Genet. 2011, PMID: 21211617; Smith, L. A. et al., J. Am. Soc. Nephrol., 2006, PMID: 16928806), prompted us to investigate whether NIMA kinases modulate OSM-3-driven intraflagellar transport. We hypothesized that NEKL-3/4, as paralogs within this family, might directly phosphorylate OSM-3 to regulate its motility...”

      Page #4, line #26: 

      “... To determine whether NIMA kinase family members could directly phosphorylate OSM-3, we purified prokaryotic recombinant C. elegans NEKL-3/NEKL-4 and OSM3 protein in order to perform in vitro phosphorylation assays. We were able to obtain active recombinant NEKL-3 but not NEKL-4. The in vitro phosphorylation assays showed that NEKL-3, directly phosphorylates OSM-3 (Fig. 1A-B, Appendix Table S1). Subsequent mass spectrometric analysis revealed phosphorylation at residues 487-490, which localize to the conserved "YSTT" motif within OSM-3’s C-terminal tail region...”

      (3) It is curious the authors have not addressed the cilia phenotype and the localization of OSM-3 in nekl-3 mutant. Regardless of whether these observations agrees with the proposed mechanisms, it is essential for the authors to show and discuss the cilia phenotype and OSM-3 localization in nekl-3 mutants.

      We thank the Reviewer for highlighting this critical point. Indeed, nekl-3 null mutants are inviable due to essential mitotic roles (Barstead et al., 2012, PMID: 23173093), precluding direct analysis of ciliary phenotypes. To bypass this limitation, we recently generated nekl-3 conditional knockouts (cKOs) in ciliated neurons (Huang et al., JCB, 2025 in press, attached). In these mutants, OSM-3—which is normally enriched in the ciliary distal segment—becomes uniformly distributed along the cilium. This redistribution correlates with premature activation of OSM-3-driven anterograde motility in the ciliary middle region, consistent with our proposed model where NEKL3 phosphorylation suppresses OSM-3 activity. We have now integrated this result and discussion into the revised manuscript, reinforcing the physiological relevance of NEKL-3-mediated regulation in ciliary transport. 

      Page #6 line #10

      “… While nekl-3 null mutants are inviable due to essential mitotic roles (Barstead et al., 2012, PMID: 23173093), conditional knockout (cKO) of nekl-3 in ciliated neurons (Huang et al., JCB, 2025 in press, attached) revealed its critical role in regulating OSM3 dynamics. In nekl-3 cKO animals, OSM-3—normally enriched in the ciliary distal segment—redistributed uniformly along the cilium, concomitant with premature activation of anterograde motility in the middle ciliary region. This phenotype aligns with our model wherein NEKL-3 phosphorylation suppresses OSM-3 activity, ensuring spatiotemporal regulation of IFT.…”

      (4) The methods section lacks some information, which is critical to reproducing this study.

      We have now provided detailed information in the methods section in the revised manuscript.

      (a) It is not described how the authors determined phosphorylation of OSM-3 by NEKL-3. In methods, nothing is described about the assay.

      We performed in vitro phosphorylation assays using recombinant OSM-3 and NEKL3 purified from bacteria. We then used LC-MS/MS for identification of phosphorylation sites. We have now updated the methods section to include all the information.

      Page #4 line #26

      “... To determine whether NIMA kinase family members could directly phosphorylate OSM-3, we purified prokaryotic recombinant C. elegans NEKL-3/NEKL-4 and OSM3 protein in order to perform in vitro phosphorylation assays. We were able to obtain active recombinant NEKL-3 but not NEKL-4. The in vitro phosphorylation assays showed that NEKL-3, directly phosphorylates OSM-3 (Fig. 1A-B, Appendix Table S1). Subsequent mass spectrometric analysis revealed phosphorylation at residues 487-490, which localize to the conserved "YSTT" motif within OSM-3’s C-terminal tail region...”

      Page #36, line #19

      “In vitro phosphorylation assay 20 μM purified OSM-3 was incubated with 1 μM GST-NEKL-3 at 30 °C in 100 μL reaction buffer (50 mM Tris-HCl pH 8.0, 10 mM MgCl2, 150 mM NaCl, and 2 mM ATP) for 30 min. The reaction was terminated by boiling for 5 min with an SDS-sample buffer.

      Mass spectrometry

      Following NEKL-3 treatment, OSM-3 proteins were resolved by SDS-PAGE and visualized with Coomassie Brilliant Blue staining. Protein bands corresponding to OSM-3 were excised and subjected to digestion using the following protocol: reduction with 5 mM TCEP at 56°C for 30 min; alkylation with 10 mM iodoacetamide in darkness for 45 min at room temperature, and tryptic digestion at 37°C overnight with a 1:20 enzyme-to-protein ratio. The resulting peptides were subjected to mass spectrometry analysis. Briefly, the peptides were analyzed using an UltiMate 3000 RSLCnano system coupled to an Orbitrap Fusion Lumos mass spectrometer (Thermo Fisher Scientific). We applied an in-house proteome discovery searching algorithm to search the MS/MS data against the C. elegans database. Phosphorylation sites were determined using PhosphoRS algorithm with manual validation of MS/MS spectra.”

      (b) The method of structural prediction by Alfafold2 and LocalColabFold needs clarification. In general, the prediction gives several candidates. How did the authors choose one of these candidates?

      We generated five candidate models and all of them showed similar conformation. We thus chose the model with the highest confidence. We have provided PAE and pLDDT as additional data in Figure S2 and discussed them in the revised text on, Page #4, line #32: 

      “...To gain structural insights from this motif, we employed LocalColabFold based on AlphaFold2 to predict the dimeric structure of OSM-3 (Evans et al., 2022; Jumper et al., 2021; Mirdita et al., 2022). The highest-confidence model was selected for further analysis (Fig. 1C, Fig. S2)...”

      (c) The methods to predict conformational changes by introducing various point mutations are interesting (Figure 3). However, the methods require more detailed descriptions. In the current form, the manuscript only lists the tools used. The pipelines and parameters need to be described. This information is important because AlphaFoldbased predictions often give folded conformations because the training data are mainly composed of folded proteins. It is surprising that the methods applied here give open conformations induced by point mutations.

      We have described the pipelines in the revised Methods section on page#34, line#25: 

      “…OSM-3 model was predicted using LocalColabFold (Evans et al., 2022; Jumper et al., 2021; Mirdita et al., 2022). Mutated proteins were designed by Pymol 2.6, choosing the rotamer of the mutated residues in G444E, PM and PD models with the least clash as the initial conformation. To predict mutation-induced conformational changes, the initial models were subjected to Pyrosetta (Chaudhury et al., 2010). The energies of pre-relaxed models were evaluated with Rosetta Energy Function 2015 (Alford et al., 2017), and then the relax procedure were applied to the models with default parameters to obtain the relaxed models visualized by Pymol to minimize the energy of these models. In detail, to obtain the relaxed models visualized by Pymol and minimize the energy of these models, the classic relax mover was used in the procedure mentioned above with default settings. The relax script has been uploaded to Github: https://github.com/young55775/RosettaRelax_for_OSM3...”

      (5) The authors have purified proteins. Do they show different properties in gel filtration that are consistent with the structural prediction? It is anticipated that open-form mutants are eluted from earlier than closed forms.

      We thank the reviewer for this insightful suggestion. Indeed, our recent study supported that the open-from of the active OSM-3 G444E mutation were eluted earlier than the wild-type closed form (Xie et al., EMBO J., 2024). While the current study did not perform gel filtration chromatography (SEC) to directly compare the hydrodynamic properties of the OSM-3 mutants, our functional assays provide robust evidence for conformational changes predicted by structural modeling. For example: ATPase activity assays revealed that the open-state mutants (e.g., G444E and PD muatnts) exhibited significantly enhanced enzymatic activity (Figure 4A), consistent with structural predictions of an active, destabilized autoinhibitory interface (Figure 3A). These functional readouts collectively validate the predicted structural states. While SEC could further corroborate these findings by distinguishing compact (closed) versus extended (open) conformations, we prioritized assays that directly link structural predictions to in vitro enzymatic activity and in vivo ciliary transport dynamics. Future studies incorporating SEC or cryo-EM will provide additional biophysical validation of these states.

      We have revised the text in the manuscript (Page #7, Lines #22): 

      “…Notably, the open-state OSM-3 mutants (e.g., G444E) displayed elevated ATPase activity, consistent with structural predictions of autoinhibition release (Fig. 3A, Fig. 4A) (Xie et al., 2024). While hydrodynamic profiling (e.g., SEC) could further resolve conformational states, our functional assays directly connect predicted structural changes to altered biochemical and cellular activity...”

      Minor point

      (1) Line 85 "MIMA kinase family" should be "NIMA kinase family".

      We have corrected the typo and appreciate that the Reviewer for pointing it out. 

      (2) M.S. and D.S. need to be defined in Figure 2D.

      We have updated the figures.

    1. Note: This response was posted by the corresponding author to Review Commons. The content has not been altered except for formatting.

      Learn more at Review Commons


      Reply to the reviewers

      Manuscript number: RC-2025-02887

      Corresponding author(s): Philippe Bastin

      1. General Statements [optional]

      • *

      We thank the reviewers for their constructive suggestions. We are delighted to see that they appreciated our work and its interest for the broad cell biology community, as well as the potential impact of the inducible expression of tagged tubulin as a new tool to investigate microtubule assembly at large.

      We are now providing a full revision that contains two major modifications and that addresses all the minor points detailed below. The two major modifications are:

      • A simplification and a shortening of the text as requested by reviewers 1 and 3
      • The addition of a new experiment evaluating the role of the locking protein CEP164C to gain insight into the mechanism, as suggested by reviewers 1 and 2 Briefly, CEP164C is a protein localised to the transition fibres (structures that dock the basal body of the flagellum to the membrane) of only the old flagellum. Its depletion leads to an excessive elongation of the old flagellum and the production of a shorter new flagellum, suggesting competition between the two flagella for tubulin incorporation (Atkins et al., 2021). In the new figure 5, we have expressed tagged tubulin in the CEP164CRNAi cell line and formally demonstrated simultaneous incorporation in both flagella. Unexpectedly, the new flagellum incorporated more tubulin than the old one, suggesting a bias of tubulin targeting in favour of the new flagellum and the existence of additional contributors to the Grow-and-Lock model.

      2. Point-by-point description of the revisions

      This section is mandatory. *Please insert a point-by-point reply describing the revisions that were already carried out and included in the transferred manuscript. *

      • *

      Reviewer #1

      Evidence, reproducibility and clarity

      The manuscript by Daniel Abbühl on "A novel approach to tagging tubulin reveals MT assembly dynamics of the axoneme in Trypanosoma brucei" uses an innnovative approach to label tubulin, which allows the authors to unveil new mechanisms in flagellar length regulation.

      The manuscript is very nice and will be very interesting for the cell biology community and therefore should be accepted. In some parts it becames a bit complex with all the models and complex phrasing, I wonder whether the text could be simplified to be more appealing. I have a few minor comments:

      We agree that some of the explanations are lengthy and complex. We have simplified the explanations and hopefully made the models more accessible. Complexity comes from the fact that trypanosomes do not have a synchronized cell cycle.

      -From the model the authors show in Figure 8- there should be a way of pulsing the cells in G1 for a short amount of time -2 hours- and getting both flagella tips labelled. But the authors seem to require longer labelling to get that result. This should be better explained.

      We are not quite sure what is meant here with both flagella as in G1-phase, all cells are mono-flagellated. We do see mono-flagellated cells with a labelled tip after 2 hours, both with the HALO-tag or the Ty-1-tubulin system.

      In regard to bi-flagellate cells, we believe that incorporation in the OF happened at the beginning of G1-phase when the cell was mono-flagellated. If tubulin is present at that point, it will be incorporated at the tip. This cell then approaches the end of G1-phase and starts to initiate NF assembly. Since tagged tubulin is already present it will be incorporated along the whole length of the NF.

      A short induction of 2h would not suffice as it wouldn't cover the duration of the G1-phase and the initiation of a NF (duration of G1-phase is ~4h). We attempted to explain this in Fig. 4 and reworked the text to make this clearer.

      -Why do some cells not express the construct? Weren´t they all selected?

      We never managed to get a cell line where inducible expression is present in 100% of cells. Here, around 95% of cells were positive for Ty-1-tubulin after 24h of induction. Non-expression is not a phenomenon restricted to this tubulin cell line but also observed with other ectopically expressed proteins (e.g. Sunter et al. JCS 2015, Bastin et al. MCB 1999). All these cell lines represent clonal populations and are resistant to antibiotic treatment, however not all cells express the respective protein. For each experiment where we believed the number of expressing cells matter (for example the washout), we quantified in how many cells Ty-1-tubulin was present in the cell body microtubules.

      -"The linear regression line in Fig. 3C was corrected by subtracting 45 minutes from each timepoint due to the previously reported delay between addition of tetracycline and the expression of the respective protein". However, in the authors data the delay may amount to one hour (western analysis- S4). Shouldn´t they use their data.

      Indeed, the western blot shows expression after 1-hour, however we did not take a 45-minute timepoint, so we don't know if the protein was detectable at that time. In addition, IFA is more sensitive than western blot. We cannot say exactly when the average cell starts to express the induced protein.

      -Fig 3: To measure the timepoints of flagella growth, wouldn´t it be better to do it with NF that started to grow before induction, rather than starting to grow after induction, to be sure that the timing of incorporation is fully accounted for?

      We indeed did consider only NFs, which started to grow before induction, as suggested by the reviewer. In the revised version the description of the experiment can be found on page 9 line 22 - 28.

      -Although it is not the focus of the manuscript it would have been very interesting to use the CEP164C mutant to see whether it would change the dynamics of incorporation and fully test their model and discussion.

      This is a great suggestion, so we performed some experiments to address this issue. When CEP164C was knocked down before Ty-1-tubulin expression, integration is seen at the distal tip of both NF and OF. This is coherent with the idea of removal of the locking protein from the OF. However, lengths of the green segments in NF and OF do not have the same length (NF ~6 µm, OF ~2 µm), which indicates that CEP164C might not be the only protein involved in regulating flagellum length. A new figure explaining this experiment was added (Fig. 5, Fig. S6). We believe this data provides novel insight on the locking mechanism and strengthens the manuscript.

      -In some parts of the manuscript/supplemental material the authors say they insert the Ty-1- tag one aminoacid after the acetylated lysine- other parts they say two aminoacids after- this should be consistent.

      We thank the reviewer for spotting these mistakes, we have changed the text accordingly.

      -Fig. S1: 'Binding epitope of the TAT-1 antibody is highlighted in red'. There is no highlighting in red in this figure?

      This sentence was removed.

      -Fig. S2: Western blots are not very clear. What is the 'X' present in the C (first lane)? Weight of markers should be shown also in S4.

      Molecular weight markers have been added. X is an empty lane, we have now indicated this in the figure legend.

      -Fig 5: 'C: Frequency of bi-flagellated cells grouped by the different types of' The authors didn't finish the sentence.

      Previous Fig. 5 is now Fig. 6. Sentence has been completed. "Frequency of bi-flagellated cells grouped by different types of old flagella"

      -Fig. S7: The 'B' is missing in both picture and legend.

      This has been added


      Significance

      This study advances our knowledge of flagellar length regulation and maintenance. Moreover, the tools designed in this work will be very useful for the cell biology community in general.


      Reviewer #2

      Evidence, reproducibility and clarity

      Summary: The length of the old flagellum of Trypanosome is constant during G1 phase as well as during cell cycle progression when the new flagellum is assembled. The authors have previously proposed a "Grow and Lock" model for the flagellar length control in which no flagellar building blocks are incorporated. To test this hypothesis, the authors used a tagging strategy for alpha-tubulin and tracking its incorporation. The authors showed that the new flagellum incorporates new tubulins, as is expected. For the mature flagellum, tubulins are incorporated at the flagellar tip and only when the cells start to assemble the new flagellum. Thus, it shows that old flagellum is stable but not completely locked for the incorporation of tubulins.

      Major comments: The study is methodologically rigorous, integrating fluorescence microscopy, biochemical approaches, and proteomic analyses to validate the functionality of the tagged tubulin. The use of both inducible expression and endogenous protein tagging (HaloTag) strengthens the conclusions. This study has supported the "Grow-and-Lock" model" that the authors previously proposed. In addition, they have revealed that the stability of the old flagellum is temporally controlled.

      The data showed that brief incorporation of tubulins at the tip of the old flagellum occurs when the cells start to form the new flagellum. I thought the assembly of the new flagellum occurs during the cell division. However, in the abstract, it says that "The restriction is lifted briefly after the bi-flagellated cell has divided." Is my understanding wrong?

      We believe incorporation at the tip of the "OF" occurred after the cell has divided, when the OF daughter is mono-flagellated. It happens before this daughter cells starts assembling its new flagellum is formed. Of course, when looking at biflagellated cells, the NF as well as the tip of the OF will be green, but our data supports that incorporation happened in G1-phase and not during the biflagellated stage as the lock seals the OF before the NF emerges. To clarify on terminology: The bi-flagellate stage begins when basal bodies are duplicated, shortly after the beginning of S-phase and ends with cytokinesis. This means G1-phase and the mono-flagellated stage are nearly the same (Woodward and Gull, JCS1990) and occupy ~40% of the cell cycle.

      P12, "The cartoon in Fig. 5A illustrates the progression of the cells in scenario 2 (Fig. 4A) over the duration of one cell cycle (~9 hours)" I thought that one cell cycle should start with cell with only one flagellum, followed by assembly of a new flagellum during cell division, the cell then divides when the new flagellum is almost completely assembled. If my understanding is correct, perhaps the cartoon should be modified accordingly.

      Indeed, the cell cycle starts with a cell in G1-phase. Here, we have chosen the initiation of a NF assembly as our starting point because we focused the investigation on bi-flagellated cells. We have now illustrated the cell cycle (adapted from Woodward and Gull 1990) and when cells are biflagellated in Fig. 6A (revised version).

      Minor comments:

      1) Several references are not correctly formatted. P3: (Flavin and Slaughter, 1974) (Rosenbaum 1969). P10, (Sherwin et al., 1987)(Sheriff et al., 2014) 2) In several places there are no space between the number and the unit. For eample, P3, 9 - 24µm/h. 7, 1μg/m; P8, 50kDa; P9, 1M; 8-9h; P11, 2.9µm/h and etc. 3) P11, Flagella were extracted. I thought the cells were extracted.

      Thank you for pointing these out, we have changed these in the text.


      Significance

      Cilia and eukaryotic flagella are considered dynamic structures in which the flagellar components especially tubulins under constant turnovers even in steady state. This work demonstrates that in Trypanosome the stable old flagellum is temporally controlled for tubulin turnovers, suggesting a tight regulation of microtubule dynamics. Future elucidation of the regulatory mechanism will be more interesting. This work will be interesting to the field of cilia and microtubules. In addition, the new technique used for tracking tubulins will also be interesting.

      I am an expert on ciliary biology.

      Reviewer #3 (Evidence, reproducibility and clarity (Required)):

      Summary:

      This study seeks to investigate the mechanism by which the length of an eukaryotic cilium is set and maintained in a constant state. The flagellated protist Trypanosoma brucei serves as the study model and the authors take advantage of the genetic tools that allow precise modification and tagging of flagellar proteins and they build on prior knowledge about the well-characterised flagellar assembly cycle, which allows tracking the assembly of a new flagellum alongside an existing old one in the course of one cell cycle. The group of Bastin has previously reported a very interesting "Grow-and-Lock Model for the Control of Flagellum Length in Trypanosomes" and this current manuscript provides a test of this model, and a refinement. Key to this is an advance in technique, reported here, namely expression of an epitope tagged version of alpha tubulin. The epitope is inserted in an internal loop, which apparently for the first time provides a traceable tubulin that is reliably incorporated into the cytoskeleton (subpellicular array, spindle and cilium). Expressing an inducible version of this Ty-1-tubulin allows for a set of experiments that measure the place and timing of tubulin incorporation into cilia. The results are largely confirmatory of previous findings (incorporation exclusively into the new flagellum, at the distal end, linear growth rate that matches previous estimates). Examination of tubulin incorporation patterns then reveal additional information about the old flagellum: evidence from Ty-1-tubulin labelling, corroborated by incorporation patterns of another ciliary protein (RSP 4/6) suggest that the "lock" on the old flagellum is relieved for short periods after cell division, leading to a refined model presented in Figure 8.

      Major comments:

      This study provides an elegant test of the grow-and-lock model and the major conclusions are supported by the data. I have no major concerns.

      Minor comments:

      There are several minor points that could be addressed to make the manuscript easier to follow (and adding line numbers to the manuscript would help with reviewing).

      The introduction is quite long. Some of the well-established background information on the T. brucei cell cycle could be shortened. If the paper is intended for a broader audience, it would be valuable instead to cite studies that have succeeded in tagging tubulin and tracing its incorporation in other cilia. Could the Ty-1-tubulin approach be relevant more broadly or are simpler methods already established?

      The introduction has been shortened, we now also cite two published studies that tracked tubulin integration in Chlamydomonas and C. elegans respectively.

      On p.6 the rationale for endogenous tagging was to "reduce the risk of artifacts portentially due to untimely expression or unnatural protein levels". However most of the experiments were done with ectopically expressed inducible Ty-1-tubulin. For the experiments it is crucial to use an inducible system but the authors may wish to comment why the risk of artifacts was no longer a concern.

      The reasoning here was that in case the Ty-1-tubulin would not have been incorporated into MTs, we could have attributed it solely to the presence of the tag and no other factors, but this was not the case. This therefore allowed us to move to the inducible expression system.

      On p.7 / Fig S2A-B there appears to be a mistake in the presentation. Spindles are mentioned in the text - I can't see any in the figure. Fig S2A and B both show cytoskeletons, but the text suggests only B is about cytoskeletons. None of the blot shows BB2 staining of different cell fractions, contrary to statements in the text. The letter codes in the panel (T, C, D) don't match the codes in the legend (T, P, S).

      We thank the reviewer for spotting the mistakes. A panel with the spindle was added in Fig. S2. We did not stain fraction blots of the in-situ tagged cell lines with BB2. However, this was done with the inducible cell line and is shown in Fig. 1D. Letter code in the legend was adapted to match the figure.

      Figure 1. The evidence for incorporation into spindles is not strong. The structure indicated by the arrive could be a spindle but it's not very clear. There is a great example of a labelled spindle only in figure S5A. Here, at the start, it would be good to show a panel of cells in successive cell cycle stages (best, whole cells and cytoskeletons) to clearly show the structures that are labelled with Ty-1-tubulin.

      The current Fig. 1B (Fig. 1A before) depicts whole cells of an induced and a non-induced culture; we show whole cells to provide a complete picture of tubulin integration. A panel with detergent extracted cytoskeletons from the in situ tagged cell line has been added to Fig. 1A. We chose to show cytoskeletons or isolated flagella instead of whole cells because (1) the flagella are easier to see and (2) it formally demonstrates that tagged tubulin is incorporated in MTs.

      In general, tubulin labelling of the spindle was more consistently observed in whole cells as we did not use spindle preserving extraction buffers when preparing cytoskeletons. However, we did observe clear spindles in cytoskeletons as well (see Fig. S5 for example). The same was observed for the beta-tubulin specific KMX1 antibody in the past which is the gold standard to visualize the spindle (Sasse and Gull JCS1988). Regardless, a panel depicting spindle progression through mitosis using staining of Ty-1-tubulin has been added in Fig. S2 (The panel is a mix of whole cells and cytoskeletons).

      On p.8 (end of first paragraph) there is reference to cell cycle analyses, but no data is shown. Also on p.8, please clarify what the evidence is that "a fraction of cells did not respond to tetracycline". The fact that they remain unstained by Ty-1-tubulin is not in itself evidence they did not respond to tetracycline.

      We did not show the cell cycle data as it was similar to non-induced and does not provide any new information in our opinion. Hence, the sentence has been removed.

      The reviewer is correct that we do not have evidence that these cells did not respond to tetracycline. Some cells remained completely devoid of Ty-1-tubulin even after multiple days of induction. This was typically between 5-10% of cells. In experiments where the exact number is important, we counted the amount of "non-expressers" in whole cells.

      Figure S4A. The blot for the soluble fraction is not of great quality. I don't see how the conclusion was reached that the Ty-1-tubulin bands were faint.

      The blot of the soluble fraction that was stained with BB2 had to be exposed a lot longer compared to the blot stained with TAT-1. The soluble blots were repeated with the same result (lots of background noise when using BB2, a clear blot with TAT-1). In the TAT-1 blot only the endogenous tubulin band is clearly visible, with some very faint signal above corresponding to the Ty-1-tubulin. Soluble Ty-1-tubulin with BB2 or TAT-1 is visible in Fig. 1D after longer inductions.

      On p.11, it would be interesting to compare measured elongation rates with previously measured estimates for flagellum growth, comparing the growth rates, and relating them to cell cycle times in the corresponding experiments (which vary slightly between labs and studies).

      We attempted to address this in the discussion by comparing our experiments to the assembly rate measured with the PFR as reporter (Bastin et al. 1999). We could mention the corresponding doubling times in correlation to how many cells are bi-flagellated, but this was only done with the Ty-1-tubulin cell line and not with the PFR. In our experiments the average doubling time was ~9 hours with 52% of cells being bi-flagellated. This was measured with FTZC (marker of the transition zone at the base of the flagellum) and Mab25 (marker of the axoneme of the flagellum) which will lead to a slight underestimate of the real number of bi-flagellated cells, as the NF is initially very close which makes it difficult to notice/differentiate from the old one.

      Figure S6. I find the presentation of this figure confusing. It should be revised with clearer labelling of "cell cycle 1", "cell cycle 2", and the precise meaning of "type 3" should be clarified. There are two instances of "type 1" in the drawing, but one of these seems to fulfil the criteria of "type 3" (OF 1-4µm).

      We agree with the reviewer and therefore decided to remove this figure. We also considered the comments of the other two reviewers about complexity of the manuscript and changed the text of figure 5 to make it more approachable. This includes a simpler explanation for the expected amounts of flagella.

      Figure 7. In panel A, the absence of label at the NF distal end is not total, a purple line is still visible. Was any quantitation attempted (signal intensity, changes in length of labelled fragments over time?). Minimally, say how many cells were analysed for the numbers in panels D and E, and how many times this experiment was done.

      We agree with the reviewer that the decrease in the TMR signal in the NF of the cell in the original Fig. 7A (currently Fig. 8A) is gradual and not abrupt. Similarly to the Ty-1-tubulin experiments where the tagged protein becomes progressively more available (increasing intensity), the intensity of TMR-ligand becomes progressively less abundant (gradually decreasing intensity) as new (not TMR labelled) protein gets synthesized during the period of NF construction, progressively diluting the initially fully labeled population of RSP4/6. The slope of the gradient may differ between axonemal constituents, as it reflects the kinetics of protein synthesis, degradation, its incorporation into the axoneme, as well as the size of the soluble protein pool in the cytosol. We classify this type of signal as gradients, as opposed to the sharp decrease. At initial times after TMR-ligand washout (e.g. 4 hours in Fig. 8C), this long gradient is observed at the distal end of NFs and in some uniflagellated cells (NF-inheriting daughters). The distal ends of OFs in these experiments (if not fully labelled) display a sharp decrease, as do frequent uniflagellated cells, likely OF-inheriting daughters. The existence of these two different patterns demonstrates that two different mechanisms are responsible for incorporation of fresh RSP4/6 into the NF and OF axoneme, respectively. While incorporation into the NF is gradual, incorporation into the distal region of the OF is stepwise (restricted in time). Numbers of cells quantified for the table in Fig. 8 have been added. The NFs and OFs displaying the patterns of the gradient and sharp decrease, respectively, were observed in multiple experiments.

      Reviewer #3 (Significance (Required)):

      • General assessment: strengths and limitations

      Strengths: Trypanosoma brucei is a powerful model system in which to ask detailed questions about the assembly dynamics and hierarchy of microtubule-based cytoskeletal structures in general and cilia in particular. This elegant and well-designed study overcomes a previous technical limitation by allowing for the direct labelling of alpha tubulin, one of the main building blocks of the ciliary axoneme. The study sets out to test a specific hypothesis (grow-and-lock model) and provides evidence in support, leading to a refined model for cilia length regulation in trypanosomes.

      Limitations: With this system, visualisation of new tubulin incorporation requires de novo synthesis. There is a time lag between inducing expression of Ty-1-tubulin with tetracycline and being able to visualize the tagged proteins that needs to be taken into consideration. This time lag was estimated based on previous studies and the relatively quick appearance of Ty-1-tubulin on Western blots (within hours). This inevitably creates a situation where levels of tagged tubulin change rapidly, creating gradients of signal intensity (and variations in levels) that lead to some uncertainty in estimations of length of labelled microtubule fragments. Furhtermore, the epitope label is not compatible with live cell imaging, restricting analyses to fixed cells. The Ty-1-tubulin data is well ducmented; the RSP4/6 data appear to corroborate these findings but are less extensively documented.

      • Advance: The results succeed in integrating several recent findings from different research groups into a refined coherent model about cilia length regulation in trypanosomes. The tubulin tagging method could be gainfully transferred to other systems (although the state of the field in tubulin tagging in other systems is not clearly laid out in the paper).

      This paper could be of interest to a broad cell biology community interested in cilia and cytoskeletal dynamics.

    1. Reviewer #2 (Public review):

      Summary:

      The authors assemble 222 publicly available bone marrow single-cell RNA sequencing samples from healthy donors and primary AML, including pediatric, adolescent, and adult patients at diagnosis. Focusing on one specific subtype, t(8;21), which, despite affecting all age classes, is associated with better prognosis and drug response for younger patients, the authors investigate if this difference is reflected also in the transcriptomic signal. Specifically, they hypothesize that the pediatric and part of the young population acquires leukemic mutations in utero, which leads to a different leukemogenic transformation and ultimately to differently regulated leukemic stem cells with respect to the adult counterpart. The analysis in this work heavily relies on regulatory network inference and clustering (via SCENIC tools), which identifies regulatory modules believed to distinguish the pre-, respectively, post-natal leukemic transformation. Bulk RNA-seq and scATAC-seq datasets displaying the same signatures are subsequently used for extending the pool of putative signature-specific TFs and enhancer elements. Through gene set enrichment, ontology, and perturbation simulation, the authors aim to interpret the regulatory signatures and translate them into potential onset-specific therapeutic targets. The putative pre-natal signature is associated with increased chemosensitivity, RNA splicing, histone modification, stem-ness marker SMARCA2, and potentially maintained by EP300 and BCLAF1.

      Strengths:

      The main strength of this work is the compilation of a pediatric AML atlas using the efficient Cellxgene interface. Also, the idea of identifying markers for different disease onsets, interpreting them from a developmental angle, and connecting this to the different therapy and relapse observations, is interesting. The results obtained, the set of putative up-regulated TFs, are biologically coherent with the mechanisms and the conclusions drawn. I also appreciate that the analysis code was made available and is well documented.

      Weaknesses:

      There were fundamental flaws in how methods and samples were applied, a general lack of critical examination of both the results and the appropriateness of the methods for the data at hand, and in how results were presented. In particular:

      (1) Cell type annotation:

      a) The 2-phase cell type annotation process employed for the scRNA-seq sample collection raised concerns. Initially annotated cells are re-labeled after a second round with the same cell types from the initial label pool (Figure 1E). The automatic annotation tools were used without specifying the database and tissue atlases used as a reference, and no information was shown regarding the consensus across these tools.

      b) Expression of the CD34 marker is only reported as a selection method for HSPCs, which is not in line with common practice. The use of only is admitted as a surface marker, while robust annotation of HSPCs should be done on the basis of expression of gene sets.

      c) During several analyses, the cell types used were either not well defined or contradictory, such as in Figure 2D, where it is not clear if pySCENIC and AUC scores were computed on HSPCs alone or merged with CMPs. In other cases, different cell type populations are compared and used interchangeably: comparing the HSPC-derived regulons with bulk (probably not enriched for CD34+ cells) RNA samples could be an issue if there are no valid assumptions on the cell composition of the bulk sample.

      (2) Method selection:

      a) The authors should explain why they use pySCENIC and not any other approach. They should briefly explain how pySCENIC works and what they get out in the main text. In addition they should explain the AUCell algorithm and motivate its usage.

      b) The obtained GRN signatures were not critically challenged on an external dataset. Therefore, the evidence that supports these signatures to be reliable and significant to the investigated setting is weak.

      (3) There are some issues with the analysis & visualization of the data.

      (4) Discussion:

      a) What exactly is the 'regulon signature' that the authors infer? How can it be useful for insights into disease mechanisms?

      b) The authors write 'Together this indicates that EP300 inhibition may be particularly effective in t(8;21) AML, and that BCLAF1 may present a new therapeutic target for t(8;21) AML, particularly in children with inferred pre-natal origin of the driver translocation.' I am missing a critical discussion of what is needed to further test the two targets. Put differently: Would the authors take the risk of a clinical study given the evidence from their analysis?

    2. Author response:

      Reviewer #1 (Public review):

      Summary:

      In this manuscript, the authors performed an integration of 48 scRNA-seq public datasets and created a single-cell transcriptomic atlas for AML (222 samples comprising 748,679 cells). This is important since most AML scRNA-seq studies suffer from small sample size coupled with high heterogeneity. They used this atlas to further dissect AML with t(8;21) (AML-ETO/RUNX1-RUNX1T1), which is one of the most frequent AML subtypes in young people. In particular, they were able to predict Gene Regulatory Networks in this AML subtype using pySCENIC, which identified the paediatric regulon defined by a distinct group of hematopoietic transcription factors (TFs) and the adult regulon for t(8;21). They further validated this in bulk RNA-seq with AUCell algorithm and inferred prenatal signature to 5 key TFs (KDM5A, REST, BCLAF1, YY1, and RAD21), and the postnatal signature to 9 TFs (ENO1, TFDP1, MYBL2, KLF1, TAGLN2, KLF2, IRF7, SPI1, and YXB1). They also used SCENIC+ to identify enhancer-driven regulons (eRegulons), forming an eGRN, and found that prenatal origin shows a specific HSC eRegulon profile, while a postnatal origin shows a GMP profile. They also did an in silico perturbation and found AP-1 complex (JUN, ATF4, FOSL2), P300, and BCLAF1 as important TFs to induce differentiation. Overall, I found this study very important in creating a comprehensive resource for AML research.

      Strengths:

      (1) The generation of an AML atlas integrating multiple datasets with almost 750K cells will further support the community working on AML.

      (2) Characterisation of t(8;21) AML proposes new interesting leads.

      We thank the reviewer for a succinct summary of our work and highlighting its strengths.

      Weaknesses:

      Were these t(8;21) TFs/regulons identified from any of the single datasets? For example, if the authors apply pySCENIC to any dataset, would they find the same TFs, or is it the increase in the number of cells that allows identification of these?

      The purpose of our study was to gain biological insights by integrating multiple datasets, to overcome limitations from small sample size. We expect that the larger dataset would improve network inference, which is what we implemented in the manuscript, hence we have not looked at individual datasets. However, we will investigate this further in the revised manuscript by running pySCENIC on individual datasets and comparing to the results drawn from the whole atlas.

      Reviewer #2 (Public review):

      Summary:

      The authors assemble 222 publicly available bone marrow single-cell RNA sequencing samples from healthy donors and primary AML, including pediatric, adolescent, and adult patients at diagnosis. Focusing on one specific subtype, t(8;21), which, despite affecting all age classes, is associated with better prognosis and drug response for younger patients, the authors investigate if this difference is reflected also in the transcriptomic signal. Specifically, they hypothesize that the pediatric and part of the young population acquires leukemic mutations in utero, which leads to a different leukemogenic transformation and ultimately to differently regulated leukemic stem cells with respect to the adult counterpart. The analysis in this work heavily relies on regulatory network inference and clustering (via SCENIC tools), which identifies regulatory modules believed to distinguish the pre-, respectively, post-natal leukemic transformation. Bulk RNA-seq and scATAC-seq datasets displaying the same signatures are subsequently used for extending the pool of putative signature-specific TFs and enhancer elements. Through gene set enrichment, ontology, and perturbation simulation, the authors aim to interpret the regulatory signatures and translate them into potential onset-specific therapeutic targets. The putative pre-natal signature is associated with increased chemosensitivity, RNA splicing, histone modification, stem-ness marker SMARCA2, and potentially maintained by EP300 and BCLAF1.

      Strengths:

      The main strength of this work is the compilation of a pediatric AML atlas using the efficient Cellxgene interface. Also, the idea of identifying markers for different disease onsets, interpreting them from a developmental angle, and connecting this to the different therapy and relapse observations, is interesting. The results obtained, the set of putative up-regulated TFs, are biologically coherent with the mechanisms and the conclusions drawn. I also appreciate that the analysis code was made available and is well documented.

      We thank the reviewer for reviewing our work, and highlighting its key features, including creation of AML atlas, downstream analysis and interpretation for t(8;21) subtype.

      We also appreciate useful critique of our paper provided below.

      Weaknesses:

      There were fundamental flaws in how methods and samples were applied, a general lack of critical examination of both the results and the appropriateness of the methods for the data at hand, and in how results were presented. In particular:

      (1) Cell type annotation:

      a) The 2-phase cell type annotation process employed for the scRNA-seq sample collection raised concerns. Initially annotated cells are re-labeled after a second round with the same cell types from the initial label pool (Figure 1E). The automatic annotation tools were used without specifying the database and tissue atlases used as a reference, and no information was shown regarding the consensus across these tools.

      We believe that most of the reviewer’s criticisms stem from a misunderstanding, and we apologize for not explaining certain aspects of our work more clearly.

      The two types of cell type annotation applied were different and served distinct purposes:

      • One was using general bone marrow/blood reference datasets to annotate blood subtype lineage clusters.

      • The other was using a CD34 purified AML specific reference dataset which included leukaemia-associated annotations, to identify HSPC subpopulations. We also implemented this on a single-cell level to allow more robust identification of these rare populations in a large dataset.

      This is probably not well explained in the methods and figure presentation. We will clearly indicate in the revised manuscript that different HSPC annotations represent separate analysis and will update the figures to highlight this. We will provide a comprehensive review of the annotation strategies implemented, including the automated tool outputs, which may be useful for the single-cell community.

      b) Expression of the CD34 marker is only reported as a selection method for HSPCs, which is not in line with common practice. The use of only is admitted as a surface marker, while robust annotation of HSPCs should be done on the basis of expression of gene sets.

      We used CD34 expression in conjunction with other cell type annotations and marker sets to identify LSCs, although results are same when we use HSPC annotated cells without condition on CD34 expression.  In the revised manuscript, we will simplify this analysis to use HSPC clusters as suggested by the reviewer.

      c) During several analyses, the cell types used were either not well defined or contradictory, such as in Figure 2D, where it is not clear if pySCENIC and AUC scores were computed on HSPCs alone or merged with CMPs. In other cases, different cell type populations are compared and used interchangeably: comparing the HSPC-derived regulons with bulk (probably not enriched for CD34+ cells) RNA samples could be an issue if there are no valid assumptions on the cell composition of the bulk sample.

      As mentioned in the Methods, we only excluded lymphoid cell types from the pySCENIC analysis to overcome the bias that some samples were enriched using CD34 selection when preparing them for scRNA-seq. We will make this clearer in the text and figures of the revised manuscript. It is difficult to overcome this bias when using bulk RNA samples, which may explain why some of our samples do not fit into our defined signature groups. However, as we do not have access to primary samples ourselves, we cannot provide a better matched experimental cohort for validation.

      (2) Method selection:

      a) The authors should explain why they use pySCENIC and not any other approach. They should briefly explain how pySCENIC works and what they get out in the main text. In addition they should explain the AUCell algorithm and motivate its usage.

      pySCENIC is state-of-the-art method for network inference from scRNA data and is widely used within the single-cell community (over 5000 citations for both versions of the SCENIC pipeline). The pipeline has been benchmarked as one of the top performers for GRN analysis (Nguyen et al, 2021. Briefings in Bioinformatics). AUCELL is a module within the pySCENIC pipeline to summarise the activity of a set of genes (a regulon) into a single number which helps compare and visualise different regulons. We agree with reviewer that this could have been more clearly explained within the manuscript. We will update text in the revised manuscript to add more explanation.

      b) The obtained GRN signatures were not critically challenged on an external dataset. Therefore, the evidence that supports these signatures to be reliable and significant to the investigated setting is weak.

      These signatures were inferred from the best suitable AML single-cell RNA datasets available to date, and we used two independent datasets to validate our findings (the TARGET AML bulk RNA sequencing cohort, and the Lambo et al. scRNA-seq dataset). To our knowledge, there are no other better suited datasets for validation. Experimental validations on patient samples are beyond the scope of this study.

      (3) There are some issues with the analysis & visualization of the data.

      We will provide new statistical tests to improve robustness of the analysis as well as presentation and visualization of the data in the revised manuscript.

      (4) Discussion:

      a) What exactly is the 'regulon signature' that the authors infer? How can it be useful for insights into disease mechanisms?

      The ’regulon signature’ here refers to a gene regulatory program (multiple gene modules, each defined by a transcription factor and its targets) which are specific to different age groups. Further investigation into this can be useful for understanding why patients of different ages confer a different clinical course. We will add more text on the utility of our discovered 'regulon signature' in the discussion section of revised manuscript.

      b) The authors write 'Together this indicates that EP300 inhibition may be particularly effective in t(8;21) AML, and that BCLAF1 may present a new therapeutic target for t(8;21) AML, particularly in children with inferred pre-natal origin of the driver translocation.' I am missing a critical discussion of what is needed to further test the two targets. Put differently: Would the authors take the risk of a clinical study given the evidence from their analysis?

      Of course, many extensive studies would be required before these findings are clinically translatable. We can include some perspectives on what further work is required in terms of further experimental validation and potential subsequent clinical study.

    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Recommendations for the authors):

      We appreciate the reviewers' thoughtful comments and suggestions. Below, we provide point-by-point responses to the recommendations and outline the updates made to the manuscript.

      (1) Discussion, "the obvious experiment is to manipulate a neuron's anatomical embedding while leaving stimulus information intact."] The epiphenomenon can arise from the placement and types of a neuron's neurotransmitters and neuromodulators, too.

      The content of vesicles released by a neuron is obviously of great importance in determining postsynaptic impact. However, we’re suggesting that (assuming vesicular content is held constant) the anatomically-relevant patterning of spiking might additionally affect the postsynaptic neuron’s integration of the presynaptic input. To avoid confusion, we updated the text accordingly: “the obvious experiment is to manipulate a neuron's anatomical embedding while minimally impacting external and internal variables, such as stimulus information and levels of neurotransmitters or neuromodulators” (Line 594 - 596).

      (2) “In all conditions, the slope of the input duration versus sensitivity line was still positive at 1,800 seconds (Fig. 3B)". This may suggest that the estimate of the calculated statistics (ISI, PSTH) is more reliable with more data, rather than (or in addition to) specific information being extracted from faraway time points. Another potential confound is the training statistics were calculated from all training data, so the test data is a better match to training data when test statistics are calculated from more data. Overall, the validity of the conclusions following this observation is not clear to me.

      This is a great point. Accordingly, we revised the text to include this possibility: “Because the training data were of similar duration, this could be explained by either of two possibilities. First, the signal is relatively short, but noisy—in this case, extended sampling will increase reliability. Second, the anatomical signal is, itself, distributed over time scales of tens to hundreds of seconds.” (Line 252 - 255).

      (3) "This further suggests that there is a latent neural code for anatomical location embedded within the spike train, a feature that could be practically applied to determining the brain region of a recording electrode without the need for post-hoc histology". The performance of the model at the subregion level, which is a typical level of desired precision in locating cells, does not seem to support such a practical application. Please clarify to avoid confusion.

      The current model should not be considered a replacement for traditional methods, such as histology. Our intention is to convey that, with the inclusion of multimodal data and additional samples, a computational approach to anatomical localization has great promise. We updated the manuscript to clarify this point: “While significantly above chance, the structure-level model still lacks the accuracy for immediate practical application. However, it is highly likely that the incorporation of datasets with diverse multi-modal features and alternative regions from other research groups will increase the accuracy of such a model. In addition, a computational approach can be combined with other methods of anatomical reconstruction.” (Line 355 - 359).

      Additionally, we directly addressed this point in our original manuscript (Discussion section: Line 498 - 505 in the current version). Furthermore, following the release of our preprint, independent efforts have adopted a multimodal strategy with qualitatively similar results (Yu et al., 2024). Other recent work expands on the idea of utilizing single-neuron features for brain region/structure characterization (La Merre et al., 2024).

      Yu, H., Lyu, H., Xu, E. Y., Windolf, C., Lee, E. K., Yang, F., ... & Hurwitz, C. (2024). In vivo cell-type and brain region classification via multimodal contrastive learning. bioRxiv, 2024-11.

      Le Merre, P., Heining, K., Slashcheva, M., Jung, F., Moysiadou, E., Guyon, N., ... & Carlén, M. (2024). A Prefrontal Cortex Map based on Single Neuron Activity. bioRxiv, 2024-11.

      (4) "These results support the notion the meaningful computational division in murine visuocortical regions is at the level of VISp versus secondary areas.". The use of the word "meaningful" is vague and this conclusion is not well justified because it is possible that subregions serve different functional roles without having different spiking statistics.

      Precisely! It is well established that different subregions serve different functional purposes - but they do not necessitate different regional embeddings. It is important to note the difference between stimulus encoding and the embedding that we are describing. As a rough analogy, the regional embedding might be considered a language, while the stimulus is the content of the spoken words. However, to avoid vague words, we revised the sentence to “These results suggest that the computational differentiability of murine visuocortical regions is at the level of VISp versus secondary areas.” (Line 380 - 381)

      (5) Figure 3D left/right halves look similar. A measure of the effect size needs to accompany these p-values.

      We assume the reviewer is referring to Figure 3E. Although some of the violin plots in Figure 3E look similar, they are not identical. In the revision, we include effect sizes in the caption.

      (6) Figure 3A, 3F: Could uncertainty estimates be provided?

      Yes. We added uncertainty estimates to the text (Line 272 - 294) and to the caption of Figure S2, which displays confusion matrices corresponding to Figure 3A. The inclusion of similar estimates for 3F would be so unwieldy as to be a disservice to the reader—there are 240 unique combinations of stimulus parameters and structures. In the context of the larger figure, 3F serves to illustrate a relationship between stimulus, region, and the anatomical embedding.

      (7) Page 21. "semi-orthogonal". Please reword or explain if this usage is technical.

      We replaced “semi-orthogonal” with “dissociable” (Line 549).

      (8) Page 11, "This approach tested whether..."] Unclear sentence. Please reword.

      We changed “This approach tested whether the MLP’s performance depended on viewing the entire ISI distribution or was enriched in a subset of patterns” to “This approach identified regions of the ISI distribution informative for classification” (Line 261).

      Reviewer #2 (Recommendations for the authors):

      We appreciate the reviewer’s comments and summary of the results. We agree that the introductory results (Figs. 1-3) are not particularly compelling when considered in isolation. They provide a baseline of comparison for the subsequent results. Our intention was to approach the problem systematically, progressing from well-established, basic methods to more advanced approaches. This allows us to clearly test a baseline and avoid analytical leaps or untested assumptions. Specifically:

      ● Figure 1 provides an evaluation of the standard dimensionality reduction methods. As expected, these methods yield minimal results, serving as a clear baseline. This is consistent, for example, with an understanding of single units as rate-varying Poisson processes.

      ● Figures 2 and 3 then build upon these results with spiking features frequent in neuroscience literature such as firing rate, coefficient of variation, etc using linear supervised and more detailed spiking features such as ISI distribution using nonlinear supervised machine learning methods.

      By starting from the standpoint of the status quo, we are better able to contextualize the significance of our later findings in Figures 4–6.

      Response to Specific Points in the Summary

      (6) Separability of VISp vs. Secondary Visual Areas

      I found the entire argument about visual areas somewhat messy and unclear. The stimuli used might not drive the secondary visual areas particularly well and might necessitate task engagement.

      We appreciate your feedback that the dissection of visual cortical structures is unclear. To summarize, as shown in the bottom three rows of Figure 6, there is a notable lack of diagonality in visuocortical structures. This means that our model was unable to learn signatures to reliably predict these classes. In contrast, visuocortical layer is returned well above chance, and superstructures (primary and secondary areas) are moderately well identified, albeit still well above chance.

      Consider a thought experiment, if Charlie Gross had not shown faces to monkeys to find IT, or Newsome and others shown motion to find MT and Zeki and others color stimuli to find V4, we would conclude that there are no differences.

      The thought experiment is misleading. The results specifically do not arise from stimulus selectivity—much of Newsome’s own work suggests that the selectivity of neurons in IT etc. is explained by little more than rate varying Poisson processes. In this case, there should be no fundamental anatomical difference in the “language” of the neurons in V4 and IT, only a difference in the inputs driving those neurons. In contrast, our work suggests that the “language” of neurons varies as a function of some anatomical divisions. In other words, in contrast to a Poisson rate code, our results predict that single neuron spike patterns might be remarkably different in MT and IT— and that this is not a function of stimulus selectivity. Notably, the anatomical (and functional) division between V1 and secondary visual areas does not appear to manifest in a different “language”, thus constituting an interesting result in and of itself.

      We regret a failure to communicate this in a tight and compelling fashion on the first submission, but hope that the revision is limpid and accessible.

      Barberini, C. L., Horwitz, G. D., & Newsome, W. T. (2001). A comparison of spiking statistics in motion sensing neurones of flies and monkeys. Motion Vision: Computational, Neural, and Ecological Constraints, 307-320.

      Bair, W., Zohary, E., & Newsome, W. T. (2001). Correlated firing in macaque visual area MT: time scales and relationship to behavior. Journal of Neuroscience, 21(5), 1676-1697.

      Similarly, why would drifting gratings be a good example of a stimulus for the hippocampus, an area thought to be involved in memory/place fields?

      The results suggest that anatomical “language” is not tied to stimuli. It is imperative to recall that neurons are highly active absent experimentally imposed stimuli, such as when an animal is at rest, when an animal is asleep, and when an animal is in the dark (relevant to visual cortices). With this in mind, also recall that, despite the lack of stimuli tailored to the hippocampus, neurons therein were still reliably separable from neurons in seven nuclei in the thalamus, 6 of which are not classically considered visual regions. Should these regions (including hippocampus) have been inert during the presentation of visual stimuli, there would have been very little separability.

      (7) Generalization across laboratories

      “[C]omparison across laboratories was somewhat underwhelming. It does okay but none of the results are particularly compelling in terms of performance.

      Any result above chance is a rejection of the null hypothesis: that a model trained on a set of animals in Laboratory A will be ineffective in identifying brain regions when tested on recordings collected in Laboratory B (in different animals and under different experimental conditions). As an existence proof, the results suggest conserved principles (however modest) that constrain neuronal activity as a function of anatomy. That models fail to achieve high accuracy (in this context) is not surprising (given the limitations of available recordings)---that models achieve anything above chance, however, is.

      Thus, after reading the paper many times, I think part of the problem is that the study is not cohesive, and the authors need to either come up with a tool or demonstrate a scientific finding.

      We demonstrate that neuronal spike trains carry robust anatomical information. We developed an ML architecture for this and that architecture is publicly available.

      They try to split the middle and I am left somewhat perplexed about what exact scientific problem they or other researchers are solving.

      We humbly suggest that the question of a neurons “language” is highly important and central to an understanding of how brains work. From a computational perspective, there is no reason for a vast diversity of cell types, nor a differentiation of the rules that dictate neuronal activity in one region versus another. A Turing Complete system can be trivially constructed from a small number of simple components, such as an excitatory and inhibitory cell type. This is the basis of many machine learning tools.

      Please do not confuse stimulus specificity with the concept of a neuron’s language. Neurons in VISp might fire more in response to light, while those in auditory cortex respond to sound. This does not mean that these neurons are different - only that their inputs are. Given the lack of a literature describing our main effect—that single neuron spiking carries information about anatomical location—it is difficult to conclude that our results are either commonplace or to be expected.

      I am also unsure why the authors think some of these results are particularly important.

      See above.

      For instance, has anyone ever argued that brain areas do not have different spike patterns?

      Yes. In effect, by two avenues. The first is a lack of any argument otherwise (please do not conflate spike patterns with stimulus tuning), and the second is the preponderance of, e.g., rate codes across many functionally distinct regions and circuits.

      Is that not the premise for all systems neuroscience?

      No. The premise for all systems neuroscience (from our perspective) is that the brain is a) a collection of interacting neurons and b) the collective system of neurons gives rise to behavior, cognition, sensation, and perception. As stated above, these axiomatic first principles fundamentally do not require that neurons, as individual entities, obey different rules in different parts of the brain.

      I could see how one could argue no one has said ISIs matter but the premise that the areas are different is a fundamental part of neuroscience.

      Based on logic and the literature, we fundamentally disagree. Consider: while systems neuroscience operates on the principle that brain regions have specialized functions, there is no a priori reason to assume that these functions must be reflected in different underlying computational rules. The simplest explanation is that a single language of spiking exists across regions, with functional differences arising from processing distinct inputs rather than fundamentally different spiking rules. For example, an identical spike train in the amygdala and Layer 5 of M1 would have profoundly different functional impacts, yet the spike timing itself could be identical (even as stimulus response). Until now, evidence for region-specific spiking patterns has been lacking, and our work attempts to begin addressing this gap. There is extensive further work to be conducted in this space, and it is certain that models will improve, rules will be clarified, and mechanisms will be identified.

      Detailed major comments

      (1) Exploratory trends in spiking by region and structure across the population:

      The argument in this section is that unsupervised analyses might reveal subtle trends in the organization of spiking patterns by area. The authors show 4 plots from t-SNE and claim to see subtle organization. I have concerns. For Figure 1C, it is nearly impossible to see if a significant structure exists that differentiates regions and structures. So this leads certain readers to conclude that the authors are looking at the artifactual structure (see Chari et al. 2024) - likely to contribute to large Twitter battles. Contributing to this issue is that the hyperparameter for tSNE was incorrectly chosen. I do think that a different perplexity should be used for the visualization in order to better show the underlying structure; the current visualization just looks like a single "blob". The UMAP visualizations in the supplement make this point more clearly. I also think the authors should include a better plot with appropriate perplexity or not include this at all. The color map of subtle shades of green and yellow is hard to see as well in both Figure S1 and Figure 1.

      In response to the feedback, we replaced t-SNE/UMAP with LDA, while keeping PCA for dimensionality reduction.

      As stated in the original methods, t-SNE/UMAP hyperparameters were chosen based on the combination that led to the greatest classifiable separability of the regions/structures in the space (across a broad range of possible combinations). It just so happens that the maximally separable structure from a regions/structures perspective is the “blob”. This suggests that perhaps the predominant structure the t-SNE finds in the data is not driven by anatomy. If we selected hyperparameters in some other way that was not based specifically on regions/structures (e.g. simple visual inspection of the plots) the conformation would of course be different and not blob-like. However, we removed the t-SNE and UMAP to avoid further confusion.

      The “muddy appearance” is not an issue with the color map. As seen in Figure 1B, the chosen colors are visibly distinct. Figure 1C (previous version) appeared muddy yellow/green because of points that overlap with transparency, resulting in a mix of clearly defined classes (e.g., a yellow point on top of a blue point creating green). This overlap is a meaningful representation of the separability observed in this analysis. We also tried using 2D KDE for visualization, but it did not improve the impression of visual separability.

      We are removing p-values from the figures because they lead to the impression that we over-interpret these results quantitatively. However, we calculated p-values based on label permutation similar to the way R2 suggests (see previous methods). The conflation with the Wasserstein distances is an understandable misunderstanding. These are unrelated to p-values and used for the heatmaps in S1 only (see previous methods).

      Instead of p-values, we now use the adjusted rand index, which measures how accurately neurons within the same region are clustered together (see Line 670 - 671, Figure 1C, and Figure S1) (Hubert & Arabie 1985). This quantifies the extent to which the distribution of points in dimensionally-reduced space is shaped by region/structure.

      Hubert, L., & Arabie, P. (1985). Comparing partitions. Journal of Classification, 2(1), 193–218. https://doi.org/10.1007/BF01908075

      (2) Logistic classifiers:

      The results in this section are somewhat underwhelming. Accuracy is around 40% and yes above chance but I would be very surprised if someone is worried about separating visual structures from the thalamus. Such coarse brain targeting is not difficult. If the authors want to include this data, I recommend they show it as a control in the ISI distribution section. The entire argument here is that perhaps one should not use derived metrics and a nonlinear classifier on more data is better, which is essentially the thrust of the next section.

      As outlined above, our work systematically increases in model complexity. The logistic result is an intermediate model, and it returns intermediate results. This is an important stepping stone between the lack of a result based on unsupervised linear dimensionality reduction and the performance of supervised nonlinear models.

      From a purely utilitarian perspective, the argument could be framed as “one should not use derived metrics, and a nonlinear classifier on more data is better.” However, please see all of our notes above.

      (3) MLP classifiers:

      Even in this section, I was left somewhat underwhelmed that a nonlinear classifier with large amounts of data outperforms a linear classifier with small amounts of data. I found the analysis of the ISIs and which timescales are driving the classifier interesting but I think the classifier with smoothing is more interesting. So with a modest chance level decodability of different brain areas in the visual system, I found it somewhat grandiose to claim a "conserved" code for anatomy in the brain. If there is conservation, it seems to be at the level of the coarse brain organization, which in my opinion is not particularly compelling.

      The sample size used for both the linear and nonlinear classifiers is the same; however, the nonlinear classifier leverages the detailed spiking time information from ISIs. Our goal here was to systematically evaluate how classical spike metrics compare to more detailed temporal features in their ability to decode brain areas. We chose a linear classifier for spike metrics because, with fewer features, nonlinear methods like neural networks often offer very modest advantages over linear methods, less interpretability, and are prone to overfitting.

      Respectfully, we stand by our word choice. The term “conserved” is appropriate given that our results hold appreciably, i.e., statistically above chance, across animals.

      (4) Generalization section:

      The authors suggest that a classifier learned from one set of data could be used for new data. I was unsure if this was a scientific point or the fact that they could use it as a tool.

      It can be both. We are more driven by the scientific implications of a rejection of the null.

      Is the scientific argument that ISIs are similar across areas even in different tasks?

      It appears so - despite heterogeneity in the tuning of single neurons, their presynaptic inputs, and stimuli, there is identifiable information about anatomical location in the spike train.

      Why would one not learn a classifier from every piece of available data: like LFP bands, ISI distributions, and average firing rates, and use that to predict the brain area as a comparison?

      Because this would obfuscate the ability to conclude that spike trains embed information about anatomy.

      Considering all features simultaneously and adding additional data modalities—such as LFP bands and spike waveforms—has potential to improve classification accuracy at the cost of understanding the contribution of each feature. The spike train as a time series is the most fundamental component of neuronal communication. As a result, this is the only feature of neuronal activity of concern for the present investigation.

      Or is the argument that the ISIs are a conserved code for anatomy? Unfortunately, even in this section, the data are underwhelming.

      We appreciate the reviewer’s comments, but arrive at a very different conclusion. We were quite surprised to find any generalizability whatsoever.

      Moreover, for use as a tool, I think the authors need to seriously consider a control that is either waveforms from different brain areas or the local field potentials. Without that, I am struggling to understand how good this tool is. The authors said "because information transmission in the brain arises primarily from the timing of spiking and not waveforms (etc)., our studies involve only the timestamps of individual spikes from well-isolated units ". However, we are not talking about information transmission and actually trying to identify and assess brain areas from electrophysiological data.

      While we are not blind to the “tool” potential that is suggested by our work, this is not the primary motivation or content in any section of the paper. As stated clearly in the abstract, our motivation is to ask “whether individual neurons [...] embed information about their own anatomical location within their spike patterns”. We go on to say “This discovery provides new insights into the relationship between brain structure and function, with broad implications for neurodevelopment, multimodal integration, and the interpretation of large-scale neuronal recordings. Immediately, it has potential as a strategy for in-vivo electrode localization.” Crucially, the last point we make is a nod to application. Indeed, our results suggest that in-vivo electrode localization protocols may benefit from the incorporation of such a model.

      In light of the reviewer’s concerns, we have further dampened the weight of statements about our model as a consumer-ready tool.

      Example 1: The final sentence of the abstract now reads: “Computational approximations of anatomy have potential to support in-vivo electrode localization.”

      Example 2: The results sections now contains the following text: “While significantly above chance, the structure-level model still lacks the accuracy for immediate practical application. However, it is highly likely that the incorporation of datasets with diverse multi-modal features and alternative regions from other research groups will increase the accuracy of such a model. In addition, a computational approach can be combined with other methods of anatomical reconstruction.” (Line 355 - 359).

      Example 3: We replaced the phrase "because information transmission in the brain arises primarily from the timing of spiking and not waveforms (etc) " with the phrase “because information is primarily encoded by the firing rate or the timing of spiking and not waveforms (etc)” (Line 116 - 118).

      (5) Discussion section:

      In the discussion, beginning with "It is reasonable to consider . . ." all the way to the penultimate paragraph, I found the argumentation here extremely hard to follow. Furthermore, the parts of the discussion here I did feel I understood, I heavily disagreed with. They state that "recordings are random in their local sampling" which is almost certainly untrue when it comes to electrophysiology which tends to oversample task-modulated excitatory neurons (https://elifesciences.org/articles/69068). I also disagree that "each neuron's connectivity is unique, and vertebrate brains lack 'identified neurons' characteristic of simple organisms. While brains are only eutelic and "nameable" in only the simplest organisms (C. elegans), cell types are exceedingly stereotyped in their connectivity even in mammals and such connectivity defines their computational properties. Thus I don't find the premise the authors state in the next sentence to be undermined ("it seems unlikely that a single neuron's happenstance imprinting of its unique connectivity should generalize across stimuli and animals"). Overall, I found this subsection to rely on false premises and in my opinion it should be removed.

      At the suggestion of R2, we removed the paragraph in question. However, we would like to address some points of disagreement:

      We agree that electrophysiology, along with spike-sorting, quality metrics, and filtering of low-firing neurons, leads to oversampling of task-modulated neurons. However, when we stated that recordings are random in their local sampling, we were referring to structural (anatomical) randomness, not functional randomness. In other words, the recorded neurons were not specifically targeted (see below).

      Electrode arrays, such as Neuropixels, record from hundreds of neurons within a small volume relative to the total number of neurons and the volume of a given brain region. For instance, the paper R2 referenced includes a statement supporting this: “... assuming a 50-μm ‘listening radius’ for the probes (radius of half-cylinder around the probe where the neurons’ spike amplitude is sufficiently above noise to trigger detection) …, the average yield of 116 regular-spiking units/probe (prior to QC filtering) would imply a density of 42,000 neurons/mm³, much lower than the known density of ~90,000 neurons/mm³ for excitatory cells in mouse visual cortex….”

      If we take the estimated volume of V1 to be approximately 3 mm³, this region could theoretically be subdivided into multiple cylinders with a 100-μm diameter. While stereotaxic implantation of the probe mitigates some variability, the natural anatomical variability across individual animals introduces spatially random sampling. This was the randomness we were referring to, and thus, we disagree with the assertion that our claim is “almost certainly untrue.”

      Additionally, each cortical pyramidal neuron is understood to have ~ 10,000 presynaptic partners. It is highly unlikely that these connections are entirely pre-specified, perfectly replicated within the same animal, and identical across all members of species. Further, there is enormous diversity in the activity properties of even neighboring cells of the same type. Consider pyramidal neurons in V1. Single neuron firing rates are log normally distributed, there are many of combinations of tuning properties (i.e., direction, orientation) that must occupy each point in retinotopic space, and there is powerful experience dependent change in the connectivity of these cells. We suggest that it is inconceivable that any two neurons, even within a small region of V1, have identical connectivity.

      Minor Comments:

      (1) Although the description of confusion matrices is good from a didactic perspective, some of this could be moved to methods to simplify the paper.

      We thank the reviewer for the suggestion. However, given the broad readership of eLife, we gently suggest that confusion matrices are not a trivial and universally appreciated plotting format. For the purpose of accessibility, a brief and didactic 2-sentence description will make the paper far more comprehensible to many readers at little cost to experts.

      (2) Figure 3A: It is concluded in their subsequent figure that the longer the measured amount of time, the better the decoding performance. Thus it makes sense why the average PSTHs do not show significant decoding of areas or structures

      That is a good observation. However, all features were calculated from the same duration of data, except in Figure 3B, where we tested the effect of duration. The averaged PSTH was calculated from the same length of data as the ISI distribution and binned to have the same number of feature lengths as the ISI distribution (refer to Methods section). Therefore, we interpreted this as an indication of information degradation through averaging, rather than an effect of data length (Line 234 - 237).

      (3) Figure 3D: A Gaussian is used to fit the ISI distributions here but ISI distributions do not follow a normal distribution, they follow an inverse gamma distribution.

      We agree with the reviewer and we are familiar with the literature that the ISI distribution is best fitted by a gamma family distribution (as a recent, but not earliest example: Li et al. 2018). However, we did not fit a gaussian (or any distribution) to the data, we just calculated the sample mean and variance. Reporting sample mean and variance (or standard deviation) is not something that is only done for Gaussian distributions. They are broadly used metrics that simply have additional intrinsic meaning for Gaussian distributions. We used the schematic illustration in Fig 3D because mean and variance are much more familiar in Gaussian distribution context, but ultimately that does not affect our analyses in Fig 3 E-F. Alternatively, the alpha and beta intrinsic parameters of a gamma distribution could have been used, but they are known by a much smaller portion of neuroscientists.

      Li, M., Xie, K., Kuang, H., Liu, J., Wang, D., Fox, G. E., ... & Tsien, J. Z. (2018). Spike-timing pattern operates as gamma-distribution across cell types, regions and animal species and is essential for naturally-occurring cognitive states. Biorxiv, 145813(10.1101), 145813.

      (4) Figure 3G: Something is wrong with this figure as each vertical bar is supposed to represent a drifting grating onset but yet, they are all at 5 hz despite the PSTH being purportedly shown at many different frequencies from 1 to 15 hz.

      We appreciate your attention to detail, but we are not representing the onset of individual drifting gratings in this. We just meant to represent the overall start\end of the drifting grating session. We did not intend to signal the temporal frequency of the drifting gratings (or the spatial frequency, orientation, or contrast).

    1. But for his sins against me, I will this very day take vengeance on Hippolytus; for long ago I cleared the ground of many obstacles, so it needs but trifling toil. For as he came one day from the home of Pittheus to witness the solemn mystic rites and be initiated therein in Pandion's land, Phaedra, his father's noble wife, caught sight of him, and by my designs she found her heart was seized with wild desire. And ere she came to this Troezenian realm, a temple did she rear to Cypris hard by the rock of Pallas where it o'erlooks this country, for love of the youth in another land; and to win his love in days to come she called after his name the temple she had founded for the goddess.

      In this passage the goddess makes her case for the eventual penalty leveled against Hippolytus by the wrathful goddess. The audience is presented with the transgression as well as the intended punishment. Just as Greek society had a strict code of laws regarding civil responsibilities they also had a moral code from the Gods. The reiteration of these standards in this form promotes general acceptance among the masses.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      This study examined the interaction between two key cortical regions in the mouse brain involved in goal-directed movements, the rostral forelimb area (RFA) - considered a premotor region involved in movement planning, and the caudal forelimb area (CFA) - considered a primary motor region that more directly influences movement execution. The authors ask whether there exists a hierarchical interaction between these regions, as previously hypothesized, and focus on a specific definition of hierarchy - examining whether the neural activity in the premotor region exerts a larger functional influence on the activity in the primary motor area than vice versa. They examine this question using advanced experimental and analytical methods, including localized optogenetic manipulation of neural activity in either region while measuring both the neural activity in the other region and EMG signals from several muscles involved in the reaching movement, as well as simultaneous electrophysiology recordings from both regions in a separate cohort of animals.

      The findings presented show that localized optogenetic manipulation of neural activity in either RFA or CFA resulted in similarly short-latency changes in the muscle output and in firing rate changes in the other region. However, perturbation of RFA led to a larger absolute change in the neural activity of CFA neurons. The authors interpret these findings as evidence for reciprocal, but asymmetrical, influence between the regions, suggesting some degree of hierarchy in which RFA has a greater effect on the neural activity in CFA. They go on to examine whether this asymmetry can also be observed in simultaneously recorded neural activity patterns from both regions. They use multiple advanced analysis methods that either identify latent components at the population level or measure the predictability of firing rates of single neurons in one region using firing rates of single neurons in the other region. Interestingly, the main finding across these analyses seems to be that both regions share highly similar components that capture a high degree of variability of the neural activity patterns in each region. Single units' activity from either region could be predicted to a similar degree from the activity of single units in the other region, without a clear division into a leading area and a lagging area, as one might expect to find in a simple hierarchical interaction. However, the authors find some evidence showing a slight bias towards leading activity in RFA. Using a two-region neural network model that is fit to the summed neural activity recorded in the different experiments and to the summed muscle output, the authors show that a network with constrained (balanced) weights between the regions can still output the observed measured activities and the observed asymmetrical effects of the optogenetic manipulations, by having different within-region local weights. These results put into question whether previous and current findings that demonstrate asymmetry in the output of regions can be interpreted as evidence for asymmetrical (and thus hierarchical) inputs between regions, emphasizing the challenges in studying interactions between any brain regions.

      Strengths:

      The experiments and analyses performed in this study are comprehensive and provide a detailed examination and comparison of neural activity recorded simultaneously using dense electrophysiology probes from two main motor regions that have been the focus of studies examining goal-directed movements. The findings showing reciprocal effects from each region to the other, similar short-latency modulation of muscle output by both regions, and similarity of neural activity patterns without a clear lead/lag interaction, are convincing and add to the growing body of evidence that highlight the complexity of the interactions between multiple regions in the motor system and go against a simple feedforward-like network and dynamics. The neural network model complements these findings and adds an important demonstration that the observed asymmetry can, in theory, also arise from differences in local recurrent connections and not necessarily from different input projections from one region to the other. This sheds an important light on the multiple factors that should be considered when studying the interaction between any two brain regions, with a specific emphasis on the role of local recurrent connections, that should be of interest to the general neuroscience community.

      Weaknesses:

      While the similarity of the activity patterns across regions and lack of a clear leading/lagging interaction are interesting observations that are mostly supported by the findings presented (however, see comment below for lack of clarity in CCA/PLS analyses), the main question posed by the authors - whether there exists an endogenous hierarchical interaction between RFA and CFA - seems to be left largely open. 

      The authors note that there is currently no clear evidence of asymmetrical reciprocal influence between naturally occurring neural activity patterns of the two regions, as previous attempts have used non-natural electrical stimulation, lesions, or pharmacological inactivation. The use of acute optogenetic perturbations does not seem to be vastly different in that aspect, as it is a non-natural stimulation of inhibitory interneurons that abruptly perturbs the ongoing dynamics.

      We do believe that our optogenetic inactivation identifies a causal interaction between the endogenous activity patterns in the excitatory projection neurons, which we have largely silenced, and the downstream endogenous activity that is perturbed. The effect in the downstream region results directly from the silencing of activity in the excitatory projection neurons that mediate each region’s interaction with other regions. Here we have performed a causal intervention common in biology: a loss-of-function experiment. Such experiments generally reveal that a causal interaction of some sort is present, but often do not clarify much about the nature of the interaction, as is true in our case. By showing that a silencing of endogenous activity in one motor cortical region causes a significant change to the endogenous activity in another, we establish a causal relationship between these activity patterns. This is analogous to knocking out the gene for a transcription factor and observing causal effects on the expression of other genes that depend on it. 

      Moreover, our experiments are, to our knowledge, the first that localize a causal relationship to endogenous activity in motor cortex at a particular point during a motor behavior. Lesion and pharmacological or chemogenetic inactivation have long-lasting effects, and so their consequences on firing in other regions cannot be attributed to a short-latency influence of activity at a particular point during movement. Moreover, the involvement of motor cortex in motor learning and movement preparation/initiation complicates the interpretation of these consequences in relation to movement execution, as disturbance to processes on which execution depends can impede execution itself. Stimulation experiments generate spiking in excitatory projection neurons that is not endogenous.

      That said, we would agree that the form of the causal interaction between RFA and CFA remains unaddressed by our results. These results do not expose how the silenced activity patterns affect activity in the downstream region, just as knocking out a transcription factor gene does not expose how the transcription factor influences the expression of other genes. To show evidence for a specific type of interaction dynamics between RFA and CFA, a different sort of experiment would be necessary. See Jazayeri and Afraz, Neuron, 2017 for more on this issue.

      Furthermore, the main finding that supports a hierarchical interaction is a difference in the absolute change of firing rates as a result of the optogenetic perturbation, a finding that is based on a small number of animals (N = 3 in each experimental group), and one which may be difficult to interpret. 

      Though N = 3, we do show statistical significance. Moreover, using three replicates is not uncommon in biological experiments that require a large technical investment.

      As the authors nicely demonstrate in their neural network model, the two regions may differ in the strength of local within-region inhibitory connections. Could this theoretically also lead to a difference in the effect of the artificial light stimulation of the inhibitory interneurons on the local population of excitatory projection neurons, driving an asymmetrical effect on the downstream region? 

      We (Miri et al., Neuron, 2017) and others (Guo et al., Neuron, 2014) have shown that the effect of this inactivation on excitatory neurons in CFA is a near-complete silencing (90-95% within 20 ms). There thus is not much room for the effects on projection neurons in RFA to be much larger. We have measured these local effects in RFA as part of other work (Kristl et al., biorxiv, 2025), verifying that the effects on RFA projection neuron firing are not larger.

      Moreover, the manipulation was performed upon the beginning of the reaching movement, while the premotor region is often hypothesized to exert its main control during movement preparation, and thus possibly show greater modulation during that movement epoch. It is not clear if the observed difference in absolute change is dependent on the chosen time of optogenetic stimulation and if this effect is a general effect that will hold if the stimulation is delivered during different movement epochs, such as during movement preparation.

      We agree that the dependence of RFA-CFA interactions on movement phase would be interesting to address in subsequent experiments. While a strong interpretation of lesion results might lead to a hypothesis that premotor influence on primary motor cortex is local to, or stronger during, movement preparation as opposed to execution, at present there is to our knowledge no empirical support from interventional experiments for this hypothesis. Moreover, existing results from analysis of activity in these two regions have produced conflicting results on the strength of interaction between these regions during preparation. Compare for example BachschmidRomano et al., eLife, 2023 to Kaufman et al., Nature Neuroscience, 2014.

      That said, this lesion interpretation would predict the same asymmetry we have observed from perturbations at the beginning of a reach - a larger effect of RFA on CFA than vice versa.

      Another finding that is not clearly interpretable is in the analysis of the population activity using CCA and PLS. The authors show that shifting the activity of one region compared to the other, in an attempt to find the optimal leading/lagging interaction, does not affect the results of these analyses. Assuming the activities of both regions are better aligned at some unknown groundtruth lead/lag time, I would expect to see a peak somewhere in the range examined, as is nicely shown when running the same analyses on a single region's activity. If the activities are indeed aligned at zero, without a clear leading/lagging interaction, but the results remain similar when shifting the activities of one region compared to the other, the interpretation of these analyses is not clear.

      Our results in this case were definitely surprising. Many share the intuition that there should be a lag at which the correlations in activity between regions may be strongest. The similarity in alignment across lags we observed might be expected if communication between regions occurs over a range of latencies as a result of dependence on a broad diversity of synaptic paths that connect neurons. In the Discussion, we offer an explanation of how to reconcile these findings with the seemingly different picture presented by DLAG.

      Reviewer #2 (Public review):

      Summary:

      While technical advances have enabled large-scale, multi-site neural recordings, characterizing inter-regional communication and its behavioral relevance remains challenging due to intrinsic properties of the brain such as shared inputs, network complexity, and external noise. This work by Saiki-Ishkawa et al. examines the functional hierarchy between premotor (PM) and primary motor (M1) cortices in mice during a directional reaching task. The authors find some evidence consistent with an asymmetric reciprocal influence between the regions, but overall, activity patterns were highly similar and equally predictive of one another. These results suggest that motor cortical hierarchy, though present, is not fully reflected in firing patterns alone.

      Strengths:

      Inferring functional hierarchies between brain regions, given the complexity of reciprocal and local connectivity, dynamic interactions, and the influence of both shared and independent external inputs, is a challenging task. It requires careful analysis of simultaneous recording data, combined with cross-validation across multiple metrics, to accurately assess the functional relationships between regions. The authors have generated a valuable dataset simultaneously recording from both regions at scale from mice performing a cortex-dependent directional reaching task.

      Using electrophysiological and silencing data, the authors found evidence supporting the traditionally assumed asymmetric influence from PM to M1. While earlier studies inferred a functional hierarchy based on partial temporal relationships in firing patterns, the authors applied a series of complementary analyses to rigorously test this hierarchy at both individual neuron and population levels, with robust statistical validation of significance.

      In addition, recording combined with brief optogenetic silencing of the other region allowed authors to infer the asymmetric functional influence in a more causal manner. This experiment is well designed to focus on the effect of inactivation manifesting through oligosynaptic connections to support the existence of a premotor to primary motor functional hierarchy.

      Subsequent analyses revealed a more complex picture. CCA, PLS, and three measures of predictivity (Granger causality, transfer entropy, and convergent cross-mapping) emphasized similarities in firing patterns and cross-region predictability. However, DLAG suggested an imbalance, with RFA capturing CFA variance at a negative time lag, indicating that RFA 'leads' CFA. Taken together these results provide useful insights for current studies of functional hierarchy about potential limitations in inferring hierarchy solely based on firing rates.

      While I would detail some questions and issues on specifics of data analyses and modeling below, I appreciate the authors' effort in training RNNs that match some behavioral and recorded neural activity patterns including the inactivation result. The authors point out two components that can determine the across-region influence - 1) the amount of inputs received and 2) the dependence on across-region input, i.e., the relative importance of local dynamics, providing useful insights in inferring functional relationships across regions.

      Weaknesses:

      (1) Trial-averaging was applied in CCA and PLS analyses. While trial-averaging can be appropriate in certain cases, it leads to the loss of trial-to-trial variance, potentially inflating the perceived similarities between the activity in the two regions (Figure 4). Do authors observe comparable degrees of similarity, e.g., variance explained by canonical variables? Also, the authors report conflicting findings regarding the temporal relationship between RFA and CFA when using CCA/PLS versus DLAG. Could this discrepancy be due to the use of trial-averaging in former analyses but not in the latter?

      We certainly agree that the similarity in firing patterns is higher in trial averages than on single trials, given the variation in single-neuron firing patterns across trials. Here, we were trying to examine the similarity of activity variance that is clearly movement dependent, as trial averages are, and to use an approach aligned with those applied in the existing literature. We would also agree that there is more that can be learned about interactions from trial-by-trial analysis. It is possible that the activity components identified by DLAG as being asymmetric somehow are not reflected strongly in trial averages. In our Discussion we offer another potential explanation that is based on other differences in what is calculated by DLAG and CCA/PLS.

      We also note here that all of the firing pattern predictivity analysis we report (Figure 6) was done on single trial data, and in all cases the predictivity was symmetric. Thus, our results in aggregate are not consistent with symmetry purely being an artifact of trial averaging.

      (2) A key strength of the current study is the precise tracking of forelimb muscle activity during a complex motor task involving reaching for four different targets. This rich behavioral data is rarely collected in mice and offers a valuable opportunity to investigate the behavioral relevance of the PM-M1 functional interaction, yet little has been done to explore this aspect in depth. For example, single-trial time courses of inter-regional latent variables acquired from DLAG analysis can be correlated with single-trial muscle activity and/or reach trajectories to examine the behavioral relevance of inter-regional dynamics. Namely, can trial-by-trial change in inter-regional dynamics explain behavioral variability across trials and/or targets? Does the inter-areal interaction change in error trials? Furthermore, the authors could quantify the relative contribution of across-area versus within-area dynamics to behavioral variability. It would also be interesting to assess the degree to which across-area and within-area dynamics are correlated. Specifically, can acrossarea dynamics vary independently from within-area dynamics across trials, potentially operating through a distinct communication subspace?

      These are all very interesting questions. Our study does not attempt to parse activity into components predictive of muscle activity and others that may reflect other functions. Distinct components of RFA and CFA activity may very well rely on distinct interactions between them.

      (3) While network modeling of RFA and CFA activity captured some aspects of behavioral and neural data, I wonder if certain findings such as the connection weight distribution (Figure 7C), across-region input (Figure 7F), and the within-region weights (Figure 7G), primarily resulted from fitting the different overall firing rates between the two regions with CFA exhibiting higher average firing rates. Did the authors account for this firing rate disparity when training the RNNs?

      The key comparison in Figure 7 is shown in 7F, where the firing rates are accounted for in calculating the across-region input strength. Equalizing the firing rates in RFA and CFA would effectively increase RFA rates. If the mean firing rates in each region were appreciably dependent on across-region inputs, we would then expect an off-setting change in the RFA→CFA weights, such that the RFA→CFA distributions in 7F would stay the same. We would also expect the CFA→RFA weights would increase, since RFA neurons would need more input. This would shift the CFA→RFA (blue) distributions up. Thus, if anything, the key difference in this panel would only get larger. 

      We also generally feel that it is a better approach to fit the actual firing rates, rather than normalizing, since normalizing the firing rates would take us further from the actual biology, not closer.

      (4) Another way to assess the functional hierarchy is by comparing the time courses of movement representation between the two regions. For example, a linear decoder could be used to compare the amount of information about muscle activity and/or target location as well as time courses thereof between the two regions. This approach is advantageous because it incorporates behavior rather than focusing solely on neural activity. Since one of the main claims of this study is the limitation of inferring functional hierarchy from firing rate data alone, the authors should use the behavior as a lens for examining inter-areal interactions.

      As we state above, we agree that examining interactions specific to movement-related activity components could reveal interesting structure in interregional interactions. Since it remains a challenge to rigorously identify a subset of neural activity patterns specifically related to driving muscle activity, any such analysis would involve an additional assumption. It remains unclear how well the activity that decoders use for predicting muscle activity matches the activity that actually drives muscle activity in situ.

      To address this issue, which related to one raised by Reviewer #3 below, we have added an additional paragraph to the Discussion (see “Manifestations of hierarchy in firing patterns”).

      Reviewer #3 (Public review):

      This study investigates how two cortical regions that are central to the study of rodent motor control (rostral forelimb area, RFA, and caudal forelimb area, CFA) interact during directional forelimb reaching in mice. The authors investigate this interaction using

      (1) optogenetic manipulations in one area while recording extracellularly from the other, (2) statistical analyses of simultaneous CFA/RFA extracellular recordings, and (3) network modeling.

      The authors provide solid evidence that asymmetry between RFA and CFA can be observed, although such asymmetry is only observed in certain experimental and analytical contexts.

      The authors find asymmetry when applying optogenetic perturbations, reporting a greater impact of RFA inactivation on CFA activity than vice-versa. The authors then investigate asymmetry in endogenous activity during forelimb movements and find asymmetry with some analytical methods but not others. Asymmetry was observed in the onset timing of movement-related deviations of local latent components with RFA leading CFA (computed with PCA) and in a relatively higher proportion and importance of cross-area latent components with RFA leading than CFA leading (computed with DLAG). However, no asymmetry was observed using several other methods that compute cross-area latent dynamics, nor with methods computed on individual neuron pairs across regions. The authors follow up this experimental work by developing a twoarea model with asymmetric dependence on cross-area input. This model is used to show that differences in local connectivity can drive asymmetry between two areas with equal amounts of across-region input.

      Overall, this work provides a useful demonstration that different cross-area analysis methods result in different conclusions regarding asymmetric interactions between brain areas and suggests careful consideration of methods when analyzing such networks is critical. A deeper examination of why different analytical methods result in observed asymmetry or no asymmetry, analyses that specifically examine neural dynamics informative about details of the movement, or a biological investigation of the hypothesis provided by the model would provide greater clarity regarding the interaction between RFA and CFA.

      Strengths:

      The authors are rigorous in their experimental and analytical methods, carefully monitoring the impact of their perturbations with simultaneous recordings, and providing valid controls for their analytical methods. They cite relevant previous literature that largely agrees with the current work, highlighting the continued ambiguity regarding the extent to which there exists an asymmetry in endogenous activity between RFA and CFA.

      A strength of the paper is the evidence for asymmetry provided by optogenetic manipulation. They show that RFA inactivation causes a greater absolute difference in muscle activity than CFA interaction (deviations begin 25-50 ms after laser onset, Figure 1) and that RFA inactivation causes a relatively larger decrease in CFA firing rate than CFA inactivation causes in RFA (deviations begin <25ms after laser onset, Figure 3). The timescales of these changes provide solid evidence for an asymmetry in the impact of inactivating RFA/CFA on the other region that could not be driven by differences in feedback from disrupted movement (which would appear with a ~50ms delay).

      The authors also utilize a range of different analytical methods, showing an interesting difference between some population-based methods (PCA, DLAG) that observe asymmetry, and single neuron pair methods (granger causality, transfer entropy, and convergent cross mapping) that do not. Moreover, the modeling work presents an interesting potential cause of "hierarchy" or "asymmetry" between brain areas: local connectivity that impacts dependence on across-region input, rather than the amount of across-region input actually present.

      Weaknesses:

      There is no attempt to examine neural dynamics that are specifically relevant/informative about the details of the ongoing forelimb movement (e.g., kinematics, reach direction). Thus, it may be preemptive to claim that firing patterns alone do not reflect functional influence between RFA/CFA. For example, given evidence that the largest component of motor cortical activity doesn't reflect details of ongoing movement (reach direction or path; Kaufman, et al. PMID: 27761519) and that the analytical tools the authors use likely isolate this component (PCA, CCA), it may not be surprising that CFA and RFA do not show asymmetry if such asymmetry is related to the control of movement details. 

      An asymmetry may still exist in the components of neural activity that encode information about movement details, and thus it may be necessary to isolate and examine the interaction of behaviorally-relevant dynamics (e.g., Sani, et al. PMID: 33169030).

      To clarify, we are not claiming that firing patterns in no way reflect the asymmetric functional influence that we demonstrate with optogenetic inactivation. Instead, we show that certain types of analysis that we might expect to reflect such influence, in fact, do not. Indeed, DLAG did exhibit asymmetries that matched those seen in functional influence (at least qualitatively), though other methods we applied did not.

      As we state above, we do think that there is more that can be gleaned by looking at influence specifically in terms of activity related to movement. However, if we did find that movement-related activity exhibited an asymmetry following functional influence, our results imply that the remaining activity components would exhibit an opposite asymmetry, such that the overall balance is symmetric. This would itself be surprising. We also note that the components identified by CCA and PLS do show substantial variation across reach targets, indicating that they are not only reflecting condition-invariant components. These analyses were performed on components accounting for well over 90% of the total activity variance, suggesting that both conditiondependent and condition-invariant components should be included.

      To address the concern about condition-dependent and condition-invariant components, we have added a sentence to the Results section reporting our CCA and PLS results: “Because our results here involve the vast majority of trial-averaged activity variance, we expect that they encompass both components of activity that vary for different movement conditions (condition-dependent), and those that do not (condition-invariant).” To address the general concerns about potential differences in activity components specifically related to muscle activity, we have also added an additional paragraph to the Discussion (see “Manifestations of hierarchy in firing patterns”).

      The idea that local circuit dynamics play a central role in determining the asymmetry between RFA and CFA is not supported by experimental data in this paper. The plausibility of this hypothesis is supported by the model but is not explored in any analyses of the experimental data collected. Given the focus on this idea in the discussion, further experimental investigation is warranted.

      While we do not provide experimental support for this hypothesis, the data we present also do not contradict this hypothesis. Here we used modeling as it is often used - to capture experimental results and generate hypotheses about potential explanation. We do feel that our Discussion makes clear where the hypothesis derives from and does not misrepresent the lack of experimental support. We expect readers will take our engagement with this hypothesis with the appropriate grain of salt. The imaginable experiments to support such a hypothesis would constitute another substantial study, requiring numerous controls - a whole other paper in itself.

      Recommendations for the authors:  

      Reviewer #1 (Recommendations for the authors):

      (1) There are a few small text/figure caption modifications that can be made for clarity of reading:

      (2) Unclear sentence in the second paragraph of the introduction: "For example, stimulation applied in PM has been shown to alter the effects on muscles of stimulation in M1 under anesthesia, both in monkeys and rodents."

      This sentence has been rephrased for clarity: “For example, in anesthetized monkeys34 and rodents35, stimulation in PM alters the effects of stimulation in M1 on muscles.”

      (3) The first section of the results presents the optogenetic manipulation. However, the critical control that tests whether this was strictly a local manipulation that did not affect cells in the other region is introduced only much later. It may be helpful to add a comment in this section noting that such a control was performed, even if it is explained in detail later when introducing the recordings.

      We have added the following to the first Results section: “we show below that direct optogenetic effects were only seen in the targeted forelimb area and not the other.”

      (4) Figure 1D - I imagine these averages are from a single animal, but this is not stated in the figure caption.

      “For one example mouse,” has been added to the beginning of the Figure 1D legend.

      (5) Figure 2F - N=6 is not stated in the panel's caption (though it can make it clearer), while it is stated in the caption of 2H.

      “n = 6 mice” has been added to the Figure 2F legend.

      (6) There's some inconsistency with the order of RFA/CFA in the figures, sometimes RFA is presented first (e.g., Figure 1D and 1F), and sometimes CFA is presented first (e.g., panels of Figure 2).

      We do not foresee this leading to confusion.

      (7) "As expected, the majority of recorded neurons in each region exhibited an elevated average firing rate during movement as compared to periods when forelimb muscles were quiescent (Figure 2D,E; Figure S1A,B)" - Figure S1A,B show histograms of narrow vs. wide waveforms, is this the relevant figure here?

      We apologize for the cryptic reference. The waveform width histograms were referred to here because they enabled the separation of narrow- and wide-waveform cells shown in Figure 2D,E. We have added the following clause to the referenced sentence to make this explicit:  “, both for narrow-waveform, putative interneurons and wide-waveform putative pyramidal neurons.”

      (8) Figure 2I caption - "The fraction of activity variance from 150 ms before reach onset to 150 ms after it that occurs before reach onset" - this sentence is not clear.

      The Figure 2I legend has been updated to “The activity variance in the 150 ms before muscle activity onset, defined as a fraction of the total activity variance from 150 ms before to 150 ms after muscle activity onset, for each animal (circles) and the mean across animals (black bars, n = 6 mice).”

      (9) Figure 4B-G - is this showing results across the 6 animals? Not stated clearly.

      Yes - the 21 sessions we had referred to are drawn from all six mice. We have updated the legend here to make this explicit.

      (10) DLAG analysis - is there any particular reasoning behind choosing four across-region and four within-region components?

      In actuality, we completed this analysis for a broad range of component numbers and obtained similar results in all cases. Four fell in the center of our range, and so we focused the illustrations shown in the figure on this value. In general, the number of components is arbitrary. The original paper from Gokcen et al. describes a method for identifying a lower bound on the number of distinct components the method can identify. However, this method yields different results for each individual recording session. For the comparisons we performed, we needed to use the same range of values for each session.

      (11) Figure 5A seems to show 11 across-session components, it's unclear from the caption but I imagine this should show 12 (4 components times 3 sessions?)

      As we state in the Methods, any across-region latent variable with a lag that failed to converge between the boundary values of ±200 ms was removed from the analysis. In the case illustrated in this panel, the lag for one of the components failed to converge and is not shown. We have now clarified this both in the relevant Results paragraph and in the figure legend.

      (12) Figure 5B - is each marker here the average variance explained by all across/within components that were within the specified lag criteria across sessions per mouse? In other words, what does a single marker here stand for?

      We apologize for the lack of clarity here. These values reflect the average across sessions for each mouse. We have updated the legend to make this explicit.

      Reviewer #2 (Recommendations for the authors):

      As I have addressed most of my major recommendations in the public review, I will use this section to include relatively minor points for the authors to consider.

      (1) The EMG data in Figure 1C shows distinct patterns across spouts, both in the magnitude and complexity of muscle activations. It would be interesting to investigate whether these differences in muscle activity lead to behavioral variations (e.g., reaction time, reach duration) and how they relate to the relative involvement of the two areas.

      We agree that it would be interesting to examine how the interactions between areas vary as behavior varies. While the differences between reaches here are limited, we have addressed this question for two substantially different motor behaviors (reaching and climbing) in a follow-up study that was recently preprinted (Kristl et al., biorxiv, 2025).

      (2) How do the authors account for the lingering impact of RFA inactivation on muscle activity, which persists for tens of milliseconds after laser offset? Could this effect be due to compensatory motor activity following the perturbation? A further illustration of how the raw limb trajectories and/or muscle activity are perturbed and recovered would help readers better understand the impact of motor cortical inactivation.

      To clarify the effects of inactivation on a longer timescale, we have added a new supplemental figure showing the plots from Figure 1D over a longer time window extending to 500 ms after trial onset (new Figure S1). Lingering effects do persist, at least in certain cases. In general, we find it hard to ascertain the source of optogenetic effects on longer timescales like this. On the shortest timescales, effects will be mediated by relatively direct connections between regions. However, on these longer timescales, effects could be due to broader changes in brain and behavioral state that can influence muscle activity. For example, attempts to compensate for the initial disturbance to muscle activity could cause divergence from controls on these longer timescales. Muscle tissue itself is also known to have long timescale relaxation dynamics, and it would not be surprising if the relevant control circuits here also had long timescales dynamics, such that we would not expect an immediate return to control when the light pulse ends. Because of this ambiguity, we generally avoid interpretation of optogenetic effects on these longer timescales.

      Reviewer #3 (Recommendations for the authors):

      (1) Page 9: ". We measured the time at which the activity state deviated from baseline preceding reach onset," - I cannot find how this deviation was defined (neither the baseline nor the threshold).

      We have added text to the Figure 2G legend that explicitly states how the baseline and activity onset time were defined.

      (2) Given the shape of the curves in Figure 2G, the significance of this result seems susceptible to slight modifications of what defines a baseline or a deviation threshold. For example, it looks like the circle for CFA has a higher y-axis value, suggesting the baseline deviance is higher, but it is unclear why that would be from the plot. If the threshold for deviation in neural activity state were held uniform between CFA and RFA is the difference still significant across animals?

      We have repeated the analysis using the same absolute threshold for each region. We used the higher of the two thresholds from each region. The difference remains significant. This is now described in the last paragraph of the Results section for Figure 2.

      (3) Since summed deviation of the top 3 PCs is used to show a difference in activity onset between CFA/RFA, but only a small proportion of variance is explained pre-movement (<2% in most animals), it seems relevant to understand what percentage of CFA/RFA neuron activity actually is modulated and deviates from baseline prior to movement and to show the distribution of activity onsets at the single neuron level in CFA/RFA. Can an onset difference only be observed using PCA? 

      Because many neurons have low firing rates, estimating the time at which their firing rate begins to rise near reach onset is difficult to do reliably. It is also true that not all neurons show an increase around onset - some show a decrease and others show no discernible change. Using PCs to measure onset avoids both of these problems, since they capture both increases and decreases in individual neuron firing rates and are much less noisy than individual neuron firing rates. 

      However, based on this comment, we have repeated this analysis on a single-neuron level using only neurons with relatively high average firing rates. Specifically, we analyzed neurons with mean firing rates above the 90th percentile across all sessions within an animal. Neurons whose activity never crossed threshold were excluded. Results matched those using PCs, with RFA neurons showing an earlier average activity onset time. This is now described in the last paragraph of the Results section for Figure 2.

      (4) It is stated that to study the impact of inactivation on CFA/RFA activity, only the 50 highest average firing rate neurons were used (and maybe elsewhere too, e.g., convergent cross mapping). It is unclear why this subselection is necessary. It is justified by stating that higher firing rate neurons have better firing rate estimates. This may be supportable for very low firing rate units that spike sorting tools have a hard time tracking, but I don't think this is supported by data for most of the distribution of firing rates. It therefore seems like the results might be biased by a subselection of certain high firing rate neuron populations. It would be useful to also compute and mention if the results for all neurons/neuron pairs are the same. If there is worry about low-quality units being those with low firing rates, a threshold for firing rate as used elsewhere in the paper (at least 1 spike / 2 trials) seems justified.

      The issue here is that as firing rates decrease and firing rate estimates get noisier, estimates of the change in firing rate get more variable. Here we are trying to estimate the fraction of neurons for which firing rates decreased upon inactivation of the other region. Variability in estimates of the firing rate change will bias this estimate toward 50%, since in the limit when the change estimates are entirely based on noise, we expect 50% to be decreases. As expected, when we use increasingly liberal thresholds for this analysis, the fraction of decreases trends closer to 50%. 

      As a consequence of this, we cannot easily distinguish whether higher firing rate neurons might for some reason have a greater tendency to exhibit decreases in firing compared to lower firing rate neurons. However, we see no positive reason to expect such a difference. We have added a sentence noting this caveat in interpreting our findings to the relevant paragraph of the Results.

      The lack of min/max axis values in Figure 3B-F makes it hard to interpret - are these neurons almost silent when near the bottom of the plot or are they still firing a substantial # of spikes?

      To aid interpretation of the relative magnitude of firing rate changes, we have added minimum firing rates for the averages depicted in Figure 3B,C,E and F to the legend. Our original thinking was that the plots in Figure 3G and H would provide an indication of the relative changes in firing.

      It would be interesting to know if the impact of optogenetic stimulation changed with exposure to the manipulation. Are all results presented only from the first X number of sessions in each animal? Or is the effect robust over time and (within the same animal) you can get the same results of optogenetic inactivation over time? This information seems critical for reproducibility.

      We have now performed brief optogenetic inactivations in several brain areas in several different behavioral paradigms, and have found that inactivation effects are stable both within and across sessions, almost surprisingly so. This includes cases where the inactivations were more frequent (every ~1.25 s on average) and more numerous (>15,000 trials per animal) than in the present manuscript. Thus we did not restrict our analysis here to the first X sessions or trials within a session. We have added additional plots as Figure S3T-AA showing the stability of optogenetic effects both within and across sessions.

      Given that it can be difficult to record from interneurons (as the proportion of putative interneurons in Figure S1 attests), the SALT analyses would be more convincing if a few recordings had been performed in the same region as optogenetic stimulation to show a "positive control" of what direct interneuron stimulation looks like. Could also use this to validate the narrow/wide waveform classification.

      We have verified that using SALT as we have in the present manuscript does detect vGAT+ interneurons directly responding to light. This is included in a recent preprint from the lab (Kristl et al., biorxiv, 2025). We (Warriner et al., Cell Reports, 2022) and others (Guo et al., Neuron, 2014) have previously used direct ChR2 activation to validate waveform-based classification.

      Simultaneous CFA/RFA recordings during optogenetic perturbation would also allow for time courses of inhibition to be compared in RFA/CFA. Does it take 25ms to inhibit locally, and the cross-area impact is fast, or does it inactivate very fast locally and takes ~25ms to impact the other region?

      Latencies of this sort are difficult to precisely measure given the statistical limits of this sort of data, but there does appear to be some degree of delay between local and downstream effects. We do not have a statistical foundation as of yet for concluding that this is the case. It will be interesting to examine this issue more rigorously in the future.

      Given the difference in the analytical methods, the authors should share data in a relatively unprocessed format (e.g., spike times from sorted units relative to video tracking + behavioral data), along with analysis code, to allow others to investigate these differences.

      We plan to post the data and code to our lab’s Github site once the Version of Record is online.

    1. Briefing : Utiliser l'IA dans le monde associatif avec Yann Ferguson (LaborIA)

      Source : Extraits de "Parlez-moi d’IA #67 Utiliser l'IA dans le monde associatif avec Yann Ferguson (LaborIA)", podcast animé par Jean-Philippe Clément, avec la participation de Yann Ferguson, chercheur et sociologue, directeur du laboratoire Laboria.

      Contexte : Ce briefing est basé sur des extraits d'une table ronde organisée par la Maison de la Vie Associative et Citoyenne du 13ème arrondissement de Paris, abordant les enjeux, risques et bonnes pratiques de l'utilisation de l'IA dans le monde associatif.

      Yann Ferguson, spécialiste de l'IA dans le monde du travail, y a partagé son expertise.

      Thèmes Principaux :

      • Origines et évolution de l'Intelligence Artificielle (IA).
      • Impact de l'IA sur le monde du travail, notamment dans le secteur associatif.
      • La crise récurrente de l'intelligence face aux nouvelles technologies.
      • Avantages et risques de l'IA générative (ChatGPT) et l'émergence du "Shadow AI".
      • Alternatives aux grands modèles d'IA et l'importance de l'Open Source et de l'IA frugale.
      • Utilisation de l'IA dans l'éducation et le développement des compétences humaines.

      Idées Clés et Faits Importants :

      Définition et histoire de l'IA :

      L'IA est avant tout une discipline de recherche formalisée à partir du milieu des années 1950.

      Le terme "Intelligence Artificielle" a été choisi dès 1955 pour son pouvoir accrocheur et marketing, permettant d'attirer des financements.

      Les fondateurs avaient pour ambition de créer des machines capables de réaliser des tâches nécessitant des processus mentaux de haut niveau, jusqu'alors mieux réalisées par les humains (raisonnement, organisation de la mémoire, pensée critique, apprentissage perceptuel).

      Citation : "l'intelligence artificielle c'est une discipline de recherche avant d'être des outils et des applications qui... a commencé à se formaliser à partir de la deuxième moitié des années 50 donc 1950."

      Citation : "Ils cherchaient un terme assez accrocheur... un peu qui avait une dimension marketing assez forte." Deux approches historiques de l'IA : Initialement, deux grandes voies ont été explorées :

      L'imitation du raisonnement humain (IA logique ou symbolique) : reproduire les logiques des experts face aux problèmes. Cette approche a dominé jusqu'aux années 2000.

      L'imitation de l'apprentissage humain (apprentissage machine - machine learning) : fabriquer des machines qui apprennent à partir d'exemples, comme les humains. Cette approche a connu un renouveau important depuis 2010, notamment grâce à l'avènement du Big Data.

      Citation : "Le premier chemin consiste à imiter le raisonnement humain... La deuxième manière qu'ils ont imaginé c'est plutôt d'imiter le l'apprentissage humain."

      Citation : "depuis 2010 on connaît un renouveau de l'intelligence artificielle mais plus autour de l'approche de limitation du raisonnement humain bien plus autour de l'approche de limitation de l'apprentissage humain."

      Citation : "une des raisons pour lesquelles on a une accélération... ça a été le big data vous savez toutes ces données que l'on retrouve sur internet."

      L'apprentissage profond (Deep Learning) et le problème de la boîte noire : Le Deep Learning, un courant du machine learning, a permis d'obtenir d'excellents résultats (conversation, reconnaissance d'images) mais souffre d'un problème d'opacité dans son fonctionnement, appelé "problème de l'explicabilité et de la boîte noire".

      Citation : "un courant du machine learning qui est le deep learning l'apprentissage profond... avec un problème c'est que la façon dont la machine parvient à apprendre est assez opaque."

      IA et le monde du travail : L'IA s'inscrit dans la révolution informatique, aux côtés de l'intelligence augmentée (l'ordinateur comme outil d'assistance) et de l'intelligence distribuée (réseau, Web, intelligence collective). L'IA, ou intelligence automatisée, est la machine capable de raisonner seule.

      Citation : "l'intelligence artificielle elle prend place dans la dans la révolution informatique à partir des années 50 et autour de cette de ces nouvelles machines les ordinateurs il y a en fait trois promesses d'intelligence Il y a l'intelligence augmentée... l'intelligence distribué... la troisième l'intelligence artificielle qui était plutôt l'intelligence automatisée."

      Les cinq scénarios du futur du travail face à l'IA : Yann Ferguson a identifié cinq grands récits sur l'impact de l'IA sur le travail :

      • Le travailleur remplacé : la machine prend la place de l'humain.

      • Le travailleur dominé : l'humain est commandé par la machine ou dépend de plateformes sans protection sociale ("uberisation").

      • Les travailleurs divisés : apparition d'une classe de travailleurs pauvres avec des emplois peu intéressants et mal payés.

      • Le travailleur augmenté : l'IA libère l'humain des tâches répétitives, lui permettant de se concentrer sur des tâches plus créatives et relationnelles. Le travailleur réhumanisé : Recentrage de l'humain sur des tâches dignes de son humanité.

      Citation : "j'avais proposé cinq grandes directions à ce récit... Il y a évidemment le récit du travailleur remplacé... la deuxième le deuxième élément un peu dystopique aussi hein c'est le travailleur dominé... 3ème scénario négatif c'est ce que j'avais appelé les travailleurs divisés...

      Il y avait quand même deux scénarios beaucoup plus optimistes Il y avait le scénario du travailleur euh du travailleur augmenté... Après le dernier c'est ce qu'on appelle le travailleur réhumanisé."

      Le rôle essentiel du secteur associatif ("tiers secteur non marchand") : Jeremy Rifkin, dans les années 90, anticipait que le logiciel remplacerait de nombreux emplois, mais que le "tiers secteur non marchand" (l'associatif), essentiel pour vivre ensemble et non adressé par les logiques marchandes, prendrait une part importante de notre activité.

      Citation : "il y a plein d'activités dont on a absolument besoin pour vivre ensemble Et il a appelé ça le tiers secteur non marchand."

      Citation : "l'associatif c'est tout ce dont on a besoin pour vivre ensemble mais que le marché n'adresse pas."

      L'impact de l'IA générative (ChatGPT) : La sortie de ChatGPT en novembre 2022 a transformé l'usage de l'IA, passant d'initiatives majoritairement patronales à une adoption par les employés eux-mêmes ("Shadow AI"). Plus de la moitié des Français utilisant l'IA au travail le font sans l'accord de leur manager.

      Citation : "Lia générative ça change tout... Avant ChatGPT les études montraient que il y avait entre 1 et 10 % des Français qui pouvaient dire j'ai travaillé avec lire Voilà au moins une fois avec Chat GPT on a dépassé les 20 %."

      Citation : "on est beaucoup plus dans moi employé j'utilise chat GPT et c'est mon employeur qui veut pas mais je le fais quand même Voilà on appelle ça le shadow AI."

      La crise récurrente de l'intelligence face aux technologies : L'histoire montre une résistance récurrente aux nouvelles technologies de l'intelligence, de l'écriture (critiquée par Socrate) à l'imprimerie, la télévision, internet et Wikipédia, toutes accusées de "nous rendre idiot" ou de détruire le lien social. Cette crainte se manifeste aujourd'hui avec l'IA.

      Citation : "Qu'est-ce qu'une technologie de l'intelligence c'est un un artefact qui va intervenir d'une façon ou d'une autre dans nos raisonnements Et un artefact très noble extrêmement valorisé dans les universités c'est l'écriture."

      Citation : "on a eu différents moments de crise de l'intelligence en lien avec les technologies de l'intelligence L'imprimerie a été une crise dans l'intelligence majeure... Lorsque j'étais jeune c'était la la télévision hein qui était qui nous rendait idiot... internet non seulement devait nous rendre idiot mais aussi de mettre devait mettre fin au lien social... Wikipédia... lorsque Tiag GPT est arrivé on a dit il cite pas ses sources."

      Citation : "on a une espèce de crise récurrente de l'intelligence qui amène souvent à considérer que la nouvelle technologie l'intelligence ça nous rend idiot." Perte de facultés versus développement de compétences supérieures : L'utilisation de technologies de l'intelligence peut entraîner une perte de certaines facultés (ex: mémorisation avec les smartphones), mais la question est de savoir si cette perte permet de se déplacer vers des facultés cognitives supérieures (raisonnement, interprétation).

      Citation : "il y a toujours une perte de faculté euh lorsqu'on ne mobilise pas la compétence... Montigne nous dit il vaut mieux une tête bien faite qu'une tête bien pleine."

      Le risque de la médiocrité généralisée par l'IA générative :

      L'IA générative peut rendre un travail "moyen" accessible sans effort, ce qui est gratuit et peut satisfaire le marché (y compris associatif).

      Le risque est d'accepter cette médiocrité intellectuelle au détriment d'un travail de meilleure qualité. Les experts sont mieux placés pour utiliser l'IA pour aller au-delà de la moyenne.

      Citation : "l' générative dont on parle beaucoup de ce soir elle peut facilement rendre tout le monde moyen sans effort." Citation : "quand vous êtes très performant dans votre sujet et bien vous êtes finalement un bien meilleur utilisateur de lien." Citation : "l'escroquerie c'est de dire que tout le monde peut tout faire avec liagénérative Ça c'est le mensonge." Citation : "le risque c'est plutôt cette espèce de satisfaction générale pour des choses de médiocre qualité intellectuelle."

      Alternatives Open Source et IA frugale : Face aux grands modèles propriétaires (comme ChatGPT d'OpenAI), il existe des alternatives Open Source dont le code est public et modifiable. Des modèles spécialisés et "frugaux" (moins consommateurs de ressources énergétiques) sont développés, permettant d'installer l'IA localement sur son ordinateur (ex: LM Studio).

      Des acteurs comme Perplexity proposent des modèles qui citent leurs sources (même si la vérification reste nécessaire). Des initiatives françaises comme Playas entraînent des modèles Open Source sur des corpus de données transparents.

      Citation : "il y a une une troisième voix qui est en train de s'ouvrir... J'ai il y a un acteur qui je trouve fait un travail fantastique en France qui s'appelle Playas qui est un acteur qui qui entraîne ses modèles ce qu'on appelle en open source."

      Citation : "Les solutions commencent à émerger... il n'y a pas que les grands modèles générals qui sont après on va revenir peut-être à un moment donné mais qui sont très très consommateurs de ressources d'énergie On peut il y a il y a désormais une école de développement de l'IA qui est euh très frugal très spé spécial très spécifique à des sujets."

      Citation : "Open AI est une solution privée où on ne connaît pas la source... Et il y a plein de modèles qui sont open source c'est-à-dire que le code il est publié en ligne."

      L'éthique et l'écologie de l'IA : La course au "scaling" (augmenter l'échelle des modèles) des grands acteurs est très énergivore.

      L'IA frugale, qui se concentre sur des tâches spécifiques et minimise les ressources, est une approche plus responsable. Une spécification AFNOR sur l'IA Frugale existe pour guider les développeurs.

      Citation : "J'ai pas pu m'empêcher de faire mon petit coup de gueule sur l'usage des modèles généraux énormes et leur fabrication euh versus les petits modèles frugo et et open source."

      Citation : "vous avez une spécification de l'AFNOR donc vraiment un truc officiel quand même qui s'appelle IA Frugal et qui donne des méthodes aux gens qui développent sur l'IA pour s'interroger sur la manière de le faire de la manière la plus la plus frugale."

      Utilisation de l'IA comme outil d'amélioration dans l'éducation :

      L'exemple de l'utilisation de l'IA pour aider une lycéenne à améliorer son devoir montre comment l'IA peut servir non pas à faire le travail à la place de l'humain, mais à le perfectionner et à développer des compétences critiques.

      Citation : "on a plutôt essayé de voir comment Lia pouvait l'aider à faire mieux qu'à faire moins."

      Conclusion Principale :

      L'IA représente une évolution majeure de l'intelligence, inscrite dans une histoire longue de technologies qui ont suscité des craintes récurrentes.

      Son impact sur le monde du travail, y compris dans le secteur associatif, est multiforme, avec des scénarios allant du remplacement à l'augmentation et à la réhumanisation du travail.

      L'avènement de l'IA générative a démocratisé son usage, mais soulève des questions éthiques, écologiques et de qualité.

      Il est crucial de ne pas se limiter aux grands modèles propriétaires et de considérer les alternatives Open Source et les approches d'IA frugale.

      L'enjeu n'est pas nécessairement de "déléguer l'intelligence humaine à une machine", mais de comprendre comment l'IA peut être un outil pour améliorer et développer les compétences humaines, notamment en s'appuyant sur les forces relationnelles et créatives de l'humain, particulièrement importantes dans le monde associatif.

      Être un "bon utilisateur" de l'IA, capable d'interpréter et de contextualiser ses réponses, est essentiel.

    1. Author response:

      We will revise the statements of novelty in the introduction by more clearly emphasizing how our model addresses gaps in the existing literature. In addition, we will clarify the description of the dispersal process. Briefly, we use the same dispersal gene β to represent the likelihood an individual will either leave or join a group, thereby quantifying both dispersal and immigration using the same parameter. Specifically, individuals with higher β are more likely to remain as floaters (i.e., disperse from their natal group to become a breeder elsewhere), whereas those with lower β are either more likely to remain in their natal group as subordinates (i.e., queue in a group for the breeding position) or join another group if they dispersed. Immigrants that join a group as a subordinate help and queue for a breeding position, as does any natal subordinate born into the group. To follow the suggestion of the referee and more fully explore the impact of competition between subordinates born in the group and subordinate immigrants, we will explore extending our model to allow dispersers to leave their natal group and join another as subordinates, by incorporating a reaction norm based on their age or rank (D = 1 / (1 + exp (β<sub>t</sub> * t – β<sub>0</sub>)) . This approach will allow individuals to adjust also their dispersal strategy to their competitiveness and to avoid kin competition by remaining as a subordinate in another group.

      We apologize that there was some confusion with terminology. We use the term “disperser” to describe individuals that disperse from their natal group. Dispersers can assume one of three roles: (1) they can migrate to another group as "subordinates"; (2) they can join another group as "breeders" if they successfully outcompete other candidates; or (3) they can remain as "floaters" if they fail to join a group. "Floaters" are individuals who persist in a transient state without access to a breeding territory, waiting for opportunities to join a group in an established territory. Therefore, dispersers do not work when they are floaters, but they may later help if they immigrate to a group as a subordinate. Consequently, immigrant subordinates have no inherent competitive advantage over natal subordinates (as step 2.2. “Join a group” is followed by step 3. “Help”, which occurs before step 5. “Become a breeder”). Nevertheless, floaters can potentially outcompete subordinates of the same age if they attempt to breed without first queuing as a subordinate (step 5) when subordinates are engaged in work tasks. We believe that this assumption is realistic and constitutes part of the costs associated with work tasks. However, floaters are at a disadvantage for becoming a breeder because: (1) floaters incur higher mortality than individuals within groups (eq. 3); and (2) floaters may only attempt to become breeders in some breeding cycles (versus subordinate groups members, who are automatically candidates for an open breeding position in the group in each cycle). Therefore, due to their higher mortality, floaters are rarely older than individuals within groups, which heavily influences dominance value and competitiveness. Additionally, any competitive advantage that floaters might have over other subordinate group members is unlikely to drive the kin selection-only results because subordinates would preferably choose defense tasks instead of work tasks so as not to be at a competitive disadvantage compared to floaters.

      We note that reviewers also mention that floaters often aren't usually high resource holding potential (RHP) individuals and, therefore, our assumptions might be unrealistic. As we explain above, floaters are not inherently at a competitive advantage in our model. In any case, empirical work in a number of species has shown that dispersers are not necessarily those of lower RHP or of lower quality. In fact, according to the ecological constraints hypothesis, one might predict that high quality individuals are the ones that disperse because only individuals in good condition (e.g., larger body size, better energy reserves) can afford the costs associated with dispersal (Cote et al., 2022). By adding a reaction norm approach to explore the role of age or rank in the revised version, we can also determine whether higher or lower quality individuals are the ones dispersing. We will address the issues of terminology and clarity of the relative competitive advantage of floaters versus subordinates, and also include more information in the Supplementary Tables (e.g., the number of floaters). As a side note, the “scramble context” we mention was an additional implementation that we decided to remove from the final manuscript, but we forgot to remove from Table 1 before submission.

      The reviewers also raised a question about asexual reproduction and relatedness more generally. As we showed in the Supplementary Tables and the section on relatedness in the SI (“Kin selection and the evolution of division of labor"), high relatedness does not appear to explain our results. In evolutionary biology generally and in game theory specifically (with the exception of models on sexual selection or sex-specific traits), asexual reproduction is often modelled because it reduces unnecessary complexity. To further study the effect of relatedness on kin structures more closely resembling those of vertebrates, however, we will create an additional “relatedness structure level”, where we will shuffle half of the philopatric offspring using the same method used to remove relatedness completely. This approach will effectively reduce relatedness structure by half and overcome the concerns with our decision to model asexual reproduction.

      Briefly, we will elaborate on the concept of division of labor and the tasks that cooperative breeders perform. In nature, multiple tasks are often necessary to successfully rear offspring. For example, in many cooperatively breeding birds, the primary reasons that individuals fail to produce offspring are (1) starvation, which is mitigated by the feeding of offspring, and (2) nest depredation, which is countered by defensive behavior. Consequently, both types of tasks are necessary to successfully produce offspring, and focusing solely on one while neglecting the other is likely to result in lower reproductive success than if both tasks are performed by individuals within the group. We simplify this principle in the model by maximizing reproductive output when both tasks are carried out to a similar extent, allowing for some flexibility from the mean. In response to the reviewer suggestion about making fecundity a function of work tasks and offspring survival as a function of defensive tasks, these are actually equivalent in model terms, as it’s the same whether breeders produce three offspring and two die, or if they only produce one. This represents, of course, a simplification of the natural context, where breeding unsuccessfully is more costly (in terms of time and energy investment) than not breeding at all, but this is approach is typically used in models of this sort.

      The scope of this paper was to study division of labor in cooperatively breeding species with fertile workers, in which help is exclusively directed towards breeders to enhance offspring production (i.e., alloparental care). Our focus is in line with previous work in most other social animals, including eusocial insects and humans, which emphasizes how division of labor maximizes group productivity. Other forms of “general” help are not considered in the paper, and such forms of help are rarely considered in cooperatively breeding vertebrates or in the division of labor literature, as they do not result in task partitioning to enhance productivity.

      How do we model help? Help provided is an interaction between H (total effort) and T (proportion of total effort invested in each type of task). We will make this definition clearer in the revised manuscript. Thank you for pointing out an error in Eq. 1. This inequality was indeed written incorrectly in the paper (but is correct in the model code); it is dominance rank instead of age (see code in Individual.cpp lines 99-119). We will correct this mistake in the revision.

      There was also a question about bounded and unbounded helping costs. The difference in costs is inherent to the nature of the different task (work or defense): while survival is naturally bounded, with death as the lower bound, dominance costs are potentially unbounded, as they are influenced by dynamic social contexts and potential competitors. Therefore, we believe that the model’s cost structure is not too different to that in nature.

      Thank you for your comments about the parameter landscape. It is important to point out that variations in the mutation rate do not qualitatively affect our results, as this is something we explored in previous versions of the model (not shown). Briefly, we find that variations in the mutation rates only alter the time required to reach equilibrium. Increasing the step size of mutation diminishes the strength of selection by adding stochasticity and reducing the genetic correlation between offspring and their parents. Population size could, in theory, affect our results, as small populations are more prone to extinction. Since this was not something we planned to explore in the paper directly, we specifically chose a large population size, or better said, a large number of territories (i.e. 5000) that can potentially host a large population.

      During the exploratory phase of the model development, various parameters and values were also assessed. However, the manuscript only details the ranges of values and parameters where changes in the behaviors of interest were observed, enhancing clarity and conciseness. For instance, variation in y<sub>h</sub> (the cost of help on dominance when performing “work tasks”) led to behavioral changes similar to those caused by changes in x<sub>h</sub> (the cost of help in survival when performing “defensive tasks”), as both are proportional to each other. Specifically, since an increase in defense costs raises the proportion of work relative to defense tasks, while an increase in the costs of work task has the opposite effect, only results for the variation of x<sub>h</sub> were included in the manuscript to avoid redundancy. We will make this clearer in the revision.

      Finally, following the advice from the reviewers, we will add the symbols of the variables to the figure axes, and clarify whether the values shown represent a genetic or phenotypic trait. In Figure 2, the x-axis is H and the y-axis is T. In Figure 3A, the subindex t in x-axis is incorrect; it should be subindex R (reaction norm to dominance rank instead of age), the y-axis is T. In Figure 3B, the x-axis is R, and the y-axis is T. All values of T, H and R are phenotypic expressed values (see Table 1). For instance, T values are the phenotypic expressed values from the individuals in the population according to their genetic gamma values and their current dominance rank at a given time point.

      References

      Cote, J., Dahirel, M., Schtickzelle, N., Altermatt, F., Ansart, A., Blanchet, S., Chaine, A. S., De Laender, F., De Raedt, J., & Haegeman, B. (2022). Dispersal syndromes in challenging environments: A cross‐species experiment. Ecology Letters, 25(12), 2675–2687.

    1. Despite pressures to conform to normative gender, gender remains inplay. While this sense of play may open possibilities, play with gender orplay with sexuality also raises anxieties and bias against transgender andgender nonconforming youth.

      In this passage, the author shares the challenges of and consequences of youth who do not conform to gender norms, specifically gender nonconforming or transgender youth. Gender play can extend the possibilities of self-expression, but it often triggers adult worries and institutional bias. The text reflects not only ignorance but also an attempt to contain youth identities. The schools expel the 14-year-old who painted her face and wore makeup, and there was no dress code. In this scenario, the school systems, which depend heavily on rigid gender norms, behave unjustifiably. Schools pay greater reaction to perceived threats to gender roles than to the rules that are actually being broken.

    2. For instance, a 14-year-old Florida student, who preferred not to share hissexual identity, was punished for wearing makeup to school (Sieczkowski,2013 ). The principal claimed the student was in violation of the dress code,but the dress code had no reference to makeup (Sieczkowski, 2013). Othersimilar situations, including a young woman wearing a suit (Esseks, 2010),a transgender student wanting to wear a dress (American Civil LibertiesUnion [ACLU], 2013), or simply cisgendered heterosexual students dressingin nonconforming ways,

      This passage refers to how school dress codes are enforced unfairly to reinforce traditional gender expectations even though the written policies themselves might not necessarily ban expressions such as the use of cosmetics or gender-nonconformist clothing. The examples show that students, both LGBTQ+ and non-LGBTQ+, will be disciplined for simply appearing in a way that subverts stereotypical masculine and feminine expectations. It represents a more fundamental problem in which student individuality is policed on the guise of "dress code infractions" in a strict enforcement of gender roles and hostility to student diversity of identity.

    1. How can you use it in your day-to-day life? After you’re done reading a blog post, go to another tab or open your notebook and try to write a few bullet points about the content, from memory. If you’re learning how to code, try to read a tutorial in a focused way, then apply what you learned without looking at the tutorial. Only check the tutorial when you feel stuck. When trying to memorise anything, ask a friend to quiz you so you can generate your own answers. Or create your own flashcards to test yourself.

      Implement active learning

    1. Note de Synthèse : Analyse du Rapport sur Parcoursup et Mon Master (Extrait)

      Date : 2025

      Objet : Examen des plateformes d'accès à l'enseignement supérieur (Parcoursup et Mon Master), focus sur les algorithmes, les bacheliers STMG et la région Nouvelle-Aquitaine.

      Source : Extraits du rapport "rapport parcoursup 2025-297809.pdf" du Comité éthique et scientifique de Parcoursup et Mon Master (CESPM).

      Introduction :

      Ce rapport, le 7ème du CESPM, offre un éclairage approfondi sur les mécanismes et les enjeux des plateformes d'accès à l'enseignement supérieur Parcoursup et Mon Master.

      Il s'appuie sur des analyses générales, des entretiens avec les acteurs de terrain et une étude de cas spécifique portant sur les bacheliers de la filière Sciences et technologies du management et de la gestion (STMG) et la région Nouvelle-Aquitaine.

      L'objectif est de garantir la transparence, l'équité et l'efficacité de ces processus d'admission.

      Thèmes Principaux et Idées Clés :

      Rôle et Fonctionnement des Algorithmes dans Parcoursup et Mon Master (Section 1.1, 1.2, 1.4):

      Clarification fondamentale : Le rapport insiste sur le fait que ce ne sont pas les plateformes ou leurs algorithmes qui prennent les décisions d'admission, mais bien les responsables et enseignants des formations réunis en commissions d'examen.

      Algorithme d'Appariement : L'algorithme gère la mise en relation entre les vœux des candidats et les classements des formations. Il ne classe pas les candidats lui-même mais utilise les classements fournis par les formations. Son code est publié pour assurer la transparence.

      Différences entre Parcoursup et Mon Master : Parcoursup applique des quotas (boursiers, géographiques pour les licences) qui peuvent modifier les classements initiaux des formations, dans un souci de réduction des inégalités sociales. Mon Master n'applique aucun quota. "Algorithmes Locaux" (Classement par les Formations) :

      Le rapport dénonce l'appellation impropre "algorithmes locaux". Les formations utilisent des outils (souvent des tableurs) basés sur des critères définis par les équipes pédagogiques pour pré-classer les candidatures, facilitant le travail de délibération des commissions. Transparence des Critères de Classement : Le Comité recommande vivement la publication précise et quantifiable des critères de classement utilisés par les formations. Cette transparence est jugée essentielle pour l'équité.

      Inégalités d'Accès : Le rapport affirme que les algorithmes d'appariement n'amplifient pas les inégalités sociales ou géographiques.

      Au contraire, les quotas dans Parcoursup visent à les réduire.

      Les inégalités d'accès existent mais sont liées à d'autres facteurs.

      Sélection à l'Entrée de l'Enseignement Supérieur (Section 1.3, 1.6):

      Formations Sélectives : Ces formations ne classent pas toujours tous les candidats, refusant certains d'emblée selon leurs critères.

      Le rapport critique le "malthusianisme" de certaines formations attractives qui ne classent pas suffisamment de candidats pour remplir leurs capacités d'accueil, entraînant une perte de places.

      Recommandation : Pour lutter contre cette pratique, le rapport suggère d'exiger une publication très précise des critères de refus ou d'obliger les formations à classer tous les candidats. Une autre idée serait de réduire la capacité d'accueil des formations qui ne remplissent pas sur plusieurs années.

      Accompagnement des Étudiants : La loi ORE a prévu des dispositifs "oui si" pour les licences non sélectives, mais rien n'est prévu pour les formations sélectives, même si certaines d'entre elles mettent en place un accompagnement.

      Utilisation des Données Publiques (Section 1.5): Richesse des Données : Les données publiques issues des plateformes sont une ressource précieuse pour analyser la réalité de l'accès à l'enseignement supérieur.

      Manque de Documentation : Le rapport souligne un manque de documentation précise pour de nombreux indicateurs (comme le "taux de poursuite"), rendant difficile l'interprétation et l'utilisation de ces données pour le pilotage.

      Recommandation : Améliorer la documentation des données publiques pour permettre des analyses robustes et éclairer les politiques publiques. Focus sur les Bacheliers STMG (Chapitre 2): Filière Hétérogène et Parfois Subie : La filière STMG accueille des élèves d'origines sociales plus défavorisées que les autres filières technologiques et générales. L'orientation en STMG est souvent plus importante que les demandes des familles et peut servir de "voie de délestage".

      Méconnaissance de la Filière : Le rapport pointe une méconnaissance des enseignements et des débouchés de la filière STMG par les professeurs et les parents, ce qui peut impacter négativement l'orientation.

      Difficultés d'Accès à l'Enseignement Supérieur : Les bacheliers STMG rencontrent des difficultés pour entrer dans l'enseignement supérieur, caractérisées par une "volatilité" importante, un sur cinq quittant Parcoursup après avoir reçu une proposition.

      Réussite en Études Supérieures : La réussite des bacheliers STMG en licence est faible. Elle est meilleure en formations courtes professionnalisées (BTS, BUT), mais reste inférieure à celle des autres bacheliers technologiques et généraux.

      Recommandations :Créer des parcours de licence adaptés aux bacheliers technologiques (notamment STMG). Mettre en place ou développer des parcours de consolidation et des dispositifs d'accompagnement en BUT et L1 pour favoriser la "persévérance" et éviter les décrochages précoces. Accès à l'Enseignement Supérieur en Nouvelle-Aquitaine (Chapitre 3):

      Impact de l'Éloignement Géographique : L'étude de la Nouvelle-Aquitaine, région étendue et rurale, montre l'impact significatif de l'éloignement des pôles d'enseignement supérieur sur les vœux et les admissions des candidats. Les candidats éloignés sont moins souvent admis dans les formations sélectives des grands pôles et plus souvent dans des formations courtes professionnalisantes plus proches.

      Contraintes de Transport et de Logement : Les contraintes financières liées au transport et au logement sont identifiées comme des facteurs majeurs d'inégalité. Le logement est particulièrement problématique dans les zones tendues de la côte atlantique.

      Critères d'Attribution des Logements Étudiants : Les critères actuels d'attribution des logements CROUS, trop axés sur les revenus, sont jugés inadaptés pour prendre correctement en compte la distance.

      Recommandations :Définir des critères spécifiques pour l'attribution des logements étudiants qui prennent mieux en compte les distances.

      Expérimenter l'attribution des logements étudiants par la procédure Parcoursup pour certains types de formations dans les régions rurales, à l'instar des internats de CPGE. Partager les internats dans les lycées qui offrent CPGE et BTS, et intégrer les demandes dans Parcoursup pour les BTS.

      Mon Master (Chapitre 4): * Plateforme Utile et Bien Acceptée : Mon Master est généralement bien perçue par les formations, facilitant le travail de classement des candidatures. Elle sert également d'observatoire pour l'adaptation de l'offre de formation. * Introduction de la Phase Complémentaire (PC) : L'ajout de la PC en 2024 est unanimement salué, permettant de mieux gérer les places vacantes et d'offrir de nouvelles opportunités aux candidats. * Sélection et Taux de Remplissage : L'entrée en master reste sélective. Des places vacantes persistent, avec des disparités selon les territoires et les mentions. Les candidats issus de BUT ou de licences professionnelles ont un taux d'admission plus faible que les titulaires de licence générale. * Saisine du Recteur : Le dispositif de saisine pour les titulaires d'une licence générale n'ayant reçu aucune proposition est détaillé. Le rapport recommande d'informer plus clairement les étudiants sur les restrictions de conditions pour cette saisine. * Périmètre de la Plateforme : Le rapport recommande de ne pas étendre la plateforme Mon Master aux formations ne conduisant pas à un Diplôme National de Master (DNM), mais d'y ajouter à titre informatif les masters qui ne recrutent qu'en formation continue. * Transparence et Équité : Le rapport réitère la nécessité de transparence sur les critères de sélection et la publication de rapports de jury, comme pour Parcoursup. * Alternance : Le recrutement en alternance via Mon Master présente des spécificités (calendrier, absence de classement des candidats). Le rapport suggère de permettre aux établissements de gérer la fongibilité des capacités d'accueil entre statut étudiant et apprentissage. * Nombre de Vœux : Le nombre élevé de vœux possibles sur Mon Master (15 par type de formation) alourdit le travail des commissions et peut inciter à des candidatures non mûries. * Sécurité et Protection des Données : Le code de l'algorithme de Mon Master est publié, ce qui est une avancée. Le rapport suggère de renforcer la sécurité des accès privilégiés par une authentification à double facteur et d'améliorer la remontée sécurisée des notes des candidats étudiant en France. * Partage d'Expérience entre Parcoursup et Mon Master : Le Comité recommande d'amplifier le partage d'expérience et de perspectives entre les deux plateformes et d'unifier la présentation de leurs algorithmes.

      Recommandations Clés du Rapport :

      Le rapport émet plusieurs recommandations concrètes pour améliorer les processus d'admission et l'accès à l'enseignement supérieur, notamment :

      • Améliorer la transparence des critères de classement et, pour les formations sélectives, les critères de refus (Recommandation 1).
      • Mieux documenter les données publiques pour permettre des analyses fines et éclairer les politiques publiques (Recommandation 2).
      • Créer des parcours de licence adaptés aux bacheliers technologiques (notamment STMG) et développer des dispositifs d'accompagnement en L1 et BUT pour favoriser la persévérance des STMG (Recommandations 4 et 5).
      • Définir des critères spécifiques pour l'attribution des logements étudiants prenant mieux en compte les distances et expérimenter l'attribution via Parcoursup (Recommandation 7).
      • Partager les internats dans les lycées offrant CPGE et BTS et intégrer les demandes dans Parcoursup pour les BTS (Recommandation 8).
      • Mieux informer les parents et les élèves sur Parcoursup, les filières d'études, les aides sociales et au logement (Recommandation 9).
      • Doter les établissements d'outils statistiques robustes et partagés pour suivre les parcours des jeunes (Recommandation 10).
      • Avertir clairement les étudiants des restrictions concernant la saisine du recteur sur Mon Master (Recommandation 14).
      • Ne pas ouvrir Mon Master aux formations ne conduisant pas à un DNM, mais ajouter les masters qui ne recrutent qu'en formation continue (Recommandation 15).
      • Permettre aux établissements de gérer la fongibilité entre les capacités d'accueil des formations en statut étudiant et en apprentissage sur Mon Master (Recommandation 20).
      • Amplifier le partage d'expérience et de perspectives entre Mon Master et Parcoursup et unifier la présentation des algorithmes (Recommandation 23).

      Conclusion du Comité :

      Le CESPM estime que ses travaux ont contribué à éclairer la décision publique pour l'amélioration de Parcoursup et espère faire de même pour Mon Master.

      Le rapport met en lumière le renoncement de nombreux lycéens, notamment STMG, à leurs aspirations en raison de l'éloignement géographique et d'une dévalorisation perçue de leur filière.

      L'amélioration de l'accès au logement étudiant, une meilleure information sur l'orientation et un suivi accru de la réussite des bacheliers STMG sont des pistes essentielles pour l'avenir.

      Le Comité prévoit de poursuivre son analyse de Mon Master l'année prochaine, avec un focus approfondi sur des disciplines spécifiques et la question de la saisine du recteur.

    1. We're at the point in humanity's development of computing infrastructure where the source code for program texts (e.g. a module definition) should be rich text and not just ASCII/UTF-8.

      Forget that I said "rich text" for a moment and pretend that I was just narrowly talking about the inclusion of, say, graphical diagrams in source code comments.

      Could we do this today? Answer: yes.

      "Sure, you could define a format, but what should it look like? You're going to have to deal with lots of competing proposals for how to actually encode those documents, right?" Answer: no, not really. We have a ubiquitous, widely supported format that is capable of encoding this and more: HTML.

      Now consider what else we could do with that power. Consider a TypeScript alternative that works not by inserting inline type annotations into the program text, but instead by encoding the type of a given identifier via the HTML class attribute.

      Now consider program parametrization where a module includes multiple options for the way that you use it, and you configure it as the programmer by opening up the module definition in your program editor, gesturing at the thing it is that you want to concretely specify, selecting one of those options, and have the program text for the module react accordingly—without erasing or severing the mechanism for configuration, so if another programmer wants to change the module parameters to satisfy some future need—or lift that module from your source tree and use it in another one for a completely different program—then they can reconfigure it with the same mechanism that you used.

    1. My demonstration is also shared at the end of this blog if you are interested in trying it for yourself!

      I was expecting a github repository with code, but sadly that wasn't the case.

    1. Author response:

      The following is the authors’ response to the original reviews

      Response to the public reviews:

      We are very pleased to see these positive reviews of our preprint.

      Reviewers 1 and 3 raise issues around PIP-PP1 interactions.

      (1) Role of the “RVxF-ΦΦ-R-W string”

      Most PIPs interact with the globular PP1 catalytic core through short linear interaction motifs (SLiMs) and Choy et al (PNAS 2014) previously showed that many PIPs interact with PP1 through conserved trio of SLiMs, RVxF-ΦΦ-R, which is also present in the Phactrs.

      Previous structural analysis showed the trajectory of the PPP1R15A/B, Neurabin/Spinphilin (PPP1R9A/B), and PNUTS (PPP1R10) PIPs across the PP1 surface encompasses not only the RVxF-ΦΦ-R trio, but also additional sequences C-terminal to it (Chen et al, eLife, 2015). This extended trajectory is maintained in the Phactr1-PP1 complex (Fedoryshchak et al, eLife (2020). Based on structural alignment we proposed the existence of an additional hydrophobic “W” SLiM that interacts with the PP1 residues I133 and Y134.

      The extended “RVxF-ΦΦ-R-W” interaction brings sequences C-terminal to the “W” SLiM into the vicinity of the hydrophobic groove that adjoins the PP1 catalytic centre. In the Phactr1/PP1 complex, these sequences remodel the groove, generating a novel pocket that facilitates sequence-specific substrate recognition.

      This raises the possibility that sequences C-terminal to the extended “RVxF-ΦΦ-R-W string” in the other complexes also confer sequence-specific substrate recognition, and our study aims to test this hypothesis. Indeed, the hydrophobic groove structures of the Neurabin/Spinophilin/PP1 and Phactr1/PP1 complexes differ significantly (Ragusa et al, 2010; see Fedoryshchak et al 2020, Fig2 FigSupp1).

      (2) Orientation of the W side chain

      Reviewer 1 points out that in the substrate-bound PP1/PPP1R15A/Actin/eIF2 pre-dephosphorylation complex the W sidechain is inverted with respect to its orientation in  PP1-PPP1R15B complex (Yan et al, NSMB 2021). The authors proposed that this may reflect the role of actin in assembly of the quaternary complex. This does not necessarily invalidate the notion that sequences C-terminal to the “W” motif might play a role in actin-independent substrate recognition, and we therefore consider our inclusion of the R15A/B fusions in our analysis to be reasonable.

      (3) Conservation of W

      The motif ‘W’ does not mandate tryptophan - Phactrs and PPP1R15A/B indeed have W at this position but Neurabin/spinophilin contain VDP, which makes similar interactions. Similarly the “RVxF” motifs in Phactr1, Neurabin/Spinophilin, PPP1R15A/B and PNUTS are LIRF, KIKF, KV(R/T)F and TVTW respectively.

      In our revision, we will present comparisons of the differentially remodelled/modified PP1 hydrophobic groove in the various complexes, discuss the different orientations of the tryptophan in the previously published PPP1R15A/PP1 and PPP1R15B/PP1 structures. We will also address the other issues raised by the referees.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Comments and suggestions for revisions

      (1) The authors do not provide strong evidence that the interactions of the 'W' of the RVxF- øø -R-W string with the hydrophobic groove of PP1 is conserved in PIPs. Whereas the RVxF motif is well conserved and validated since its discovery in 1997, as are the øø - (an extension of the RVxF motif), and the 'R', the conservation of the Trp residue in the RVxF-øø-R-W string is not conserved.

      We did not mean to imply that the W motif is conserved amongst all PIPs.

      Most PIPs interact with the globular PP1 catalytic core through short linear interaction motifs (SLiMs). Choy et al (PNAS 2014) previously showed that many PIPs interact with PP1 through a conserved trio of SLiMs, RVxF-ΦΦ-R, which is also present in the Phactrs.

      Previous structural analysis showed that the PPP1R15A/B, Neurabin/Spinophilin (PPP1R9A/B), and PNUTS (PPP1R10) PIPs share a trajectory across the PP1 surface that encompasses not only the RVxF-ΦΦ-R SLIMs, but also additional sequences C-terminal to the R SLIM (Chen et al, eLife, 2015). This trajectory is also shared by the Phactr1-PP1 complex (Fedoryshchak et al, eLife, 2020). Based on this structural alignment we proposed the existence of an additional hydrophobic “W” SLiM that interacts with the PP1 residues I133 and Y134 (See Fedoryshchak et al, 2020, Figure 1 figure supplement 2).

      Introduction, paragraph 2 is rewritten to make this clearer.

      The sequence and positions of W differ in amino acid type and position relative to the RVxF-øø-R string.

      The motif ‘W’ does not mandate tryptophan, it is our name for a common structurally aligned motif: although the Phactrs and PPP1R15A/B indeed have W at this position, Neurabin and spinophilin contain VDP, which nevertheless makes similar interactions. Similarly the _“_RVxF” motifs in Phactr1, Neurabin/Spinophilin, PPP1R15A/B and PNUTS are LIRF, KIKF, KV(R/T)F and TVTW respectively.

      In the Discussion the authors state that the hydrophobic groove of PP1 is remodelled by Neurabin. However, details of this are not described or shown in the manuscript.

      The shared trajectory determined by the RVxF-øø-R-W string brings the sequences C-terminal to the W SLIM into the vicinity of the PP1 hydrophobic groove. In the Phactr1/PP1 holoenzyme this generates a novel pocket required for substrate recognition (Fedoryshchak et al, 2020). These observations raised the possibility that sequences C-terminal to the “W” motif in the other RVxF-øø-R-W PIPs also play a role in substrate recognition.

      Introduction paragraph 3 now cites a new Figure 1-S2, which shows how the hydrophobic groove is remodelled in the various different PIP/PP1 complexes. A revised Figure 1A now indicates the hydrophobic residues defining the hydrophobic groove by grey shading.

      (2) To add to the confidence of the structure, the authors should include a 2Fo-Fc simulated annealing omit map, perhaps showing the R and W interactions of the RVxF-øø-R-W string.

      This is now included as new Figure 6 Figure supplement 1. Note that in Neurabin, the W motif is VDP, where the valine and proline sidechains interact similarly to the tryptophan (see also new Figure 1-S2G,H).

      We also add a new supplementary Figure 6-S1 comparing our PBM-liganded Neurabin PDZ domain with the previously published unliganded structure (Ragusa et al 2010).

      (3) Page 16. The authors state that spinophilin remodels the PP1 hydrophobic groove differently from Phactrs. Arguably spinophilin does not remodel the PP1 hydrophobic groove at all. There are no contacts between spinophilin and the PP1 hydrophobic groove in the spinophilin-PP1 structure, correlating with the absence of 'W" in the RVxF-øø-R-W string in spinophilin.

      The VDP sequence corresponding to the W motif in spinophilin and neurabin makes analogous contacts to those made by the W in Phactr1 (see Fedoryshchak et al 2020).

      Remodelling is meant in the sense of altering the structure of the major groove by bringing new sequences into its vicinity rather than necessarily directly interacting with it. The spinophilin/PP1 and Phactr/PP1 hydrophobic grooves are compared in new Figure 1-S2 (see also Fedoryshchak et al 2020, Figure 2 figure supplement 1)

      (4) Page 8. For the cell-based/proteomics-dephosphorylation assay in Figure 2, it isn't clear why there were no dephosphorylation sites detected for the PPP1R15A/B-PP1 fusion (except PPP6R1 S531 for PPP1R15B). One might have expected a correlation with PP1 alone. Does this imply that PPP1R15A/B are inhibiting PP1 catalytic activity? Was the activity tested in vitro?

      The R15A/B data are compared to average abundance of all the phosphosites in the dataset, including those of PP1.

      We have not tested for a general inhibitory effect of R15A/B on PP1 activity. Many PIPs including R15A/B do occlude one or more of the PP1 substrate groove and therefore generally act as inhibitors of PP1 activity against some potential substrates, while enhancing activities against others.

      Other points 

      (4) Figure S1: Colour sequence similarities/identities.

      Done

      (6) Figures: Structure figures lacked labels:

      Figure 1A, label PP1, Phactrs etc.

      Done

      Figure 6, label PP1, Neurabin, previous Neurabin structure (Fig. 6C), hydrophobic groove, PDZ domain, etc.

      Done

      (7) Statistical analysis. p values should be shown for data in:

      Figure 5.

      To avoid cluttering the Figure, a new sheet, “statistical significance” has been added to Supplementary Table 3, summarizing the analysis.

      Figure 1.

      Figure amended (now figure 1-S1).

      (8) Some inconsistency with labels, eg '34-WT' used in Fig. 5C, whereas '34A-WT' (better) in Methods.

      Now changed to 34A etc where used.

      (9) Page 6. PPP1R9A/B is not shown in Figure 1A and Figure S1A.

      PPP1R9A/B are Neurabin and spinophilin - now clarified in Introduction paragraph 2, Results paragraph 1, Discussion paragraph 1.

      (10) Page 7: lines 4, 'site' not 'side'.

      Done

      (11) Page 9: DTL and CAMSAP3 were found to be dephosphorylated in the PP1-Neurabin/spinophilin screen. Are these PDZ-binding proteins?

      Neither DTL nor CAMSAP3 contain C-terminal hydrophobic residues characteristic of classical PBMs. Sentence added in Discussion, paragraph 5

      (12) Page 12 and Figure 5 and S5: The synthetic p4E-BP1 and IRSp53WT peptides with PBM should be given more specific names to indicate the presence of the PBM.

      We have renamed 4E-BP1<sup>WT</sup> and IRSp53<sup>WT</sup> to 4E-BP1<sup>PBM</sup> and  IRSp53<sup>PBM</sup> respectively, emphasising the inclusion of the wildtype or mutated PBM from 4E-BP1 on these peptides.

      Text, Figure 5, and Figure S5 all revised accordingly.

      (13) Give PDB code for spinophilin-PP1 complex coordinates shown in Figure 6C.

      PDB codes for the various PIP/PP1 complexes now given in new Figure 1-S2 and revised Figure 6C.

      Reviewer #2 (Recommendations for the authors):

      The work undertaken by the authors is extensive and robust, however, I believe that some improvement in the writing and some detailed explanation of certain results sections would help with the presentation of the work and clarity for the readers.

      (1) The introduction should contain more information about the interaction between PP1 and Neurabin, given that this is the focus of the paper. This would give the reader the necessary background required to follow the paper.

      Introduction paragraph 2 revised to describe the different SLIMs in more detail. New Figure 1-S2 shows detail of the different remodelled hydrophobic grooves in the various PIP/PP1 complexes.

      (2) More information on PP1-IRSp53L460A has to be added before discussing results in S1B.

      Sentence explaining that IRSp53 L460 docks with the remodelled PP1 hydrophobic groove in the Phactr1/PP1 holoenzyme added in Results paragraph 2.

      (3) Page 6: "as expected, the +5 residue L460A mutation, which impairs dephosphorylation by the intact Phactr1/PP1 holoenzyme, impaired sensitivity to all the fusions, indicating that they recognise phosphorylated IRSp53 in a similar way (Figure S1B)". Statistics between IRSp53 and IRSp53L460A across PP1-PIPs need to be conducted before concluding the above. From the graph and the images, the impairment to dephosphorylation is not convincing.

      For each of the four PP1-Phactr fusions, the IRSp53 L460A peptide shows significantly less reactivity than the IRSp53WT peptide (p<0.05 for each fusion).

      Since the proteomics studes in Figure 2 show that the substrate specificity of the four PP1-Phactr1 fusions is virtually identical, we combined the data for the four different fusions. The IRSp53 L460A peptide shows significantly less reactivity than the IRSp53WT peptide in this analysis (p< 0.0001). This result shown in revised Figure S1B and legend.

      (4) mCherry-4E-BP1(118+A), in which an additional C-terminal alanine should still allow TOSmediated phosphorylation, but prevent PDZ interaction. Does 4EBP1 (118+A) actually prevent interaction between PP1-Neurabin? This interaction needs to be validated, especially since spinophilin was shown to bind to multiple regions of PP1.

      It is not clear what the referee is asking for here. The biochemical analysis in Figure 4C shows that the C-terminus of 4E-BP1 constitutes a classical PBM. The X-ray crystallography in Figure 6 confirms this, demonstrating H-bond interactions between the 4E-BP1 C-terminal carboxylate and main chain amides of L514, G515 and I516.

      We consider the possibility that the 4E-BP1(118+A) mutant inhibits the activity of PP1-neurabin via a mechanism other than direct blocking 4E-BP1 / PDZ interaction to be unlikely for the following reasons:

      (1) Addition of a C-terminal alanine will disrupt the PBM interaction because the extra residue sterically blocks access to the PBM-binding groove. This is the most parsimonious explanation, and is based on our solid structural and biochemical evidence that the 4E-BP1 C-terminus is a classical PBM.

      (2) Alphafold3 modelling predicts Neurabin PDZ / 4E-BP1 PBM interaction with high confidence (shown in Figure 6-S2E), but it does not predict any PDZ interaction with 4E-BP1(118+A). Note added in Figure 6-S2 legend.

      (3) Recognition of the 4E-BP1(118+A) mutation without loss of binding affinity would require that the mutant becapable of binding formally equivalent to recognition of an “internal” PDZ-binding peptide. Recognition of such “internal peptides” is dependent on their adopting a specifically constrained conformation, which typically requires reorganisation of the PDZ carboxylate-binding GLGF loop. Such “internal site” recognition typically involves more than one residue C-terminal to the conventional PDZ “0” position (see Penkert et al NSMB 2004, doi:10.1038/nsmb839; Gee et al JBC 1998, DOI: 10.1074/jbc.273.34.21980; Hillier et al 1999, Science PMID: 10221915).

      (5) It is nice to see that the various PP1-Phactr fusions have around 60% substrate overlap between them. Would it be possible to compare these results with previously published mass spec data of Phactr1XXX from the group? There is mention of some substrates being picked up, but a comparison much like in Figure 2E would be more informative about the extent to which the described method captures relevant information.

      This is difficult to do directly as the PP1-Phactr fusion data are from human cells while that in Fedoryshchak et al 2020 is from mouse.

      However, manual curation shows that of the 28 top hits seen in our previous analysis of Phactr1XXX in NIH3T3 cells, 18 were also detectable in the HEK293 system; of these, 13 were also detected as as PP1-Phactr fusion hits. Data summarised in new Figure 2-S1C. Text amended in Results, “Proteomic analysis...”, paragraph 2.

      (6) Figure 3D Why are the levels of pT70, pT37/46 and total protein in vector controls much lower as compared to 0nM Tet in PP1-Neurabin conditions? It is also weird that given total protein is so low, why are the pS65/101 levels high compared to the rest?

      We think it likely these phenomena reflect a low level expression of PP1-Neurabin expression in uninduced cells. Now noted in Figure 3D legend, basal PP1-Neurabin expression shown in new Figure 3-S1C. This alters the relative levels of the different species detected by the total 4E-BP1 antibody in favour of the faster migrating forms, which are less phosphorylated than the slower ones, and the total amount increases about 2-fold (Figure 3D, compare 0nM Tet lanes).

      The altered p65/101-pT70 ratio is also likely to reflect the leaky PP1-Neurabin expression, since the relative intensities of the various phosphorylated species are dependent on both the relative rates of phosphorylation and dephosphorylation. Expression of a phosphatase would therefore be expected to differentially affect the phosphorlyation levels of different sites according to their reactivity.

      (7) Figure 3E: Does inhibiting mTORC further reduce translation when PP1-Neurabin is expressed? If this is the case, this might suggest that they might not necessarily be mTORC inhibitors?

      We have not done this experiment. Since Rapamycin cannot be guaranteed to completely block 4E-BP1 phosphorylation, and PP1-Neurabin cannot be guaranteed to completely dephosphorylate 4E-BP1, any further reduction upon their combination would be hard to interpret.

      (8) Substrate interactions with the remodelled PP1 hydrophobic groove do not affect PP1-Neurabin specificity. Is there evidence that PP1-Neurabin remodels the hydrophobic groove? Is it not possible that Neurabin does not remodel the PP1 groove to begin with and hence there is no effect observed with the various mutants? If this is not the case, it should be explained in a bit more detail.

      Comparison of the Neurabin/PP1 and Phactr1/PP1 structures shows that the hydrophobic groove is remodelled differently in the two complexes. Now shown in new Figure 1-S2B,C,G.

      (9) Figure 5B has a lot of interesting information, which I believe has not been discussed at all in the results section.

      To help interpretation of the enzymology in Figure 5 we have renamed 4E-BP1WT and IRSp53WT to 4E-BP1PBM and IRSp53PBM respectively, emphasising the inclusion of the wildtype or mutated PBM from 4E-BP1 on these peptides. Text in Results, “PDZ domain interaction…”, paragraph 1, and Figures 5 and S5 revised accordingly.

      Why does the 4E-BP1Mut affect catalytic efficiency of PP1 alone when compared with WT, while no difference is observed with IRSp53WT and mutant?

      We do not understand the basis for the differential reactivity of 4E-BP1PBM and 4E-BP1MUT with PP1 alone; we suspect that it reflects the hydrophobicity change resulting from the MDI -> SGS substitution. However this is unlikely to be biologically significant as PP1 is sequestered in PIP-PP1 complexes.

      Importantly, the two PP1 fusion proteins behave consistently in this assay – the presence of the intact PBM increases reactivity with PP1-Neurabin, but has no effect on dephosphorylation by PP1-Phactr1.

      Why does PP1 alone not have a difference between IRSp53WT and mutant, while PP1-Neurabin does have a difference?

      This is due to the presence of the PBM in IRSp53WT (now renamed IRSp53PBM), which affects increases affinity for PP1 Neurabin, but not PP1 alone. Likewise, PP1-Phactr1, which does not possess a PDZ domain, is also unaffected by the integrity of the PBM.

      (7) “Strikingly, alanine substitutions at +1 and +2 in 4E-BP1WT increased catalytic efficiency by both fusions, perhaps reflecting changes at the catalytic site itself (Figure 5E, Figure S5E)”. This could be expanded upon, because this suggests a mechanism that makes the substrate refractory to PDZ/hydrophobic groove remodelling?

      We favour the idea that this reflects a requirement to balance dephosphorylation rates between the multiple 4E-BP1 phosphorylation sites, especially if multiple rounds of dephosphorylation occur for each PBM—PDZ interaction. Additional sentences added in Discussion paragraph 7.

      (8) Typographical errors and minor comments:

      a) PIPs can target PP1 to specific subcellular locations, and control substrate specificity through autonomous substrate-binding domains, occupation or extension of the substrate grooves, or modification of PP1 surface electrostatics.

      b) Phosphophorylation side site abundances within triplicate samples from the same cell line were comparable between replicates (Figure 2B).

      c) While the alanine substitutions had little effect, conversion of +4 to +6 to the IRSp534E-BP1 sequence LLD increased catalytic efficiency some 20-fold (Figure 5C, Figure S5C). 

      d) Figure 3E labels are not clear. The graph can be widened to make the labels of the conditions clearer.

      All corrected

      Reviewer #3 (Recommendations for the authors):

      This was a very well-written manuscript.

      However, I was looking for a summary mechanistic figure or cartoon to help me navigate the results.

      I noted a few typos in the text.

      New summary Figure 5-S2 added, cited in results, and discussed in Discussion paragraph 6,7.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      This article presents a meta-analysis that challenges established abundance-occupancy relationships (AORs) by utilizing the largest known bird observation database. The analysis yields contentious outcomes, raising the question of whether these findings could potentially refute AORs.

      We thank the Reviewer for their positive comments.

      Strengths:

      The study employed an extensive aggregation of datasets to date to scrutinize the abundance-occupancy relationships (AORs).

      We thank the Reviewer for their positive comments.

      Weaknesses:

      While the dataset employed in this research holds promise, a rigorous justification of the core assumptions underpinning the analytical framework is inadequate. The authors should thoroughly address the correlation between checklist data and global range data, ensuring that the foundational assumptions and potential confounding factors are explicitly examined and articulated within the study's context.

      We thank the Reviewer for these comments. We agree that more justification and transparency is needed of the core assumptions that form the foundation of our methods. In our revised version, we have taken the following steps to achieve this:

      - Altered the title to be more explicit about the core assumptions, which now reads: “Local-scale relative abundance is decoupled from global range size”

      - We have added more details on why and how we treat global range size as a measure of ‘occupancy.’

      - We have added a section that discusses the limitations of using eBird relative abundance

      Reviewer #2 (Public Review):

      Summary:

      The goal is to ask if common species when studied across their range tend to have larger ranges in total. To do this the authors examined a very large citizen science database which gives estimates of numbers, and correlated that with the total range size, available from Birdlife. The average correlation is positive but close to zero, and the distribution around zero is also narrow, leading to the conclusion that, even if applicable in some cases, there is no evidence for consistent trends in one or other direction.

      We thank the Reviewer for these comments.

      Strengths:

      The study raises a dormant question, with a large dataset.

      We thank the Reviewer for these comments. We intended to take a longstanding question and attempt to apply novel datasets that were not available mere decades ago. While we do not imply that we have ‘solved’ the question, we hope this work highlights the potential for further interrogation using these large datasets.

      Weaknesses:

      This study combines information from across the whole world, with many different habitats, taxa, and observations, which surely leads to a quite heterogeneous collection.

      We agree that there is a heterogeneous collection of data across many habitats, taxa, and observations. However, rather than as a weakness, we see this as a significant strength. Our work assumes we are averaging over this variability to assess for a large-scale pattern in the relationship - something that was potentially a limitation of previous work, as these large datasets were often focused on particular contexts (e.g., much work focused solely on the UK), which we believe could limit some of the generalizability of the previous work. However, the reviewer makes a fair point in regard to the heterogeneity of data collection. We have now added some text in the discussion which is explicit about this - see the new section named “Potential limitations of current work and future work –-although our findings challenge some long-held assumptions about the consistency of the abundance-occupancy relationship, our work only deals with interspecific AORs among birds, synthesizing observations of potentially heterogeneous locations, context and quality”.

      First, scale. Many of the earlier analyses were within smaller areas, and for example, ranges are not obviously bounded by a physical barrier. I assume this study is only looking at breeding ranges; that should be stated, as 40% of all bird species migrate, and winter limitation of populations is important. Also are abundances only breeding abundances or are they measured through the year? Are alien distributions removed?

      Second, consider various reasons why abundance and range size may be correlated (sometimes positively and sometimes negatively) at large scales. Combining studies across such a large diversity of ecological situations seems to create many possibilities to miss interesting patterns. For example:

      (1) Islands are small and often show density release.

      See comment below.

      (2) North temperate regions have large ranges (Rapoport's rule) and higher population sizes than the tropics.

      See comment below.

      (3) Body size correlates with global range size (I am unsure if this has recently been tested but is present in older papers) and with density. For example, cosmopolitan species (barn owl, osprey, peregrine) are relatively large and relatively rare.

      See comment below.

      (4) In the consideration of alien species, it certainly looks to me as if the law is followed, with pigeon, starling, and sparrow both common and widely distributed. I guess one needs to make some sort of statement about anthropogenic influences, given the dramatic changes in both populations and environments over the past 50 years.

      See comment below. We also added a sentence in the methods that highlighted we did not remove alien ranges and provided reasons why. Still, we do acknowledge the dramatic changes in populations and environments over the past 50 years (see the new section  “Potential limitations of current work and futur work”)

      (5) Wing shape correlates with ecological niche and range size (e.g. White, American Naturalist). Aerial foraging species with pointed wings are likely to be easily detected, and several have large ranges reflecting dispersal (e.g. barn swallow).

      We agree that all of the points above are interesting data explorations. As said above, our main purpose was to highlight the potential for further interrogation using these large datasets. However, we have added some additional text in the discussion that explicitly mentions/encourages these additional data explorations. We hope people will pick up on the potential for these data and explore them further.

      Third, biases. I am not conversant with ebird methodology, but the number appearing on checklists seems a very poor estimate of local abundance. As noted in the paper, common species may be underestimated in their abundance. Flocking species must generate large numbers, skulking species few. The survey is often likely to be in areas favorable to some species and not others. The alternative approach in the paper comes from an earlier study, based on ebird but then creating densities within grids and surely comes with similar issues.

      We agree that if we were interested in the absolute abundance of a given species, the local number on an eBird checklist would be a poor representation. However, our study aims not to estimate absolute abundance but to examine relative abundance among species on each checklist. By focusing on relative abundance, we leverage eBird data's strengths in detecting the presence and frequency of species across diverse locations and times, thereby capturing community composition trends that can provide meaningful insights despite individual checklist biases. This approach allows us to assess the comparative prominence of species in the community as reported by the observer, providing a consistent metric of relative abundance. Despite detectability biases, the structure of eBird checklists reflects the observer’s encounter rates with each species under similar conditions, offering a valuable snapshot of relative species composition across sites and times. The key to our assumption is that these biases discussed are not directional and, therefore, random throughout the sampling process, which would translate to no ‘real’ bias in our effect size of interest.

      Range biases are also present. Notably, tropical mountain-occupying species have range sizes overestimated because holes in the range are not generally accounted for (Ocampo-Peñuela et al., Nature Communications). These species are often quite rare, too.

      We thanks the reviewer for pointing to this issue and reference. We included a discussion on these biases in our limitations section and reference Ocampo-Peñuela et al. to emphasize the need for improved spatial resolution in range data for more accurate AOR assessments.”More precise range-size estimates would also improve the accuracy of AOR assessments, since species range data are often overestimated due to the failure to capture gaps in actual distributions ”

      Fourth, random error. Random error in ebird assessments is likely to be large, with differences among observers, seasons, days, and weather (e.g. Callaghan et al. 2021, PNAS). Range sizes also come with many errors, which is why occupancy is usually seen as the more appropriate measure.

      If we consider both range and abundance measurements to be subject to random error in any one species list, then the removal of all these errors will surely increase the correlation for that list (the covariance shouldn't change but the variances will decrease). I think (but am not sure) that this will affect the mean correlation because more of the positive correlations appear 'real' given the overall mean is positive. It will definitely affect the variance of the correlations; the low variance is one of the main points in the paper. A high variance would point to the operation of multiple mechanisms, some perhaps producing negative correlations (Blackburn et al. 2006).

      We agree random errors can affect estimates, but as we wrote above, random errors, regardless of magnitudes, would not bias estimates. After accounting for sampling error (a part of random errors), little variance is left to be explained as we have shown in the MS. This suggests that many of the random errors were part of the sampling errors. And this is where meta-analysis really shines.

      On P.80 it is stated: "Specifically, we can quantify how AOR will change in relation to increases in species richness and sampling duration, both of which are predicted to reduce the magnitude of AORs" I haven't checked the references that make this statement, but intuitively the opposite is expected? More species and longer durations should both increase the accuracy of the estimate, so removing them introduces more error? Perhaps dividing by an uncertain estimate introduces more error anyway. At any rate, the authors should explain the quoted statement in this paper.

      It would be of considerable interest to look at the extreme negative and extreme positive correlations: do they make any biological sense?

      Extremely high correlations would not make any biological sense if these observations were based on large sample sizes. However, as shown in Figure 2, all extreme correlations come from small sample sizes (i.e., low precision), as sampling theory expects (actually our Fig 2 a text-book example of the funnel shape). Therefore, we do not need to invoke any biological explanations here.

      Discussion:

      I can see how publication bias can affect meta-analyses (addressed in the Gaston et al. 2006 paper) but less easily see how confirmation bias can. It seems to me that some of the points made above must explain the difference between this study and Blackburn et al. 2006's strong result.

      We agree. Now, we extended an explanation of why confirmation bias could result in positive AOR. Yet, we point out confirmation bias is a very common phenomena which we cite relevant citations in the original MS. The only way to avoid confirmation bias is to conduct a study blind but this is not often possible in ecological work.

      “Meta-research on behavioural ecology identified 79 studies on nestmate recognition, 23 of which were conducted blind. Non-blind studies confirmed a hypothesis of no aggression towards nestmates nearly three times more often. It is possible that confirmation bias was at play in earlier AOR studies.”

      Certainly, AOR really does seem to be present in at least some cases (e.g. British breeding birds) and a discussion of individual cases would be valuable. Previous studies have also noted that there are at least some negative and some non-significant associations, and understanding the underlying causes is of great interest (e.g. Kotiaho et al. Biology Letters).

      We agree. And yes, we pointed out these in our introduction.

      Reviewer #3 (Public Review):

      Summary:

      This paper claims to overturn the longstanding abundance occupancy relationship.

      Strengths:

      (1) The above would be important if true.

      (2) The dataset is large.

      We have clarified this point by changing the title to emphasize that we do not suggest overturning AORs entirely but instead provide a refined view of the relationship at a global scale. Our results suggest a weaker and more context-dependent AOR than previously documented. We hope our revised title and additional clarifications in the text convey our intent to contribute to a more nuanced understanding rather than a whole overturning of the AOR framework.

      Weaknesses:

      (1) The authors are not really measuring the abundance-occupancy relationship (AOR). They are measuring abundance-range size. The AOR typically measures patches in a metapopulation, i.e. at a local scale. Range size is not an interchangeable notion with local occupancy.

      We have refined this in our revision to be more explicitly focused on global range size. However, we note that the classic paper by Bock and Richlefs (1983, Am Nat) also refers to global (species entire) range size in the context of the AOR. Importantly, Bock and Richlefs pointed out the importance of using species’ entire ranges; without such uses, there will be sampling artifacts creating positive AORs when using arbitrary geographical ranges, which were used in some studies of AORs. So we highlight that our work is well in line with the previous work, allowing us to question the longstanding macroecological work. One of the issues of AOR has been how to define occupancy and global range size, which provides a relatively ambiguous measure, which is why we used this measure.

      (2) Ebird is a poor dataset for this. The sampling unit is non-standard. So abundance can at best be estimated by controlling for sampling effort. Comparisons across space are also likely to be highly heterogenous. They also threw out checklists in which abundances were too high to be estimated (reported as "X"). As evidence of the biases in using eBird for this pattern, the North American Breeding Bird Survey, a very similar taxonomic and geographic scope but with a consistent sampling protocol across space does show clear support for the AOR.

      Yes, we agree the sampling unit is non-standard. However, this is a significant strength in that it samples across much heterogeneity (as discussed in response to Reviewer 2, above). We were interested in relative abundance and not direct absolute abundance per se, which is accurate, especially since we did control for sampling effort.

      We appreciate the reviewer’s attention to our data selection criteria. We excluded checklists containing ‘X’ entries to minimize biases in our abundance estimates. The 'X' notation is often used for the most common species, reflecting the observer's identification of presence without specifying a count. This approach was chosen to avoid disproportionately inflating presence data for these abundant species, which could distort the relative abundance calculations in our analysis. By excluding such checklists, we aimed to retain consistency and ensure that local abundance estimates were representative across all species on each checklist. We have revised our manuscript to clarify this methodological choice and hope this explanation addresses the reviewer’s concern. We modified our text in the methods to make the entries ‘X’ clearer (see the Method section).

      (3) In general, I wonder if a pattern demonstrated in thousands of data sets can be overturned by findings in one data set. It may be a big dataset but any biases in the dataset are repeated across all of those observations.

      Overturning a major conclusion requires careful work. This paper did not rise to this level.

      We appreciate the reviewer’s caution regarding broad conclusions based on a single dataset, even one as large as eBird. Our intention was not to definitively overturn the abundance-occupancy relationship (AOR) but to re-evaluate it with the most extensive and globally representative dataset currently available. We recognise that potential biases in citizen science data, such as observer variation, may influence our findings, and we have taken steps to address these in our methodology and limitations sections. We see this work as a contribution to an ongoing discourse, suggesting that AOR may be less universally consistent than previously believed, mainly when tested with large-scale citizen science data. We hope this study will encourage additional research that tests AORs using other expansive datasets and approaches, further refining our understanding of this classic macroecological relationship. However, we have left our broad message about instigating credible revolution and also re-examining ecological laws.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      (1) The investigation focuses solely on interspecific relationships among birds; thus, the extrapolation of these conclusions to broader ecological contexts requires further validation.

      We have now added this point to our new section: “Although our findings challenge some long-held assumptions about the consistency of the abundance-occupancy relationship, our work only deals with interspecific AORs among birds, so we hope this work serves as a foundation for further investigations that utilize such comprehensive datasets.”

      (2) The rationale for combining data from eBird - a platform predominantly representing individual observations from urban North America - with the more globally comprehensive BirdLife International database needs to be substantiated. The potential underrepresentation of global abundance in the eBird checklist data could introduce a sampling bias, undermining the foundational premises of AORs.

      We agree with the limitation of ebird sampling coverage, but it should not bias our results. In statistical definitions, bias is directional, and if not directional, it will become statistical noise, making it difficult to detect the signal. In fact, our meta-analyses adjust what statisticians call sampling bias and it is the strength of meta-analysis.

      (3) In the full mixed-effect model, checklist duration and sampling variance (inversely proportional to sample size N) are treated as fixed effects. However, these variables are likely to be negatively correlated, which could introduce multicollinearity, inflating standard errors and diminishing the statistical significance of other factors, such as the intercept. This calls into question the interpretation of insignificance in the results.

      Multicollinearity is an issue with sample sizes. For example, with small datasets, correlations of 0.5 could be an issue, and such an issue would usually show up as a large SE. We do not have such an issue with ~ 17 million data points. Please refer to this paper.

      Freckleton, Robert P. "Dealing with collinearity in behavioural and ecological data: model averaging and the problems of measurement error." Behavioral Ecology and Sociobiology 65 (2011): 91-101.

      (4) The observed low heterogeneity may stem from discrepancies in sampling for abundance versus occupancy, compounded by uncertainties in reporting behavior.

      If we assume everybody underreports common species or overreports rare species, this could happen. However, such an assumption is unlikely. If some people report accurately (but not others), we should see high heterogeneity, which we do not observe).  We have touched upon this point in our original MS.

      (5) The contribution and implementation of phylogenetic comparative analysis remain ambiguous and were not sufficiently clarified within the study.

      We need to add more explanation for the global abundance analysis

      “To statistically test whether there was an effect of abundance and occupancy at the macro-scale, we used phylogenetic comparative analysis.  This analysis also addresses the issue of positive interspecific AORs potentially arising from not accounting for phylogenetic relatedness among species examined ”

      (6) The use of large N checklists could skew the perceived rarity or commonality of species, potentially diminishing the positive correlation observed in AORs. A consistent observer effect could lead to a near-zero effect with high precision.

      Regardless of the number of N species in checklists (seen in Fig 2), correlations are distributed around zero. This means there is nothing special about large N checklists. 

      (7) The study should acknowledge and discuss any discrepancies or deviations from previous literature or expected outcomes.

      We felt we had already done this as we discussed the previous meta-analysis and what we expected from this meta-analysis.  Nevertheless, we have added some relevant sentences in the new version of MS.

      In addition to these major points, there are several minor concerns:

      (1) Figure 2B lacks discussion, and the metric for the number of observations is not clarified. Furthermore, the labeling of the y-axis appears to be incorrect.

      Thank you very much for pointing out this shortcoming. Now, the y-axis label has been fixed and we mention 2B in the main text.

      (2) The study should provide a clear, mathematical expression of the multilevel random effect models for greater transparency.

      Many thanks for this point, and now we have added relevant mathematical expressions in Table S6.

      (3) On Line 260, the term "number of species" should be refined to "number of species in a checklist," ideally represented by a formula for precision.

      This ambiguity has been mended as suggested.

      Please provide the data and R code linked to the outputs.

      The referee must have missed the link (https://github.com/itchyshin/AORs) in our original MS. In addition to our GitHub repository link, we now have added a link to our Zenodo repository (https://doi.org/10.5281/zenodo.14019900).

      Reviewer #3 (Recommendations For The Authors):

      The authors cite Rabinowitz's 7 forms of rarity paper as a suggestion that previous findings also break the AOR. In fact empirical studies of the 7 forms of rarity typically find that all three forms of rareness vs commonness are heavily correlated (e.g. Yu & Dobson 2000).

      We thank the reviewer for drawing attention to Yu & Dobson (2000) and similar studies that find positive correlations among the axes of rarity. Ref 3 is correct in that Rabinowitz’s (1981) framework does not require that local abundance and geographic range size be uncorrelated for every species; instead, it highlights conceptual scenarios where a species may be common locally yet have a restricted distribution (or vice versa).

      Empirical analyses such as Yu & Dobson (2000) show that, on average, these axes can be correlated, which may align with conventional AOR findings in some taxonomic groups. However, Rabinowitz’s key insight was that exceptions do occur, so these exceptions demonstrate that strong positive AORs may not be universally applicable. Our results do not claim that Rabinowitz’s framework “breaks” the AOR outright; instead, we use it to underscore that local abundance can, in principle, be “decoupled” from global occupancy.  Whether the correlation found by Yu & Dobson (2000) implies a positive AOR, requires a detailed simulation study, which is an interesting avenue for future research. 

      Thus, citing Rabinowitz serves to highlight the potential heterogeneity and complexity of abundance–occupancy relationships rather than to refute every positive correlation reported in the literature. Our findings suggest that when examined at large spatiotemporal scales (with unbiased sampling), the overall AOR signal may be less robust than traditionally believed. This is consistent with Rabinowitz’s view that local abundance and global range can vary along independent axes. Now we added

      “Although studies using her framework found positive correlations between species range and local abundance.”

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript assesses the utility of spatial image correlation spectroscopy (ICS) for measuring physiological responses to DNA damage. ICS is a long-established (~1993) method, similar to fluorescence correlation spectroscopy, for deriving information about the fluorophore density that underlies the intensity distributions of images.

      The revisions to the current manuscript have improved the understanding of the strengths and limitations of the spatial ICS method. In particular, since the measurements are obtaining complementary information to traditional focus counting, one does not expect a simple linear relationship between the quantities obtained by ICS and by immunostaining. The explanations are satisfactory to me and, I expect, to the interested reader.

      Additionally, I am satisfied with the code availability now that it is placed on Github.

    2. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      This manuscript assesses the utility of spatial image correlation spectroscopy (ICS) for measuring physiological responses to DNA damage. ICS is a long-established (~1993) method similar to fluorescence correlation spectroscopy, for deriving information about the fluorophore density that underlies the intensity distributions of images. The authors first provide a technical but fairly accessible background to the theory of ICS, then compare it with traditional spot-counting methods for its ability to analyze the characteristics of γH2AX staining. Based on the degree of aggregation (DA) value, the authors then survey other markers of DNA damage and uncover some novel findings, such as that RPA aggregation inversely tracks the sensitivity to PARP inhibitors of different cell lines.

      The need for a more objective and standardized tool for analyzing DNA damage has long been felt in the field and the authors argue convincingly for this. The data in the manuscript are in general well-supported and of high quality, and show promise of being a robust alternative to traditional focus counting. However, there are a number of areas where I would suggest further controls and explanations to strengthen the authors' case for the robustness of their ICS method.

      Strengths:

      The spatial ICS method the authors describe and demonstrate is easy to perform and applicable to a wide variety of images. The DDR was well-chosen as an arena to showcase its utility due to its well-characterized dose-responsiveness and known variability between cell types. Their method should be readily useable by any cell biologist wanting to assess the degree of aggregation of fluorescent tags of interest.

      Weaknesses:

      The spatial ICS method, though of longstanding history, is not as intuitive or well-known as spot-based quantitation. While the Theory section gives a standard mathematical introduction, it is not as accessible as it could be. Additionally, the values of TNoP and DA shown in the Results are not discussed sufficiently with regard to their physical and physiological interpretation.

      We agree that a major limitation in adaption of this approach is a deeper understanding of the theory and results. We have updated the theory section to include further discussion (Page 4 line 132)

      The correlation of TNoP with γH2AX foci is high (Figure 2) and suggestive that the ICS method is suitable for measuring the strength of the DDR. The authors correctly mention that the number of spots found using traditional means can vary based on the parameters used for spot detection. They contrast this with their ICS detection method; however, the actual robustness of spatial ICS is not given equal consideration.

      We found it difficult to give equal consideration of robustness to ICS. The major limitation of traditional approaches is proper selection of an intensity threshold that is necessary to define and separate foci from background intensity. However, ICS does not employ a threshold, therefore we could not test different thresholding applications in ICS as we did with traditional methods. In our view the absence of the need for a threshold is profoundly advantageous. The only inputs we employ in the ICS analysis are used to segment cell nuclei, yet these have no impact on the ICS calculation and are necessary for any analysis of the DDR.

      Reviewer #2 (Public review):

      Summary:

      Immunostaining of chromatin-associated proteins and visualization of these factors through fluorescence microscopy is a powerful technique to study molecular processes such as DNA damage and repair, their timing, and their genetic dependencies. Nonetheless, it is well-established that this methodology (sometimes called "foci-ology") is subject to biases introduced during sample preparation, immunostaining, foci visualization, and scoring. This manuscript addresses several of the shortcomings associated with immunostaining by using image correlation spectroscopy (ICS) to quantify the recruitment of several DNA damage response-associated proteins following various types of DNA damage.

      The study compares automated foci counting and fluorescence intensity to image correlation spectroscopy degree of aggregation study the recruitment of DNA repair proteins to chromatin following DNA damage. After validating image correlation spectroscopy as a reliable method to visualize the recruitment of γH2AX to chromatin following DNA damage in two separate cell lines, the study demonstrates that this new method can also be used to quantify RPA1 and Rad51 recruitment to chromatin following DNA damage. The study further shows that RPA1 signal as measured by this method correlates with cell sensitivity to Olaparib, a widely-used PARP inhibitor.

      Strengths:

      Multiple proof-of-concept experiments demonstrate that using image correlation spectroscopy degree of aggregation is typically more sensitive than foci counting or foci intensity as a measure of recruitment of a protein of interest to a site of DNA damage. The sensitivity of the SKOV3 and OVCA429 cell lines to MMS and the PARP inhibitors Olaparib and Veliparib as measured by cell viability in response to increasing amounts of each compound is a valuable correlate to the image correlation spectroscopy degree of aggregation measurements.

      Weaknesses:

      The subjectivity of foci counting has been well-recognized in the DNA repair field, and thus foci counts are usually interpreted relative to a set of technical and biological controls and across a meaningful time period. As such:

      (1) A more detailed description of the numerous prior studies examining the immunostaining of proteins such as γH2AX, RAD51, and RPA is needed to give context to the findings presented herein.

      We apologize for not providing enough detail. We have added further references and discussion. γH2AX foci counting, in particular, has been used in thousands of previous studies. (Pages 18 line 513 and 517)

      (2) The benefits of adopting image correlation spectroscopy should be discussed in comparison to other methods, such as super-resolution microscopy, which may also offer enhanced sensitivity over traditional microscopy.

      Thank you for raising this point. We have added this discussion (page 19 line 553). The limiting factor that ICS addresses is the partition coefficient of signal in a foci or cluster versus outside the cluster. Super-resolution will not necessarily improve this unless it is resolved down to single molecule counting. However, one would still need to evaluate how to define a cluster or foci in the background of non-cluster distribution.

      (3) Additional controls demonstrating the specificity of their antibodies to detection of the proteins of interest should be added, or the appropriate citations validating these antibodies included.

      We have added text stating that we only use validated antibodies (page 6 line 193). One thing to note is that we are measuring differences between treatment conditions, thus, if an antibody has non-specific labeling of proteins of cellular structures that do not change upon treatment, our approach would overcome this limitation.

      Reviewer #3 (Public review):

      Summary:

      This paper described a new tool called "Image Correlation Spectroscopy; ICS) to detect clustering fluorescence signals such as foci in the nucleus (or any other cellular structures). The authors compared ICS DA (degree of aggregation) data with Imaris Spots data (and ImageJ Find Maxima data) and found a comparable result between the two analyses and that the ICS sometimes produced a better quantification than the Imaris. Moreover, the authors extended the application of ICS to detect cell-cycle stages by analyzing the DAPI image of cells. This is a useful tool without the subjective bias of researchers and provides novel quantitative values in cell biology.

      Strengths:

      The authors developed a new tool to detect and quantify the aggregates of immunofluorescent signals, which is a center of modern cell biology, such as the fields of DNA damage responses (DDR), including DNA repair. This new method could detect the "invisible" signal in cells without pre-extraction, which could prevent the effect of extracted materials on the pre-assembled ensembles, a target for the detection. This would be an alternative method for the quantification of fluorescent signals relative to conventional methods.

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      Major comments:

      (1) The ICS theory section is essential and based on an excellent review from one of the authors. It would benefit greatly from a diagram showing where the quantities 𝒈(𝟎, 𝟎), 𝝎𝟎, and 𝒈inf come from in the 2D Gaussian fit, ideally for two cases where these quantities differ (i.e., how they correspond to different DA or TNoP values). In my opinion, this addition would greatly increase the manuscript's accessibility for DDR researchers. The citation of the review at the beginning would also be a plus.

      We have added the review citation at the front of the theory section (page 3 line 87).We have highlighted where g(0,0), the most critical measurement for determination of TNoP and DA, derives from in Figure 2D. However, it is difficult to describe all the curve fit parameters in an image as they have some interdependency on each other and thus labeling one in a single image would not independently capture how they might be observed in a different curve fit.

      (2) The TNoP measured in Figure 2 is a quantity about 2000-3000 times greater than the number of "traditionally detected" foci by both methods and the linear relations have very low Y intercepts. Can the authors comment explicitly on the physical interpretation of this number - are 2 to 3 thousand independent particles present within each "focus" detected by traditional means? If so, then what might one "particle" correspond to? (a single secondary antibody or fluorophore? a nucleosome?). In a similar vein, the X intercepts lie at around 25 foci, meaning that in images with fewer than that number of foci detected by ImageJ or Imaris, the ICS method should detect zero TNoP - is this in line with the authors' predictions? Is it possible that a first-order line fit is not the most appropriate relation between the two methods?

      We apologize for our brevity here. Since DA proved to be a more useful metric we did not spend much effort discussing TNoP. TNoP correlates to the number of clustered particles, or non-diffuse fluorophores. TNoP is the inverse of the number of individual particles per nucleus, but the value is not a direct measure of foci. If a sample had no clustering at all, the number of individual particles would be at a maximum and the TNoP would be at a minimum. However, as fluorophores cluster, the number of individual particles (i.e. non-clustered fluorophores) decreases, which increases the TNoP value. Therefore, TNoP has a correlation to the number of foci detected through traditional measurements, as we found here. Yet, TNoP is a relative measurement and cannot be compared across different conditions. Similar to foci counting, TNoP is unable to factor the size or intensity of each cluster, thus DA is a more appropriate quantification of the DNA damage response.

      The value of TNoP is dependent on the fitted point spread function and the area of the nucleus. The y=0 intercept of TNoP is defined by the optical setup and is not expected to necessarily go through x=0. Intriguingly, other groups have found that some foci identified through traditional measurements are actually clusters of multiple smaller foci, thus the concept of what a foci represents is difficult to interpret. Thus, here we aimed to show a general correlation of TNoP with foci count through traditional methods to reflect how ICS is similar to foci counting, then employed DA to overcome the limitations of defining a foci.

      We have tried to clarify this in the text (page 8, line 266)

      (3) Some suggestions to address the robustness of ICS:

      For a given sample (i.e. one segmented nucleus), the calculation of DA and TNoP should be similar between different images of that same nucleus taken at different times, similar to how the number of traditionally detected foci would be fairly invariant. In particular, it should be shown that these values are not just scaling with the higher normalized intensity seen in stronger DDR responses. In the same vein, the linear relationship between TNoP and "foci" should not change even if the confocal settings are slightly different (i.e., higher/lower illumination intensity) as long as the condition stipulated by the authors in the Discussion holds ("ICS can be implemented on any fluorescence image as long as the square relative fluorescence intensity fluctuations are detectable above noise fluctuations."). To show, as the title states, that spatial ICS is a robust tool, it would be desirable to demonstrate this with a series of images of the same cell at the same or varying excitation intensities.

      Thank you for your suggestions. Indeed, the calculation will be the same over sequential images of the same cell. Observations of dose dependent DA that does not correlate with intensity for RPA1 and RAD51 results (Fig. S5) directly demonstrates that DA does not just scale with intensity.

      We would not expect the TNoP to change with confocal setting, however we show in Figure 1 that the number of foci does indeed change with intensity settings as captured by thresholds. Therefore, any interpretation of TNoP vs. foci count would be very difficult to make at different microscope settings. To ensure we are fairly comparing ICS to existing analysis we keep the settings the same and measure changes between conditions.

      (4) More information is needed on how intensity normalization was performed. The Methods states "Measurements across experiments were normalized by the control in each dataset." The DMSO (0mM drug) plots all appear to have a mean of 1.0, so it appears the values for each set of control nuclei were divided by their own mean, and then the values for each set of experimental nuclei were divided by the mean value of all 3 controls as an aggregate; is this correct?

      We apologize for not being more clear. Thank you for raising this point. We normalized data to a control from each experimental group. Thus, in figures 3,4 and 5 data were collected over multiple experiments with one control per experiment and each treatment condition included in each experiment. Therefore, we normalized each result to the corresponding control from that imaging session. However, in Figure 8 we ran experiments at much higher throughput with multiple controls per experiment, thus the data were normalized to the overall average of the controls, which is why the control averages are not all at a value of 1. We have clarified this in the text. (Page 7 line 218).

      (5) Some more information about the ICS analysis should be given if the full code is not provided - in particular, how the nucleus mask was implemented on the "signal" channel (were the edges abruptly set to zero or was a window function introduced to avoid edge effects in the discrete FFT?

      Thank you for raising this point. We have added the code to GitHub - github.com/ dubachLab/ics. The signal region was established by simply applying the nuclear mask from the DAPI channel to the IF channel. Each region is padded with average intensity value at the edges for 2x the dimensions of the ROI to remove edge effects in the FFT.

      Minor comments:

      (1) Figure 3, 4, 5: I think it would aid figure readability if channels were labeled in the images themselves, not just in the legend.

      Thank you for the suggestion, we tried doing this and struggle to fit a label with the layout of the images. We were also concerned about interpretation of data in each column and the potential to assign data to each figure if they were so prominently labeled.

      (2) Supplemental Figures are mislabeled; the order given in the legends is S1, S2, S3, S2, S3. S4 is called out in the main text where it should be S5.

      Thank you for catching this error. We have made the necessary corrections. S4 contains data on cellular response to the drugs, while S5 contains intensity data in response to MMS.

      (3) It should be stated for each Figure what kind of microscopy was performed - I assume that it is confocal for everything except when widefield is explicitly stated, but for clarity please add this information.

      Indeed, this is correct, we have indicated which microscopy was used for each figure.

      (4) The MATLAB code and full (uncropped) Western blots should be provided as supplemental data if possible.

      We have included a GitHub link for the code and un-cropped western blots.

      (5) The p values from significance tests should indicate whether multiple comparisons correction was necessary (if suggested by Prism) and performed.

      Apologies for a lack of clarity but this was not necessary, significance was calculated vs. the next lower dose (e.g. 10 micromolar vs. 1 micromolar). We have clarified this in the methods (page 7 line 221).

      Reviewer #2 (Recommendations for the authors):

      Major points:

      In addition to the weaknesses noted above, to encourage widespread adoption of this method, the authors should make the tools that they used for their analysis publicly available. In a few instances (e.g., compare Figures 3J and 3L), other methods outperform DA. It would be meaningful to discuss when especially DA may be a better measure than others (such as intensity or number of foci).

      We have made code available on Github. We expect results, such as those in Figures 3J and 3L where intensity is significantly higher at the highest concentration but DA is not are reflective of the underlying biology and this may be interpreted differently under different experimental conditions. Imaris spots (Fig. 3K) also does not capture a significant increase at the highest dose of olaparib, suggesting that intensity may raise but it doesn’t not generate more foci. These results are likely highly dependent on the mechanism of olaparib at such a high concentration and the DDR response. We are hesitant to draw biological conclusions from these results and instead would like to highlight the capacity of ICS to evaluate the DDR, therefore we don’t want to make any broad comments about different applications.

      Minor points:

      (1) Pg. 12: "We used MMS to induce DNA damage in SKOV3 and OVCA429 cells. As expected, normalized intensity for RPA1 and RAD51 values (Figure S5) did not display a dose dependence on MMS concentration."

      Please provide a citation for the claim that RPA1 and RAD51 normalized intensities do not display a dose dependence on MMS concentration.

      These were data that we generated. We were not expecting an intensity change as that would presumably require increased protein generation in response to MMS, compared to gH2AX where the phospho-specific H2AX is generated in the DDR.

      (2) Pg. 12: "Similar to RPA1, RAD51 does not form distinguishable foci in the nuclei in cells without preextraction (Fig. 5)." Please provide a citation for this claim.

      We did not do pre-extraction and our results don’t produce changes in distinguishable foci. We provided citations discussing how, without pre extraction, foci formation for these proteins is not obvious (REF 38 and 39).

      (3) I noted that the authors cite one paper [38] apparently showing that RPA and Rad51 do not always form foci, however, this is in the C. elegans germline in response to micro irradiation, therefore I am not sure that it is applicable to human cells.

      We apologize for referencing a paper on C elegans. Most papers looking at RPA and RAD51 in the DDR use pre-extraction as it seems necessary to observe foci. Therefore, there are not as many papers, that we could find, that do not use pre-extraction. Reference 39 is in Hela cells.

      Reviewer #3 (Recommendations for the authors):

      Major points:

      (1) Page 8, the second paragraph: In the Result section, it is better to describe how the authors carried out immuno-staining (without pre-extract subtraction) and ICS briefly, although the method is described in detail in the Method section.

      Thank you for the suggestion, we have added this description (page 8, line 259)

      (2) In Figure 5K-P: The authors analyzed "invisible" RAD51 foci on the image (Fig. 5L, M, O, and P) without pre-extraction. As a control experiment, it is useful to check whether pre-extraction would provide "visible" RAD51 foci and to examine the similar MMS concentration dependency shown in Figure 5R (or 5T). This would strengthen the power of the ICS analysis.

      Thank you for the suggestion. In our hands, pre-extraction is extremely subjective. We have tried performing pre-extraction but find highly variable results depending on conditions. Therefore, we did not include any pre-extraction here. We expect that performing these experiments may or may not agree with results in Figure 5 largely because we are unable to achieve repeatable pre-extraction foci counting.

      (3) Figure 6D (and 6C) looks very interesting. It would be important to show the interpretation of this correlation shown in the graph. Although the authors argued that ICS analysis results shown in the graph could provide new insight into the DDR (page 14, last line 5), as shown in another part, it is important to carry out the same analysis by using Imaris Spots. Moreover, it is interesting to apply the analysis to RAD51 foci (shown in Figure 5), given that the PARPi effect is enhanced in the absence of RAD51mediated recombination.

      We completely agree that this analysis may generate interesting results to help interpret the DDR response to PARP inhibition. These experiments are part of an ongoing follow up study where we extend the use of ICS to other parts of the DDR and investigate protein clustering across several proteins with impact on PARPi response. Therefore, since the focus of this manuscript is introducing ICS as a tool to study the DDR, we believe that omitting those data here does not deter from the central points of the manuscript. We including results in Figure 6 because we wanted to show how ICS could impact DDR research. Furthermore, combined with our advances shown in Figures 7 and 8, we are currently working on adapting ICS to be high-throughput and much simpler than Imaris spots for handling large datasets needed to generate results like those in Figure 6.

      Minor points:

      (1) Figure 1I, blue arrows: These showed an area with a higher background. Because of a low magnification, it is very hard to see the difference from the other areas of the background. It is better to show a magnified image of the representative region with a higher background.

      We hope that readers can see the higher intensity in the diffuse area. We attempted to construct a zoomed in area, but that either blocked a significant portion of the nonzoomed image or added complexity to the figure. We have noted that images in Figure S1 are larger and more obviously capture an increase in background intensity.

      (2) Figure 2 legend, line 5, the same as "A)": This should be "B".

      Here, the number of independent particle clusters is intended to be the same as A, the difference is that the independent particles are clusters in C and individual fluorophores in A.

      (3) Page 9, the first paragraph, last line, foci formation, and foci composition: These should be "focus formation and focus composition".

      We have changed this.

      (4) Page 15, the first paragraph, line 5, palbociclib, camptothecin, or etoposide: please explain what kinds of the drugs are.

      We have added that these drugs cause cells to stall at different cell cycle stages. Explaining the drugs would take considerable room in the text.

      (5) Page 16, the first paragraph, line 1, bleomycin: Please explain what this drug is.

      Similar to above, we have stated that this drug causes DNA damage, going into detail would take several sentences.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public reviews:

      Reviewer #1:

      (1) This manuscript introduces a useful curation pipeline of antibody-antigen structures downloaded from the PDB database. The antibody-antigen structures are presented in a new database called AACDB, alongside annotations that were either corrected from those present in the PDB database or added de-novo with a solid methodology. Sequences, structures, and annotations can be very easily downloaded from the AACDB website, speeding up the development of structure-based algorithms and analysis pipelines to characterize antibody-antigen interactions. However, AACDB is missing some key annotations that would greatly enhance its usefulness.

      Here are detailed comments regarding the three strengths above:

      I think potentially the most significant contribution of this database is the manual data curation to fix errors present in the PDB entries, by cross-referencing with the literature. However, as a reviewer, validating the extent and the impact of these corrections is hard, since the authors only provided a few anecdotal examples in their manuscript.

      I have personally verified some of the examples presented by the authors and found that SAbDab appears to fix the mistakes related to the misidentification of antibody chains, but not other annotations.

      (a) "the species of the antibody in 7WRL was incorrectly labeled as "SARS coronavirus B012" in both PDB and SabDab" → I have verified the mistake and fix, and that SAbDab does not fix is, just uses the pdb annotation.

      (b) "1NSN, the resolution should be 2.9 , but it was incorrectly labeled as 2.8" → I have verified the mistake and fix, and that sabdab does not fix it, just uses the PDB annotation.

      (c) "mislabeling of antibody chains as other proteins (e.g. in 3KS0, the light chain of B2B4 antibody was misnamed as heme domain of flavocytochrome b2)" → SAbDab fixes this as well in this case.

      (d) "misidentification of heavy chains as light chains (e.g. both two chains of antibody were labeled as light chain in 5EBW)" → SAbDab fixes this as well in this case.

      I personally believe the authors should make public the corrections made, and describe the procedures - if systematic - to identify and correct the mistakes. For example, what was the exact procedure (e.g. where were sequences found, how were the sequences aligned, etc.) to find mutations? Was the procedure run on every entry?

      We appreciate the reviewer’s valuable feedback. Our correction procedures combined manual curation with systematic sequence analysis. While most metadata discrepancies were resolved through cross-referencing original literature, we implemented a structured approach for identifying mutations in specific cases. For PDB entries labeled as variants (e.g., "Bevacizumab mutant" or "Ipilimumab variant Ipi.106") where the "Mutation(s)" field was annotated as "NO," we retrieved the canonical therapeutic antibody sequence from Thera-SAbDab, then performed pairwise sequence alignment against the PDB entry using BLAST program to identified mutated residues.

      This procedure was not applied to all entries, as mutations are context-dependent. Therapeutic antibodies have well-defined reference sequences, enabling systematic alignment. For antibodies lacking unambiguous wild-type references (e.g., research-grade or non-therapeutic antibodies), mutation annotations were directly inherited from the PDB or literature.

      All corrections have been publicly archived in AACDB. We have added a detailed discussion of this issue in the section “2.3 Metadata” of revised manuscript.

      (2) I believe the splitting of the pdb files is a valuable contribution as it standardizes the distribution of antibody-antigen complexes. Indeed, there is great heterogeneity in how many copies of the same structure are present in the structure uploaded to the PDB, generating potential artifacts for machine learning applications to pick up on. That being said, I have two thoughts both for the authors and the broader community. First, in the case of multiple antibodies binding to different epitopes on the same antigen, one should not ignore the potentially stabilizing effect that the binding of one antibody has on the complex, thereby enabling the binding of the second antibody. In general, I urge the community to think about what is the most appropriate spatial context to consider when modeling the stability of interactions from crystal structure data. Second, and in a similar vein, some antigens occur naturally as homomultimers - e.g. influenza hemagglutinin is a homotrimer. Therefore, to analyze the stability of a full-antigen-antibody structure, I believe it would be necessary to consider the full homo-trimer, whereas, in the current curation of AACDB with the proposed data splitting, only the monomers are present.

      We sincerely appreciate the reviewer’s insightful comments regarding the splitting of PDB files and we appreciate the opportunity to address the reviewer’s thoughtful concerns.

      Firstly, when two antibodies bind to distinct epitopes on the same antigen, we would like to clarify that this scenario can be divided into two cases based on the experimental context: Case1: When two antibodies bind to distinct epitopes on the same antigen, and their complexes are determined in separate structures. For example, SAR650984 (PDB: 4CMH) and daratumumab (PDB: 7DHA) target CD38 at non-overlapping epitopes. These two antibody-antigen complexes were determined independently, and their structures do not influence each other. Case 2 : When the crystal structure contains a ternary complex with two antibodies and an antigen, as in the example of 6OGE discussed in Section 2.2 of our manuscript. After reviewing the original literature, the experiment confirmed that the order of Fab binding does not affect the formation of the ternary complex, and the binding of one antibody does not enhance the binding of the other. This supports the rationale for splitting 6OGE into two separate structures. However, we acknowledge that not all ternary complexes in the PDB provide such detailed experimental descriptions in their original literature. We agree with the reviewer that in some cases, one antibody may stabilize the structure to facilitate the binding of a second antibody. For instance, in 3QUM, the 5D5A5 antibody stabilizes the structure, enabling the binding of the 5D3D11 antibody to human prostate-specific antigen. Such sandwich complexes are indeed valuable for identifying true epitopes and paratopes. Importantly, splitting the structure does not alter the interaction sites.

      Secondly, we fully agree with the reviewer that for antigens that naturally exist as homomultimers (e.g., influenza hemagglutinin as a homotrimer), the full multimeric structure should be considered when analyzing stability. In such cases, users can directly utilize the original PDB structures provided in their multimeric form. Our splitting approach is intended to provide an additional option for cases where monomeric analysis is sufficient or preferred, but it does not preclude the use of the original multimeric structures when necessary.

      (3) I think the manuscript is lacking in justification about the numbers used as cutoffs (1A^2 for change in SASA and 5A for maximum distance for contact) The authors just cite other papers applying these two types of cutoffs, but the underlying physico-chemical reasons are not explicit even in these papers. I think that, if the authors want AACDB to be used globally for benchmarks, they should provide direct sources of explanations of the cutoffs used, or provide multiple cutoffs. Indeed, different cutoffs are often used (e.g. ATOM3D uses 6A instead of 5A to determine contact between a protein and a small molecule https://datasets-benchmarks-proceedings.neurips.cc/paper/2021/hash/c45147dee729311ef5b5c3003946c48f-Abstract-round1.html). I think the authors should provide a figure with statistics pertaining to the interface atoms. I think showing any distribution differences between interface atoms determined according to either strategy (number of atoms, correlation between change in SASA and distance...) would be fundamental to understanding the two strategies. I think other statistics would constitute an enhancement as well (e.g. proportion of heavy vs. light chain residues).

      Some obvious limitations of AACDB in its current form include:

      AACDB only contains entries with protein-based antigens of at most 50 amino acids in length. This excludes non-protein-based antigens, such as carbohydrate- and nucleotide-based, as well as short peptide antigens.

      AACDB does not include annotations of binding affinity, which are present in SAbDab and have been proven useful both for characterizing drivers of antibody-antigen interactions (cite https://www.sciencedirect.com/science/article/pii/S0969212624004362?via%3Dihub) and for benchmarking antigen-specific antibody-design algorithms (cite https://www.biorxiv.org/content/10.1101/2023.12.10.570461v1)).

      We thank the reviewer for raising this critical point about the cutoff values used in AACDB. In the current study, the selection of the threshold value is very objective; the threshold chosen in the manuscript is summarized based on existing literature, and we have provided more literature support in the manuscript. The criteria for defining interacting amino acids in established tools, typically do not set the ΔSASA exceed 1 Å2 and the distance exceed 6 Å. While our manuscript emphasizes widely accepted thresholds for consistency with prior benchmarks, AACDB explicitly provides raw ΔSASA and distance values for all annotated residues. Users can dynamically filter the data from downloaded files by excluding entries exceeding their preferred thresholds (e.g., selecting 5Å instead of 6Å). This ensures adaptability to diverse research needs. In the revised version, we reset the distance threshold to 6 Å and calculated the interacting amino acids in order to give the user a wider range of choices. In the section “3.2 Database browse and search” of revised manuscript, we provide a description of the flexible choice of thresholds for practical use.

      Furthermore, distance and ΔSASA are two distinct metrics for evaluating interactions. Distance directly quantifies spatial proximity between atoms, reflecting physical contacts such as van der Waals interactions or hydrogen bonds, and is ideal for identifying direct spatial adjacency. ΔSASA, on the other hand, measures changes in solvent accessibility of residues during binding, capturing the contribution of buried surfaces to binding free energy. Even for residues not in direct contact, reduced SASA due to conformational changes may indicate indirect functional roles.

      As demonstrated through comparisons on the detailed information pages, the sets of interacting amino acids defined by these two methods differ by only a few residues, with no significant variation in their overall distributions. However, since interaction patterns vary significantly across different complexes, analyzing residue distributions across all structures using both criteria is not feasible.

      We thank the reviewer for highlighting these limitations. AACDB currently focuses on protein-based antigens ≤50 amino acids to prioritize structural consistency, which excludes non-protein antigens and shorter peptides. While affinity annotations are critical for benchmarking antibody design tools, these data were not integrated in this release due to insufficient data verification caused by internal team constraints. We acknowledge these gaps and plan to expand antigen diversity and incorporate affinity metrics in future updates.

      Reviewer #2:

      Summary:

      Antibodies, thanks to their high binding affinity and specificity to cognate protein targets, are increasingly used as research and therapeutic tools. In this work, Zhou et al. have created, curated, and made publicly available a new database of antibody-antigen complexes to support research in the field of antibody modelling, development, and engineering.

      Strengths:

      The authors have performed a manual curation of antibody-antigen complexes from the Protein Data Bank, rectifying annotation errors; they have added two methods to estimate paratope-epitope interfaces; they have produced a web interface that is capable of both effective visualisation and of summarising the key useful information in one page. The database is also cross-linked to other databases that contain information relevant to antibody developability and therapeutic applications.

      Weaknesses:

      The database does not import all the experimental information from PDB and contains only complexes with large protein targets.

      Thank you for the valuable feedback. As previously responded to Reviewer 1, due to limitations within our team, comprehensive data integration from PDB has not been achieved in the current version. We acknowledge the significance of expanding the database to encompass a broader range of experimental information and complexes with diverse target sizes. Regrettably, immediate updates to address these limitations are not feasible at this time. Nevertheless, we are committed to enhancing the database in upcoming upgrades to provide users with a more comprehensive and inclusive resource

      Recommendations for the authors:

      Reviewer #1:

      (1) Line 194: "produce" → "produced"

      We thank the reviewer for the feedback. We have checked the grammar and spelling carefully in the revised manuscript.

      (2) As mentioned in the public review, I think adding binding affinity annotations would greatly enhance the use cases for the database.

      We thank the reviewer for the suggestion. As the response in “Public review”. Due to team constraints, these data are not integrated into this release but are being collated. We recognize these gaps and plan to expand antigenic diversity and incorporate affinity metrics in future updates.

      (3) I think adding a visualization of interface atoms and contacts on an entry's webpage would be useful for someone exploring specific entries. It also would be useful if the authors provided a pymol command to select interface residues since that's a procedure any structural biologist is likely to do.

      We sincerely appreciate the reviewer’s constructive suggestions. In response to the request for enhanced visualization and accessibility of interface residue information, we have implemented the following improvements: (1) Web Interface Visualization. On the entry-specific webpage, we have added an interactive visualization window that highlights the antigen-antibody interaction interface using distinct colors. The interaction interface visualization has been incorporated into Figure 5 of the revised manuscript, with a detailed description. (2) PyMOL Command Accessibility. The “Help” page now provides step-by-step PyMOL commands to select and visualize interface residues.

      (4) I think the authors should provide headers to the files containing interface residues according to the change-in-SASA criterion, as they do for those computed according to contact. This would avoid unnecessary confusion - however slight - and make parsing easier. I was initially confused by the meaning of the last column, though after a minute I understood it to be the change in SASA.

      We thank the reviewer for providing such detailed feedback. We thank the reviewer for the comment and the suggestion. We have provided headers for the files of the interacting residues defined by ΔSASA.

      (5) Line 233: "AACDB's data processing pipeline supports mmCIF files" → The meaning and implications of this statement are not obvious to me, and are mentioned nowhere else in the paper. Do you mean that in AACDB there are structure entries that the RCSB PDB database only has in mmCIF file format, and not .pdb format? So, effectively, there are some entries in AACDB that are not in any other antibody-specific database?I checked and, as of Dec 3rd, 2024, there are 41 structures in AACDB that are NOT in SAbDab. Manually checking 5 of those 41 structures, none are mmCIF-only structures.

      We thank the reviewer for the valuable comment. Because of the size of the structures within certain entries, representing them in a single PDB format data file is not feasible due to the excessive number of atoms and polymer chains they contain. As a result, PDB stores these structures in “mmcif” format files. In AACDB, 47 entries, such as 7SOF, 7NKT, 7B27, and 6T9D, are only available in the “mmCIF” format from the PDB. The “.pdb” and “.cif” files contain atomic coordinates in distinct text formats, and the segmentation of these structure files is automatically conducted based on manually annotated antibody-antigen chains. To accommodate this, we have incorporated these considerations into our file processing pipeline, thereby enabling a fully automated file segmentation process. Additionally, we employed Naccess to calculate interatomic distances. However, since this software only accepts .pdb format files as input, we also converted all split .cif files into .pdb format within our fully automated pipeline. We apologize for the lack of clarity in the original manuscript and have included a more detailed explanation in the "2.2 PDB Splitting" section of the revised manuscript.

      Reviewer #2:

      (1) In SabDab and PDB, experimental binding affinities are also reported: could the authors comment on whether they also imported this information and double-checked it against the original paper? If it wasn't imported, that might discourage some users and should be considered as an extension for the future.

      We thank the reviewer for the comment and the suggestion. As the response in “Public review”. Due to current resource constraints, quantitative affinity data has not been incorporated into this release but is undergoing systematic curation. We explicitly recognize these limitations and propose a two-pronged strategy for future iterations: (1) broadening antigen diversity coverage through expanded structural sampling, and (2) integrating quantitative binding affinity measurements. In the Discussion section, we have included description outlining the planned enhancements.

      (2) Line 49-50: the references mentioned in connection to deep learning methods for antibody-antigen predictions seem a bit limited given the amount of articles in this field, with 3 of 4 references on one method only (SEPPA), could the authors expand this list to reflect a bit more the state of the art?

      We thank the reviewer for the suggestion. We agree that more relevant studies should be listed and therefore more references are provided in the revised manuscript.

      When mentioning the limitations of the existing databases, it feels a bit that the criticism is not fully justified. For instance:

      Line 52-53: could the authors elaborate on the reasons why such an identification is challenging? (Isn't it possible to make an efficient database-filtered search? Or rather, should one highlight that a more focussed resource is convenient and why?)

      Thank you for feedback. In this study, the keywords "antibody complex," "antigen complex," and "immunoglobulin complex," were employed during data collection. PDB returned over 30,000 results, of which only one-tenth met our criteria after rigorous filtering. This demonstrates that keyword searches, while useful, inherently limit result precision and introduce substantial redundancy, likely due to the PDB's search mechanism. That’s why we illustrated the significant challenges in identifying antibody-antigen complexes from general protein structures in the PDB.

      Line 55: reading the website http://www.abybank.org/abdb/, it would be fairer to say that the web interface lacks updates, as the database and the code have gone through some updates. Could the authors provide a concrete example of the reason why: 'The AbDb database currently lacks proper organization and management of this valuable data.'?

      We thank the reviewer for highlighting this issue. In our original manuscript, the statement that the AbDb database "lacks proper organization and management" was based on the absence of explicit statement regarding data updates on its official website at the time of submission, even though internal updates to its content may have occurred. We fully respect the long-standing contributions of AbDb to antibody structural research, and our comments were solely directed at the specific state of the database at that time. As the reviewer noted, following the release of our preprint, we have also taken note of AbDb's recent updates. To reflect the latest developments and avoid potential misinterpretation, we have revised the original statement in revised manuscript.

      Also 'this rapid updating process may inadvertently overlook a significant amount of information that requires thorough verification,': it's difficult for me to understand what this means in practice. Could the authors clarify if they simply mean that SabDab collects information from PDB and therefore tends to propagate annotation errors from there? If yes, I think it's enough to state it in these terms, and for sure I agree that the reason is that correcting these annotation errors requires a substantial amount of work.

      We thank the reviewer for providing such detailed feedback on the manuscript. We acknowledge that SabDab represents a highly valuable contribution to the field, and its rapid update mechanism has significantly advanced related research areas. However, as stated by the reviewer, we aim to clarify that SabDab primarily relies on automated metadata extraction from the PDB for annotation, and its rapid update process inherently inherits raw data from upstream sources. According to their paper, manual curation is only applied when the automated pipeline fails to resolve structural ambiguities. This workflow—dependent on PDB annotations with limited manual verification—may propagate errors provided by PDB. Examples include species misannotation and mutation status misinterpretation. We fully agree with the reviewer's observation that correcting errors in such cases necessitates labor-intensive manual curation, which is a core motivation for our study.

      Line 86: why 'Structures that consisted solely of one type of antibody were excluded'? Why exclude complexes with antigens shorter than 50 amino acids? These complexes are genuine antibody-antigen complexes.

      We thank the reviewer for the valuable question. The AACBD database is dedicated to curating structural data of antigen-antibody complexes. Structures featuring only a single antibody type are classified as free antibodies and systematically excluded from the database due to the absence of protein-bound partners. During data screening , we retained sequences shorter than 50 amino acids by categorizing them as peptides rather than eliminating them outright. The current release exclusively encompasses complexes with protein-based antigens. Meanwhile, complexes involving peptide, haptens, and nucleic acid antigens are undergoing systematic curation, with planned inclusion in future updates to broaden antigen category representation.

      Line 96 needs a capital letter at the beginning.

      Line 107: 'this would generate' → 'this generates' (given it is something that has been implemented, correct?).

      Line 124: missing an 'of'.

      Line 163: inspiring by -> inspired by.

      Thank you for feedback. All of the above grammatical or spelling errors have been revised in the manuscript.

      Line 109-111: apart from the example, it would be good to spell out the general rule applied to anti-idiotypic antibodies.

      We thank the reviewer for the valuable feedback. For anti-idiotypic antibodies complex. the partner antibody is treated as a dual-chain antigen, , necessitating individual evaluation of heavy chain and light chain interactions with the anti-idiotypic component. We have given a general rule for anti-idiotypic antibodies in section “2.2 PDB splitting” of revised manuscript.

      Line 155-159: could the authors provide references for the two choices (based on sasa and any-atom distance) that they adopted to define interacting residues?

      We thank the reviewer for the comment and the suggestion. As the same as the response to reviewer #1 in Public review. The interacting residues definition and the threshold chosen in the manuscript is summarized based on existing literature. We have added additional references for support in section “1.Introduction”. Our resource does not provide a fixed amino acid list. Instead, all interacting residues are explicitly documented alongside their corresponding ΔSASA (solvent-accessible surface area changes) and intermolecular distances, allowing researchers to flexibly select residue pairs based on customized thresholds from downloadable datasets. Furthermore, aligning with widely adopted criteria in current literature—where interactions are defined by ΔSASA >1 Ų and atomic distances <6 Å, we have recalibrated our analysis in the revised version. Specifically, we replaced the previous 5 Å distance threshold with a 6 Å cutoff to recalculate interacting residues.

      Line 176-178: could the authors re-phrase this sentence to clarify what they mean by 'change in the distribution'?

      We thank the reviewer for the suggestion. Our search was conducted with an end date of November 2023. However, Figure 3B includes an entry dated 2024. Upon reviewing this record, we identified that the discrepancy arises from the supersession of the 7SIX database entry (originally released in December 2022) by the 8TM1 version in January 2024. This version update explains the apparent chronological inconsistency. We regret any lack of clarity in our original description and have revised the corresponding section in the manuscript to explicitly clarify this change of database.

      Caption Figure 3: please spell out all the acronyms in the figure. Provide the date when the last search was performed (i.e., the date of the last update of these statistics).

      We thank the reviewer for the comment. We have systematically expanded all acronyms and included update dates for statistics in the legend of Figure 3. Corresponding changes have also been made to the statistical pages on the website.

      Finally, it would be advisable to do a general check on the use of the English language (e.g. I noted a few missing articles). In Figure 5 DrugBank contains typos.

      We sincerely appreciate the reviewer's meticulous attention to linguistic precision. We have corrected the typographical error in Figure 5 and conducted a comprehensive review of the entire manuscript to ensure accuracy and clarity.

  3. inst-fs-iad-prod.inscloudgate.net inst-fs-iad-prod.inscloudgate.net
    1. The school’s dress code stipulates that clothes should “not distract or interfere with the educa-tional environment . . .

      The idea that girls’ clothing could be a “distraction” implies that boys’ inability to focus is the responsibility of girls, shifting blame onto female students and reinforcing sexist double standards. The enforcement of these rules disproportionately impacts Latinas, who are already sexualized by cultural stereotypes.

    2. As several Latinos argued, “women can wear clothes that guys can’t” and pointed to a Latina classmate wearing a t-shirt with a woman and gun on it. One Latina student quickly corrected, “there are dress codes for prom. We can’t wear low-cut dresses or ones that are cut down the back”. Her response was met with a chorus of affi rming “yea’s” from her Latina classmates. A student later shared how, before she could enter the school’s prom, school offi cials required her to cover her chest with a shawl because they believed her dress was “too revealing”. Thus, whereas the young men in the class critiqued a double standard where women are perceived to be granted more liberty, some women feel limited by the school’s requirement that they cover parts of their bodies.

      This greatly explains how many men think it's unfair that a woman could get away with wearing a shirt with a woman and gun, where if they wore the same shirt, they probably would get the same reaction from the school as their counterpart. But there are so many regulations that they DON'T have to think about because they only align to women, and those are only felt by the people they restrain. Now I do think having a dress code isn't a bad thing, but there are many schools that abuse the rules and twist it to sexualize clothes young girls should be able to wear normally.

    3. Thus, the objectifi cation of female bodies and the implementation of school dress codes are not neutral

      This has given me some new ideas about the objectization of students in schools. Behind such requirements and policies lies actually the division and control of gender and race, which intensifies the control of gender and leads to a vicious circle. This reminds me of school uniforms. In China, all students are required to wear uniform clothing. I think this has some special differences from the dresscode. What China demands is collectivism that is for all genders, but the dress code for girl is a kind of materialization of women themselves.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      The authors investigate how the viscoelasticity of the fingertip skin can affect the firing of mechanoreceptive afferents and they find a clear effect of recent physical skin state (memory), which is different between afferents. The manuscript is extremely well-written and well-presented. It uses a large dataset of low threshold mechanoreceptive afferents in the fingertip, where it is particularly noteworthy that the SA-2s have been thoroughly analyzed and play an important role here. They point out in the introduction the importance of the non-linear dynamics of the event when an external stimulus contacts the skin, to the point at which this information is picked up by receptors. Although clearly correlated, these are different processes, and it has been very well-explained throughout. I have some comments and ideas that the authors could think about that could further improve their already very interesting paper. Overall, the authors have more than achieved their aims, where their results very much support the conclusions and provoke many further questions. This impact of the previous dynamics of the skin affecting the current state can be explored further in so many ways and may help us to better understand skin aging and the effects of anatomical changes of the skin.

      At the beginning of the Results, it states that FA-2s were not considered as stimuli did not contain mechanical events with frequency components high enough to reliably excite them. Was this really the case, did the authors test any of the FA-2s from the larger dataset? If FA-2s were not at all activated, this is also relevant information for the brain to signal that it is not a relevant Pacinian stimulus (as they respond to everything). Further, afferent receptive fields that were more distant to the stimulus were included, which likely fired very little, like the FA-2s, so why not consider them even if their contribution was low?

      Thank you for bringing this up, we have now clarified in the text that while FA-2s did respond at a low rate during the experiment, their responses were not reliably driven by the force stimuli. In the Methods section we have included the following text:

      “Initially, 10 FA-2 neurons were also included in the analysis. But their responsiveness during the experiment was remarkably low, and unlike the other neuron types, their responses were rarely affected by force stimuli. Specifically, only one of the observed FA-2 neurons responded during the force protraction phases. Due to the lack of clear stimulus-driven responses, FA-2 neurons were subsequently excluded from further analysis.”

      One question that I wondered throughout was whether you have looked at further past history in stimulation, i.e. not just the preceding stimulus, but 2 or 3 stimuli back? It would be interesting to know if there is any ongoing change that can be related back further. I do not think you would see anything as such here, but it would be interesting to test and/or explore in future work (e.g. especially with sticky, forceful, or sharp indentation touch). However, even here, it could be that certain directions gave more effects.

      This is a very interesting question! A discernible effect from the previous stimulus could persist at the end of the current stimulation (see Figure 4C), potentially influencing the next one—a 2-stimuli-back effect. Unfortunately, our experimental design did not allow for rigorous testing of this effect. While all possible pairs of stimulus directions were included in immediately consecutive trials, this was not the case for pairs separated by additional trials. Hence, the combination of a likely weak effect and limited variation in history precluded a thorough analysis of a 2-stimuli-back effect. Future work should delve into the time course of the viscoelastic effect in greater detail.

      Did the authors analyze or take into account the difference between receptive field locations? For example, did afferents more on the sides have lower responses and a lesser effect of history?

      An investigation into the potential impact of the relationship between the receptive field location on the fingertip skin and the primary contact site of the stimulus surface revealed no discernible influence for SA-1 and SA-2 neurons. In contrast, FA-1 neurons, particularly those predominantly sensitive to the previous stimulation or displaying mixed sensitivity, exhibited a tendency to terminate near the primary stimulation site. We have added these observations to the text:

      “We found no straightforward relationship between a neuron's sensitivity to current and previous stimulation and its termination site in fingertip skin. Specifically, there was no statistically significant effect of the distance between a neuron's receptive field center and the primary contact site of the stimulus surface on whether neurons signaled current, prior, or mixed information for SA-1 (Kruskal-Wallis test H(2)=3.86, p= 0.15) or SA-2 neurons (H(2)=0.75, p=0.69). However, a significant difference emerged for FA-1 neurons (H(2)=8.66, p=0.01), indicating that neurons terminating closer to the stimulation site on the flat part of the fingertip were more likely to signal past or mixed information.”

      Was there anything different in the firing patterns between the spontaneous and non-spontaneously active SA-2s? For example, did the non-spontaneous show more dynamic responses?

      The firing patterns of both spontaneously and non-spontaneously active SA-2 neurons shared similarities in terms of adaptation and range of firing rate modulation in response to force stimuli, i.e., ‘dynamic response’. The distinction lay in the pattern of modulation of the firing rate associated with stimulus presentations. For spontaneously active SA-2 neurons, this modulation occurred around a significant background discharge, implying that a force stimulus could either decrease or increase the firing rate, depending on how it deformed the fingertip. This characteristic is well illustrated by the firing pattern of the neuron depicted in the lower panels of Figure 3D. Conversely, in non-spontaneously active SA-2 neurons, a force stimulus could only induce an increase in the firing rate or no change. Although the neuron depicted in the upper panels of Figure 3D exhibited some background activity, it serves to exemplify this characteristic. In the text, we have elucidated the dynamics of the SA-2 neuron response by highlighting that force stimulation can either decrease or increase the firing rate in neurons with spontaneous activity through the following addition/change:

      “This increased variability was most evident during the force protraction phase where most neurons exhibited the most intense responses. Increased variability was also observed in instances where the dynamic response to force stimulation involved a decrease in the firing rate (lower panels of Figure 3D). This phenomenon was observed in SA-2 neurons that maintained an ongoing discharge during intertrial periods (cf. Fig. 2A). In these cases, the response to a force stimulus constituted a modulation of the firing rate around the background discharge, signifying that a force stimulus could either decrease or increase the firing rate depending on the prevailing stimulus direction.”

      Were the spontaneously active SA-2 afferents firing all the time or did they have periods of rest - and did this relate to recent stimulation? Were the spontaneously active SA-2s located in a certain part of the finger (e.g. nail) or were they randomly spread throughout the fingertip? Any distribution differences could indicate a more complicated role in skin sensing.

      SA-2 neurons, in general, are well-known for undergoing significant post-stimulation depression (e.g., Knibestöl and Vallbo, 1970; Chambers et al., 1972; Burgess and Perl, 1973). In our force stimulations, this post-excitatory depression manifested as a reduced or absent response during the latter part of the stimulus retraction period for stimuli in directions that markedly excited the neuron. The excitability recovered when the fingertip relaxed during the subsequent intertrial period, and for "spontaneously active" neurons, the firing resumed (see examples in Figure 7A). Furthermore, some “spontaneously active” neurons could be silenced or exhibit a near-silent period during force stimulation for certain force directions, while the spontaneous firing returned during the upcoming intertrial period when the fingertip shape recovered (for example, see responses to stimulation in the proximal and especially ulnar directions in the top panel in Figure 7A).

      Regarding the location of the receptive field centres of spontaneously active and non-spontaneously active SA-2 neurons on the fingertip we did not observe any obvious spatial segregation. To illustrate this, we have revised Figure 1A by color-marking SA-2 neurons that exhibited ongoing activity in intertrial periods, and the figure caption has been modified accordingly:

      “Figure 1. Experimental setup. A. Receptive field center locations shown on a standardized fingertip for all first-order tactile neurons included in the study, categorized by neuron type. Purple symbols denote spontaneously active SA-2 neurons exhibiting ongoing activity without external stimulation.”

      Did the authors look to see if the spontaneous firing in SA-2s between trials could predict the extent to which the type 1 afferents encode the proceeding stimulus? Basically, does the SA-2 state relate to how the type 1 units fire?

      We found no clear indications that the responses of FA-1 and SA-1 could be readily anticipated based on the firing patterns of SA-2 neurons.

      In the discussion, it is stated that "the viscoelastic memory of the preceding loading would have modulated the pattern of strain changes in the fingertip differently depending on where their receptor organs are situated in the fingertip". Can the authors expand on this or make any predictions about the size of the memory effect and the distance from the point of stimulation?

      We have explored this topic further in the text, referring to recent studies modeling essential aspects of fingertip mechanics. However, in our view, current models lack the capability to predict the specific nature sought by the reviewer. These models should include a detailed understanding of the intricate networks of collagen fibers anchoring the pulp tissue at the distal phalangeal bone and the nail. They should also consider potential inherent directional preferences of the receptor organs, attributed to their microanatomy. The text modifications are as follows:

      “In addition to the receptor organ locations, the variation in sensitivity among neurons to fingertip deformations in response to both previous and current loadings would stem from the fingertip’s geometry and its complex composite material properties. Possible inherent directional preferences of the receptor organs, attributed to their microanatomy, could also be significant. However, mechanical anisotropy, particularly within the viscoelastic subcutaneous tissue of the fingertip induced by intricately oriented collagen fiber strands forming fat columns in the pulp (Hauck et al., 2004), are likely to play a crucial role. This anisotropy would shape the dynamic pattern of strain changes at neurons' receptor sites, intricately influencing a neuron's sensitivity not only to current but also to preceding loadings. Indeed, recent modeling efforts suggest that such mechanical anisotropy strongly influences the spatiotemporal distribution of stresses and strains across the fingertip (Duprez et al., 2024).”

      Relatedly, we have included additional text to provide a more comprehensive explanation of the “bulk deformation” of the fingertip that occurs during the loadings:

      “As pressure increases in the pulp, the pulp tissue bulges at the end and sides of the fingertip. Simultaneously, the tangential force component amplifies the bulging in the direction of the force while stretching the skin on the opposite side.”

      In the discussion, it would be good if the authors could briefly comment more on the diversity of the mechanoreceptive afferent firing and why this may be useful to the system.

      The diversity in responses among neurons is instrumental in enhancing the information transmitted to the brain by averting redundancy in information acquisition. This diversity thereby contributes to an overall increase in information. We've included a brief statement, along with several references, underscoring this concept:

      "The resulting diversity in the sensitivities of neurons might enhance the overall information collected and relayed to the brain by the neuronal population, facilitating the discrimination between tactile stimuli or mechanical states of the fingertip (see Rongala et al., 2024; Corniani et al., 2022; Tummala et al., 2023, for more extensive explorations of this idea)."

      Also, the authors could briefly discuss why this memory (or recency) effect occurs - is it useful, does it serve a purpose, or it is just a by-product of our skin structure? There are examples of memory in the other senses where comparisons could be drawn. Is it like stimulus adaptation effects in the other senses (e.g. aftereffects of visual motion)?

      We have expanded the concluding paragraph of the discussion, specifically delving into the question of whether the mechanical memory effect serves a deliberate purpose or is simply an incidental byproduct of our skin structure:

      “In any case, the viscoelastic deformability of the fingertips plays a pivotal role in supporting the diverse functions of the fingers. For example, it allows for cushioned contact with objects featuring hard surfaces and allows the skin to conform to object shapes, enabling the extraction of tactile information about objects' 3D shapes and fine surface properties. Moreover, deformability is essential for the effective grasping and manipulation of objects. This is achieved, among other benefits, by expanding the contact surface, thereby reducing local pressure on the skin under stronger forces and enabling tactile signaling of friction conditions within the contact surface for control of grasp stability. Throughout, continuous acquisition of information about various aspects of the current state of the fingertip and its skin by tactile neurons is essential for the functional interaction between the brain and the fingers. In light of this, the viscoelastic memory effect on tactile signaling of fingertip forces can be perceived as a by-product of an overall optimization process within prevailing biological constraints.”

      One point that would be nice to add to the discussion is the implications of the work for skin sensing. What would you predict for the time constant of relaxation of fingertip skin, how long could these skin memory effects last? Two main points to address here may be how the hydration of the skin and anatomical skin changes related to aging affect the results. If the skin is less viscoelastic, what would be the implications for the firing of mechanoreceptors?

      It is likely that the time constant depends to some extent on mechanical factors of the skin, which will likely change due to age or environmental factors. However, while these questions are intriguing, they fall outside the scope of the current study and we are not aware of studies that have addressed these issues directly in experiments either.

      How long does it take for the effect to end? Again, this will likely depend on the skin's viscoelasticity. However, could the authors use it in a psychophysical paradigm to predict whether participants would be more or less sensitive to future stimuli? In this way, it would be possible to test whether the direction modifies touch perception.

      Time constants for tissue viscoelasticity have been estimated to extend up to several seconds (see citations in the introduction). While direct perceptual effects could indeed be explored through psychophysical experimental paradigms, we are currently unaware of any studies specifically addressing the type of effect described in this study. In addition to the statement that, concerning manipulation and haptic tasks, "to our knowledge, a possible influence of fingertip viscoelasticity on task performance has not been systematically investigated," we have now also addressed tactile psychophysical tasks conducted during passive touch with the following sentence in the text:

      “Similarly, there is a lack of systematic investigation of potential effects of fingertip viscoelasticity on performance in tactile psychophysical tasks conducted during passive touch.”

      Reviewer #2 (Public Review):

      Summary:

      The authors sought to identify the impact skin viscoelasticity has on neural signalling of contact forces that are representative of those experienced during normal tactile behaviour. The evidence presented in the analyses indicates there is a clear effect of viscoelasticity on the imposed skin movements from a force-controlled stimulus. Both skin mechanics and evoked afferent firing were affected based on prior stimulation, which has not previously been thoroughly explored. This study outlines that viscoelastic effects have an important impact on encoding in the tactile system, which should be considered in the design and interpretation of future studies. Viscoelasticity was shown to affect the mechanical skin deflections and stresses/strains imposed by previous and current interaction force, and also the resultant neuronal signalling. The result of this was an impaired coding of contact forces based on previous stimulation. The authors may be able to strengthen their findings, by using the existing data to further explore the link between skin mechanics and neural signalling, giving a clearer picture than demonstrating shared variability. This is not a critical addition, but I believe would strengthen the work and make it more generally applicable.

      Strengths:

      - Elegant design of the study. Direct measurements have been made from the tactile sensory neurons to give detailed information on touch encoding. Experiments have been well designed and the forces/displacements have been thoroughly controlled and measured to give accurate measurements of global skin mechanics during a set of controlled mechanical stimuli.

      - Analytical techniques used. Analysis of fundamental information coding and information representation in the sensory afferents reveals dynamic coding properties to develop putative models of the neural representation of force. This advanced analysis method has been applied to a large dataset to study neural encoding of force, the temporal dynamics of this, and the variability in this.

      Weaknesses:

      - Lack of exploration of the variation in neural responses. Although there is a viscoelastic effect that produces variability in the stimulus effects based on prior stimulation, it is a shame that the variability in neural firing and force-induced skin displacements have been presented, and are similarly variable, but there has been no investigation of a link between the two. I believe with these data the authors can go beyond demonstrating shared variability. The force per se is clearly not faithfully represented in the neural signal, being masked by stimulation history, and it is of interest if the underlying resultant contact mechanics are.

      Thank you for this suggestion. We have added a new section investigating the link between skin deformation and neural firing in more depth via a simple neural model. Please see our answer below in the ‘Recommendations’ section for further details.

      Validity of conclusions:

      The authors have succeeded in demonstrating skin viscoelasticity has an impact on skin contact mechanics with a given force and that this impacts the resultant neural coding of force. Their study has been well-designed and the results support their conclusions. The importance and scope of the work is adequately outlined for readers to interpret the results and significance.

      Impact:

      This study will have important implications for future studies performing tactile stimulation and evaluating tactile feedback during motor control tasks. In detailed studies of tactile function, it illustrates the necessity to measure skin contact dynamics to properly understand the effects of a force stimulus on the skin and mechanoreceptors.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      (Very) minor comments

      - The authors say at the beginning of the Results that, "The fourth type of tactile neurons in the human glabrous skin, fast adapting type II neurons...". Although generally written that there are four types of afferent in the glabrous skin, it would be better to state that these are low-threshold A-beta myelinated mechanoreceptive afferents, at least one time, as there are other types of afferent in the glabrous skin that respond to mechanical stimulation (e.g. low and high threshold C-fibers).

      This is now clarified at the start of the Results section:

      “We recorded action potentials in the median nerve of individual low-threshold A-beta myelinated first-order human tactile neurons innervating the glabrous skin of the fingertip…”

      - Fig. 3: Could you add '(N)' as the measurement of force for Fig. 3A for Fz, Fy, and Fz? Also, please change 'Data was recorded' to 'Data were recorded' in the legend.

      Fixed.

      - At the beginning of the Methods, you say that your study conforms to the Declaration of Helsinki, which actually requires pre-registration in a database. If you did not pre-register your study, please can you add '... in accordance with the Declaration of Helsinki, apart from pre-registration in a database'.

      Thanks for making us aware of this. We have added the suggested qualifier to the ethics statement.

      Reviewer #2 (Recommendations For The Authors):

      The neural representation/encoding of the actual displacement vectors would be a useful addition to the analyses. These vectors have been demonstrated to systematically change with the condition in the irregular series (Figure 2E) and will thus significantly act on the dynamics of induced mechanical changes in the skin with a given interaction force. Thus, it could be examined how the neurons code the magnitude of displacements as well as their direction. An evaluation of the extent to which the imposed displacement magnitudes are encoded in the neural responses would be a useful addition in explaining the signalling of the force events and how the central nervous system decodes these. Evaluating an alternative displacement encoding for comparison to pure force encoding may reveal more about how contact events are represented in the tactile system, which must decode these variable afferent signals to reconstruct a percept of the interaction. It could then be explored how the central nervous system may then scale the dynamic afferent responses based on the background viscoelastic state likely to be present in the SA-II afferent signals (Figure 7) for a context in which to evaluate the dynamic contact forces. This may of course be a complex relationship for the type-I afferents, where the underlying mechanical events evoking the firing (microslips not represented in global forces) have not been measured here. Such a model could be more widely applicable, as the skin viscoelasticity and displacement magnitudes are a straightforward measurement metric and could perhaps be used as a better proxy for neural signalling. This would allow the investigation of a wider variety of forces, and the study of the timing of the viscoelastic effect, both of which have been fixed here. This would give the work a broader impact, rather than just highlighting that this effect produces variability, it could reveal if this mechanical feature is structured in the neural representation. The categorical encoding/decoding tested here is specific to the stimuli used (magnitudes, intervals), but there is the possibility that this may be more generally applicable (within the bounds of forces/speeds) if the underlying basis of the variability in the signalling produced by the viscoelasticity is identified. Since the time course of the viscoelasticity has not been measured here (fixed forces and intervals), further study is required to fully understand the implications this has for a wider variety of situations.

      We agree that a better understanding of how the mechanical deformations are reflected in the resulting spike trains would be valuable. While ultimately a full understanding will need precise measurements of skin deformation across the whole fingertip to account for mechanical propagation to mechanoreceptor locations, relating the deformations at the contact location with neural firing patterns directly can provide useful hints into which aspects of deformation are encoded and how. To this end, we ran a new analysis that aimed to predict the time-varying neural responses directly from the recorded mechanical movements of the contactor.

      Below we have reproduced the new results and methods text along with the additional figures for this analysis. Note that we have also added text in the Discussion to interpret these findings in the context of our other results.

      New section in Results titled Predicting neural responses from contactor movements: “The similarity in the history-dependent variation in neural firing and fingertip deformation at a given force stimulus suggests that neuronal firing is determined by how the fingertip deforms rather than the applied force itself. However, this similarity does not clarify the relationship between fingertip deformation dynamics and neural signaling. To investigate further, we fit cross-validated multiple linear regression models to evaluate how well distinct aspects of contactor movement could predict the time-varying firing rates of individual neurons during the protraction phases of the irregular sequence. The models used predictors based on (1) the three-dimensional position of the contactor, (2) its three-dimensional velocity, (3) a combination of position and velocity signals, and, finally, (4) position and velocity signals along with all possible two-way interactions between them, capturing potentially complex relationship between fingertip deformations and neural signaling.

      Comparing the variance explained (R<sup>2</sup>) by each regression model for each neuron type revealed clear differences between the models (Figure 5A). A two-way mixed design ANOVA, with regression model as within-group effects and neuron type as a between-group effect revealed a main effect of model on variance explained (F(3,462) = 815.5, p < 0.001, η<sub>p</sub><sup>2</sup> = 0.84). Model prediction accuracy overall increased with the number of predictors, with the two-way interaction model outperforming all others (p < 0.001 for all comparisons, Tukey’s HSD). Additionally, a significant main effect of neuron type (F(2,154) = 29.8, p < 0.001, η<sub>p</sub><sup>2</sup> = 0.28) and a significant interaction between regression model and neuron type were observed (F(6,462) = 50.8, p < 0.001, η<sub>p</sub><sup>2</sup> = 0.40).

      For neuron type, model predictions were most accurate for SA-2 neurons, followed by SA-1 neurons, with FA-1 neurons showing the lowest accuracy (p < 0.003 for all comparisons, Tukey’s HSD). The interaction between model and neuron type revealed distinct patterns. For SA-1 and SA-2 neurons, position-only and velocity-only models had similar prediction accuracy (p ≥ 0.996, Tukey’s HSD) with no significant differences between these neuron types (p ≥ 0.552, Tukey’s HSD). FA-1 neurons performed poorly with the position-only model but showed higher accuracy with the velocity-only model (p < 0.001, Tukey’s HSD) and better than SA-1 neurons (p = 0.006, Tukey’s HSD). Models combining position and velocity predictors (without interactions) surpassed both position-only and velocity-only models for SA-1 and SA-2 neurons (p < 0.001, Tukey’s HSD). Overall, the differences between neuron types broadly match their tuning to static and dynamic stimulus properties.

      The two-way interaction model, accounting for most variance in neural responses, produced mean R<sup>2</sup> values of 0.75 for FA-1, 0.88 for SA-1, and 0.91 for SA-2 neurons (Figure 5A). To evaluate the contribution of the different predictors, we ranked them using the permutation feature importance method, focusing on the six most important ones. Regression analyses using only these variables explained almost all of the variance explained by the full model, with a median R<sup>2</sup> reduction of just 0.055 across all neurons. Across all neuron types, at least half included all three velocity components (dPx, dPy, dPz) among the top six, with FA-1 neurons showing the highest prevalence (Figure 5B). Interactions between normal position (Pz) and each velocity component were also frequently observed, while interactions involving tangential position and velocity components were less common. Interactions among velocity components were relatively well represented, followed by interactions limited to position components. Position signals were generally less represented, except for normal position (Pz) in slowly adapting neurons, where it appeared in 50% of SA-1 and 68% of SA-2 neurons. Despite these broad trends, important predictors varied widely across ranks even within a given neuron class (see Figure 5-figure supplement 1), and even the most frequent variables appeared in only a subset of cases, suggesting broad variability in sensitivity across neurons.”

      New methods paragraph titled Predicting time-varying firing rates from skin deformations:

      “This analysis was conducted in Python (v3.13) with pandas for data handling, numpy for numerical operations, and scikit-learn for model fitting and evaluation.

      To assess how well individual neurons' time-varying firing rates could be predicted from simultaneous contactor movements, we fitted multiple linear regression models (see Khamis et al., 2015, for a similar approach}. This analysis focused on the force protraction phase of the irregular sequence, where neurons were most responsive and sensitive to stimulation history. Data from 100 ms before to 100 ms after the protraction phase (between -0.100 s and 0.225 s relative to protraction onset) were included for each trial. Neurons were included if they fired at least two action potentials during the force protraction phase and the following 100 ms in at least five of the 25 trials. This ensured sufficient variability in firing rates for meaningful regression analysis, resulting in 68 SA-1, 38 SA-2, and 51 FA-1 neurons being included.

      Contractor position signals digitized at 400 Hz were linearly interpolated to 1000 Hz. Instantaneous firing rates, derived from action potentials sampled at 12.8 kHz, were resampled at 1000 Hz to align with position signals. A Gaussian filter (σ = 10 ms, cutoff ~16 Hz) was applied to the firing rate as well as to the position signals before differentiation. To account for axonal conduction (8–15 ms) and sensory transduction delays (1–5 ms), firing rates were advanced by 15 ms to align approximately with independent variables.

      Regressions were performed using scikit-learn's Ridge and RidgeCV regressors, which apply L2 regularization to mitigate overfitting. Hyperparameter tuning for the regularization parameter (alpha) was performed using GridSearchCV with a predefined range (0.001–1000.0), incorporating five-fold cross-validation to select the best value. To minimize overfitting risks, model performance was further validated with independent five-fold cross-validation (KFold), and R<sup>2</sup> scores were computed using cross_val_score.

      We constructed four linear regression models with increasing complexity: (1) Position-only, using three-dimensional contactor positions (Px, Py, Pz); (2) Velocity-only, using three-dimensional velocities (dPx, dPy, dPz); (3) Combined, including all position and velocity signals (6 predictors); and (4) Interaction, including all signals and their two-way interactions (21 predictors). All features were standardized using StandardScaler to improve regularization and model convergence. PolynomialFeatures generated second-order interaction terms for the interaction model. Feature importance was evaluated with permutation_importance, and simpler models were built using the most important features. These models were validated through cross-validation to assess retained explanatory power.”

      Minor:

      - It would be useful to add a brief description of the material aspects of the contactor tip to the methods (as per Birznieks 2001).

      We have added the following statement:

      “To ensure that friction between the contactor and the skin was sufficiently high to prevent slips, the surface was coated with silicon carbide grains (50–100 μm), approximating the finish of smooth sandpaper.”

      - The axes labelling on Figure 3A and legend description is ambiguous, probably placing the Px, Py, and Pz labels on the far left axes and the Fx, Fy, and Fz on the right side of the far right axes would make this clearer.

      Label placement has been improved along with some other minor fixes.

      - For the quasi-static phase analysis, the phrase "absence of loading" used in reference to the interstimulus period and SA-II afferents does not seem to be a correct description. The finger is still loaded (at least in the normal direction), with a magnitude of imposed displacement that counteracts the viscoelastic force exerted by the skin mechanics of the fingertip. Although there is a zero net-force load, a mechanical stimulus is still being actively applied to the skin.

      We have changed the wording throughout the text and now consistently refer either to the “interstimulus period” directly or to an “absence of externally applied stimulation” to avoid confusion.

    1. Document de Briefing : Examen des Enjeux de l'Éducation dans la Région Hauts-de-France

      Source : Extraits de "🔴 Xavier Bertrand sur la réponse de l’État face aux violences dans les établissements scolaires"

      Date :2025

      Intervenant Principal : Xavier Bertrand, Président du Conseil Régional des Hauts-de-France

      Contexte : Audition dans le cadre d'une commission d'enquête parlementaire sur la réponse de l'État face aux violences dans les établissements scolaires.

      L'audition se concentre notamment sur le contrôle exercé par les autorités, le financement des établissements scolaires (publics, privés sous contrat et hors contrat), et spécifiquement sur le cas du lycée privé Averroès de Lille dont le contrat d'association avec l'État a été résilié.

      Principaux Thèmes Abordés :

      Financement des Établissements Scolaires par la Région Hauts-de-France :

      • Montants : Xavier Bertrand détaille les subventions versées par la région pour l'année scolaire en cours.
      • Dotation globale de fonctionnement (établissements publics et privés sous contrat) : 90,2 millions d'euros.
      • Subventions pour les actions éducatives : 3,2 millions d'euros.
      • Politiques volontaires (prêts à taux zéro, garanties d'emprunt, etc.) : 10 millions d'euros (suspendus ou en année blanche pour les établissements privés sous contrat à la prochaine rentrée).
      • Dépenses assurées en maîtrise d'ouvrage région (maintenance) : un peu plus de 5 millions d'euros.
      • Distinction Public/Privé :Établissements publics : Les 90,2 millions d'euros (dotation globale) s'appliquent également aux établissements publics.
      • Établissements privés sous contrat : 50 millions d'euros font l'objet de dépenses obligatoires. 10 millions d'euros concernent des politiques non obligatoires (politiques volontaires/facultatives).
      • Établissements Privés Hors Contrat : La région Hauts-de-France ne subventionne actuellement aucun établissement privé hors contrat. Xavier Bertrand n'interdit pas cette possibilité par principe si cela présente un intérêt avéré pour la région, nécessitant un débat en assemblée plénière. La position de la région sur le hors contrat est connue, et aucune demande de subvention de la part de ces établissements n'a été reçue. Xavier Bertrand a été sollicité pour visiter de tels établissements mais a refusé, expliquant ne pas avoir de relations établies avec eux.
      • Subventions Facultatives (10 millions d'euros) : Ces subventions relèvent d'un projet et d'une volonté politique. Elles servent principalement à financer des travaux ou des achats d'équipement (numérique, rénovation énergétique, etc.).

      L'attribution se fait souvent via des protocoles d'accord, notamment avec l'enseignement diocésain, et peut prendre la forme d'avances remboursables, de prêts à taux zéro ou d'annuités d'emprunt.

      La loi Falloux limite ces subventions à 10% des dépenses annuelles pour certains types d'établissements (sauf pour l'enseignement technique, professionnel, agricole et les maisons familiales rurales - loi Acier de 1919). Un contrôle sur pièces et sur place est systématiquement effectué pour vérifier l'engagement des travaux avant le versement des fonds. Ces subventions facultatives ne concernent pas la vie scolaire ou l'animation éducative, mais du bâtiment et du technique.

      Rôle et Compétences de la Région en matière de Contrôle et d'Inspection :

      • Absence d'Association et d'Information : Le Conseil Régional n'est ni associé ni informé lorsque l'État effectue un contrôle ou une inspection dans les lycées (académique, administratif, audit des finances publiques). Xavier Bertrand souhaiterait que l'information soit systématique et que les services régionaux dédiés aux lycées y soient associés, surtout si cela touche aux compétences régionales (fonctionnement financier, gestion des agents).
      • Personnel Non-Enseignant : La question de l'inspection du personnel non enseignant (relevant de la responsabilité de la région) est soulevée. Bien que ces agents fassent partie de la communauté éducative, Xavier Bertrand souligne qu'il n'a pas l'autorité fonctionnelle sur eux, ce qu'il considère "anormal" et méritant l'attention du législateur. Il estime qu'une inspection générale serait fondée à s'intéresser à ce personnel, mais insiste sur la nécessité d'une collaboration étroite avec les services régionaux.

      • Contrôle d'Honorabilité du Personnel : La région procède systématiquement aux contrôles préalables autorisés par la loi pour le recrutement du personnel (casier judiciaire B2). Cependant, elle ne fait pas partie des employeurs qui peuvent demander une attestation d'honorabilité pour certaines fonctions réglementées. Xavier Bertrand est demandeur de cette possibilité, le B2 étant le document utilisé pour vérifier le passé judiciaire, un casier judiciaire rendant impossible de travailler auprès des jeunes.

      • Le Cas du Lycée Privé Averroès de Lille : Ce cas est central dans l'audition et illustre les problématiques de contrôle et de relations entre l'État et la Région.

      • Contexte : Averroès est le deuxième établissement sous contrat en France dont le contrat d'association a été rompu (en 2023). Cette rupture est possible selon l'article L442-10 du code de l'éducation, à l'initiative de l'État ou sur demande d'une collectivité.

      • Rôle de Xavier Bertrand : Xavier Bertrand se considère comme un "lanceur d'alerte" dans ce dossier depuis novembre 2017, date à laquelle il a sollicité l'État (le rectorat puis le ministère de l'éducation nationale) pour savoir si les conditions du contrat d'association étaient toujours réunies. Il a suspendu le versement du forfait d'externat (dépense obligatoire) à partir de novembre 2019, dans l'attente des conclusions des inspections et audits diligentés par l'État, et ce malgré plusieurs condamnations par le tribunal administratif. Il assume pleinement cette décision politique, la considérant comme le seul moyen d'alerter.

      • Demande de Résiliation du Contrat : Bien que le code de l'éducation l'autorise à demander la résiliation au préfet, Xavier Bertrand a choisi de ne pas le faire, arguant qu'il respecte les principes de responsabilité définis par la loi. Il souhaiterait cependant que la Région ait plus de responsabilités et de compétences sur de telles questions.

      • Commission de Concertation : La résiliation intervient après avis de la commission de concertation (article R442-64 du code de l'éducation), composée notamment de trois conseillers régionaux. Xavier Bertrand a participé à la réunion de cette commission en novembre 2023, invité par le préfet pour être entendu et présenter la position de la Région. Il a quitté la salle avant le vote. Il ne pense pas avoir enfreint l'article du code, étant invité pour exposer sa position et non pour voter. Il a assumé dire que pour lui, Averroès n'était "pas le cas d'un lycée républicain", ce qui, selon le rapporteur, a fortement influencé le vote de la commission en faveur de la résiliation.

      Motivations et Chronologie :

      Xavier Bertrand détaille la chronologie de ses actions et motivations :

      • 2015 : Visite de l'établissement après les attentats, témoignage de soutien aux victimes par l'établissement.
      • 2017 : Saisine du recteur pour clarifier la place du religieux (rapport de 2015). Absence de réponse.
      • 2019 : Révélations dans "Qatar Papers" sur des financements étrangers. Saisine du ministre de l'éducation nationale. Le ministre envisage une inspection et un audit financier. Suspension du forfait d'externat.
      • 2020 : Réception du rapport d'inspection confirmant un prêt remboursé par un don de Qatar Charity. Saisine à nouveau du ministre pour obtenir des précisions sur les conditions du don et le rôle d'une mosquée ayant reçu des financements étrangers. Pas de réponse précise.
      • 2023 : Rapport de la Chambre Régionale des Comptes (CRC) du 30 juin 2023. Le préfet convoque la commission de concertation, qui vote la résiliation. Le tribunal administratif annule la résiliation, et le gouvernement fait appel.

      Décalage avec le Rapport de l'Inspection Générale :

      Le rapporteur souligne que le rapport de l'Inspection Générale de 2020 concluait que l'établissement respectait "parfaitement les valeurs de la République". Xavier Bertrand conteste cette conclusion, se basant sur le rapport de la CRC qui, selon lui, met en évidence des contenus d'enseignement contraires aux valeurs républicaines (cours d'éthique musulmane basé sur les 40 hadiths de l'imam An-Nawawi). Il exprime son trouble que la CRC ait vu ce que l'IGESR n'aurait pas vu, estimant que l'attitude de l'Éducation Nationale revenait à dire "circulez, il n'y a rien à voir".

      • Arguments de la CRC : Xavier Bertrand cite explicitement les points soulevés par la CRC dans son rapport : proximité de l'association Averroès avec Musulmans de France (lié aux Frères Musulmans), présence d'une personne liée à ce mouvement dans les organes décisionnels, financement étranger significatif (1,9 million d'euros) provenant de divers pays (Koweït, Arabie Saoudite, Bahreïn), et le contenu du cours d'éthique musulmane jugé en contradiction avec les valeurs de la République (interdiction pour une femme de se faire ausculter par un homme, évitement de la mixité, prohibition de l'apostasie, prééminence de la loi divine).

      Autres Cas et Proactivité de la Région :

      • La Région n'a pas été saisie d'autres cas de lycées sous sa responsabilité qui l'auraient amenée à se poser les mêmes questions.
      • Concernant le cas de Riomont (qui ne concerne pas un lycée géré ou financé par la Région), Xavier Bertrand a saisi la procureure de la République de Lille en janvier 2024 pour attirer son attention et exprimer le soutien de la Région aux victimes. Il est disposé à rencontrer le collectif de victimes.
      • Il a également mentionné l'établissement Notre Dame de Riyaumont - Saint-Jean de Bosco à Liin, précisant que la région ne l'a jamais financé.
      • La Région a mené des actions contre le harcèlement scolaire (appels à projet, cellules d'intervention rapide), notamment une initiative avec le procureur d'Amiens pour une cellule d'intervention immédiate dans la Somme incluant un délégué du procureur. Il reconnaît cependant la difficulté à intégrer la dimension spécifique du handicap dans ces dispositifs.

      Recommandations et Souhaits pour l'Avenir :

      • Xavier Bertrand souhaite une coopération renforcée et une information systématique des collectivités locales (Régions pour les lycées, Départements pour les collèges) lors des inspections et contrôles de l'État dans les établissements scolaires qu'elles financent, particulièrement sur les aspects financiers et de gestion des personnels régionaux.
      • Il est demandeur de la possibilité pour les Régions de solliciter une attestation d'honorabilité pour le personnel qu'elles recrutent.
      • Il estime que les collectivités territoriales devraient être davantage associées aux commissions de concertation pour la résiliation des contrats d'association.
      • Plus globalement, il plaide pour une place plus importante des Régions dans la gestion et la protection des lycées, estimant qu'elles pourraient intervenir plus efficacement sur l'environnement sécuritaire (à l'intérieur et autour du lycée), sur les dimensions psychologiques et physiques des violences, même si le contenu pédagogique et la direction relèvent de l'État.

      Citations Clés :

      • "Aujourd'hui, ça n'est pas le cas non plus et que les services régionaux y soient associés notamment ceux dédiés au lycée je trouve que ce serait effectivement une bonne chose à partir du moment où ça va toucher les compétences régionales que ce soit le fonctionnement financier la gestion des agents régionaux." (Xavier Bertrand sur l'association de la région aux inspections de l'État)
      • "Je pense que d'ailleurs à un moment ou un autre il serait quand même utile que le législateur se penche sur cette question [l'autorité fonctionnelle sur les agents régionaux dans les lycées]. C'est effectivement bien souvent l'agent comptable qui donne des instructions Mais en tout état de cause moi président de région je n'ai absolument pas l'autorité fonctionnelle sur ces agents." (Xavier Bertrand sur son manque d'autorité sur le personnel régional)
      • "Je pense que si la région pouvait parce qu'on est en contact avec des jeunes pouvoir établir une attestation enf pardon pouvoir solliciter une attestation d'honorabilité je suis pas preneur je suis demandeur." (Xavier Bertrand sur l'attestation d'honorabilité)
      • "clairement je le dis et je l'assume ces subventions facultatives relèvent donc bel et bien d'un projet et d'une volonté politique." (Xavier Bertrand sur les subventions facultatives)
      • "J'ai avant tout et surtout sollicité l'État depuis novembre 2017 pour savoir si oui ou non les conditions étaient réunies pour qu'un contrat d'association entre l'État et ce lycée soit toujours accordé (...) et je n'ai eu de cesse de me considérer comme un lanceur d'alerte et si c'était à refaire je referais tout depuis le départ." (Xavier Bertrand sur son rôle dans l'affaire Averroès)
      • "Au fond ne pas verser le forfait externa au fond c'est rompre le contrat de l'association qui garantit le financement euh de la part de la région aussi de cet établissement Ma question est est très simple Pourquoi ne pas être allé peut-être au bout donc de cette démarche que vous avez engagé en suspendant le versement du forfait externa en demandant au préfet comme le code de l'éducation là encore le permettre de de rompre ce contrat" (Rapporteur questionnant la stratégie de Xavier Bertrand)
      • "on peut financer je vous cite je le dis l'enseignement libre et je continuerai à vouloir le faire mais on peut le faire pour des lycées qui sont vraiment républicains et pour moi ce n'est pas le cas du lycée aouest" (Citation rapportée des propos de Xavier Bertrand lors de la commission de concertation)
      • "Je pense avoir montré depuis effectivement des années quelle avait été la position de la région que j'ai assumé notamment devant les interpellations de monsieur le rapporteur Concernant le lycée Jean-Paul II il faut savoir une chose c'est qu'il y a une enquête préliminaire donc une instruction judiciaire qui est clairement ouverte Je crois que le proviseur a également démissionné et qu'encore une fois permettez-moi de le dire avec cet exemple vous apportez de l'eau à mon moulin comme le disait madame la présidente tout à l'heure elle me questionnait là-dessus Oui je souhaiterais qu'on soit avisé des enquêtes qui sont faites ou des inspections parce que c'est ça vos références Oui Oui Non non mais je je voulais pas vous interrompre mais vous pouvez lire médi pouvez connaître pas besoin d'être averti un article" (Xavier Bertrand sur d'autres cas potentiels et la nécessité d'être informé des inspections)
      • "cette commission d'enquête j'ai pas besoin de le dire mais elle est particulièrement importante parce qu'elle touche à ce pour qui on est prêt à faire effectivement tous les efforts cette question des violences (...) Le vrai problème que nous avons nous les conseils régionaux c'est qu'au final on finance les établissements la structure le personnel d'entretien de restauration mais on a beau être paraît-il chez nous nous ne le sommes pas vraiment" (Xavier Bertrand en conclusion sur l'importance de la commission et les limites des compétences régionales)

      Points de Discussion Potentiels pour les Auditions Futures :

      • Approfondir le manque d'autorité fonctionnelle des Présidents de Région sur le personnel qu'ils financent dans les lycées.
      • Explorer la faisabilité et l'opportunité d'étendre la possibilité de demander une attestation d'honorabilité aux Régions.
      • Examiner les raisons du décalage entre les conclusions du rapport de l'IGESR et celles du rapport de la CRC dans l'affaire Averroès.
      • Évaluer l'efficacité actuelle des inspections et audits de l'État et les moyens de les améliorer, y compris la question de l'information et de l'association des collectivités locales.
      • Discuter des modalités d'intervention des collectivités locales dans la lutte contre le harcèlement, particulièrement pour les élèves en situation de handicap.
      • Analyser les critères et le processus de subventionnement des établissements privés sous contrat et hors contrat par les Régions et l'opportunité d'une plus grande formalisation.
      • Ce document de briefing résume les points clés de l'audition, mettant en lumière les préoccupations de Xavier Bertrand concernant le contrôle de l'État, le financement des établissements, et les limites actuelles des compétences régionales face aux enjeux de sécurité et de respect des valeurs républicaines dans le système éducatif. Le cas Averroès sert d'étude de cas détaillée pour illustrer ces défis.
    1. Synthèse de la réunion : Élisabeth Guigou et représentants de Départements de France sur les violences en milieu scolaire

      Date de la réunion: 2025

      Objectif: Examiner la prise en compte par l'institution judiciaire (Mme Guigou, ancienne Ministre de la Justice) et le rôle des départements (Départements de France) des violences en milieu scolaire, avec un focus particulier sur les cas de violences sexuelles commises par des adultes.

      Participants:

      • Élisabeth Guigou: Ancienne Ministre de la Justice (1997-2000)
      • Représentants de Départements de France: François Sauvadet (Président), Florence d'Abin (Vice-présidente), Anne-Sophie Abgral (Directrice générale adjointe), Yves Lebreton (Directeur général), Paul-Étienne Koffman (Conseiller éducation), Bris Lacourieux (Conseiller relation avec le Parlement), Steven Brunetta (membre du cabinet de Mme Davin).

      Principaux thèmes abordés:

      • Évolution de la prise en compte des violences sexuelles en milieu scolaire par la justice: Témoignage de Mme Guigou sur son action en tant que Ministre de la Justice, notamment l'élaboration et le vote de la loi du 17 juin 1998.
      • Le cas de l'affaire Caricar (Bétarram): Analyse des flux d'information au sein du Ministère de la Justice à l'époque des faits et la question de la transmission des informations entre la justice et le Ministère de l'Éducation Nationale.
      • Rôle des départements en matière de protection de l'enfance: Clarification des compétences et des pratiques des départements, en particulier concernant les violences en milieu scolaire.
      • Gestion des informations préoccupantes (IP) et des signalements: Examen des procédures, des difficultés rencontrées et des axes d'amélioration dans la transmission des informations entre les différents acteurs (enseignants, CRIP, éducation nationale, justice).

      Idées et faits les plus importants:

      • Action de Mme Guigou (1997-2000): Mme Guigou a souligné l'importance qu'elle a accordée à la lutte contre les violences sexuelles sur les enfants dès son arrivée au Ministère de la Justice. Elle a trouvé un projet de loi existant mais a décidé de le réorienter pour inclure la protection des victimes et revoir l'approche centrée uniquement sur les auteurs. Cette démarche a abouti à la loi du 17 juin 1998.
      • Citation: "J'ai trouvé en juin 97 j'étais loin d'imaginer ce que nous savons maintenant 28 ans après et ce que nous avons découvert petit à petit mais j'ai trouvé un projet de loi qui avait été préparé par mon prédécesseur et qui traitait de la lutte contre les les auteurs de crimes sexuel de de délit ou de crime sexuel je me suis dit que c'était un sujet dont il fallait s'emparer sans tarder c'est en juin 97..."
      • Citation: "...nous avons élaboré un projet de loi qui a été voté vous pourrez comprendre à quel point c'est quand même enfin j'en étais contente en 1 an ça a donné lieu à la loi du 17 juin 1998 sur euh la prévention la répression des infractions sexuelles ainsi qu'à la protection des mineurs victimes."
      • Cette loi visait notamment à prévenir et réprimer les infractions sexuelles et à mieux protéger les mineurs victimes. Une circulaire a suivi pour préconiser l'enregistrement des premières déclarations des enfants afin d'éviter qu'ils n'aient à répéter leur traumatisme.
      • Citation: "...une circulaire dans laquelle je préconisais je demandais qu'on procède à l'enregistrement des premières déclarations des enfants victimes d'abus sexuel pour qu'il n'ait pas à répéter le récit de leur traumatisme évidemment c'était encore plus douloureux donc voilà alors je pourrais revenir sur le contelé de la loi si vous voulez ce qui me semble intéressant de remarquer parce que c'était c'est assez révélateur de l'état d'esprit de l'époque c'est que cette loi elle est passée inaperçu elle est passée sous les radars médiatiques..."
      • Indépendance de la justice et flux d'information sous Mme Guigou: Mme Guigou a mis en place une rupture avec les pratiques antérieures en instaurant une règle stricte de non-interférence politique dans les affaires individuelles. Elle a indiqué que les procureurs généraux étaient encouragés à communiquer les affaires sensibles à la Direction des affaires criminelles et des grâces (DACG), et c'était au directeur de la DACG de décider de la transmission au cabinet du Ministre.
      • Citation: "quand je suis arrivée au ministère j'ai dit tout de suite ce ministère ne sera plus le ministère des affaires il sera le ministère du droit et donc j'ai fait savoir publiquement à l'Assemblée au Sénat au conseil des ministres et naturellement au au membres du corps judiciaire et parajudiciaire que désormais nous opérer une rupture nette avec les pratiques qui étaient séculaires quand même de absolument tous les gouvernements précédents..."
      • Citation: "j'avais demandé à ce que les rapports arrivent directement à la direction des affaires criminelles et des grâces c'était le directeur qui était chargé d'apprécier si oui ou non ces rapports méritait d'être porté à mon attention c'est-à-dire d'être envoyé ou communiqué ou bien dont la teneur était communiquée aux membres de mon cabinet voilà c'est comme ça que ça se passait..."
      • Le cas Caricar et la non-information de Mme Guigou: Mme Guigou a affirmé n'avoir aucun souvenir d'avoir été informée de l'affaire Bétarram (Caricar) à l'époque, malgré la transmission de courriers par le procureur général de Pau à la DACG. Elle a appris l'existence de ces documents récemment, dans le cadre de cette audition et de sollicitations médiatiques.
      • Citation: "Je dois vous dire tout de suite que je n'ai aucun souvenir d'avoir été informé de l'affaire de l'institution BRAM Bouquin..."
      • Citation: "...dans mon souvenir parce que j'ai certainement eu communication euh on m'a certainement on a dû m'informer en tout cas on a informé la direction des affaires criminales des grasses des enquêtes préliminaires ou des ouvertures d'instruction lorsqu'il y avait les plaintes et lorsqu'on pouvait soupçonner de de teles enfin des infractions à la loi tout simplement notamment lorsqu'il s'agissait de crime mais ne m'était communiqué que les affaires qui pouvaient d'effrayer si vous voulez sur lesquelles j'étais susceptible d'être interrogé en réalité voilà c'est-à-dire soit que il y a un écho médiatique important soit qu' ell elles aient été estimées de nature systémique..."

      • Absence d'obligation légale de transmission justice-Éducation Nationale avant 2016: Mme Guigou a expliqué que, selon la loi en vigueur à son époque, il n'était pas possible pour le Ministre de la Justice ou les magistrats de transmettre des informations couvertes par le secret professionnel et le secret de l'instruction au Ministère de l'Éducation Nationale, même en cas de suspicion de violences graves en milieu scolaire. Cette possibilité n'a été introduite qu'avec la loi de 2016 (article 11-2 du Code de procédure pénale), avec des précautions importantes.

      • Citation: "je vais malheureusement vous répondre non et ça a été le cas jusque en 2016 jusqu'à une loi de 2016 pourquoi ? parce que en tout cas il ne pouvait pas être question ni pour l'administration du ministère de la justice ni pour mon cabinet ni pour moi de déroger à l'application de la loi et la loi nous imposait à la fois de respecter le secret professionnel s'agissant de moi le secret de l'instruction s'agissant des magistrats et la présomption d'innocence..."

      • Citation: "C'est vrai que cette ceci a été assoupli en 2016 donc j'ai j'ai pris connaissance de ça à l'occasion de de cette invitation que vous que vous me faites j'ai recherché comment ça avait évolué et j'ai vu que dans cette loi de 2016 madame Tobira à l'époque et madame Valkassem avait ensemble annoncé que dorén avant il pouvait y avoir dans certaines conditions madame Tobéin insistait sur le respect de la préstation d'innocence il pouvait y avoir communication du ministère de la justice au ministère de l'éducation nationale d'information concernant des enquêtes préliminaires..."

      • Rôle et limites des départements en matière de protection de l'enfance: Les représentants de Départements de France ont rappelé que la compétence principale des départements en matière de protection de l'enfance concerne l'intrafamilial. Ils sont responsables des collèges sur les aspects fonctionnels (bâtiments, personnel technique, restauration) et financent les établissements privés sous contrat dans certaines limites.

      La protection de l'enfant en danger relève de l'autorité parentale en premier lieu, puis des services départementaux en cas de défaillance.

      • Citation: "nous avons la charge des collèges mais mais on est une charge des collèges des départements sur le plan strictement fonctionnel c'est-à-dire c'est le bâtis scolaire ce sont nos personnels techniques et puis la la restauration l'exception des établissements privés sauf exception pour lequel la restauration relève de l'établissement..."
      • Citation: "le premier responsable lorsque il n'est pas défaillant et que en l'occurrence c'est le l'autorité parentale est assumée par les départements le premier protecteur de l'enfant c'est le parent et je veux le redire tout simplement donc si un enfant subit des violences au sein d'un établissement scolaire c'est le c'est le détenteur de l'autorité parentale qui doit le protéger en prémilieux parce que c'est lui qui l'exerce..."
      • Flux d'information entre l'Éducation Nationale et les départements: Les violences commises par des adultes ayant autorité en milieu scolaire relèvent de la responsabilité de l'Éducation Nationale, qui est censée saisir directement l'autorité judiciaire (signalement Article 40) et en adresser une copie au président du département pour information.
      • Citation: "et donc les violences qui font l'objet de cette commission d'enquête ne relèvent pas de la de la compétence des départements puisque nous n'exerçons pas d'autorité sur les personnels d'abord sur les enfants évidemment parce qu'ils ont des parents et en plus sur les personnels de ces établissements quels que soient d'ailleurs les établissements en dehors pour les établissements publics hein..."
      • Citation: "en cas de violence l'éducation nationale et les fondé à nous adresser en direct un signalement ou en tout cas un direct en signal à l'autorité judiciaire elle peut saisir elle-même l'autorité judiciaire lorsqu'il y a des faits qui ont été commis par l'enseignant une personne qui est autorité dans l'établissement elle fondé adresser en direct un signalement à l'autorité judiciaire elle doit en adresser une copie au président du département concerné pour son pour son information en tant que chef de fil de la protection de l'enfance..."
      • Difficultés et pistes d'amélioration dans la gestion des informations préoccupantes (IP): Les départements constatent une hausse des IP. Il existe une sensibilisation croissante de l'Éducation Nationale aux signalements. Cependant, des difficultés persistent, notamment des délais dans la transmission des informations (parfois transmises juste avant les vacances scolaires) et un manque de clarté pour certains enseignants sur les procédures de signalement (distinction entre Article 40 et IP).
      • Citation: "on note une hausse significative en ce moment des informations préoccupantes reçues par les cripes je vous l'ai dit nous avons aussi de plus en plus d'enfants à protéger sans doute qu'il y a une prise de conscience qui est salvatrice mais qui voilà qui qui nécessite une réponse c'est beaucoup plus adapté..."
      • Citation: "...est-ce que c'est parce que les enseignants se sentent bien seuls parfois ? est-ce qu'il craent la réaction des parents ou les représaillent lorsqu'il y a un signalement ? donc le signal avant les vacances voilà donc on a signalé cela aussi à l'éducation nationale..."
      • Rôle des CRIP en cas d'IP atypique: En cas d'IP reçue par la CRIP concernant une violence en milieu scolaire qui devrait normalement relever de l'Éducation Nationale et de l'Article 40, le département procède en principe à un signalement Article 40 auprès du parquet et informe le DSDEN (ou l'inspection académique). Il y a un dialogue pour vérifier les faits et la connaissance par l'Éducation Nationale. Le rôle du département est aussi de voir si les parents sont informés et en capacité d'agir, et de les accompagner si nécessaire.
      • Citation (Florence d'Abin): "...si nous sommes réceptacle d'une information quelle qu'elle soit via la cryppe évidemment ça peut être via la cryppe mais quel qu'elle soit puisque au titre de l'article 40 en tant que fonctionnaire et à forcerie en tant que fonctionnaire départementaux chargé de la protection de l'enfance Nous sommes chargés de signaler au parquet toute information et donc si le cas que vous citez par exemple ce qui se passerait... c'est que le collègue qui reçoit l'information donc à la cri tous les jours toutes les informations qui arrivent sont lues... si c'est urgent on traite en priorité dans dans un délai de 24 heures on va saisir le parquet et donc le professionnel qui verrait ça il appellerait immédiatement l'éducation nationale et il dirait voilà est-ce que vous avez besoin d'un soutien pour formaliser euh le signalement puisque normalement effectivement le signalement doit être formalisé par eux..."
      • Besoin de formation et de clarification des procédures: Un besoin de formation des enseignants, particulièrement ceux du premier degré, sur les procédures de signalement et le rôle des différents acteurs a été souligné pour sécuriser les professionnels et améliorer la circulation de l'information.
      • Citation (Florence d'Abin): "je vais vous rejoindre madame la rapporteur sur euh l'information ou le besoin de sécuriser aussi les enseignants les professeurs et peut-être différentes lectures à la fois les professeurs du premier degré et les professeurs du second degré... on voit bien qu'il y a une faille... sur le besoin spécifique des enseignants du premer degré à être formé acculturé sur le sujet des informations préoccupantes comment ça doit fonctionner ? Vers qui je me tourne ma chef d'établissement mon chef d'établissement et qu'est-ce qui en découle pour justement éviter ce que pointait le président Sauvadet à savoir l'influx des informations préoccupantes souvent fin juin début juillet..."
      • Fragilité de la médecine scolaire: La faiblesse de la médecine scolaire en France a été soulignée comme un enjeu majeur pour le suivi de la santé des enfants et la prévention.

      Points de divergence / Nuances:

      • Mme Guigou a exprimé sa surprise et son regret de ne pas avoir été informée de l'affaire Caricar à l'époque, tout en réaffirmant que cela ne l'aurait pas conduite à donner d'instructions individuelles mais potentiellement à émettre une circulaire de politique pénale générale sur le sujet.

      • Les représentants de Départements de France ont insisté sur le fait que la responsabilité première des violences en milieu scolaire incombe à l'Éducation Nationale, même s'ils agissent en cas d'IP pour ne pas laisser une situation de danger sans réponse.

      Questions en suspens / Recommandations implicites:

      • Comment assurer une transmission fluide et systématique des informations entre tous les acteurs concernés (Éducation Nationale, justice, départements, familles) en cas de violences en milieu scolaire ?
      • Comment mieux former et sécuriser les enseignants dans les procédures de signalement ?
      • Comment renforcer la médecine scolaire et la prise en charge de la santé mentale des jeunes ?
      • Faut-il revoir ou préciser les textes réglementaires pour clarifier les rôles et obligations de chacun, notamment concernant la transmission d'informations sensibles ? (La loi de 2016 a été un pas, mais son application et ses limites doivent être évaluées).
      • Faut-il harmoniser les systèmes d'alerte et de protection en cas de danger dans les établissements scolaires ?

      Conclusion:

      • Ces auditions ont mis en lumière la complexité des circuits d'information et des responsabilités en matière de violences en milieu scolaire.

      Si des progrès ont été réalisés dans la prise de conscience et la réponse judiciaire (loi de 1998, loi de 2016), des dysfonctionnements persistent dans la transmission des informations et la coordination entre les différents acteurs.

      Les départements, bien que leur compétence principale soit l'intrafamilial, jouent un rôle essentiel dans la protection de l'enfance et sont souvent en première ligne face aux informations préoccupantes.

      Des efforts de formation, de clarification des procédures et de renforcement des liens entre les institutions semblent nécessaires pour mieux protéger les enfants victimes de violences.

      Prochaines étapes:

      • La commission d'enquête continuera ses travaux en auditionnant d'autres acteurs pour approfondir ces questions et formuler des propositions concrètes.
    1. Reviewer #1 (Public review):

      Summary:

      This manuscript by Harris and Gallistel investigates how the rate of learning and strength of conditioned behavior post learning depend on the various temporal parameters of Pavlovian conditioning. They replicate results from Gibbon and Balsam (1981) in rats to show that the rate of learning is proportional to the ratio between the cycle duration and the cue duration. They further show that the strength of conditioned behavior post learning is proportional to the cue duration, and not the above ratio. The overall findings here are interesting, provide context to many conflicting recent results on this topic, and are supported by reasonably strong evidence. Nevertheless, there are some major weaknesses in the evidence presented for some of the stronger claims in the manuscript.

      Strengths:

      This manuscript has many strengths including a rigorous experimental design, several different approaches to data analysis, careful consideration of prior literature, and a thorough introduction and discussion. The central claim-that animals track the rates of events in their environment, and that the ratio of two rates determine the rate of learning-is supported with solid evidence.

      Weaknesses:

      Despite the above major strengths, some key aspects of the paper need major improvement. These are listed below.

      (1) A key claim made here is that the same relationship (including the same parameter) describes data from pigeons by Gibbon and Balsam (1981) and the rats in this study. I think the evidence for this claim is weak as presented here. First, the exact measure used for identifying trials to criterion makes a big difference in Fig 3. As best as I understand, the authors do not make any claims about which of these approaches is the "best" way. Second, the measure used for identifying trials to criterion in Fig 1 appears different from any of the criteria used in Fig 3. If so, to make the claim that the quantitative relationship is one and the same in both datasets, the authors need to use the same measure of learning rate on both datasets and show that the resultant plots are statistically indistinguishable. Currently, the authors simply plot the dots from the current dataset on the plot in Fig 1 and ask the readers to notice the visual similarity. This is not at all enough to claim that both relationships are the same. In addition to the dependence of the numbers on the exact measure of learning rate used, the plots are in log-log axis. Slight visual changes can mean a big difference in actual numbers. For instance, between Fig 3 B and C, the highest information group moves up only "slightly" on the y-axis but the difference is a factor of 5. The authors need to perform much more rigorous quantification to make the strong claim that the quantitative relationships obtained here and in Gibbon and Balsam 1981 are identical.

      (2) Another interesting claim here is that the rates of responding during ITI and the cue are proportional to the corresponding reward rates with the same proportionality constant. This too requires more quantification and conceptual explanation. For quantification, it would be more convincing to calculate the regression slope for the ITI data and the cue data separately and then show that the corresponding slopes are not statistically distinguishable from each other. Conceptually, I am confused why the data used to the test the ITI proportionality come from the last 5 sessions. Specifically, if the model is that animals produce response rates during the ITI (a period with no possible rewards) based on the overall rate of rewards in the context, wouldn't it be better to test this before the cue learning has occurred? Before cue learning, the animals would presumably only have attributed rewards in the context to the context and thus, produce overall response rates in proportion to the contextual reward rate. After cue learning, the animals could technically know that the rate of rewards during ITI is zero. Why wouldn't it be better to test the plotted relationship for ITI before cue learning has occurred? Further, based on Fig 1, it seems that the overall ITI response rate reduces considerably with cue learning. What is the expected ITI response rate prior to learning based on the authors' conceptual model? Why does this rate differ pre and post cue learning? Finally, if the authors' conceptual framework predicts that ITI response rate after cue learning should be proportional to contextual reward rate, why should the cue response rate be proportional to cue reward rate instead of cue reward rate plus contextual reward rate?

      (3) I think there was a major conceptual disconnect between the gradual nature of learning shown in Figs 7 and 8 and the information theoretic model proposed by the authors. To the extent that I understand the model, the animals should simply learn the association once the evidence crosses a threshold (nDKL > threshold) and then produce behavior in proportion to the expected reward rate. If so, why should there be a gradual component of learning as shown in these figures? In terms of the proportional response rule to rate of rewards, why is it changing as animals go from 10% to 90% of peak response? I think the manuscript would be much strengthened if these results are explained within the authors' conceptual framework. If these results are not anticipated by the authors' conceptual framework, please do explicitly state this in the manuscript.

      (4) I find the idea stated in the Conclusion section that any model considering probability of reinforcement cannot be correct because it doesn't have temporal units to be weak. I think the authors might mean that existing models based on probability do not work and not that no possible model can work. For any point process, the standard mathematical treatment of continuous time is to compute the expected count of events as p*dt where p is the probability of occurrence of the event in that time bin and dt is an infinitesimal time bin. There is obviously a one-to-one mapping between probability of an event in a point process and its rate. Existing models use an arbitrary time bin/trial and thus, I get the authors' argument in the discussion. However, I think their conclusion is overstated.

      (5) The discussion states that the mutual information defined in equation 1 does not change during partial reinforcement. I am confused by this. The mean delay between reinforcements increases in inverse proportion to the probability of reinforcement, but doesn't the mean delay between cue and next reinforcement increase by more than this amount (next reinforcement is greater than or equal to the cue-to-cue interval away from the cue for many trials)? Why is this ratio invariant to partial reinforcement?

      Comments on revisions:

      Update following revision

      (1) This point is discussed in more detail in the attached file, but there are some important details regarding the identification of the learned trial that require more clarification. For instance, isn't the original criterion by Gibbon et al. (1977) the first "sequence of three out of four trials in a row with at least one response"? The authors' provided code for the Wilcoxon signed rank test and nDkl thresholds looks for a permanent exceeding of the threshold. So, I am not yet convinced that the approaches used here and in prior papers are directly comparable. Also, there's still no regression line fitted to their data (Fig 3's black line is from Fig 1, according to the legends). Accordingly, I think the claim in the second paragraph of the Discussion that the old data and their data are explained by a model with "essentially the same parameter value" is not yet convincing without actually reporting the parameters of the regression. Related to this, the regression for their data based on my analysis appears to have a slope closer to -0.6, which does not support strict timescale invariance. I think that this point should be discussed as a caveat in the manuscript.

      (2) The authors report in the response that the basis for the apparent gradual/multiple step-like increases after initial learning remains unclear within their framework. This would be important to point out in the actual manuscript. Further, the responses indicating the fact that there are some phenomena that are not captured by the current model would be important to state in the manuscript itself.

      (3) There are several mismatches between results shown in figures and those produced by the authors' code, or other supplementary files. As one example, rat 3 results in Fig 11 and Supplementary Materials don't match and neither version is reproduced by the authors' code. There are more concerns like this, which are detailed in the attached review file.

    2. Author response:

      The following is the authors’ response to the original reviews

      ANALYTICAL

      (1) A key claim made here is that the same relationship (including the same parameter) describes data from pigeons by Gibbon and Balsam (1981; Figure 1) and the rats in this study (Figure 3). The evidence for this claim, as presented here, is not as strong as it could be. This is because the measure used for identifying trials to criterion in Figure 1 appears to differ from any of the criteria used in Figure 3, and the exact measure used for identifying trials to criterion influences the interpretation of Figure 3***. To make the claim that the quantitative relationship is one and the same in the Gibbon-Balsam and present datasets, one would need to use the same measure of learning on both datasets and show that the resultant plots are statistically indistinguishable, rather than simply plotting the dots from both data sets and spotlighting their visual similarity. In terms of their visual characteristics, it is worth noting that the plots are in log-log axis and, as such, slight visual changes can mean a big difference in actual numbers. For instance, between Figure 3B and 3C, the highest information group moves up only "slightly" on the y-axis but the difference is a factor of 5 in the real numbers. Thus, in order to support the strong claim that the quantitative relationships obtained in the Gibbon-Balsam and present datasets are identical, a more rigorous approach is needed for the comparisons.

      ***The measure of acquisition in Figure 3A is based on a previously established metric, whereas the measure in Figure 3B employs the relatively novel nDKL measure that is argued to be a better and theoretically based metric. Surprisingly, when r and r2 values are converted to the same metric across analyses, it appears that this new metric (Figure 3B) does well but not as well as the approach in Figure 3A. This raises questions about why a theoretically derived measure might not be performing as well on this analysis, and whether the more effective measure is either more reliable or tapping into some aspect of the processes that underlie acquisition that is not accounted for by the nDKL metric.

      Figure 3 shows that the relationship between learning rate and informativeness for our rats was very similar to that shown with pigeons by Gibbon and Balsam (1981). We have used multiple criteria to establish the number of trials to learn in our data, with the goal of demonstrating that the correspondence between the data sets was robust. In the revised Figure 3, specifically 3C and 3D, we have plotted trials to acquisition using decision criterion equivalent to those used by Gibbon and Balsam. The criterion they used—at least one peck at the response key on at least 3 out of 4 consecutive trials—cannot be directly applied to our magazine entry data because rats make magazine entries during the inter-trial interval (whereas pigeons do not peck at the response key in the inter-trial interval). Therefore, evidence for conditioning in our paradigm must involve comparison between the response rate during CS and the baseline response rate, rather than just counting responses during the CS. We have used two approaches to adapt the Gibbon and Balsam criterion to our data. One approach, plotted in Figure 3C, uses a non-parametric signed rank test for evidence that the CS response rate exceeds the pre-CS response rate, and adopting a statistical criterion equivalent to Gibbon and Balsam’s 3-out-of-4 consecutive trials (p<.3125). The second method (Figure 3D) estimates the nDkl for the criterion used by Gibbon and Balsam and then applies this criterion to the nDkl for our data. To estimate the nDkl of Gibbon and Balsam’s data, we have assumed there are no responses in the inter-trial interval and the response probability during the CS must be at least 0.75 (their criterion of at least 3 responses out of 4 trials). The nDkl for this difference is 2.2 (odds ratio 27:1). We have then applied this criterion to the nDkl obtained from our data to identify when the distribution of CS response rates has diverged by an equivalent amount from the distribution of pre-CS response rates. These two analyses have been added to the manuscript to replace those previously shown in Figures 3B and 3C.

      (2) Another interesting claim here is that the rates of responding during ITI and the cue are proportional to the corresponding reward rates with the same proportionality constant. This too requires more quantification and conceptual explanation. For quantification, it would be more convincing to calculate the regression slope for the ITI data and the cue data separately and then show that the corresponding slopes are not statistically distinguishable from each other. Conceptually, it is not clear why the data used to test the ITI proportionality came from the last 5 conditioning sessions. What were the decision criteria used to decide on averaging the final 5 sessions as terminal responses for the analyses in Figure 5? Was this based on consistency with previous work, or based on the greatest number of sessions where stable data for all animals could be extracted?

      If the model is that animals produce response rates during the ITI (a period with no possible rewards) based on the overall rate of rewards in the context, wouldn't it be better to test this before the cue learning has occurred? Before cue learning, the animals would presumably only have attributed rewards in the context to the context and thus, produce overall response rates in proportion to the contextual reward rate. After cue learning, the animals could technically know that the rate of rewards during ITI is zero. Why wouldn't it be better to test the plotted relationship for ITI before cue learning has occurred? Further, based on Figure 1, it seems that the overall ITI response rate reduces considerably with cue learning. What is the expected ITI response rate prior to learning based on the authors' conceptual model? Why does this rate differ from pre and post-cue learning? Finally, if the authors' conceptual framework predicts that ITI response rate after cue learning should be proportional to contextual reward rate, why should the cue response rate be proportional to the cue reward rate instead of the cue reward rate plus the contextual reward rate?

      A single regression line, as shown in Figure 5, is the simplest possible model of the relationship between response rate and reinforcement rate and it explains approximately 80% of the variance in response rate. Fixing the log-log slope at 1 yields the maximally simple model. (This regression is done in the logarithmic domain to satisfy the homoscedasticity assumption.) When transformed into the linear domain, this model assumes a truly scalar relation (linear, intercept at the origin) and assumes the same scale factor and the same scalar variability in response rates for both sets of data (ITI and CS). Our plot supports such a model. Its simplicity is its own motivation (Occam’s razor).

      If separate regression lines are fitted to the CS and ITI data, there is a small increase in explained variance (R<sub>2</sub> = 0.82). These regression lines have been added to the plot in the revised manuscript (Figure 5). We leave it to further research to determine whether such a complex model, with 4 parameters, is required. However, we do not think the present data warrant comparing the simplest possible model, with one parameter, to any more complex model for the following reasons:

      · When a brain—or any other machine—maps an observed (input) rate to a rate it produces (output rate), there is always an implicit scalar. In the special case where the produced rate equals the observed rate, the implicit scalar has value 1. Thus, there cannot be a simpler model than the one we propose, which is, in and of itself, interesting.

      · The present case is an intuitively accessible example of why the MDL (Minimum Description Length) approach to model complexity (Barron, Rissanen, & Yu, 1998; Grünwald, Myung, & Pitt, 2005; Rissanen, 1999) can yield a very different conclusion from the conclusion reached using the Bayesian Information Criterion (BIC) approach. The MDL approach measures the complexity of a model when given N data specified with precision of B bits per datum by computing (or approximating) the sum of the maximum-likelihoods of the model’s fits to all possible sets of N data with B precision per datum. The greater the sum over the maximum likelihoods, the more complex the model, that is, the greater its measured wiggle room, it’s capacity to fit data. Recall that von Neuman remarked to Fermi that with 4 parameters he could fit an elephant. His deeper point was that multi-parameter models bring neither insight nor predictive power; they explain only post-hoc, after one has adjusted their parameters in the light of the data. For realistic data sets like ours, the sums of maximum likelihoods are finite but astronomical. However, just as the Sterling approximation allows one to work with astronomical factorials, it has proved possible to develop readily computable approximations to these sums, which can be used to take model complexity into account when comparing models. Proponents of the MDL approach point out that the BIC is inadequate because models with the same number of parameters can have very different amounts of wiggle room. A standard illustration of this point is the contrast between logarithmic model and power-function model. Log regressions must be concave; whereas power function regressions can be concave, linear, or convex—yet they have the same number of parameters (one or two, depending on whether one counts the scale parameter that is always implicit). The MDL approach captures this difference in complexity because it measures wiggle room; the BIC approach does not, because it only counts parameters.

      · In the present case, one is comparing a model with no pivot and no vertical displacement at the boundary between the black dots and the red dots (the 1-parameter unilinear model) to a bilinear model that allows both a change in slope and a vertical displacement for both lines. The 4-parameter model is superior if we use the BIC to take model complexity into account. However, 4-parameter has ludicrously more wiggle room. It will provide excellent fits—high maximum likelihood—to data sets in which the red points have slope > 1, slope 0, or slope < 0 and in which it is also true that the intercept for the red points lies well below or well above the black points (non-overlap in the marginal distribution of the red and black data). The 1-parameter model, on the other hand, will provide terrible fits to all such data (very low maximum likelihoods). Thus, we believe the BIC does not properly capture the immense actual difference in the complexity between the 1-parameter model (unilinear with slope 1) to the 4-parameter model (bilinear with neither the slope nor the intercept fixed in the linear domain).

      · In any event, because the pivot (change in slope between black and red data sets), if any, is small and likewise for the displacement (vertical change), it suffices for now to know that the variance captured by the 1-parameter model is only marginally improved by adding three more parameters. Researchers using the properly corrected measured rate of head poking to measure the rate of reinforcement a subject expects can therefore assume that they have an approximately scalar measure of the subject’s expectation. Given our data, they won’t be far wrong even near the extremes of the values commonly used for rates of reinforcement. That is a major advance in current thinking, with strong implications for formal models of associative learning. It implies that the performance function that maps from the neurobiological realization of the subject’s expectation is not an unknown function. On the contrary, it’s the simplest possible function, the scalar function. That is a powerful constraint on brain-behavior linkage hypotheses, such as the many hypothesized relations between mesolimbic dopamine activity and the expectation that drives responding in Pavlovian conditioning (Berridge, 2012; Jeong et al., 2022; Y.  Niv, Daw, Joel, & Dayan, 2007; Y. Niv & Schoenbaum, 2008).

      The data in Figures 4 and 5 are taken from the last 5 sessions of training. The exact number of sessions was somewhat arbitrary but was chosen to meet two goals: (1) to capture asymptotic responding, which is why we restricted this to the end of the training, and (2) to obtain a sufficiently large sample of data to estimate reliably each rat’s response rate. We have checked what the data look like using the last 10 sessions, and can confirm it makes very little difference to the results. We now note this in the revised manuscript. The data for terminal responding by all rats, averaged over both the last 5 sessions and last 10 sessions, can be downloaded from https://osf.io/vmwzr/

      Finally, as noted by the reviews, the relationship between the contextual rate of reinforcement and ITI responding should also be evident if we had measured context responding prior to introducing the CS. However, there was no period in our experiment when rats were given unsignalled reinforcement (such as is done during “magazine training” in some experiments). Therefore, we could not measure responding based on contextual conditioning prior to the introduction of the CS. This is a question for future experiments that use an extended period of magazine training or “poor positive” protocols in which there are reinforcements during the ITIs as well as during the CSs. The learning rate equation has been shown to predict reinforcements to acquisition in the poor-positive case (Balsam, Fairhurst, & Gallistel, 2006).

      (3) There is a disconnect between the gradual nature of learning shown in Figures 7 and 8 and the information-theoretic model proposed by the authors. To the extent that we understand the model, the animals should simply learn the association once the evidence crosses a threshold (nDKL > threshold) and then produce behavior in proportion to the expected reward rate. If so, why should there be a gradual component of learning as shown in these figures? In terms of the proportional response rule to the rate of rewards, why is it changing as animals go from 10% to 90% of peak response? The manuscript would be greatly strengthened if these results were explained within the authors' conceptual framework. If these results are not anticipated by the authors' conceptual framework, this should be explicitly stated in the manuscript.

      One of us (CRG) has earlier suggested that responding appears abruptly when the accumulated evidence that the CS reinforcement rate is greater than the contextual rate exceeds a decision threshold (C.R.  Gallistel, Balsam, & Fairhurst, 2004). The new more extensive data require a more nuanced view. Evidence about the manner in which responding changes over the course of training is to some extent dependent on the analytic method used to track those changes. We presented two different approaches. The approach shown in Figures 7 and 8 (now 6 and 7), extending on that developed by Harris (2022), assumes a monotonic increase in response rate and uses the slope of the cumulative response rate to identify when responding exceeds particular milestones (percentiles of the asymptotic response rate). This analysis suggests a steady rise in responding over trials. Within our theoretical model, this might reflect an increase in the animal’s certainty about the CS reinforcement rate with accumulated evidence from each trial. While this method should be able to distinguish between a gradual change and a single abrupt change in responding (Harris, 2022) it may not distinguish between a gradual change and multiple step-like changes in responding and cannot account for decreases in response rate.

      The other analytic method we used relies on the information theoretic measure of divergence, the nDkl (Gallistel & Latham, 2023), to identify each point of change (up or down) in the response record. With that method, we discern three trends. First, the onset tends to be abrupt in that the initial step up is often large (an increase in response rate by 50% or more of the difference between its initial value and its terminal value is common and there are instances where the initial step is to the terminal rate or higher). Second, there is marked within-subject variability in the response rate, characterized by large steps up and down in the parsed response rates following the initial step up, but this variability tends to decrease with further training (there tend to be fewer and smaller steps in both the ITI response rates and the CS response rate as training progresses). Third, the overall trend, seen most clearly when one averages across subjects within groups is to a moderately higher rate of responding later in training than after the initial rise. We think that the first tendency reflects an underlying decision process whose latency is controlled by diminishing uncertainty about the two reinforcement rates and hence about their ratio. We think that decreasing uncertainty about the true values of the estimated rates of reinforcement is also likely to be an important part of the explanation for the second tendency (decreasing within-subject variation in response rates). It is less clear whether diminishing uncertainty can explain the trend toward a somewhat greater difference in the two response rates as conditioning progresses. It is perhaps worth noting that the distribution of the estimates of the informativeness ratio is likely to be heavy tailed and have peculiar properties (as witness, for example, the distribution of the ratio of two gamma distributions with arbitrary shape and scale parameters) but we are unable at this time to propound an explanation of the third trend.

      (4) Page 27, Procedure, final sentence: The magazine responding during the ITI is defined as the 20 s period immediately before CS onset. The range of ITI values (Table 1) always starts as low as 15 s in all 14 groups. Even in the case of an ITI on a trial that was exactly 20 s, this would also mean that the start of this period overlaps with the termination of the CS from the previous trial and delivery (and presumably consumption) of a pellet. It should be indicated whether the definition of the ITI period was modified on trials where the preceding ITI was < 20 s, and if any other criteria were used to define the ITI. Were the rats exposed to the reinforcers/pellets in their home cage prior to acquisition?

      There was an error in the description provided in the original text. The pre-CS period used to measure the ITI responding was 10 s rather than 20 s. There was always at least a 5-s gap between the end of the previous trial and the start of the pre-CS period. The statement about the pre-CS measure has been corrected in the revised manuscript.

      (5) For all the analyses, the exact models that were fit and the software used should be provided. For example, it is not necessarily clear to the reader (particularly in the absence of degrees of freedom) that the model discussed in Figure 3 fits on the individual subject data points or the group medians. Similarly, in Figure 6 there is no indication of whether a single regression model was fit to all the plotted data or whether tests of different slopes for each of the conditions were compared. With regards to the statistics in Figure 6, depending on how this was run, it is also a potential problem that the analyses do not correct for the potentially highly correlated multiple measurements from the same subjects, i.e. each rat provides 4 data points which are very unlikely to be independent observations.

      Details about model fitting have been added to the revision. The question about fitting a single model or multiple models to the data in Figure 6 (now 5) is addressed in response 2 above. In Figure 5, each rat provides 2 behavioural data points (ITI response rate and CS response rate) and 2 values for reinforcement rate (1/C and 1/T). There is a weak but significant correlation between the ITI and CS response rates (r = 0.28, p < 0.01; log transformed to correct for heteroscedasticity). By design, there is no correlation between the log reinforcement rates (r = 0.06, p = .404).

      CONCEPTUAL

      (1) We take the point that where traditional theories (e.g., Rescorla-Wagner) and rate estimation theory (RET) both explain some phenomenon, the explanation in terms of RET may be preferred as it will be grounded in aspects of an animal's experience rather than a hypothetical construct. However, like traditional theories, RET does not explain a range of phenomena - notably, those that require some sort of expectancy/representation as part of their explanation. This being said, traditional theories have been incorporated within models that have the representational power to explain a broader array of phenomena, which makes me wonder: Can rate estimation be incorporated in models that have representational power; and, if so, what might this look like? Alternatively, do the authors intend to claim that expectancy and/or representation - which follow from probabilistic theories in the RW mould - are unnecessary for explanations of animal behaviour?***

      It is important for the field to realize that the RW model cannot be used to explain the results of Rescorla’s (Rescorla, 1966; Rescorla, 1968, 1969) contingency-not-pairing experiments, despite what was claimed by Rescorla and Wagner (Rescorla & Wagner, 1972; Wagner & Rescorla, 1972) and has subsequently been claimed in many modelling papers and in most textbooks and reviews (Dayan & Niv, 2008; Y. Niv & Montague, 2008). Rescorla programmed reinforcements with a Poisson process. The defining property of a Poisson process is its flat hazard function; the reinforcements were equally likely at every moment in time when the process was running. This makes it impossible to say when non-reinforcements occurred and, a fortiori, to count them. The non-reinforcements are causal events in RW algorithm and subsequent versions of it. Their effects on associative strength are essential to the explanations proffered by these models. Non-reinforcements—failures to occur, updates when reinforcement is set to 0, hence also the lambda parameter—can have causal efficacy only when the successes may be predicted to occur at specified times (during “trials”). When reinforcements are programmed by a Poisson process, there are no such times. Attempts to apply the RW formula to reinforcement learning soon foundered on this problem (Gibbon, 1981; Gibbon, Berryman, & Thompson, 1974; Hallam, Grahame, & Miller, 1992; L.J. Hammond, 1980; L. J. Hammond & Paynter, 1983; Scott & Platt, 1985). The enduring popularity of the delta-rule updating equation in reinforcement learning depends on “big-concept” papers that don’t fit models to real data and discretize time into states while claiming to be real-time models (Y. Niv, 2009; Y. Niv, Daw, & Dayan, 2005).

      The information-theoretic approach to associative learning, which sometimes historically travels as RET (rate estimation theory), is unabashedly and inescapably representational. It assumes a temporal map and arithmetic machinery capable in principle of implementing any implementable computation. In short, it assumes a Turing-complete brain. It assumes that whatever the material basis of memory may be, it must make sense to ask of it how many bits can be stored in a given volume of material. This question is seldom posed in associative models of learning, nor by neurobiologists committed to the hypothesis that the Hebbian synapse is the material basis of memory. Many—including the new Nobelist, Geoffrey Hinton— would agree that the question makes no sense. When you assume that brains learn by rewiring themselves rather than by acquiring and storing information, it makes no sense.

      When a subject learns a rate of reinforcement, it bases its behavior on that expectation, and it alters its behavior when that expectation is disappointed. Subjects also learn probabilities when they are defined. They base some aspects of their behavior on those expectations, making computationally sophisticated use of their representation of the uncertainties (Balci, Freestone, & Gallistel, 2009; Chan & Harris, 2019; J. A. Harris, 2019; J.A. Harris & Andrew, 2017; J. A. Harris & Bouton, 2020; J. A. Harris, Kwok, & Gottlieb, 2019; Kheifets, Freestone, & Gallistel, 2017; Kheifets & Gallistel, 2012; Mallea, Schulhof, Gallistel, & Balsam, 2024 in press).

      (2) The discussion of Rescorla's (1967) and Kamin's (1968) findings needs some elaboration. These findings are already taken to mean that the target CS in each design is not informative about the occurrence of the US - hence, learning about this CS fails. In the case of blocking, we also know that changes in the rate of reinforcement across the shift from stage 1 to stage 2 of the protocol can produce unblocking. Perhaps more interesting from a rate estimation perspective, unblocking can also be achieved in a protocol that maintains the rate of reinforcement while varying the sensory properties of the US (Wagner). How does rate estimation theory account for these findings and/or the demonstrations of trans-reinforcer blocking (Pearce-Ganesan)? Are there other ways that the rate estimation account can be distinguished from traditional explanations of blocking and contingency effects? If so, these would be worth citing in the discussion. More generally, if one is going to highlight seminal findings (such as those by Rescorla and Kamin) that can be explained by rate estimation, it would be appropriate to acknowledge findings that challenge the theory - even if only to note that the theory, in its present form, is not all-encompassing. For example, it appears to me that the theory should not predict one-trial overshadowing or the overtraining reversal effect - both of which are amenable to discussion in terms of rates.

      I assume that the signature characteristics of latent inhibition and extinction would also pose a challenge to rate estimation theory, just as they pose a challenge to Rescorla-Wagner and other probability-based theories. Is this correct?

      The seemingly contradictory evidence of unblocking and trans-reinforcer blocking by Wagner and by Pearce and Ganesan cited above will be hard for any theory to accommodate. It will likely depend on what features of the US are represented in the conditioned response.

      RET predicts one-trial overshadowing, as anyone may verify in a scientific programming language because it has no free parameters; hence, no wiggle room. Overtraining reversal effects appear to depend on aspects of the subjects’ experience other than the rate of reinforcement. It seems unlikely that it can proffer an explanation.

      Various information-theoretic calculations give pretty good quantitative fits to the relatively few parametric studies of extinction and the partial-reinforcement extinction effect (see Gallistel (2012, Figs 3 & 4); Wilkes & Gallistel (2016, Fig 6) and Gallistel (2025, under review, Fig 6). It has not been applied to latent inhibition, in part for want of parametric data. However, clearly one should not attribute a negative rate to a context in which the subject had never been reinforced. An explanation, if it exists, would have to turn on the effect of that long period on initial rate estimates AND on evidence of a change in rate, as of the first reinforcement.

      Recommendations for authors:

      MINOR POINTS

      (1) It is not clear why Figure 3C is presented but not analyzed, and why the data presented in Figure 4 to clarify the spread of the distribution of the data observed across the plots in Figure 3 uses the data from Figure 3C. This would seem like the least representative data to illustrate the point of Figure 4. It also appears that the data plotted in Figure 4 corresponds to Figure 3A and 3B rather than the odds 10:1 data indicated in the text.

      Figures 3 has changed as already described. The data previously plotted in Figure 4 are now shown in 3B and corresponds to that plotted in Figure 3A.

      (2) Log(T) was not correlated with trials to criterion. If trials to criterion is inversely proportional to log(C/T) and C is uncorrelated with T, shouldn't trials to criterion be correlated with log(T)? Is this merely a matter of low statistical power?

      Yes. There is a small, but statistically non-significant, correlation between log(T) and trials to criterion, r = 0.35, p = .22. That correlation drops to .08 (p = .8) after factoring out log(C/T), which demonstrates that the weak correlation between log(T) and trials to criterion is based on the correlation between log(t) and log(C/T).

      (3) The rationale for the removal of the high information condition samples in the Fig 8 "Slope" plot to be weak. Can the authors justify this choice better? If all data are included, the relationship is clearly different from that shown in the plot.

      We have now reported correlations that include those 3 groups but noted that the correlations are largely driven by the much lower slope values of those 3 groups which is likely an artefact of their smaller number of trials. We use this to justify a second set of correlations that excludes those 3 groups.

      (4) The discussion states that there is at most one free parameter constrained by the data - the constant of proportionality for response rate. However, there is also another free parameter constrained by data-the informativeness at which expected trials to acquisition is 1.

      I think this comment is referring to two different sets of data. The constant of proportionality of the response rate refers to the scalar relationship between reinforcement rate and terminal response rate shown in Figure 5. The other parameter, the informativeness when trials to acquisition equals 1, describes the intercept of the regression line in Figure 1 (and 3).

      (5) The authors state that the measurement of available information is not often clear. Given this, how is contingency measurable based on the authors' framework?

      (6) Based on the variables provided in Supplementary File 3, containing the acquisition data, we were unable to reproduce the values reported in the analysis of Figure 3.

      Figure 3 has changed, using new criteria for trials to acquisition that attempt to match the criterion used by Gibbon and Balsam. The data on which these figures are based has been uploaded into OSF.

      GRAPHICAL AND TYPOGRAPHICAL

      (1) Y-axis labels in Figure 1 are not appropriately placed. 0 is sitting next to 0.1. 0 should sit at the bottom of the y-axis.

      If this comment refers to the 0 sitting above an arrow in the top right corner of the plot, this is not misaligned. The arrow pointing to zero is used to indicate that this axis approaches zero in the upward direction. 0 should not be aligned to a value on the axis since a learning rate of zero would indicate an infinite number of learning trials. The caption has been edited to explain this more clearly.

      (2) Typo, Page 6, Final Paragraph, line 4. "Fourteen groups of rats were trained with for 42 session"

      Corrected. Thank you.

      (3) Figure 3 caption: Typo, should probably be "Number of trials to acquisition"?

      This change has now been made. The axis shows reinforcements to acquisition to be consistent with Gibbon and Balsam, but trials and number of reinforcements are identical in our 100% reinforcement schedule.

      (4) Typo Page 17 Line 1: "Important pieces evidence about".

      Correct. Thank you.

      (5) Consider consistent usage of symbols/terms throughout the manuscript (e.g. Page 22, final paragraph: "iota = 2" is used instead of the corresponding symbol that has been used throughout).

      Changed.

      (6) Typo Page 28, Paragraph 1, Line 9: "We used a one-sample t-test using to identify when this".

      This section of text has been changed to reflect the new analysis used for the data in Figure 3.

      (7) Typo Page 29, Paragraph 1, Line 2: "problematic in cases where one of both rates are undefined" either typo or unclear phrasing.

      “of” has been corrected to “or”

      (8) Typo Page 30: Equation 3 appears to have an error and is not consistent with the initial printing of Equation 3 in the manuscript.

      The typo in initial expression of Eq 3 (page 23) has been corrected.

      (9) Typo Page 33, Line 5: "Figures 12".

      Corrected.

      (10) Typo Page 34, Line 10: "and the 5 the increasingly"? Should this be "the 5 points that"?

      Corrected.

      (11) Typo Page 35, Paragraph 2: "estimate of the onset of conditioned is the trial after which".

      Corrected.

      (12) Clarify: Page 35, final paragraph: it is stated that four-panel figures are included for each subject in the Supplementary files, but each subject has a six-panel figure in the Supplementary file.

      The text now clarifies that the 4-panel figures are included within the 6-panel figures in the Supplementary materials.

      (13) It is hard to identify the different groups in Figure 2 (Plot 15).

      The figure is simply intended to show that responding across seconds within the trial is relatively flat for each group. Individuation of specific groups is not particularly important.

      (14) It appears that the numbering on the y-axis is misaligned in Figure 2 relative to the corresponding points on the scale (unless I have misunderstood these values and the response rate measure to the ITI can drop below 0?).

      The numbers on the Y axes had become misaligned. That has now been corrected.

      (15) Please include the data from Figure 3A in the spreadsheet supplementary file 3. If it has already been included as one of the columns of data, please consider a clearer/consistent description of the relevant column variable in Supplementary File 1.

      The data from Figure 3 are now available from the linked OSF site, referenced in the manuscript.

      (16) Errors in supplementary data spreadsheets such that the C/T values are not consistent with those provided in Table 1 (C/T values of 4.5, 54, 180, and 300 are slightly different values in these spreadsheets). A similar error/mismatch appears to have occurred in the C/T labels for Figures (e.g. Figure 10) and the individual supplementary figures.

      The C/T values on the figures in the supplementary materials have been corrected and are now consistent with those in Table 1.

      (17) Currently the analysis and code provided at https://osf.io/vmwzr/ are not accessible without requesting access from the author. Please consider making these openly available without requiring a request for authorization. As such, a number of recommendations made here may already have been addressed by the data and code deposited on OSF. Apologies for any redundant recommendations.

      Data and code are now available in at the OSF site which has been made public without requiring request.

      (18) Please consider a clearer and more specific reference to supplementary materials. Currently, the reader is required to search through 4 separate supplementary files to identify what is being discussed/referenced in the text (e.g. Page 18, final line: "see Supplementary Materials" could simply be "see Figure S1").

      We have added specific page numbers in references to the Supplementary Materials.

    1. #!/usr/bin/env npx ts-node // TypeScript code Whether this always works in macOS is unknown. There could be some magic with node installing a shell command shim (thanks to @DaMaxContext for commenting about this). This doesn't work in Linux because Linux distros treat all the characters after env as the command, instead of considering spaces as delimiting separate arguments. Or it doesn't work in Linux if the node command shim isn't present (not confirmed that's how it works, but in any case, in my testing, it doesn't work in Linux Docker containers). This means that npx ts-node will be treated as a single executable name that has a space in it, which obviously won't work, as that's not an executable.
    1. Author Response:

      eLife assessment

      This is a valuable initial study of cell type and spatially resolved gene expression in and around the locus coeruleus, the primary source of the neuromodulator norepinephrine in the human brain. The data are generated with cutting-edge techniques, and the work lays the foundation for future descriptive and experimental approaches to understand the contribution of the locus coeruleus to healthy brain function and disease. However, due to small sample size and the need for additional confirmatory data, the data only incompletely support the main conclusions presented here. With the strengthening of the analyses, this paper, and the associated web application, will be of great interest to neuroscientists working on arousal-based behaviors and neurological and neuropsychiatric phenotypes.

      Thank you for the assessment and comments. Overall, the majority of the issues raised by the reviewers relate either directly or indirectly to limitations of the sample size that precluded further optimization of protocols and expansion of the dataset. We fully acknowledge the limited sample size in this dataset and aim to be transparent about the limitations of the study. This is the first report of snRNA-seq and spatially-resolved transcriptomics in the human locus coeruleus (LC). The LC is a very small nucleus, located deep within the brainstem, which is extremely challenging to study due to its small size, difficult to access location, and the very small number of norepinephrine (NE) neurons located within the nucleus, which were of prime interest for this study. We note that this study represents our initial attempt to molecularly and spatially characterize cell types within the human LC. We note that we did not have significant, established funding from extramural sources dedicated to this study, and tissue resources for the LC are difficult to ascertain, contributing to the small sample size in this initial study. We acknowledge that there are limitations in sample size as well as data quality. Findings from this study will be used to inform, improve, and optimize future and ongoing experimental design, as well as technical and analytical workflows for larger-scale studies. As brought up by one of the reviewers, this field is still in its infancy -- pilot experimentation in new brain regions is labor-intensive and these sequencing approaches remain costly. Moreover, due to the small size and difficulties in dissecting, tissue resources from the human brain in this area are a highly limited resource. Hence, notwithstanding limitations, in our view it is important to release the data for community access at this time. Specific responses to the reviewers’ comments are provided point-by-point in the following sections.

      Reviewer #1 (Public Review):

      Weber et al. collect locus coeruleus (LC) tissue blocks from 5 neurotypical European men, dissect the dorsal pons around the LC and prepare 2-3 tissue sections from each donor on a slide for 10X spatial transcriptomics. […] The authors transparently present limitations of their work in the discussion, but some points discussed below warrant further attention.

      Specific comments:

      1) snRNAseq:

      a. Major concerns with the snRNAseq dataset are A) the low recovery rate of putative LC-neurons in the snRNAseq dataset, B) the fact that the LC neuron cluster is contaminated with mitochondrial RNA, and C) that a large fraction of the nuclei cannot be assigned to a clear cell type (presumably due to contamination or damaged nuclei). The authors chose to enrich for neurons using NeuN antibody staining and FACS. But it is difficult to assess the efficacy of this enrichment without images of the nuclear suspension obtained before FACS, and of the FACS results. As this field is in its infancy, more detail on preliminary experiments would help the reader to understand why the authors processed the tissue the way they did. It would be nice to know whether omitting the FACS procedure might in fact result in higher relative recovery of LC-neurons, or if the authors tried this and discovered other technical issues that prompted them to use FACS.

      Thank you for these comments. We agree these are valid concerns in assessing the data quality and validity of the findings from the snRNA-seq dataset. We will respond to these concerns here to the best of our ability, but in some cases, we do not have definitive answers since comparison data are not yet available for this region. In particular, we were limited in resources for this initial study -- some of the results of the study and issues that we identified in attempting to molecularly profile cells in the human LC were surprising to us, and we intend to generate additional samples and troubleshoot these issues to improve data quality and increase recovery in future work. However, these experiments are (i) expensive, (ii) time- and labor-intensive, and (iii) the tissue for this region is limited and difficult to ascertain. Given the extremely small size of the LC, the tissue resource is quickly depleted. For this study, we had fixed resources and made best-guess decisions on how to proceed with the experimental design, based on our experience with snRNA-seq in other human brain regions (Tran and Maynard et al. 2021). However, the LC is a unique region, and our experiences with this dataset will guide us to make technical adjustments in future studies. Due to the limitations in the tissue resources and the lack of data currently available to the community, we wanted to share these results immediately while acknowledging the limitations of the study as we work to increase our resource availability to expand molecular and spatial profiling studies in this region of the human brain.

      Regarding the reviewer’s concern that our choice to use FANS to enrich for neurons could have potentially led to more damage and contributed to the low recovery rate of LC-NE neurons and the mitochondrial contamination -- we do not have a definitive answer to this question, since we did not perform a direct comparison with non-sorted data. As noted above, our limited tissue resource dictated that we could not do both. We made the decision to enrich for neurons based on our previous experience with identifying relatively rare populations in other brain regions (e.g. nucleus accumbens and amygdala; Tran and Maynard et al. 2021). Based on this previous work, our rationale was that without neuronal enrichment, we could potentially miss the LC-NE population, given the relative scarcity of this neuronal population. The low recovery rate and relatively lower quality / contamination issues may be due to technical issues that lead to LC-NE neurons being more susceptible to damage during nuclear preparation and sorting. We agree that directly comparing to data prepared without NeuN labeling and sorting is reasonable, as the additional perturbations may indeed contribute to cell damage. As mentioned in the discussion, we do not have a definitive answer to the reasons for increased mitochondrial contamination and we suspect that multiple technical factors may contribute -- including the relatively large size and increased fragility of LC-NE neurons. We agree that systematically optimizing the preparation to attempt to increase recovery rate and decrease mitochondrial contamination are important avenues for future work.

      b. It is unclear what percentage of cells that make up each cluster.

      We will add this information in the clustering heatmaps or as a supplementary plot in a revised version of the manuscript.

      c. The number of subjects used in each analysis was not always clear. Only 3 subjects were used for snRNAseq, and one of them only yielded 4 LC-nuclei. This means the results are essentially based on n=2. The authors report these numbers in the corresponding section, but the first sentence of the results section (and Figure 1C specifically!) create the impression that n=5 for all analyses. Even for spatial transcriptomics, if I understood it correctly, 1 sample had to be excluded (n=4).

      This is correct. We will update the figures and text in a revised version of the manuscript to make this limitation (small sample size) more clear, and to further emphasize that the intention of this study is to provide initial data to help determine next steps and best practices for a larger scale and more comprehensive study on this region, especially given the limited availability of tissue resources and currently limited data resources available for this region.

      2) Spatial transcriptomics:

      a. It is not clear to me what the spatial transcriptomics provides beyond what can be shown with snRNAseq, nor how these two sets of results compare to each other. It would be more intuitive to start the story with snRNAseq and then try to provide spatial detail using spatial transcriptomics. The LC is not a homogeneous structure but can be divided into ensembles based on projection specificity. Spatial transcriptomics could - in theory - offer much-needed insights into the spatial variation of mRNA profiles across different ensembles, or as a first step across the spatial (rostral/caudal, ventral/dorsal) extent of the LC. The current analyses, however, cannot address this issue, as the orientation of the LC cannot be deduced from the slices analyzed.

      We understand the point of the reviewer. However, we structured the manuscript in this format due to our aims of creating a data resource for the community as well as being transparent about the limitations of our study. Our experiments began with the spatial experiments on the tissue blocks because this (i) helped orient ourselves to the region, and (ii) provided guidance for how best to score the tissue blocks for the snRNA-seq experiments to maximize recovery of LC-NE neurons. Therefore, we also decided to present the results in this sequence.

      The spatial data also provides more information in that the measurements are from nuclei, cytoplasm, and cell processes (instead of nuclei only). This is one of the main differences / advantages between the platforms at this level of spatial resolution. As noted above, we were also working with a finite tissue resource -- if we ran snRNA-seq first and captured no neurons, the tissue block would be depleted. Due to the logistics / thickness of the required tissue sections for Visium and snRNA-seq respectively, running Visium first allowed us to ensure that we could collect data from both assays.

      Regarding a point raised below on why we only ran snRNA-seq on a subset of the donors -- this was due to resource depletion and not enough available tissue remaining on the tissue blocks to run the assay. We have conducted extensive piloting in other brain regions on the amount (mg) of tissue that is needed from various sized cryosections, and the LC is particularly difficult since these are small tissue blocks and the extent of the structure is small. Hence, in some of the subjects, we did not have sufficient tissue available for the snRNA-seq assay.

      We agree with the reviewer that spatial studies could, in future work, offer needed and important information about expression profiles across the spatial axes (rostral/caudal, ventral/dorsal) of the LC. Our study provides us with insight about optimizing the dissections for spatial assays, as well as bringing to light a number of technical and logistical issues that we had not initially foreseen. For example, during the course of this study and parallel, ongoing work in other small, challenging brain regions, we have now developed a number of specialized technical and logistical strategies for keeping track of orientation and mounting serial sections from the same tissue block onto a single spatial array, which is extremely technically challenging. We are now well-prepared for addressing these issues in future studies with larger numbers of donors and samples, e.g. spaced serial sections across the extent of the LC to make these types of insights. Due to the rarity of the tissue, limited availability of information in this region, and high expense of conducting these studies, we want to share this initial data with the community immediately. We also note that in addition to the 10x Genomics Visium platform, which lacks cellular and sub-cellular resolution, many new and exciting spatial platforms are entering the market, which may be able to address questions in very small regions such as the LC at higher spatial resolution.

      b. Unfortunately, spatial transcriptomics itself is plagued by sampling variability to a point where the RNAscope analyses the authors performed prove more powerful in addressing direct questions about gene expression patterns. Given that the authors compare their results to published datasets from rodent studies, it is surprising that a direct comparison of genes identified with spatial transcriptomics vs snRNAseq is lacking (unless this reviewer missed this comparison). Supplementary Figure 17 seems to be a first step in that direction, but this is not a gene-by-gene comparison of which analysis identifies which LC-enriched genes. Such an analysis should not compare numbers of enriched genes using artificial cutoffs for significance/fold-change, but rather use correlations to get a feeling for which genes appear to be enriched in the LC using both methods. This would result in one list of genes that can serve as a reference point for future work.

      We agree this is a good suggestion, and will add additional computational analyses to address this point in a revised version of the manuscript.

      c. Maybe the spatial transcriptomics could be useful to look at the peri-LC region, which has generated some excitement in rodent work recently, but remains largely unexplored in humans.

      We agree this is an excellent suggestion -- assessing cross-species comparisons related to convergence, especially, of GABAergic cell populations in the human LC is of high interest. We note that these types of extensions are exactly the reason why we have provided the publicly accessible web app (R/Shiny app, which includes the ability to annotate regions). We hope that others will use these apps for specialized topics they are interested in. As discussed above, we note that our initial dissections precluded the ability to keep track of the exact orientation of our tissue sections on the Visium arrays with respect to their location within the brainstem, so definitive localization of this region across subjects is difficult in our current study. However, it is possible, for example, to investigate whether there is a putative peri-LC region that is densely GABAergic that is homologous with the GABAergic peri-LC region in rodents. We also raise attention to a recent preprint by Luskin and Li et al. (2022), who apply snRNA-seq and spatially-resolved transcriptomics to molecularly define both LC and peri-LC cell types in mice -- in a revised version of our manuscript, we will extend our computational analyses of inhibitory neuronal subtypes in our data (Supplementary Figures 13, 16) to directly compare with those identified in this study in more detail. As noted above, we we have now developed a number of specialized technical and logistical strategies for keeping track of orientation of sections from the tissue block onto a single spatial array, and we feel that combined with optimized dissection strategies for this region and the guide of RNAscope for GABAergic markers on serial sections, that annotating the peri-LC region on spatial arrays in future studies will be possible.

      3) The comparison of snRNAseq data to published literature is laudable. Although the authors mention considerable methodological differences between the chosen rodent work and their own analyses, this needs to be further explained. The mouse dataset uses TRAPseq, which looks at translating mRNAs associated with ribosomes, very different from the nuclear RNA pool analyzed in the current work. The rat dataset used single-cell LC laser microdissection followed by microarray analyses, leading to major technical differences in terms of tissue processing and downstream analyses. The authors mention and reference a recent 10x mouse LC dataset (Luskin et al, 2022), however they only pick some neuropeptides from this study for their analysis of interneuron subtypes (Figure S13). Although this is a very interesting part of the manuscript, a more in-depth analysis of these two datasets would be very useful. It would likely allow for a better comparison between mouse and human, given that the technical approach is more similar (albeit without FACS), and Luskin et al have indicated that they are willing to share their data.

      As noted above, we plan to extend our comparisons with the dataset from Luskin and Li et al. (2022) in a revised version of the manuscript, which will provide a more in-depth cross-species comparison. In addition, we also note that there are some additional recent studies using TRAPseq of LC-NE neurons in a functional context, i.e. treatment vs. control experiments or in model systems (e.g. Iannitelli et al. 2023), which provide new opportunities for understanding disease context using in-depth cross-species comparisons. By providing our dataset and reproducible code, we will enable others to adapt and extend these types of comparisons (i.e. TRAPseq of LC-NE neurons or LC snRNA-seq following functional manipulations or in the context of disease or behavioral models) in the future.

      4) Statements in the manuscript about the unexpected identification of a 5-HT (serotonin) cell-cluster seem somewhat contradictory. Figure S14 suggests that 5-HT markers are expressed in the LC-regions just as much as anywhere else, but the RNAscope image in Figure S15 suggests spatial separation between these two populations. And Figure S17 again suggests almost perfect overlap between the LC and 5HT clusters. Maybe I misunderstood, in which case the authors should better clarify/explain these results.

      In our view, the most likely scenario is that the 5-HT neurons come from contamination from the dorsal raphe nucleus based on spatial separation from the RNAscope images, which we agree are more definitive. As mentioned above, since we do not have definitive documentation for the tissue sections in terms of orientation, it is difficult to say with clarity that the regions are the dorsal raphe and which sub-portion of the dorsal raphe they are. This initial study has now allowed us to optimize and improve our dissection strategy and approaches for retaining documentation of the orientation of the tissue sections from their intact position within the brainstem as they move from cryosection to placement on the array, which will enable us to better annotate regions with definitive anatomical information with respect to the rostral/caudal and dorsal/ventral axes in future experiments. Given that there are reports in the rodent that 5-HT markers have been identified in LC-NE neurons (Iijima 1993; Iijima 1989), and taking into account the technical limitations in our study, we felt that it was premature to definitively conclude in the manuscript that we were sure these signals arose from the dorsal raphe. We will update this language in a revised version of the manuscript to ensure that these limitations are clear (referring to Supplementary Figures S14-15, S17).

      Reviewer #2 (Public Review):

      The data generated for this paper provides an important resource for the neuroscience community. The locus coeruleus (LC) is the known seed of noradrenergic cells in the brain. Due to its location and size, it remains scarcely profiled in humans. Despite the physically minute structure containing these cells, its impact is wide-reaching due to the known neuromodulatory function of norepinephrine (NE) in processes like attention and mood. As such, profiling NE cells has important implications for most neurological and neuropsychiatric disorders. This paper generates transcriptomic profiles that are not only cell-specific but which also maintain their spatial context, providing the field with a map for the cells within the region.

      Strengths:

      Using spatial transcriptomics in a morphologically distinct region is a very attractive way to generate a map. Overlaying macroscopic information, i.e. a region with greater pigmentation, with its corresponding molecular profile in an unbiased manner is an extremely powerful way to understand the specific cellular and molecular composition of that brain structure.

      The technologies were used with an astute awareness of their limitations, as such, multiple technologies were leveraged to paint a more complete and resolved picture of the cellular composition of the region. For example, the lack of resolution in the spatial transcriptomic platform was compensated by complementary snRNA-seq and single molecule FISH.

      This work has been made publicly available and accessible through a user-friendly application such that any interested researcher can investigate the level of expression of their gene of interest within this region.

      Two important implications from this work are 1) the potential that the gene regulatory profiles of these cells are only partially conserved across species, humans, and rodents, and 2) that there may be other neuromodulatory cell types within the region that were otherwise not previously localized to the LC

      Weaknesses:

      Given that the markers used to identify cells are not as specific as they need to be to definitively qualify the desired cell type, the results may be over-interpreted. Specifically, TH is the primary marker used to qualify cells as noradrenergic, however, TH catalyzes the synthesis of L-DOPA, a precursor to dopamine, which in turn is a precursor for epinephrine and norepinephrine suggesting some of the cells in the region may be dopaminergic and not NE cells. Indeed, there are publications to support the presence of dopaminergic cells in the LC (see Kempadoo et al. 2016, Takeuchi et al., 2016, Devoto et al. 2005). This discrepancy is further highlighted by the apparent lack of overlap per given Visium spots with TH, SCL6A2, or DBH. While the single-nucleus FISH confirms that some of the cells in the region are noradrenergic, others very possibly represent a different catecholamine. As such it is suggested that the nomenclature for the cells be reconsidered.

      We appreciate the reviewer’s comment, and are aware of the reports suggesting the potential presence of dopaminergic cells in the LC. We initially had the same thought as the reviewer when we observed Visium spots in the spatial data with lack of overlap between TH, SLC6A2, and DBH as well as single nuclei in the snRNA-seq data with lack of overlap between TH, SLC6A2, and DBH. This surprising result was exactly why we performed the smFISH/RNAscope experiment with these three marker genes. Given known issues with read depth and coverage in the 10x Genomics assays, we wanted to better understand if this was a technical limitation in the sequencing coverage, or rather a true biological finding. The RNAscope data showed very clearly that nearly every cell body we looked at had co-localization of these three marker genes. We included an image from a single capture array of one tissue section in Supplementary Figure 11, but could, in a revised version of the manuscript, provide additional examples to illustrate how conclusive the images were by visualization. As such, we were quite convinced that the lack of overlap on Visium spots and in single nuclei in the snRNA-seq data was more likely related to technical issues with sequencing coverage, rather than a biological finding. We also note that we checked for the presence of the dopamine transporter, SLC6A3, and as can be appreciated in the iSEE web app for the snRNA-seq data or the R/Shiny web app for the Visium data, there is virtually no expression of SLC6A3 in the dataset, which in our view provides additional evidence against the possibility that there are substantial quantities of dopaminergic cells in this human LC dataset. We will include supplementary plots showing the lack of SLC6A3 expression in a revised version of the manuscript.

      The authors are unable to successfully implement unsupervised clustering with the spatial data, this greatly reduces the impact of the spatial technology as it implies that the transcriptomic data generated in the study did not have enough resolution to identify individual cell types.

      The reviewer is correct -- this is a fundamental limitation of the 10x Genomics Visium platform, i.e. the spatial resolution captures multiple cells per spot (e.g. around 1-10 cells per spot in human brain tissue). We note that new spatial platforms now provide cellular resolution (e.g. Vizgen MERSCOPE, 10x Genomics Xenium, 10x Genomics Visium HD), which will help address this in future work. However, many of these cellular-resolution in situ sequencing platforms have the limitation that they do not quantify genome-wide expression, and instead require users to select a priori gene panels to investigate. This is a problem if no genome-wide reference datasets are available. Hence, despite the limited spatial resolution of the Visium platform, this dataset is useful precisely for helping investigators choose gene panels for higher-resolution platforms or higher-order smFISH multiplexing.

      We also applied spatial clustering (using BayesSpace; Zhao et al. 2021) to attempt to segment the LC regions within the Visium samples in a data-driven manner as an alternative to the manual annotations, which was unsuccessful (and hence we relied on the manually annotated regions for downstream analyses) (Supplementary Figure S5). However, this is a different application of unsupervised clustering, which is separate from the task of identifying cell types.

      The sample contribution to the results is highly unbalanced, which consequently, may result in ungeneralizable findings in terms of regional cellular composition, limiting the usefulness of the publicly available data.

      We acknowledge the limitations of the work due to the small/unbalanced sample sizes. As mentioned above for Reviewer 1, this was an initial study in this region -- results of which will inform our (and hopefully others’) experimental design and approach to molecular profiling in this difficult to access brain region. Overall, this study was executed with finite tissue and financial resources and was intended to uncover limitations and help develop best practices and design workflows for future studies with larger numbers of donors and samples. Given the limited data availability for this brain region, we wanted to make this dataset available for the research community immediately. In addition, we note that making this genome-wide dataset available will help inform targeted gene panel design for higher-resolution platforms (e.g. 10x Genomics Xenium).

      This study aimed to deeply profile the LC in humans and provide a resource to the community. The combination of data types (snRNA-seq, SRT, smFISH) does in fact represent this resource for the community. However, due to the limitations, of which, some were described in the manuscript, we should be cautious in the use of the data for secondary analysis. For example, some of the cellular annotations may lack precision, the cellular composition also may not reflect the general population, and the presence of unexpected cell types may represent the accidental inclusion of adjacent regions, in this case, serotonergic cells from the Raphe nucleus.

      We agree, and have attempted to explain these limitations in the manuscript. We will clarify the language regarding the interpretation of the annotated cell populations and unexpected cell types, and the limited sample sizes, in a revised version of the manuscript.

      Nonetheless having a well-developed app to query and visualize these data will be an enormous asset to the community especially given the lack of information regarding the region in general.

      Reviewer #3 (Public Review):

      […] This study has many strengths. It is the first reported comprehensive map of the human LC transcriptome, and uses two independent but complementary approaches (spatial transcriptomics and snRNA-seq). Some of the key findings confirmed what has been described in the rodent LC, as well as some intriguing potential genes and modules identified that may be unique to humans and have the potential to explain LC-related disease states. The main limitations of the study were acknowledged by the authors and include the spatial resolution probably not being at the single cell level and the relatively small number of samples (and questionable quality) for the snRNA-seq data. Overall, the strengths greatly outweigh the limitations. This dataset will be a valuable resource for the neuroscience community, both in terms of methodology development and results that will no doubt enable important comparisons and follow-up studies.

      Major comments:

      Overall, the discovery of some cells in the LC region that express serotonergic markers is intriguing. However, no evidence is presented that these neurons actually produce 5-HT.

      The reviewer is correct that we did not provide any additional evidence to show that these neurons actually produce 5-HT. As noted above in the response to Reviewer 1, in our view, the most likely explanation is that these neurons are from dorsal raphe contamination on the tissue section. However, due to technical and logistical limitations in this study, we could not definitively say this because we did not clearly track the orientation of the tissue sections, and we did not have remaining tissue sections from all donor tissue blocks to repeat RNAscope experiments. For some of the donors, where we had remaining tissue sections to go back to repeat RNAscope experiments after completion of the snRNA-seq and Visium assays, we could see clear separation of the LC region / LC-NE neuron core from where putative 5-HT neurons were located (Supplementary Figure 15). However, we did not have sufficient tissue resources to map this definitively in all donors, and the orientation and anatomy of each tissue block were not fully annotated.

      Due to the lack of clarity, and the fact that there have been reports that LC-NE neurons express serotonergic markers (Iijima 1993; Iijima 1989), we felt that it was premature to definitively declare that these putative 5-HT neurons that we identified were definitively from the raphe. We will clarify the language around this discrepancy in a revised version of the manuscript to ensure that these limitations are clearly described.

      Concerning the snRNA-seq experiments, it is unclear why only 3 of the 5 donors were used, particularly given the low number of LC-NE nuclear transcriptomes obtained, why those 3 were chosen, and how many 100 um sections were used from each donor. It is also unclear if the 295 nuclei obtained truly representative of the LC population or whether they are just the most "resilient" LC nuclei that survive the process.

      As discussed above for Reviewer 1, the reason we included only 3 of the 5 donors for the snRNA-seq assays was due to the tissue availability on the tissue blocks. We will clarify the language in a revised version of the manuscript to make this limitation more clear. We will also include additional details in the Methods section on the number of 100 μm sections used for each donor (which varied between 10-15, approximating 60-80 mg of tissue).

      The LC displays rostral/caudal and dorsal/ventral differences, including where they project, which functions they regulate, and which parts are vulnerable in neurodegenerative disease (e.g. Loughlin et al., Neuroscience 18:291-306, 1986; Dahl et al., Nat Hum Behav 3:1203-14, 2019; Beardmore et al., J Alzheimer's Dis 83:5-22, 2021; Gilvesy et al., Acta Neuropathol 144:651-76, 2022; Madelung et al., Mov Disord 37:479-89, 2022). It was not clear which part(s) of the LC was captured for the SRT and snRNAseq experiments.

      As discussed above for Reviewer 1, a limitation of this study was that we did not record the orientation of the anatomy of the tissue sections, precluding our ability to annotate the tissue sections with the rostral/caudal and dorsal/ventral axis labels. We agree with the reviewer that additional spatial studies, in future work, could offer needed and important information about expression profiles across the spatial axes (rostral/caudal, ventral/dorsal) of the LC. Our study provides us with insight about optimizing the dissections for spatial assays, as well as bringing to light a number of technical and logistical issues that we had not initially foreseen. For example, during the course of this study and parallel, ongoing work in other, small, challenging regions, we have now developed a number of specialized technical and logistical strategies for keeping track of orientation and mounting serial sections from the same tissue block onto a single spatial array, which is extremely technically challenging. We are now well-prepared for addressing these issues in future studies with larger numbers of donors and samples in order to make these types of insights.

      The authors mention that in other human SRT studies, there are typically between 1-10 cells per expression spot. I imagine that this depends heavily on the part of the brain being studied and neuronal density, but it was unclear how many LC cells were contained in each expression spot.

      The reviewer is correct that we did not include this information in the manuscript. We attempted to apply a computational method to count nuclei contained in each gene expression spot based on analyzing the histological H&E images (VistoSeg; Tippani et al. 2022), which we have developed and previously applied in data from the dorsolateral prefrontal cortex (DLPFC) (Maynard and Collado-Torres et al. 2021). Based on the segmentation using this workflow we observe that the counts in this region are similar to what we observed in the DLPFC, i.e., typically between 1-10 LC cells per expression spot, with approximately 1-2 LC-NE neurons (which are characterized by their large size) per expression spot. However, these analyses had several technical issues related to the images themselves, the relatively large size and pigmentation of LC-NE neurons, and parameter settings that had been optimized for different brain regions. We are currently optimizing this analysis workflow for these images to provide more accurate estimates of cell counts per spot to give readers additional context on the number of nuclei per spot in the annotated LC regions and outside the LC regions in a revised version of the manuscript.

      Regarding comparison of human LC-associated genes with rat or mouse LC-associated genes (Fig. 2D-F), the authors speculate that the modest degree of overlap may be due to species differences between rodents and human and/or methodological differences (SRT vs microarray vs TRAP). Was there greater overlap between mouse and rat than between mouse/rat and human? If so, that is evidence for the former. If not, that is evidence for the latter. Also would be useful for more in-depth comparison with snRNA-seq data from mouse LC: https://www.biorxiv.org/content/10.1101/2022.06.30.498327v1.

      We will investigate this question and discuss this in updated results in a revised version of the manuscript.

      The finding of ACHE expression in LC neurons is intriguing, especially in light of work from Susan Greenfield suggesting that ACHE has functions independent of ACH metabolism that contributes to cellular vulnerability in neurodegenerative disease.

      We thank the reviewer for pointing this out. We were very surprised too by the observed expression of SLC5A7 and ACHE in the LC regions (Visium data) and within the LC-NE neuron cluster (snRNA-seq data), coupled with absence of other typical cholinergic marker genes (e.g. CHAT, SLC18A3), and we do not have a compelling explanation or theory for this. Hence, the work of Susan Greenfield and colleagues suggesting non-cholinergic actions of ACHE, particularly in other catecholaminergic neurons (e.g. dopaminergic neurons in the substantia nigra) is very interesting. We will include references to this work and how it could inform interpretation of this expression in a revised version of the manuscript (Greenfield 1991; Halliday and Greenfield 2012).

      High mitochondrial reads from snRNA-seq can indicate lower quality. It was not clear why, given the mitochondrial read count, the authors are confident in the snRNA-seq data from presumptive LC-NE neurons.

      We will include additional analyses to further investigate and/or confirm this finding (e.g. comparing sum of UMI counts / number of detected genes and mitochondrial percentage per nucleus for this population to confirm data quality) in additional supplementary figures in a revised version of the manuscript.

      References

      • Greenfield (1991), A noncholinergic action of acetylcholinesterase (AChE) in the brain: from neuronal secretion to the generation of movement, Cellular and Molecular Neurobiology, 11, 1, 55-77.

      • Halliday and Greenfield (2012), From protein to peptides: a spectrum of non-hydrolytic functions of acetylcholinesterase, Protein & Peptide Letters, 19, 2, 165-172.

      • Iannitelli et al. (2023), The neurotoxin DSP-4 dysregulates the locus coeruleus-norepinephrine system and recapitulates molecular and behavioral aspects of prodromal neurodegenerative disease, eNeuro, 10, 1, ENEURO.0483-22.2022.

      • Iijima K. (1989), An immunocytochemical study on the GABA-ergic and serotonin-ergic neurons in rat locus ceruleus with special reference to possible existence of the masked indoleamine cells. Acta Histochema, 87, 1, 43-57.

      • Iijima K. (1993), Chemocytoarchitecture of the rat locus ceruleus, Histology and Histopathology, 8, 3, 581-591.

      • Luskin A.T., Li L. et al. (2022), A diverse network of pericoerulear neurons control arousal states, bioRxiv (preprint).

      • Maynard and Collado-Torres et al. (2021), Transcriptome-scale spatial gene expression in the human dorsolateral prefrontal cortex, Nature Neuroscience, 24, 425-436.

      • Tippani et al. (2022), VistoSeg: processing utilities for high-resolution Visium/Visium-IF images for spatial transcriptomics data, bioRxiv (preprint).

      • Tran M.N., Maynard K.R. et al. (2021), Single-nucleus transcriptome analysis reveals cell-type-specific molecular signatures across reward circuitry in the human brain, Neuron, 109, 3088-3103.

      • Zhao E. et al. (2021), Spatial transcriptomics at subspot resolution with BayesSpace, Nature Biotechnology, 39, 1375-1384.

    1. Note de Synthèse : Audition de Najat Vallaud-Belkacem sur les Violences dans les Établissements Scolaires

      Source : Extraits de l'audition de Najat Vallaud-Belkacem, ancienne ministre de l'Éducation Nationale, par la commission d'enquête parlementaire sur les violences dans les établissements scolaires.

      Date de l'audition : 2025.

      Objet : Analyse de la prise en compte des violences exercées par des adultes ayant autorité sur des enfants en milieu scolaire, identification des failles en matière de signalement, prise en charge des victimes et gestion des auteurs.

      Principaux Thèmes Abordés :

      • Gestion des cas de violence pendant son mandat (2014-2017) : Modalités de remontée des informations, création d'outils dédiés.

      • Le logiciel "Fait Établissement" : Création, périmètre (public vs privé sous contrat), limites et évolution envisagée.

      • Relations et contrôles des établissements privés sous contrat : Nature des contrôles existants, débat sur l'élargissement du périmètre de contrôle de l'État.
      • L'affaire de Villefontaine et la loi de 2016 : Le dysfonctionnement de la transmission d'informations entre la justice et l'Éducation Nationale, la réponse législative mise en place.
      • Vérification des casiers judiciaires des personnels : Le travail inédit entrepris sous son ministère pour les personnels publics et privés, sa poursuite.
      • La libération de la parole des victimes : Fragilité de ce processus et nécessité de mesures pérennes.
      • Le rôle du SGEC (Secrétariat Général de l'Enseignement Catholique) : Interactions avec le ministère, influence.
      • Formation et bien-être des personnels de l'éducation nationale : Leur importance pour la détection et le signalement des violences

      Idées et Faits les Plus Importants :

      • Prise de conscience et émoi : L'ancienne ministre exprime son "émotion et [sa] solidarité aux victimes de Betharam et des autres établissements privés où la parole s'est récemment libérée", soulignant l'aspect "fléau systémique" des violences.
      • Mécanismes de remontée d'informations existants et améliorations :Pendant son mandat, son cabinet recevait les remontées de "faits graves" des directeurs de cabinet des recteurs.
      • Des courriers de particuliers (parfois de proches des auteurs) étaient également reçus.
      • Le bien-être des élèves était une priorité de son mandat. Des circulaires et des réunions avec les recteurs insistaient sur la remontée de tous les faits.
      • Création du logiciel "Fait Établissement" : Ce fut une réponse à une "lacune" constatée, l'absence d'un "logiciel adapté pour que ces effets puisse être remonté établissement par établissement". Cet outil, toujours en fonction, permet de remonter les faits graves, de compiler des typologies de violence anonymisées pour la prévention, et de demander de l'appui.
      • Le périmètre du logiciel "Fait Établissement" et les établissements privés sous contrat :Lors de sa création, l'outil s'adressait "à tous les établissements publics comme privés".
      • Cependant, la distinction réside dans la capacité de l'État à imposer son utilisation. Ce qui relève de la "vie scolaire" dans les établissements privés sous contrat est lié à leur "caractère propre", limitant l'action des pouvoirs publics. L'ancienne ministre est favorable à un changement de législation pour y remédier.
      • Elle note que "souvent vous allez avoir des établissements privés qui vont préférer mettre en place des dispositifs maison", lesquels "ne sont pas propices à la libération de la parole des élèves".
      • L'intégration des établissements privés sous contrat au logiciel n'était donc pas obligatoire à l'époque, même si certains l'utilisaient ("certains établissements privés sous contrat je ne suis pas en train de dire qu'aucun d'eux ne le fait").
      • La ministre actuelle a annoncé vouloir élargir le périmètre via un décret, ce que l'ancienne ministre approuve.
      • Contrôle des établissements privés :Elle rappelle que les établissements privés sous contrat sont soumis à trois types de contrôle selon le code de l'éducation : "un contrôle pédagogique, un contrôle budgétaire et un contrôle administratif".
      • "Hors de ces trois types de contrôle aujourd'hui dans les textes il n'existe pas de possibilité pour les pouvoirs publics de vérifier encore une fois que sur les questions de lutte contre le harcèlement de lutte contre les violences faites aux enfants... et bien des mesures soient prises ou respectées". Elle estime qu'il faut "changer la loi dans ce sens-là".
      • Elle reconnaît que le contrôle des établissements privés sous contrat "semble inexistant" pendant son mandat, comme il l'était avant et après.
      • Elle regrette de ne pas avoir eu le temps d'aller plus loin après les travaux menés sur les établissements hors contrat (référentiel, formation d'inspecteurs), dont la logique aurait dû s'étendre au sous-contrat.
      • L'affaire de Villefontaine (2015) et la loi de 2016 :Cette affaire (un enseignant condamné en 2008 pour détention d'images pédopornographiques qui continuait d'enseigner) a révélé un "manque de connaissance au sein de l'éducation nationale de condamnation passée d'un enseignant".
      • "Ce qui est extrêmement grave à ce moment-là" est que l'Éducation Nationale n'avait pas eu connaissance de cette condamnation, malgré des circulaires antérieures demandant la transmission par le parquet.
      • L'enquête conjointe IGENR/IGSJ a constaté que la transmission était "totalement hasardeuse".
      • La loi de 2016 : L'objectif était de "corriger" cette situation en obligeant le parquet à transmettre l'information de la mise en examen. La loi a instauré un "cadre juridique précis" pour "sécuriser les magistrats" et "créer un cadre applicable pour tous".
      • Un équilibre a été recherché entre "protection des enfants" et "respect de la présomption d'innocence et de la vie privée des personnes mises en cause".
      • La transmission avant condamnation devait être entourée de garanties (appréciation de l'autorité judiciaire, limitation des infractions, support écrit, confidentialité, information de la personne concernée, information sur l'issue définitive, effacement si non-culpabilité).
      • Elle estime que l'absence de cadre légal avant 2016 "bridait le plus souvent les magistrats en question et que par peur d'être mis en cause les parquets se dispensaient d'informer".
      • Concernant la critique selon laquelle la loi de 2016 aurait pu réduire le champ d'action du procureur (en limitant l'information au stade de l'enquête/information judiciaire et non de la garde à vue), elle maintient que la loi a "posé un cadre" et que l'équilibre trouvé était "bienvenu sur ce point". Elle est toutefois "à l'écoute" d'une possible évolution si cela ressortait des consultations, mais insiste sur la nécessité d'un "cadre clair, unifié, précis connu de tous".
      • Elle précise que ce cadre "autorisait bien l'ensemble des administrations publiques à contrôler le bulletin numéro 2 du casier judiciaire des agents en contact habituel avec des mineurs en cours de carrière", y compris dans les ministères de l'Agriculture et de la Défense, ainsi que le périscolaire.
      • Vérification du stock de casiers judiciaires :Un travail "complètement inédit" a été entrepris : "la vérification des antécédents judiciaires de tous nos personnel non seulement pour l'avenir donc dans une logique de flux mais aussi sur le passé donc dans une logique de stock".
      • 850 000 dossiers B2 et Fijaï ont été passés au crible à raison de 3000 dossiers par jour.
      • Ce travail a conduit à "26 radiations et fin de contrat".
      • Les enseignants du privé étaient "naturellement" concernés par cette vérification.
      • L'ancienne ministre ne sait pas si ce travail sur le stock a été poursuivi après son départ en mai 2017.
      • Elle explique que l'absence de transmission systématique par la justice (plutôt que de laisser la charge à l'Éducation Nationale) s'explique par "un manque de moyens considérable" de la justice française (greffiers, outils informatiques, etc.).
      • Relations avec le SGEC :Il n'y avait pas de "dîner mensuel" ou de "proximité" ou "promiscuité" particulière avec le SGEC sous son mandat.
      • Le SGEC était traité "comme tous les interlocuteurs de [son] ministère", reçu "régulièrement" pour évoquer les réformes ("programmes, réforme du collège, questions de mixité sociale, questions de valeur de la République").
      • Sa position était que "l'enseignement privé ne devait pas être considéré comme extérieur aux missions de service public de l'éducation".
      • Elle ne se souvient pas que le SGEC ait montré une "réticence" directe à l'égard du caractère obligatoire de "Fait Établissement" pour le privé, mais estime que le SGEC n'était pas forcément suivi par tous les établissements de son réseau.
      • Les rencontres avec le SGEC n'avaient pas forcément d'ordre du jour ou de compte-rendu public systématique, comme pour les autres interlocuteurs, y compris les organisations syndicales.
      • La libération de la parole et l'omerta :Elle n'a pas été "surprise" par la révélation des violences sexuelles sur mineurs (citant les travaux de la CIVIS et de la CIAS), mais par "l'ampleur la gravité la durée de ces faits et la silenciation le phénomène d'omerta total" concernant des établissements comme Betaram.
      • Elle est frappée par la "fragilité" de la libération de la parole, rappelant que des plans de lutte contre la pédocriminalité existaient à la fin des années 90, mais qu'une "parenthèse s'est refermée" (mentionnant l'affaire Outreau et la remise en question de la parole des enfants).
      • "Il faudrait cette fois-ci prendre les mesures qui s'imposent pour éviter que cette parenthèse de libération de la parole ne se referme trop vite".
      • Elle insiste sur le fait que le problème n'est pas tant les "canaux" de signalement (comme le logiciel "Fait Établissement") que la "silenciation". Il faut installer une "culture de cette parole là".
      • Elle reconnaît le problème du "cloisonnement entre les administrations" qui peut empêcher les informations de remonter au bon niveau.
      • Rôle des personnels et de leur formation :Elle cite une enquête montrant une aggravation de la victimation chez les professionnels de l'éducation, suggérant que des "climats dégradés entre adultes" peuvent miner leur capacité à détecter, écouter et signaler.
      • Elle insiste sur la nécessité de "que les personnels de l'éducation nationale soient bien traités pour se sentir bien dans leur mission" et d'être "bien formés sur toutes ces questions-là".
      • Elle regrette la réduction de la "formation continue offerte aux enseignants", estimant que cela affecte leur capacité à "voir les signaux".

      Points de Tension ou Zones de Débat :

      • Le périmètre du logiciel "Fait Établissement" et son caractère obligatoire pour le privé sous contrat : Le débat sur les moyens légaux pour l'imposer, distincts de la simple recommandation.
      • Les pouvoirs réels du SGEC : La question de son autorité sur les établissements de son réseau et l'absence de cadre formel et public pour les échanges avec le ministère.
      • L'application de la loi de 2016 sur la transmission d'informations par la justice : La question de savoir si la formalisation a pu limiter le partage d'informations à des stades précoces (garde à vue), potentiellement au détriment de la protection immédiate des enfants.
      • La poursuite de la vérification du stock de casiers judiciaires après 2017 : Une interrogation sur la continuité de cette mesure cruciale.
      • La distinction entre libération de la parole et capacité d'écoute/détection : L'ancienne ministre met l'accent sur la culture de la parole, tandis que les rapporteurs soulignent la nécessité d'apprendre "à entendre à écouter à décrypter".

      Conclusion Provisoire :

      • L'audition de Najat Vallaud-Belkacem met en lumière les efforts entrepris sous son ministère pour améliorer la gestion des violences en milieu scolaire, notamment par la création d'outils de signalement (Fait Établissement) et un cadre légal pour la transmission d'informations judiciaires (Loi de 2016), suite à des dysfonctionnements graves révélés par des affaires comme celle de Villefontaine.

      Elle souligne les limites rencontrées, en particulier concernant les établissements privés sous contrat où la capacité de l'État à imposer des mesures est restreinte par la législation actuelle.

      L'ancienne ministre insiste également sur la fragilité de la libération de la parole des victimes et sur le rôle crucial de la formation et du bien-être des personnels pour la détection et le signalement des violences.

      Des questions subsistent sur l'application effective et la poursuite de certaines mesures après son départ, ainsi que sur l'articulation entre les différents ministères et réseaux d'établissements.

    1. Author response:

      The following is the authors’ response to the original reviews

      We thank the public reviewers and editors for their insightful comments on the manuscript. We have made the following changes to address their concerns and think the resulting manuscript is stronger as a result. Specifically, we have 1) added RNA FISH data of specific STB-2 and STB-3 RNA markers to confirm their distribution changes between STB<sup>in</sup> and STB<sup>out</sup> TOs, 2) removed language throughout the text that refer to STB-3 as a terminally differentiated nuclear subtype, and 3) generated CRISPR-mediated knock-outs of two genes identified by network analysis and validated their rolse in mediating STB nuclear subtype gene expression.

      Reviewer #1 (Public review): 

      Strengths: 

      The study offers a comprehensive SC- and SN-based characterization of trophoblast organoid models, providing a thorough validation of these models against human placental tissues. By comparing the older STB<sup>in</sup> and newer STB<sup>out</sup> models, the authors effectively demonstrate the improvements in the latter, particularly in the differentiation and gene expression profiles of STBs. This work serves as a critical resource for researchers, offering a clear delineation of the similarities and differences between TO-derived and primary STBs. The use of multiple advanced techniques, such as high-resolution sequencing and trajectory analysis, further enhances the study's contribution to the field. 

      Thank you for your thoughtful review—we appreciate your recognition of our efforts to comprehensively validate trophoblast organoid models and highlight key advancements in STB differentiation and gene expression.

      Weaknesses: 

      While the study is robust, some areas could benefit from further clarification. 

      (1) The importance of the TO model's orientation and its impact on outcomes could be emphasized more in the introduction. 

      We agree that TO orientation may significantly influence STB nuclear subtype differentiation. As the STB is critical for both barrier formation and molecular transport in vivo, lack of exposure to the surrounding media in STB<sup>in</sup> TOs in vitro could compromise these functions and the associated environmental cues that influence STB nuclear differentiation. We have added text to the introduction to highlight this point (lines 117-120).

      (2) The differences in cluster numbers/names between primary tissue and TO data need a clearer explanation, and consistent annotation could aid in comparison. 

      Thank you for highlighting that the comparisions and cluster annotations need clarification. In Figure 1, we did not aim to directly compare CTB and STB nuclear subtypes between TOs and tissue. Each dataset was analyzed independently, with clusters determined separately and with different resolutions decided via a clustering algorithm (Zappia and Oshlack, 2018). For example, for the STB, this approach identified seven subtypes in tissue but only two in TOs, making direct comparison challenging. To address this challenge, we integrated the SN datasets from TOs and tissue in Figure 6. This integration allowed us to directly compare gene expression between the sample types and examine the proportions within each STB subtype. Similarly, in Figure 2, direct comparison of individual CTB or STB clusters across the separate datasets is challenging (Figures 2A-C) due to differences in clustering. To overcome this, we integrated the datasets to compare cluster gene expression and relative proportions (Figures 2D-E). Nonetheless, to address the reviewers concern we have added text to the results section to clarify that subclusters of CTB and STB between datasets should not be directly compared until the datasets are integrated in Figure 2D-E and Figure 6 (lines 166-167).

      (3) The rationale for using SN sequencing over SC sequencing for TO evaluations should be clarified, especially regarding the potential underrepresentation of certain trophoblast subsets. 

      This is an important point as the challenges of studying a giant syncytial cell are often underappreciated by researchers that study mononucleated cells. We have added text to the introduction to clarify why traditional single cell RNA sequencing techniques were inadequate to collect  and characterize the STB (lines 91-93).

      (4) Additionally, more evidence could be provided to support the claims about STB differentiation in the STB<sup>out</sup> model and to determine whether its differentiation trajectory is unique or simply more advanced than in STB<sup>in</sup>. 

      Our original conclusion that STB<sup>out</sup> nuclei are more terminally differentiated than STB<sup>in</sup> was based on two observations: (1) STB<sup>out</sup> TOs exhibit increased expression of STB-specific pregnancy hormones and many classic STB marker genes and (2) STB<sup>out</sup> nuclei show an enrichment of the STB-3 nuclear subtype, which appears at the end of the slingshot pseudotime trajectory. However, upon consideration of the reviewer comments, we agree that this evidence is not sufficient to definitively distinguish if STB<sup>out</sup> nuclei are more advanced or follow a unique differentiation trajectory dependent on new environmental cues. Pseudotime analyses provided only a predictive framework for lineage tracing, and these predictions must be experimentally validated. Real-time tracking of STB nuclear subtypes in TOs would require a suite of genetic tools beyond the scope of this study. Therefore, to address the reviewers' concerns we have removed language suggesting that STB-3 is a terminally differentiated subtype or that STB<sup>out</sup> nuclei are more differentiated than STB<sup>in</sup> nuclei throughout the text until the discussion. Therein we present both our original hypothesis (that STB nuclei are further differentiated in STB<sup>out</sup>) and alternative explanations like changing trajectories due to local environmental cues (lines 619-625).

      Reviewer #2 (Public review): 

      Strengths: 

      (1) The use of SN and SC RNA sequencing provides a detailed analysis of STB formation and differentiation. 

      (2) The identification of distinct STB subtypes and novel gene markers such as RYBP offers new insights into STB development. 

      Thank you for highlighting these strengths—we appreciate your recognition of our use of SN and SC RNA sequencing to analyze STB differentiation and the discovery of distinct STB subtypes and novel gene markers like RYBP.

      Weaknesses: 

      (1) Inconsistencies in data presentation. 

      We address the individual comments of reviewer 2 later in this response.

      (2) Questionable interpretation of lncRNA signals: The use of long non-coding RNA (lncRNA) signals as cell type-specific markers may represent sequencing noise rather than true markers. 

      We appreciate the reviewer’s attention to detail in noticing the lncRNA signature seen in many STB nuclear subtypes. However, we disagree that these molecules simply represent sequencing noise. In fact, may studies have rigorously demonstrated that lncRNAs have both cell and tissue specific gene expression (e.g., Zhao et al 2022, Isakova et al 2021, Zheng et al 2020). Further, they have been shown to be useful markers of unique cell types during development (e.g., Morales-Vicente et al 2022, Zhou et al 2019, Kim et al 2015) and can enhance clustering interpretability in breast cancer (Malagoli et al 2024). Many lncRNAs have also been demonstrated to play a functional role in the human placenta, including H19, MEG3, and MEG8 (Adu-Gyamfi et al 2023) and differences are even seen in nuclear subtypes in trophoblast stem cells (Khan et al 2021). Therefore, we prefer to keep these lncRNA signatures included and let future researchers test their functional role.

      To improve the study's validity and significance, it is crucial to address the inconsistencies and to provide additional evidence for the claims. Supplementing with immunofluorescence staining for validating the distribution of STB_in, STB_out, and EVT_enrich in the organoid models is recommended to strengthen the results and conclusions. 

      Each general trophoblast cell type (CTB, STB, EVT) has been visualized by immunofluorescence by the Coyne laboratory in their initial papers characterizing the STB<sup>in</sup>, STB<sup>out</sup>, and EVT<sup>enrich</sup> models (Yang et al, 2022 and 2023). We agree that it is important to validate the STB nuclear subtypes found in our genomic study. However, one challenge in studying a syncytia is that immunofluorescence may not be a definitive method when the nuclei share a common cytoplasm. This is because protein products from mRNAs transcribed in one nucleus are translated in the cytoplasm and could diffuse beyond sites of transcription. Therefore, RNA fluorescence in situ hybridization (RNA-FISH) is instead needed. While a systematic characterization of the spatial distribution of the many marker genes found each subtype is outside the scope of this study, we include RNA-FISH of one STB-2 marker (PAPPA2) and one STB-3 marker (ADAMTS6) in Figure 3F-G and Supplemental Figure 3.3. This demonstrates there is an increase in STB-2 marker gene expression in STB<sup>in</sup> TOs and an increase in STB-3 marker gene expression in STB<sup>out</sup> TOs. 

      Reviewer #3 (Public review):  

      The authors present outstanding progress toward their aim of identifying, "the underlying control of the syncytiotrophoblast". They identify the chromatin remodeler, RYBP, as well as other regulatory networks that they propose are critical to syncytiotrophoblast development. This study is limited in fully addressing the aim, however, as functional evidence for the contributions of the factors/pathways to syncytiotrophoblast cell development is needed. Future experimentation testing the hypotheses generated by this work will define the essentiality of the identified factors to syncytiotrophoblast development and function. 

      We thank the reviewer for their thoughtful assessment, constructive feedback, and encouraging comments. We acknowledge that the initial manuscript primarily presented analyses suggesting correlations between RYBP and other factors identified in the gene network analysis and STB function. Understanding how gene networks in the STB are formed and regulated is a long-term goal that will require many experiments with collaborative efforts across multiple research groups.

      Nonetheless, to address this concern we have knocked out two key genes, RYBP and AFF1, in TOs using CRISPR-Cas9-mediated gene targeting. Bulk RNA sequencing of STB<sup>in</sup> TOs from both wild-type (WT) and knockout strains revealed that deletion of either gene caused a statistically significant decrease in the expression of the pregnancy hormone human placental lactogen and an increase in the expression of several genes characteristic of the oxygen-sensing STB-2 subtype, including FLT-1, PAPPA2, SPON2, and SFXN3. These findings demonstrate that knocking out RYBP or AFF1 results in an increase in STB-2 marker gene expression and therefore play a role in inhibiting their expression in WT TOs (Figure 5D-E and supplemental Figure 5.2). We also note that this is the first application of CRISPR-mediated gene silencing in a TO model.

      Future work will visualize the distribution of STB nuclear subtypes in these mutants and explore the mechanistic role of RYBP and AFF1 in STB nuclear subtype formation and maintenance. However, these investigations fall outside the scope of the current study.

      Localization and validation of the identified factors within tissue and at the protein level will also provide further contextual evidence to address the hypotheses generated. 

      We agree that visualizing STB nuclear subtype distribution is essential for testing the many hypotheses generated by our analysis. To address this, we have included RNA-FISH experiments for two STB subtype markers (PAPPA2 for STB-2 and ADAMTS6 for STB-3) in TOs. These experiments reveal an increase in PAPPA2 expression in STB<sup>in</sup> TOs and an increase in ADAMTS6 expression in STB<sup>out</sup> TOs (Figure 3F-G and Supplemental Figure 3.3). Genomic studies serve as powerful hypothesis generators, and we look forward to future work—both our own and that of other researchers—to validate the markers and hypotheses presented from our analysis.

      Recommendations for the authors: 

      Reviewing Editor Comments: 

      We strongly encourage the authors to further strengthen the study by addressing all reviewers' comments and recommendations, with particular attention to the following key aspects:

      (1) Clarifying the uniqueness of the STB differentiation trajectory between STB<sup>in</sup> and STB<sup>out</sup>, and determining whether STB<sup>out</sup> represents a more advanced stage of differentiation compared to STB<sup>in</sup>. It is also important to specify which developmental stage of placental villi the STB<sup>out</sup> and STB<sup>in</sup> are simulating. 

      We have revised the manuscript to remove definitive language claiming that STB-3 represents a terminally differentiated subtype or that STB<sup>out</sup> nuclei are more differentiated than STB<sup>in</sup> nuclei. Instead, we now present our hypothesis and alternative explanations in the discussion (lines 619-625), and emphasize the need for experimental validation of pseudotime predictions to test these hypotheses.

      (2) Utilizing immunofluorescence to validate the distribution of cell types in the organoid models. 

      The Coyne lab has previously performed immunofluorescence of CTB and STB markers in STB<sup>in</sup> and STB<sup>out</sup> TOs (Yang et al 2023). The syncytial nature of STBs complicates immunofluorescence-based validation of the STB nuclear subtypes due translating proteins all sharing a single common cytoplasm and therefore being able to diffuse and mix. Instead, we performed RNA-FISH for two STB subtype markers (PAPPA2, STB-2 and ADAMTS6, STB-3), which showed subtype-specific nuclear enrichment in STB<sup>in</sup> and STB<sup>out</sup> TOs, respectively (Figure 3F-G and Supplemental Figure 3.3).  

      (3) Addressing concerns regarding the use of lncRNA as cell marker genes. Employing canonical markers alongside critical TFs involved in differentiation pathways to perform a more robust cell-type analysis and validation is recommended.  

      As discussed in detail above, we maintain that lncRNAs are valuable markers, supported by their demonstrated roles in cell and tissue specificity and placental function. These signatures provide important insights and hypotheses for future research, and we have clarified this rationale in the revised manuscript.

      Reviewer #1 (Recommendations for the authors): 

      (1) The authors have presented an extensive SC- and SN-based characterization of their improved trophoblast TO model, including a comparison to human placental tissues and the previous TO iteration. In this way, the authors' work represents an invaluable resource for investigators by providing thorough validation of the TO model and a clear description of the similarities and differences between primary and TO-derived STBs. I would suggest that the authors reshape the study to further highlight and emphasize this aspect of the study. 

      We thank the reviewer for their thoughtful recommendation and agree that our datasets will serve as an invaluable resource for comparing in vitro models to in vivo gene expression. However, extensive validation is required to make definitive conclusions about the extent to which these systems mirror one another and where they diverge. For this reason, in this manuscript, we have focused on characterizing STB subtypes to provide a foundational understanding of the model and this poorly characterized subtype.

      (2) Introduction, Paragraph 3: What is the importance of orientation for the trophoblast TO model? The authors may consider removing some of the less important methodologic details from this paragraph and including more emphasis on why their TO model is an improvement. 

      Text has been added to this paragraph to highlight the importance of outward facing STB orientation, which is essential to mirror the STB’s transport function in vitro (lines 118-120).

      (3) Results, Figure 1: In addition to the primary placental tissue plots showing all cell populations, it may be useful to have side-by-side versions of similar plots showing only the trophoblast subsets, so that the primary and TO data could be more easily compared visually. 

      This has been implemented and added to the Supplemental Figure 1.4.

      (4) Results, Figure 1: In simple terms, what is the reason for ending up with different cluster numbers/names from the primary tissue and TO? Would it be possible to apply the same annotation to each (at least for trophoblast types) and thus allow direct comparison between the two? 

      As described above, each dataset was separately analyzed and clusters determined with an algorithm to determine the optimal clustering resolution. Therefore, the number of clusters between each dataset cannot be directly compared until the SN TO and tissue datasets are integrated together in Figure 6. We have added text to the manuscript to make it clear that they should not be compared except for in bulk number until this point (230-232).

      (5) Results, Figure 2: For subsequent evaluation of different in vitro TO conditions, did the authors use only SN sequencing because they wanted to focus on STB? Based on Figure 1, it seems some CTB subsets would be underrepresented if using only SN. Given that the authors look at both STB and CTB in their different TOs, is this an issue? 

      The CTB clusters that showed the greatest divergence between SC and SN datasets were those associated with mitosis and the cell cycle, likely due to nuclear envelope breakdown interfering with capture by the 10x microfluidics pipeline. While cytoplasmic gene expression provides valuable insights into CTB function, our manuscript focuses on the STB starting from Figure 2. Since the STB is captured exclusively by the SN dataset, we concentrated on this approach to streamline our analysis.

      (6) Results, Figure 3: What do the authors consider to be the primary contributing factors for why the STB subsets display differential gene expression between STB<sup>in</sup> and STB<sup>out</sup>? Is this due primarily to the cultural conditions and/or a result of the differing spatial arrangement with CTBs? 

      This is an intriguing question that is challenging to disentangle because the culture conditions are integral to flipping the orientation. The two primary factors that differ between STB<sup>in</sup> and STB<sup>out</sup> TOs are the presence of extracellular matrix in STB<sup>in</sup> and direct exposure to the surrounding media in STB<sup>out</sup>. We believe these environmental cues play a significant role in shaping the gene expression of STB subsets. Fully disentangling this relationship would require a method to alter the TO orientation without changing the culture conditions. While this is an exciting direction for future research, it falls outside the scope of the present study.

      (7) Results, Figure 4: The authors' analysis indicates that the STB nuclei from the STB<sup>out</sup> TO are likely "more differentiated" than those in STB<sup>in</sup> TO. Could the authors provide some qualitative or quantitative support for this? Is the STB<sup>out</sup> differentiated phenotype closer to what would be observed in a fully formed placenta? 

      As discussed earlier, we agree with the reviewers that this claim should be removed from the text outside of the discussion.

      (8) Results, Figure 5: Based on the trajectory analysis, do the authors consider that the STB from STB<sup>out</sup> TO are simply further along the differentiation pathway compared to those from STB<sup>in</sup> TO, or do the STB from STB<sup>out</sup> TO follow a differentiation pathway that is intrinsically distinct from STB<sup>in</sup> TO? 

      We think the idea of an intrinsically distinct pathway is a fascinating alternative hypothesis and have added it into the discussion. We do not find the pseudotime currently allows us to answer this question without additional experiments, so we have removed claims that the STB<sup>out</sup> STB nuclei are further along the differentiation pathway.

      (9) Results, Figure 6: A notable difference between the STB<sup>out</sup> TO and the term tissue is that the CTB subsets are much more prevalent. Is this simply a scale difference, i.e. due to the size of the human placenta compared to the limited STB nuclei available in the STB<sup>out</sup> TO? Or are there other contributing factors? 

      The proportion of CTB to STB nuclei in our term tissue (9:1) aligns with expectations based on stereological estimates. We believe the relatively low number of CTB nuclei in our dataset is due to the need for a larger sample size to capture more of this less abundant cell type. Since the primary focus of this paper is on STB, and we analyzed over 4,000 STB nuclei, we do not view this as a limitation. However, future studies utilizing SN to investigate term tissue should account for the abundance of STB nuclei and plan their sampling carefully to ensure sufficient representation of CTB nuclei if this is a desired focus.

      Reviewer #2 (Recommendations for the authors): 

      (1) The color annotations for cell types in Figure 2 are inconsistent between the different panels, and the term "Prolif" in Figure 2E is not explained by the authors. 

      We chose colors to enhance visibility on the UMAP. We do not wish readers to make direct comparisons between the different CTB or STB subtypes of the sample types until the datasets are integrated in Figure 2D. This is because an algorithm for the clustering resolution has been chosen independently for each dataset. Cluster proportions are better compared in the integrated datasets in Figure 2D. We have added text to the results section to make this clear to the reader (lines 166-167).

      (2) In Figure 3 and Supplementary Figures 1.3, the authors frequently present long non-coding RNA (lncRNA) signals as cell type-specific markers in the bubble plots. These signals are likely sequencing noise and may not accurately represent true markers for those cell types. It is recommended to revise this interpretation. 

      As referenced above, there are many examples of lncRNAs that have biological and pathological significance in the placenta (H19, Meg3, Meg8) and lncRNAs often have cell type specific expression that can enhance clustering. We prefer to keep these signatures included and let future researchers determine their biological significance.

      (3) In Figure 3C, the authors performed pathway enrichment analysis on the STB subtypes after integrating STB_in and STB_out organoids. The enrichment of the "transport across the blood-brain barrier" pathway in the STB-3 subtype does not align with the current understanding of STB cell function. Please provide corresponding supporting evidence. Additionally, please verify whether the other functional pathways represent functions specific to the STB subtypes. 

      Interestingly, many of the genes categorized under “transport across the blood-brain barrier” are transporters shared with “vascular transport.” These include genes involved in the transport of amino acids (SLC7A1, SLC38A1, SLC38A3, SLC7A8), molecules essential for lipid metabolism (SLC27A4, SLC44A1), and small molecule exchange (SLC4A4, SLC5A6). Given that the vasculature, the STB, and the blood-brain barrier all perform critical barrier functions, it is unsurprising that molecules associated with these GO terms are enriched in the STB-3 subtype, which expresses numerous transporter proteins. Since the transport of materials across the STB is a well-established function, we have not included additional supporting evidence but have clarified the genes associated with this GO term in the text (lines 392-394 and supplemental Table 9).

      (4) The pseudotime heatmap in Figure 4B is not properly arranged and is inconsistent with the differentiation relationships shown in Figure 4A. It is recommended to revise this. 

      We are uncertain which aspect of the heatmap in Figure 4A is perceived as inconsistent with Figure 4B. One distinction is that pseudotime in Figure 4A is normalized from 0 to 100 to fit the blue-to-yellow-to-red color scale, whereas in Figure 4B, the color scale is not normalized and the color bar ranging from white to red. This difference reflects our intent to simplify Figure 4B-C, as the abundance of color between cell types and gene expression changes required a streamlined representation to ensure the figure remained clear and easy to interpret. This is classically done in the field and consistent with the default code in the slingshot package.

      (5) In Figures 4C and 4D, although RYBP is highly expressed in STB, it is difficult to support the conclusion that RYBP shows the most significant expression changes. It is recommended to provide additional evidence. 

      The claim that RYBP exhibits the most significant expression changes was based on p-value ordering of genes associated with pseudotime via the associationTest function in slingshot and not with immunofluorescence data. The text has been revised to make this distinction clear (lines 390-393).

      (6) In Figure 4E, staining for CTB marker genes is missing, and in Figure 4F, CYTO is difficult to use as a classical STB marker. It is recommended to use the CGBs antibody from Figure 4E as a STB marker for staining to provide evidence.  

      We have revised the Figure 5B-C to use e-Cadherin as a CTB marker gene in TOs and CGB antibody as a marker of STB.

      In tissue, however, obtaining a good STB marker that does not overlap with the RYBP antibody (rabbit) in term tissue is difficult as the STB downregulates hCG expression closer to term to initiate contractions. SDC1 is often used but only labels the plasma membrane so does not help in distinguishing the STB cytoplasm. We have added an image of cytokeratin, e-Cadherin, and the STB marker ENDOU to validate that our current approach with e-Cadherin and cytokeratin allows us to accurately distinguish between CTB and STB cells.

      (7) The velocity results in Figure 5A do not align with the differentiation relationships between cells and contradict the pseudotime results presented in Figure 4 by the authors. 

      The reviewer raises an interesting observation regarding the velocity map in Figure 5A, which appears to show a bifurcation into two STB subtypes. This observation aligns with similar findings reported in tissue by our colleagues (Wang et al., 2024). However, given the low number of CTB cells in our tissue dataset, we were cautious about making definitive conclusions about pseudotime without a larger sample size. Notably, the RNA velocity map closely resembles the pseudotime trajectory in TOs, with CTB transitioning into the CTB-pf subtype and subsequently into the STB. One potential explanation for discrepancies between tissue and TOs is the difference in nuclear age: nuclei in tissue can be up to nine months old, whereas those in TOs are only hours or days old. It is possible that the lineage in TOs could bifurcate if cultured for longer than 48 hours, but our current dataset captures only the early stages of the STB differentiation process. While exploring these hypotheses is fascinating, they are beyond the scope of this current study.  

      Reviewer #3 (Recommendations for the authors): 

      Amazing work - I greatly enjoyed reading the manuscript. Here are a few questions and suggestions for consideration: 

      Evidence presented throughout the results sections hints that the organoids may represent an earlier stage of placental development compared to the term. Increased hCG gene expression is observed, but as noted expression is decreased in term STB. STB:CTB ratios are also higher at term compared to the first trimester, etc. It was difficult to conclude definitively based on how data is presented in Fig 6 and discussed. Maybe there is no clear answer. Perhaps the altered cell type ratios in the organoid models (e.g., few STB in EVT enrich conditions) impact recapitulation of the in vivo local microenvironment signaling. As such, can the authors speculate on whether cell ratios could be strategically leveraged to model different gestational time points? 

      Along these same lines, syncytiotrophoblast in early implantation (before proper villi development) is often described as invasive and later at the tertiary villi stage defined by hormone production, barrier function, and nutrient/gas exchange. Do the authors think the different STB subtypes captured in the organoid models represent different stages/functions of syncytiotrophoblast in placental development? 

      Minor Comments 

      (1) Please clarify what the third number represents in the STB:CTB ratio (e.g., 1:3:1 and 2:5:1). EVT? 

      The first number is a decimal point and not a colon (ie 1.3 and 2.5). Therefore these numbers are to be read as the STB:CTB ratio is 1.3 to 1 or 2.5 to 1.

      (2) Could consider co-localizing RYBP in term tissue with a syncytio-specific marker like CGB used for organoids (Fig 4F). 

      We addressed this concern in comment 6 to reviewer 2.

      (3) Recommend defining colors-which colors represent which module in Figure 5C in the legend and main body text. I see the labels surrounding the heatmap in 5B, but defining colors in text (e.g. cyan, magenta, etc.) would be helpful. Do the gray circles represent targets that don't belong to a specific module? Are the bolded factor names based on a certain statistical cutoff/defining criteria or were they manually selected? 

      The text of both the results and figure legends has been revised to clarify these points.

      (4) Data Availability: It would be helpful to provide supplemental table files for analyses (e.g., 5C to list the overlapping relationships in TGs for each TF/CR (5C) and 3E/6F to list DEG genes in comparisons). 

      Supplemental files for each analysis have been added (Supplemental Table 8-14). In addition, the raw and processed data is available on GEO and we have created an interactive Shiny App so people without coding experience can interact with each dataset (lines 917-919).

      (5) “...and found that each sample expressed these markers (Figure 6D), suggesting..." Consider clarifying "these". 

      Text has been added to refer to a few of these marker genes within the text (line 540).

      Citations

      (1) Zappia L, Oshlack A. Clustering trees: a visualization for evaluating clusterings at multiple resolutions. GigaScience. 2018;7(7):giy083. PMCID: PMC6057528

      (2) Zhou J, Xu J, Zhang L, Liu S, Ma Y, Wen X, Hao J, Li Z, Ni Y, Li X, Zhou F, Li Q, Wang F, Wang X, Si Y, Zhang P, Liu C, Bartolomei M, Tang F, Liu B, Yu J, Lan Y. Combined Single-Cell Profiling of lncRNAs and Functional Screening Reveals that H19 Is Pivotal for Embryonic Hematopoietic Stem Cell Development. Cell Stem Cell. 2019;24(2):285-298.e5. PMID: 30639035

      (3) Malagoli G, Valle F, Barillot E, Caselle M, Martignetti L. Identification of Interpretable Clusters and Associated Signatures in Breast Cancer Single-Cell Data: A Topic Modeling Approach. Cancers. 2024;16(7):1350. PMCID: PMC11011054

      (4) Adu-Gyamfi EA, Cheeran EA, Salamah J, Enabulele DB, Tahir A, Lee BK. Long non-coding RNAs: a summary of their roles in placenta development and pathology†. Biol Reprod. 2023;110(3):431–449. PMID: 38134961

      (5) Zheng M, Hu Y, Gou R, Nie X, Li X, Liu J, Lin B. Identification three LncRNA prognostic signature of ovarian cancer based on genome-wide copy number variation. Biomed Pharmacother. 2020;124:109810. PMID: 32000042

      (6) Khan T, Seetharam AS, Zhou J, Bivens NJ, Schust DJ, Ezashi T, Tuteja G, Roberts RM. Single Nucleus RNA Sequence (snRNAseq) Analysis of the Spectrum of Trophoblast Lineages Generated From Human Pluripotent Stem Cells in vitro. Front Cell Dev Biol. 2021;9:695248. PMCID: PMC8334858

      (7) Isakova A, Neff N, Quake SR. Single-cell quantification of a broad RNA spectrum reveals unique noncoding patterns associated with cell types and states. Proc Natl Acad Sci United States Am. 2021;118(51):e2113568118. PMCID: PMC8713755

      (8) Morales-Vicente DA, Zhao L, Silveira GO, Tahira AC, Amaral MS, Collins JJ, Verjovski-Almeida S. Singlecell RNA-seq analyses show that long non-coding RNAs are conspicuously expressed in Schistosoma mansoni gamete and tegument progenitor cell populations. Front Genet. 2022;13:924877. PMCID: PMC9531161

      (9) Kim DH, Marinov GK, Pepke S, Singer ZS, He P, Williams B, Schroth GP, Elowitz MB, Wold BJ. Single-Cell

      Transcriptome Analysis Reveals Dynamic Changes in lncRNA Expression during Reprogramming. Cell Stem Cell. 2015;16(1):88–101. PMCID: PMC4291542

      (10) Yang L, Liang P, Yang H, Coyne CB. Trophoblast organoids with physiological polarity model placental structure and function. bioRxiv. 2023;2023.01.12.523752. PMCID: PMC9882188

    1. Note de Synthèse : La Lutte pour le Droit de Vote des Femmes Européennes

      Source : Extraits du documentaire "Citoyennes ! | Documentaire complet LCP"

      Introduction

      • Le documentaire "Citoyennes !" retrace l'histoire complexe et souvent tumultueuse de l'accès des femmes européennes au droit de vote, depuis la Révolution française jusqu'à la fin du XXe siècle.

      Il souligne que cette histoire est loin d'avoir été un "long fleuve tranquille", marquant une lutte acharnée menée par des groupes de femmes (avec le soutien de quelques hommes) pour faire reconnaître leur statut de citoyennes à part entière.

      Le documentaire explore les raisons de la longue hésitation des pays européens à accorder ce droit, la manière dont les femmes "citoyennes sans citoyenneté" ont participé à la vie politique et sociale, et les figures marquantes de ce combat.

      Thèmes Principaux et Idées Clés

      Une Lutte Ancienne et Persistante :

      La participation des femmes à la vie publique et politique remonte bien avant la Révolution française, avec des figures historiques comme Jeanne d'Arc (bien que récupérée par le roman national) et des centaines d'autres femmes combattantes au Moyen-Âge.

      À partir de la fin du XVIIe et début du XVIIIe siècle, les salons littéraires et scientifiques tenus par des aristocrates et grandes bourgeoises offrent un espace de débat où les femmes peuvent s'exprimer sur un pied d'égalité avec les hommes.

      La Révolution Française : Un Moment Crucial mais Limité :

      La Révolution française marque l'émergence des premières militantes au sens propre du terme, souvent issues des milieux populaires et de la petite bourgeoisie, comme les "dames de la halle" ou les "Poissardes".

      Elles sont les premières pétitionnaires à plaider la cause du peuple, bien que leur mouvement soit plus axé sur des besoins fondamentaux comme "du pain pour le peuple, du pain pour leurs enfants" que sur un féminisme politique.

      Les "citoyennes tricoteuses", assistant aux procès, symbolisent la présence féminine dans l'espace public, même sans droit de parole initialement.

      Elles sont dépeintes de manière négative dans la propagande royaliste.

      Des figures marquantes comme Olympe de Gouge ("l'autrice de la déclaration des droits de la femme, qui est vraiment le premier manifeste féminin"), guillotinée pour ses idées, montrent l'engagement politique radical mais aussi ses dangers.

      D'autres militantes comme Manon Roland, Théroigne de Méricourt, Claire Lacombe et Pauline Léon plaident notamment pour le droit des femmes à l'autodéfense et à s'armer face aux violences.

      Cependant, la Révolution française n'apporte pas de gains significatifs pour les femmes en termes de droits politiques. S'ensuit un "grand renfermement des femmes" qui préfigure la révolution conservatrice napoléonienne.

      Le Code Civil de Napoléon (1804) : La Légalisation de l'Enfermement Domestique :

      Le Code Civil de 1804 établit un cadre légal qui confine les femmes à la sphère domestique, limitant leur liberté et leurs droits.

      "on va même créer un cadre légal à cet enfermement des femmes qui vont être dès lors complètement cantonné à la sphère domestique [...] ce qui va être traduit en fait en texte avec le code civil de Napoléon en en 1804". Ce code est également la traduction des mœurs et coutumes de l'époque.

      L'Émergence du Mouvement Suffragiste au XIXe Siècle :

      La Révolution de 1830 ouvre les consciences, et des femmes commencent à s'organiser en groupes en France, en Angleterre et dans d'autres pays européens.

      Les suffragistes militent pour deux droits fondamentaux : le droit de vote ("le suffrage") et le droit d'être élue. Jeanne Deroin est la première Française à se présenter à une élection législative en 1849.

      Le sujet du droit de vote des femmes est mis sur la table à partir de 1848, avec des auteurs féminins et masculins qui s'en emparent dans la presse.

      Divergences et Soutiens au XIXe Siècle :

      George Sand, figure féminine majeure du XIXe siècle, bien que provocatrice sur le plan des mœurs ("s'habille en pantalon qui fume le cigare qui revendique des amours homosexuels lesbiennes et cetera"), est contre le vote des femmes, estimant qu'elles ne sont pas assez éduquées et que la priorité doit être les droits civils.

      Elle craint que leur vote ne soit influencé par leur mari ou l'Église.

      Victor Hugo, en revanche, est un fervent défenseur du droit des femmes, exprimant son soutien publiquement, notamment lors d'un banquet républicain en 1872.

      Figures Pionnières du Féminisme :

      Hubertine Auclert est la première militante à se déclarer "féministe" en France.

      Elle rejoint la Ligue française pour le droit des femmes en 1877 et milite pour le droit de vote, l'indépendance financière, la séparation des biens et la féminisation de certains mots.

      Elle se présente symboliquement aux élections législatives en 1910. "Une République qui maintiendra les femmes dans une condition d'infériorité ne pourra pas faire les hommes égaux".

      Le Mouvement Britannique des Suffragettes : Radicalisme et Action Directe :

      Organisées depuis le milieu du XIXe siècle, les femmes britanniques, inspirées notamment par les campagnes de John Stuart Mill pour le droit de vote des femmes, intensifient leur action.

      En 1897, plusieurs groupes se fédèrent pour former la National Union of Women's Suffrage Societies (NUWSS). John Stuart Mill dépose régulièrement des propositions de loi pour le vote des femmes au Parlement, sans succès.

      Lassées du pacifisme, les suffragettes, menées par Emeline Pankhurst et ses filles, adoptent des méthodes plus radicales au début du XXe siècle.

      Leur slogan est "Des actes, pas des mots".

      Elles mènent des actions directes, parfois violentes :

      incendies, explosions, bris de vitres, coupure de fils téléphoniques, incendie de boîtes aux lettres. Ces actions les rendent impopulaires en Europe.

      Elles recourent à la grève de la faim en prison, entraînant des gavages forcés ("aujourd'hui on appellerait ça un viol finalement puisque c'est l'intrusion violente d'un corps étranger dans le corps d'une femme sans son consentement"). Ces violences institutionnelles choquent l'opinion publique.

      L'action la plus spectaculaire est celle d'Émilie Davidson, qui se sacrifie lors du derby de Londres en 1913 pour attirer l'attention sur leur cause.

      "Ces pétroleuses iront jusqu'au suicide politique comme Émilie Davidson qui n'hésite pas en 1913 à se faire piétiner par un cheval de l'écurie du roi George 5 lors du derby de Londres".

      Les Premiers Pays à Accorder le Droit de Vote : Nouvelle-Zélande et Finlande :

      La Nouvelle-Zélande est le premier pays au monde à accorder le droit de vote aux femmes en 1893.

      La Finlande est le premier pays en Europe à accorder le droit de vote et d'éligibilité aux femmes en 1906. "Cela a été possible grâce à des femmes des régions rurales et des villes qui se sont réunies et qui ont dit 'Nous avons une opportunité. Nous sommes en train d'écrire l'histoire.'"

      Une hypothèse avancée pour cette précocité est que ces pays étaient des nations jeunes, imprégnées d'une culture des affaires où le pouvoir politique était peut-être moins central que le pouvoir économique.

      En Finlande, la lutte pour l'indépendance vis-à-vis de l'empire russe a créé un sentiment d'unité, et les femmes jouaient déjà des rôles importants dans la société. Le principe de solidarité a justifié leur participation politique.

      La Première Guerre Mondiale : Un Tournant Ambigü : La déclaration de guerre en 1914 suspend les actions militantes en Europe, les femmes participant à l'effort de guerre (les "munitionnettes").

      Certains hommes proposent le "vote des morts" pour que les femmes puissent représenter les soldats tombés au combat, reconnaissant symboliquement leur présence dans l'espace public.

      Ce vote est adopté au Royaume-Uni et en Belgique.

      Au Royaume-Uni, le droit de vote est accordé en 1918 à certaines catégories de femmes de plus de 30 ans ("soit chef de famille soit épouse de chef de famille propriétaire d'un bien immobilier d'une valeur d'au moins 5 livres par an ou diplômé de l'université").

      Ce geste est présenté comme une récompense pour leur contribution à l'effort de guerre, occultant les années de lutte des suffragettes. Le suffrage égal à celui des hommes n'est obtenu qu'en 1928.

      Dans d'autres pays (Russie, Allemagne, pays d'Europe centrale), le droit de vote est accordé aux femmes après 1918, souvent en lien avec des changements de régime et l'éclatement des empires.

      Les Années de l'Entre-Deux-Guerres : Progrès et Résistances :

      La majorité des pays européens accordent le droit de vote aux femmes entre les deux guerres, parfois sous conditions.

      La France reste l'un des "irréductibles" qui résistent. Plusieurs propositions de loi sont bloquées par le Sénat, qui maintient une vision traditionnelle du rôle des femmes. Un sénateur décrit les mains des femmes comme "admirables et toutes semblables à des oiseaux", faites pour être baisées, pas pour voter.

      Louise Weiss emerge comme une figure clé dans les années 1930, utilisant des méthodes radicales et modernes pour bousculer l'inertie politique (candidatures symboliques, manifestations burlesques, distribution de tracts lors d'événements publics).

      "Elle avait finalement des modes d'action hyper moderne qu'on a pu voir ensuite utilisé par des groupes comme Greenpeace ou les Femon".

      Le Front Populaire (1936) : Une Ouverture Limitée : Pour la première fois, trois femmes entrent au gouvernement sous le Front Populaire, en tant que sous-secrétaires d'État.

      C'est une avancée symbolique. Cependant, Léon Blum, chef du gouvernement, craint le vote des femmes, pensant qu'elles pencheraient vers un vote catholique.

      Il demande aux femmes ministres de renoncer à la question du droit de vote.

      Cette expérience met en évidence la forte hostilité au vote des femmes au sein même du camp majoritaire (communistes, radicaux-socialistes).

      La Seconde Guerre Mondiale et l'Obtention du Vote en France (1944) :

      La déclaration de guerre en 1939 relègue à nouveau le droit de vote des femmes au second plan.

      À partir de 1943, la France Libre à Alger prépare le futur régime. Le Général de Gaulle signe l'ordonnance du 21 avril 1944 accordant aux femmes majeures le droit d'être élues et électrices, sans restriction. "C'est absent des débats. Ça va arriver tardivement en fait dans le discours de du général de Gaulle".

      Le vote est accordé en 1944, et les femmes peuvent voter pour la première fois aux élections municipales et législatives en 1945. La participation des femmes à la Résistance et leur présence accrue dans l'économie de guerre ont contribué à cette décision. "Il est indispensable que les femmes votent, une nécessité absolue, normale, un droit".

      La France, patrie des droits de l'homme, arrive tardivement en comparaison d'autres pays européens. Une explication avancée est que le changement de régime (fin de la IIIe République et passage à un nouveau régime) a créé une opportunité, comme dans d'autres pays.

      La forte présence de figures féminines allégoriques (Marianne) dans l'espace symbolique républicain est vue comme une compensation pour leur absence dans l'espace démocratique réel. "l'omniprésence de ces figures féminines dans l'ordre symbolique républicain c'est une façon de faire oublier leur absence dans l'espace public démocratique".

      La Suisse : Un Cas Particulier de Retardataire : La Suisse est l'un des derniers pays au monde à accorder le droit de vote aux femmes. "C'est vraiment pas une histoire glorieuse".

      Plus de 50 votations ont lieu entre 1919 et 1990 sur le sujet.

      Trois raisons expliquent ce retard : Le système politique de démocratie semi-directe, où le peuple (masculin) est consulté et vote "non" à de nombreuses reprises.

      Une société relativement conservatrice, influencée par le modèle germanique, où la place des femmes est traditionnellement au foyer.

      L'épargne de la Suisse par les guerres mondiales, qui n'a donc pas eu la même dynamique de reconnaissance du rôle des femmes que d'autres pays.

      Le mouvement suffragiste suisse privilégie des méthodes pacifiques, de pétitions et de persuasion, se démarquant de l'action violente des suffragettes anglaises.

      L'avocate Antoinette Quinche est une figure marquante, argumentant que la Constitution suisse de 1848 garantit l'égalité des droits pour tous les Suisses (hommes et femmes).

      Malgré l'obtention tardive du droit de vote, de nombreuses femmes ont mis du temps à oser voter, ne sachant pas toujours comment utiliser ce droit.

      Les Combats Post-Suffrage : Droits Civils et Vagues Féministes :

      L'obtention du droit de vote en 1944 en France ne marque pas la fin de la lutte pour les droits des femmes.

      Les droits civils, notamment pour les femmes mariées, doivent encore être obtenus. Il faut attendre 1965 et 1970 pour que les femmes mariées acquièrent l'autonomie sur leur compte en banque et ne nécessitent plus l'autorisation de leur mari pour travailler ou gérer leurs biens.

      "On va devoir attendre 1965 et 1970 pour qu'il y ait des réformes qui viennent donner un statut d'adulte aux femmes mariées et aux femmes mères". L'autorité parentale remplace l'autorité paternelle en 1970.

      Le documentaire évoque les différentes "vagues" féministes :

      La première vague (début du XXe siècle) axée sur le droit de vote.

      La deuxième vague (années 1970) axée sur la libération des corps et l'égalité des sexes.

      La troisième vague (années 1990) marquée par l'approche intersectionnelle, reconnaissant le cumul des formes de domination (genre, race, classe, etc.).

      Le mouvement #MeToo est présenté comme une évolution majeure, un mouvement mondial d'une ampleur inédite, bien que principalement circonscrit au monde occidental.

      Le combat féministe actuel est de plus en plus perçu comme un "combat humaniste", incluant la question des masculinités et la nécessité de libérer les hommes des stéréotypes de performance et de non-émotion. Conclusion

      L'histoire du droit de vote des femmes en Europe est une épopée longue et difficile, marquée par la persévérance de militantes qui ont dû affronter l'hostilité, l'humiliation, l'emprisonnement et même la violence d'État.

      Si l'obtention du droit de vote a constitué une victoire majeure, elle n'était qu'une étape dans la quête d'une pleine égalité, les droits civils ayant dû être arrachés ultérieurement dans de nombreux pays.

      La diversité des tactiques militantes, des approches nationales (radicalisme anglo-saxon versus approche réformiste suisse) et des temporalités souligne la complexité de ce processus historique.

      Les combats actuels continuent d'élargir la réflexion, intégrant les questions d'intersectionnalité et de redéfinition des masculinités dans une perspective humaniste plus large.

      La France, malgré son rôle dans la Révolution et sa représentation symbolique des femmes (Marianne), a été un des derniers pays à accorder ce droit fondamental, un paradoxe qui interroge.

    1. model, a de novo prediction model, a combination of library search and de novo prediction, and MS2 clustering—to estimate the number of unique structures. Our methods suggest that the number of unique compounds in the metabolomics dataset alone may already surpass existing estimates of plant chemical diversity. Finally, we project these findings across the entire plant kingdom, conservatively estimating that the total plant chemical space likely spans millions, if not more, with the vast majority still unexplored.

      Reviewer name: Kohulan Rajan Reviewer Comments: Review: Defining the limits of plant chemical space: challenges and estimations This work presents an important contribution to understanding the chemical diversity of plants through a systematic analysis combining metabolomics data and literature mining. The authors address a question in the field and employs multiple complementary approaches to estimate the size of the plant chemical space. Here are my few suggestions and question to the authors to clarify, 1. When introducing an abbreviation one could use caption letters "Natural Products (NP)" 2. There is no list of abbreviations in the document, so introduce them first and then use them. There may be some readers who are unfamiliar with the terms COCONUT and LOTUS. 3. Is there any prior work using similar combined metabolomics/literature approaches to estimate plant chemical space? If so, these should be cited. If not, please state this explicitly to highlight the novelty of your method. 4. Cite SMILES 5. While the paper describes the use of 'literature datasets,' it appears that only existing databases (COCONUT and LOTUS) are being utilized. It would be helpful if authors could clarify whether any direct literature mining was conducted. If not, consider revising terminology to more accurately reflect the use of curated databases rather than primary literature sources. 6. Great to see the data and code openly shared on both Zenodo and GitHub. I also find the GitHub repository very useful with regard to all the provided notebooks. To maximize reusability, please consider adding a detailed "How to Use" section to the README that guides others in replicating or building upon this work. 7. The different clustering thresholds (0.7 vs 0.8) lead to notably different estimates. Could you discuss which threshold might be more appropriate for this specific application to plant metabolomics data?

    2. The plant kingdom, encompassing nearly 400,000 known species, produces an immense diversity of metabolites, including primary compounds essential for survival and secondary metabolites specialized for ecological interactions. These metabolites constitute a vast and complex phytochemical space with significant potential applications in medicine, agriculture, and biotechnology. However, much of this chemical diversity remains unexplored, as only a fraction of plant species have been studied comprehensively. In this work, we estimate the size of the plant chemical space by leveraging large-scale metabolomics and literature datasets. We begin by examining the known chemical space, which, while containing at most several hundred thousand unique compounds, remains sparsely covered. Using data from over 1,000 plant species, we apply various mass spectrometry-based approaches—a formula prediction

      Reviewer name: Carlos RodrÃ-guez-López Reviewer Comments In the reviewed manuscript, Chloe Engler Hart et al. utilize different approaches to estimate the size of plant chemical space through analysis of publicly available datasets of mass spectrometry-based metabolomics. The authors tackle this issue by using data from ca. 2,000 LC-MS runs, and different formula predictors and structure annotation algorithms, and extrapolate to the estimated number of plant species. While the approach is useful at estimating structural variation, and the collected data and here-published source code can certainly be of use to the plant metabolomics community, I consider the manuscript requires modifications before it can be recommended for publication. Particularly, the language of the article should more accurately reflect the nature of this estimate; for example, mentions of the approach being "the most accurate estimate possible" (p.8, section 3.2) are not supported, and throughout the article, mentions of the calculation as a "conservative estimate" are not consistent with the approaches used, beyond formula prediction. E.g. it is mentioned that the MS2 curve being lower than formula prediction suggests that the curves may be conservative without further clarification on why this might be the case and not, e.g., a product of estimates dispersion. The authors mention that since they identify most limitations (in table 2, p. 13) are underestimations (again, with limited or no explanation) their estimate is conservative. Since no effect size can be calculated on these limitations, this statement is not true; e.g. if the approach is missing half of molecules due to extraction, and another half due to tissue coverage (total, ¼), but overestimating the plateau of plant chemical diversity by 100-fold, even if more factors underestimate the chemical space, the effect size of the latter would be dominant by far. I recommend the authors to change mentions of this estimate being a conservative approach, and instead clearly mention that this is a fragmentation-based estimate, or a similar term that better reflects the nature of the figure. Similarly, assumptions on the models should be explicitly stated, along with their limitations. The authors, for example, rely on CID induced fragmentation, and they mention that the estimate "[relies] on the predominant adduct ([M+H]+)" (p.15) and thus "this likely underestimates the true chemical diversity, as other adduct forms" (p.15). It should be stated that this is an assumption: the authors do not have evidence for the adducts being [M+H]+, which is nigh impossible with the available data, they are assuming all features are [M+H]+ adducts. This carries the implicit assumption that fragmentation mechanisms will be the same for all MS2 spectra and thus structural diversity can be estimated through MS2 clusters. It is unclear how this would yield an underestimation, as the authors claim, but rather yields an overestimation, as fragmentation of [M+H]+ and e.g. [M+Na]+ adducts of the same molecule would yield different fragmentation patterns, given the former favors charge migration dependent mechanisms compared to the latter. Thus, since the authors consider all features to be [M+H]+, two adducts of the same molecule might be considered as different moieties, given that fragmentation patterns will differ, even if no difference exists. On the same vein, since similarity thresholds of the MS2Mol algorithm are essential for the estimation of diversity, the authors should clearly state how are they calculated in text, not by reference, along with potential limitations. Finally, I believe the work would greatly benefit from including data on phylogenetics of the samples, adding diversity estimates to their sample and extrapolation data. If, for example, most of the 400,000 plant species are phylogenetically distant from the sampled species, then the reader can reasonably assume that this might be an underestimation of chemical diversity when presented with the evidence. If, on the other hand, the original sample has more diversity than the total number of plant species, this might not be the case. In any case, all of the relevant assumptions should be clearly stated. Minor note: One of the main arguments for extrapolating the diversity estimate into the rest of the plants comes from Figure 3D, where increasing MS1 adducts increases with number of samples; it would greatly help explaining the difference seen between species if the authors clarify the tissues sampled per species. E.g. if the species that only doubles the number of features contains only aerial and vegetative tissue, compared to the species that increases 6fold which might include root or reproductive tissue, etc. This might also help the authors in justifying the extrapolation of the estimate.

    1. Welcome to Python Online Compiler!Key features:• Run Python code directly in your browser• Install packages with pip• Share your code with others• Multiple tabs for different files• Light and dark themes

      Say goodbye to Python setup hassles! Code directly in your browser with zero installation needed. Write your code, hit RUN, and see results instantly. Our clean, powerful editor gives you syntax highlighting, multiple tabs, and easy package installation with pip.

      Need to share your work? One click generates a link anyone can use to view and run your code. Switch between light and dark themes, customize your layout, and code Python anywhere, anytime. Try our random code examples to get started quickly.

      Python Online - the simplest way to write, run, and share Python code!

    1. recommendation algorithm may base its decisions on how users engage with content, the biases of users play into what gets boosted by the algorithm

      Since the bias of users can play a part in what the algorithm pushes out/recommends to people, how might this worsen if there is already bias present in the code? How might marginalized groups struggle further because of this bias?

    1. Civil law systems tend to not have binding case law

      Civil Law: "Code-based law." Main source of law: Written codes and statutes passed by legislators. Judges apply the law to cases but don’t create binding precedents.

    1. Consider impact vs. intent. For example, consequentialism only cares about the impact of an action. How do you feel about the importance of impact and intent in the design of recommendation algorithms?

      I think impact matters most, even well‑intentioned algorithms can fuel outrage or misinformation if their real‑world effects aren’t tracked. So platforms should watch actual outcomes not just code, they should allow users to easily flag unwanted content and potentially test tweaks on small groups first. This way by focusing on impact and keeping good intent, recommendations can help more than they harm.

    1. constituted an effective prohibition of all depictions of queerness in Hollywood film

      since this is no longer the case (the hays code in 1933), is it important to use our power, our = specifically queer directors, to make depictions of queerness accurate (not stereotypical) and artistic (to display an important message through these depictions)

    Annotators

    1. NOTE D'INFORMATION

      Objet : Examen des modalités de contrôle et de prévention des violences dans les établissements scolaires - Audition de représentants du Ministère de l'Éducation Nationale, de l'Enseignement Supérieur et de la Recherche.

      • Date : [Insérer la date de l'audition - non précisée dans l'extrait]

      • Source : Extraits de "🔴 L'éducation nationale face aux violences dans les établissements scolaires" (Transcription d'une audition parlementaire).

      Résumé Exécutif :

      • Cette audition parlementaire a réuni des représentants de plusieurs directions du ministère de l'Éducation Nationale pour aborder les enjeux du contrôle et de la prévention des violences en milieu scolaire, avec une attention particulière portée aux établissements privés sous contrat.

      Les discussions ont mis en lumière la complexité de la chaîne de signalement et de traitement des violences, le rôle crucial mais perfectible de l'inspection (académique et générale), les défis liés au contrôle des établissements privés sous contrat (notamment en termes de moyens et de cadre juridique), et les procédures de suivi des violences commises par les personnels.

      La question de la laïcité et du dialogue avec le Secrétariat Général de l'Enseignement Catholique (SGEC) a également été soulevée.

      Les représentants du ministère ont exprimé leur bouleversement face aux témoignages de victimes et leur volonté d'améliorer les procédures existantes.

      Thèmes Principaux :

      • Organisation du Ministère et Partage des Responsabilités : Clarification des rôles de la DGESCO (pédagogie, éducation, protection de l'enfance), DGRH (gestion RH public), DAF (finances, réglementaire, RH privé), Direction des Affaires Juridiques (DAJ) et du Haut Fonctionnaire de Défense et de Sécurité (HFDS) dans le traitement des signalements et la prévention des violences.

      • Procédure de Signalement des Violences : Description de la chaîne de signalement depuis le chef d'établissement/directeur d'école (premières mesures, signalement au procureur/CRIIP, numéro harcèlement) jusqu'au niveau départemental (DSDEN/Rectorat) et national (bureau de la veille et de l'alerte du HFDS). Utilisation de l'application "fait établissement".

      • Rôle et Méthodes de l'Inspection : Examen des critères de déclenchement des inspections (académique vs. générale), des méthodes d'entretien (panel représentatif, appel à témoins, entretiens hors établissement), de l'analyse et de la restitution des témoignages. Illustration avec l'inspection du collège Stanislas.

      • Contrôle des Établissements Privés Sous Contrat : Présentation du plan de contrôle, des moyens alloués (ETP), des défis liés à son déploiement (notamment l'utilisation des ETP, l'élaboration d'un guide de contrôle, les discussions sur le périmètre du contrôle - vie scolaire, pastorale).

      • Dialogue avec les Réseaux d'Établissements Privés : Questionnement sur la base juridique et la profondeur du dialogue entre le ministère et les têtes de réseau (en particulier le SGEC), ainsi que sur les enjeux de laïcité soulevés par ce dialogue.

      • Suivi des Violences Commises par les Personnels : Présentation des statistiques disponibles (via l'application "fait établissement" et les enquêtes disciplinaires de la DGAFP) concernant les violences (sexuelles, physiques, verbales, moeurs) impliquant des personnels de l'enseignement public. Difficulté à centraliser et à suivre les suites judiciaires et administratives pour tous les cas.

      Articulation des Procédures

      Administratives/Disciplinaires et Judiciaires : Discussion sur l'indépendance des procédures, la prise de mesures conservatoires (suspension), et les délais liés à l'attente des décisions judiciaires. Difficulté de communication fluide entre le ministère et les parquets. * Prévention des Violences : Abord des outils et des programmes de prévention, notamment le programme d'éducation à la vie affective et sexuelle (EVARS), la formation des personnels (enseignants, sociaux, santé), et les dispositifs de recueil de la parole des élèves.

      Idées et Faits Importants / Citations :

      • Bouleversement face aux témoignages de victimes : La DGESCO, au nom du ministère, exprime un profond bouleversement face aux témoignages des associations de victimes, soulignant la gravité des faits révélés. Elle appelle à un "examen profond et critique de nos procédures".

      • "nous avons été profondément bouleversés par ces témoignages et nous voulons bien sûr exprimer tout notre soutien aux victimes et aux associations de de victimes Euh la gravité des faits révélés

      On conduit la ministre à annoncer un certain nombre d'actions sans attendre sur lesquelles nous pourrons également revenir à la fois des actions pour faciliter le recueil de la parole le traitement des signalements et euh le renforcement des contrôles

      En tout état de cause même si bien sûr il est il s'agit de responsabilité d'abord individuelle d'agresseur de d'enfants qu'il conviendra de déterminer et autant que c'est possible de poursuivre Euh et s'il ne faut pas substituer la responsabilité de de l'État à celle de de des agresseurs de ou des de ceux qui ont commis des crimes et des délits notre administration considère que la gravité des des faits révélés exige de nous tous un examen profond et critique de nos procédures de ce qui fonctionne de ce qui ne fonctionne pas..." (DGESCO)

      • Chaîne de Signalement : La responsabilité première incombe au chef d'établissement/directeur d'école, qui doit prendre des mesures immédiates de protection et signaler les faits via l'application "fait établissement". Les services départementaux et académiques apportent leur soutien (psychologues, équipes de sécurité) et peuvent signaler les cas graves au ministère.

      • "La chaîne de signalement au ministère elle est elle relève donc du services du haut fonctionnaire de défense et de sécurité Euh c'est en cas de d'alerte d'un d'un personnel sur une situation de violence le la responsabilité première c'est celle du chef d'établissement du directeur d'école qui ont la responsabilité générale d'assurer la sécurité des personnes au sein euh de leur de leurs établissement Et donc en cas de dès qu'ils sont saisis de ce type de signalement ou d'alerte ils doivent d'abord prendre les premières mesures de protection des victimes..." (HFDS adjoint)

      • Application "fait établissement" : Outil de recueil rapide et structuré des signalements, rappelant les étapes indispensables (saisine du procureur, CRIIP). Permet la transmission instantanée des faits graves (niveau 2 et 3) aux niveaux supérieurs.

      • "signaler immédiatement les faits via une application qui s'appelle l'application fait établissement qui permet plusieurs choses qui permet d'abord un recueil rapide et facilité du signalement avec un système de case à cocher donc permet de structurer la façon dont le signalement est rédigé la façon dont il est dont il est transmis qui permet aussi de rappeler quelques étapes indispensables quelques réflexes à avoir" (HFDS adjoint)

      • Inspection de Stanislas : L'inspection générale a mené une inspection approfondie suite à des articles de presse sur un "climat homophobe systémique". Le rapport a conduit à un article 40 contre un parent et a mis en garde l'établissement sur un "climat très fortement genré" et des inégalités de traitement, demandant une révision du règlement intérieur. Cependant, la DGESCO confirme qu'il n'y a pas eu de conclusion à une "homophobie systémique" ou "institutionnelle" dans les témoignages recueillis par les inspecteurs, malgré la lecture de PV d'audition par les parlementaires semblant contredire cette affirmation et faisant état de multiples témoignages d'homophobie.

      • "Nous n'avons extrait aucun élément caractérisé qui permettait de dire que l'établissement avait un comportement homophobe." (DGESCO, citant sa déclaration antérieure) "je rappelez que page 14 dans le rapport il en est question et que dans ce dans cette l'analyse qui a été faite par les inspecteurs des généraux dont je rappelle qu'ils travaillent collégialement et qu'ils sont ensuite revus par ce que nous appelons entre nous un référent hein qui travaille avec eux pour essayer d'expliciter et de mettre au jour l'ensemble des des analyses ou des conclusions qu'ils peuvent tirer de ces témoignages dans ce travail conjoint et collégial Il a été considéré que dans la relation quiit l'élève au préfet on nétait pas en mesure de de savoir..." (DGESCO, tentant d'expliquer l'analyse des inspecteurs)

      • Méthodes d'Inspection et Recueil de la Parole : L'inspection générale dispose d'un "Vademecum" et les inspecteurs sont formés à l'enquête administrative et au recueil de la parole, y compris des mineurs (accompagnement possible). Le témoin peut relire et corriger son témoignage. Les inspecteurs travaillent en équipe pour croiser les témoignages et établir une analyse collégiale.

      • "Les les enquêtes administratives ont fait l'objet d'un vader Mécom euh qui est un guide de l'inspection de de l'enquête administrative à destination de l'ensemble des inspecteurs généraux qui sont eux-mêmes formés à l'enquête administrative... Ils sont formés au recueil de la parole parce que vous avez raison de le dire C'est une des grosses difficultés d'arriver à libérer la parole d'un élève" (DGESCO)

      • Plan de Contrôle des Établissements Privés Sous Contrat : Un plan de contrôle est en cours d'élaboration pour systématiser les contrôles, qui étaient auparavant rares et souvent limités aux signalements. Un guide de contrôle a été élaboré. 60 ETP ont été alloués mais ont été largement utilisés pour le contrôle des établissements hors contrat et de l'instruction en famille.

      • "en terme de contrôle des établissements privés sous contrat euh en dehors de la méthodologie de de l'inspection générale il y avait peu de précédents peu de méthodes et peu de formations sur le sujet et que les académies étaient très demandeuses d'avoir un cadre qui rappelle les textes les objectifs et les spécificités de l'enseignement privé" (DAF) "Les académies avaient bien recruté les renforts mais elle les avait essentiellement utilisé euh à purer en quelque sorte le stock de contrôle qu'elle devait encore faire sur le hors contrat et sur l'instruction en famille" (DAF)

      • Dialogue avec le SGEC sur le Plan de Contrôle : La DAF a recommandé un échange sur les principes du plan de contrôle, mais pas une coconstruction. Cependant, la note de la ministre prévoit une présentation du plan de contrôle aux instances de dialogue locales incluant les représentants de l'enseignement catholique. Ce dialogue a soulevé des incompréhensions du SGEC sur le champ de contrôle (vie scolaire) et la nature des enseignements religieux (obligatoire vs. facultatif).

      • "Ne pas coprogrammer construire les contrôles avec ceux qui vont être contrôlés ça me paraît en effet de l'ordre de l'évidence" (Rapporteur) "Le dialogue avec LeGEC a a relevé un certain nombre d'incompréhensions de sa part sur le contenu du guide sur deux points qu'il nous avait signalé comme étant extrêmement important pour lui Le point de notre de la capacité juridique de l'État à étendre son contrôle sur la vie scolaire... Et le deuxième sujet c'était le sujet de la catéchèse de la pastorale et de l'instruction de la des enseignements de culture religieuse" (DAF)

      • Base Juridique du Dialogue avec le SGEC : Le dialogue avec le SGEC n'est pas prévu par la loi mais constitue une pratique facilitant l'échange d'informations et la fixation de priorités, bien que n'impliquant pas un partage de compétences.

      • "Non non effectivement il n'apparaît pas dans les textes Après il y a pas besoin d'une loi pour dialoguer C'est quelque part heureux Euh mais le le dialogue est d'ailleurs pas du tout institutionnalisé et ce n'est pas du tout comme ça a été très bien expliqué une forme de négociation ou de concertation institutionnelle prévue par les textes C'est simplement une pratique" (DAJ) "Je crois pas qu'on ait jamais interprété l'article 2 la loi 1905 comme interdisant tout dialogue avec le culte Donc je je crois qu'il y a pas de tout ce sujet au regard de cette disposition là" (DAJ)

      • Statistiques sur les Violences Impliquant les Personnels (Enseignement Public) : En 2023-2024, 1198 faits mettant en cause des personnels ont été recensés via "fait établissement", se décomposant en 24% de violences sexuelles (280 cas), 38% de violences physiques (461 cas) et 38% de violences verbales (457 cas). Les données sur les suites judiciaires ne sont pas centralisées au niveau national. Les données sur les sanctions disciplinaires sont disponibles via une enquête annuelle pour la fonction publique (chiffres 2021-2023 fournis pour les trois catégories pertinentes).

      • "Pour la précédente année scolaire oui c'est environ 2000 Ça c'est des chiffres qu'on pourra vous donner dans le cadre du questionnement On en a euh les violences mettant en cause des personnels en 2023-2024 par le biais de cette application fait établissement on en a recensé pardon 1198 1200 qui se décompose en 24 % de violence sexuelle ça fait 280 38 % de violence physique 461 et 38 % de violence verbale 457" (HFDS adjoint)

      • Suivi des Mesures Disciplinaires : Les autorités académiques sont compétentes pour les mesures conservatoires et les sanctions disciplinaires (pour les cas moins graves). La DGRH intervient pour les sanctions les plus élevées (groupe 3 et 4) et assure un suivi des situations transmises par les académies suite à une circulaire de 2016. Un suivi des sanctions disciplinaires prises dans l'enseignement public est effectué.

      • "La première c'est que cette circulaire nous demande d'apporter un regard croisé entre l'académie et le ministère pour pouvoir juger du bon quantum de sanction qui va être donc infligé au personnel concerné Et ensuite cette circulaire impose effectivement également au services académique de transmettre expost l'ensemble des dossiers au ministère pour en assurer le suivi" (DGRH)

      • Retrait de Contrat d'Association : Le code de l'éducation prévoit la fin du contrat en cas de "manquement grave au pluriel". Le régime juridique est peu éclairé par la jurisprudence. Deux seuls cas récents de rupture de contrat (Averroès, Al-Kindi) ont été cités.

      • "S'agissant euh du privé sous-contrat le le code de l'éducation prévoit que il peut être mis fin au contrat en cas de manquement grave au pluriel" (DAJ) "les seules retraites de contrat sont les retraits de contrat de d'Averos et de d'Alkind à ce stade" (DAF)

      • Élargissement de "fait établissement" au Privé Sous Contrat : Le projet d'étendre l'application "fait établissement" aux établissements privés sous contrat est ancien (discussions depuis 2019-2020) mais la décision récente de la ministre d'État vise à le rendre obligatoire par décret, nécessitant une modification du droit. Les discussions antérieures avec le SGEC reposaient sur le volontariat.

      • "l'annonce de l'extension de fait établissement au l'enseignement privé sous contrat avait été officialisé dans un un plan d'avril 2024 Donc euh nous nous y avons travaillé durant toute l'année 2024 à partir du mois de janvier jusqu'au mois de novembre avec euh des contact avec euh le ZGEC" (HFDS adjoint) "Ce que la ministre d'État a décidé c'est de changer le droit euh pour faire que fait établissement soit euh opposable euh et obligatoirement utilisé par l'ensemble des établissements Ça c'est très important parce que comme on le disait en début de l'audition nous avons que des remontées très partielles les des des signalements venant des établissements privés sous contrat pour cette raison parce que ils ne sont pas outillés de la même façon que les établissements publics Alors je revenir sur cette chronologie pardon moi je dispose du d'une note qui est commandée à la Dage en 2019 et qui porte un premier regard sur la possibilité juridique donc d'étendre l'application fait établissement aux établissements privés sous contrat bien antérieur à à l'année 2024 que vous avez mentionné..." (Rapporteur)

      • Prévention et Éducation à la Sexualité : Le programme EVARS s'inscrit dans un plan plus large de protection de l'enfance. Des formations pour les enseignants et les personnels sociaux/santé sont prévues pour le repérage et le signalement des élèves victimes. Le recueil de la parole est crucial, avec un renvoi aux professionnels (assistants sociaux, services sociaux en faveur de l'enfant).

      • "D'abord dire qu'il s'inscrit dans un plan plus large hein euh de protection de l'enfance euh dans lequel il y a déjà euh une page euh réservée sur Éduscol à l'ensemble des situations où euh la le recueil de la parole peut être peut être un élément à la fois euh d'ailleurs euh de professionnalisation des enseignants et puis un point de vigilance" (DGESCO) "Donc l'enseignant est là pour relever le signalement qui lui a été fait et ensuite il travaille avec notamment le service social en faveur de l'enfant pour prendre en compte cette parole désigner les suites qu'il doit qui doivent lui être données et en faire le signalement à la fois à son chef d'établissement et ensuite à son au directeur académique des services de l'éducation nationale" (DGESCO)

      Points de Discussion et Défis Identifiés :

      • La capacité du ministère à centraliser et à suivre de manière exhaustive les suites (administratives, disciplinaires, judiciaires) de tous les signalements de violence impliquant des personnels, en particulier dans l'enseignement privé sous contrat où le suivi est moins centralisé.

      • Les délais dans l'élaboration et la mise en œuvre des outils de contrôle et de signalement (ex: "fait établissement" étendu au privé sous contrat).

      • L'hétérogénéité des pratiques de contrôle entre académies.

      • L'interprétation de la gravité des faits et son lien avec les décisions de sanction (retrait de contrat, procédure disciplinaire).

      • Le positionnement des chefs d'établissement/directeurs d'école comme point d'entrée unique pour les signalements via "fait établissement" et le risque potentiel de frein à la libération de la parole ou de biais dans le signalement.

      • La question de la laïcité et la pertinence du dialogue approfondi avec les têtes de réseau confessionnelles dans l'élaboration de politiques de contrôle.

      • La nécessité de renforcer les dispositifs de recueil de la parole des élèves, en particulier dans le cadre de programmes comme l'EVARS.

      Conclusion (du document source) :

      • L'audition a mis en évidence la complexité des enjeux liés à la lutte contre les violences en milieu scolaire et au contrôle des établissements.

      Les efforts pour améliorer les procédures et les outils sont en cours, mais des défis importants subsistent, notamment en matière de centralisation du suivi, de déploiement des moyens de contrôle, et de clarification des rôles et des responsabilités.

      Le dialogue avec les différents acteurs, y compris les établissements privés, est jugé nécessaire, mais son cadre et ses implications juridiques nécessitent une attention particulière. La commission d'enquête poursuivra ses travaux pour approfondir ces questions.

    1. DOCUMENT DE BRIEFING : AUDITION D'ÉLISABETH BORNE SUR L'ENSEIGNEMENT SCOLAIRE

      • Date : [Date de l'audition - non spécifiée dans le texte, insérer la date réelle si connue] Source : Extraits de l'audition de Madame Élisabeth Borne, Ministre d'État, Ministre de l'Éducation Nationale, de l'Enseignement Supérieur et de la Recherche, devant l'Assemblée Nationale.
      • Résumé Exécutif :
      • L'audition de Madame Élisabeth Borne a porté principalement sur l'enseignement scolaire, les questions relatives à l'enseignement supérieur et à la recherche étant reportées. La Ministre a réaffirmé son ambition d'élever le niveau des élèves et de garantir l'égalité des chances, en s'appuyant sur une approche de terrain. Plusieurs points clés ont été abordés, notamment l'inclusion des élèves en situation de handicap, la sécurité des établissements scolaires, la revalorisation du métier d'enseignant, et la mise en œuvre de programmes spécifiques comme l'éducation à la vie affective, relationnelle et sexuelle (EVARS). Des députés de divers groupes politiques ont soulevé des préoccupations concernant les fermetures de classes, le manque d'enseignants et d'accompagnants, l'évaluation du niveau des élèves, et l'instabilité ministérielle.
      • Points Clés et Thèmes Majeurs :
      • Inclusion des Élèves en Situation de Handicap :
      • Déploiement des Pôles d'Appui à la Scolarité (PAS) : Annonce de 500 PAS en 2025, étendant l'expérimentation actuelle dans quatre départements. Un bilan de la phase initiale est en cours et sera transmis.
      • Préoccupations sur le Nombre d'Heures d'Accompagnement : Inquiétude exprimée quant à une possible réduction artificielle du nombre moyen d'heures d'accompagnement par élève pour l'adapter aux ressources disponibles, en particulier lorsque la détermination ne dépend plus des MDPH (Maisons Départementales des Personnes Handicapées).
      • Impact du Maintien des Postes d'Enseignants sur les PAS : La non-suppression des 4000 postes d'enseignants doit permettre d'accélérer le déploiement des PAS. Question posée sur le nombre de postes fléchés vers ces pôles.
      • Postes d'AESH Supplémentaires : Les 2000 postes d'AESH (Accompagnants des Élèves en Situation de Handicap) supplémentaires de la loi de finances 2025 seraient redirigés vers la pause méridienne, désormais à la charge de l'État. Confirmation et répartition souhaitées.
      • Réponse de la Ministre sur l'Inclusion : Effort "inédit" pour recruter des AESH et améliorer leurs conditions de travail (34 000 postes créés depuis 2017, 2000 cette année). Les PAS visent à une approche plus réactive et globale. Nécessité de "travailler la main dans la main entre l'éducation nationale et le secteur médico-social". Déploiement des dispositifs ULIS (Unités Localisées pour l'Inclusion Scolaire).
      • Amélioration des Conditions des AESH : Revalorisation des rémunérations (20% depuis 2017), passage en CDI après 3 ans, et prise en charge de la pause méridienne via la loi Viale pour augmenter le temps de travail et la rémunération. Poursuite de la réflexion sur la reconnaissance de l'expérience et les parcours professionnels.
      • Application de la Loi Viale : La Ministre est "parfaitement consciente que la loi Viale n'est pas appliquée" et annonce la signature d'un décret pour accélérer la bascule de la prise en charge de la rémunération des AESH sur le temps méridien.
      • Collaboration Médico-Sociale : "important qu'on travaille évidemment la main dans la main avec tout le secteur médico-social si on veut que cette école inclusive se déoie se déploie bien et dans l'intérêt des enfants". Nécessité de déterminer l'accueil adapté pour chaque élève, y compris ceux relevant d'un établissement médico-social.
      • Situation du Premier Degré et Fermetures de Classes :
      • Préoccupations Locales : Inquiétude face aux fermetures d'écoles et de classes, en particulier en zones rurales et urbaines. Demande de clarification sur les critères de fermeture pour la rentrée 2025.
      • Impact de la Baisse Démographique : La baisse démographique est une réalité. Nécessité d'anticiper plutôt que subir les évolutions démographiques et les besoins éducatifs.
      • Approche de la Ministre : Chaque décision doit être adaptée aux réalités locales sur la base d'un diagnostic partagé, associant les parlementaires et s'appuyant sur l'Observatoire des dynamiques rurales. Souhait de renforcer le partenariat État-collectivités territoriales.
      • Vision Pluriannuelle de la Carte Scolaire : Volonté de mettre en œuvre une vision pluriannuelle de la carte scolaire, comme annoncé précédemment, pour permettre aux élus locaux, parents et enseignants d'avoir de la visibilité et de travailler sur des solutions adaptées (RPI, territoires éducatifs ruraux). "On doit quand même tirer les conséquences d'une situation particulièrement favorable en terme de taux d'encadrement à Paris".
      • Géographie Prioritaire : Des analyses sont en cours pour actualiser la géographie prioritaire et mieux faire coïncider les réseaux d'éducation prioritaire avec les quartiers prioritaires de la ville. Nécessité d'analyser les conséquences sur les territoires, personnels et élèves avant d'engager la réforme.
      • Éducation à la Vie Affective, Relationnelle et Sexuelle (EVARS) :
      • Engagement et Déploiement : Programme devant être déployé à la rentrée 2025, avec un engagement à assurer son effectivité. Nécessité de prévoir les séances dans les emplois du temps des élèves et professeurs.
      • Formation et Rémunération des Enseignants : Nécessité de former et rémunérer correctement les professeurs qui dispenseront ces heures. Questions sur les professeurs concernés, le financement et le planning de formation. Recours possible à des associations partenaires.
      • Réponse de la Ministre : Le programme EVARS sera mis en œuvre par les professeurs et personnels de l'Éducation Nationale. Au premier degré, il relève des professeurs des écoles ; au second degré, les équipes éducatives définiront les modalités. La formation débutera en mars 2025 avec les référents académiques, complétée par des formations en ligne et 2 jours de formation présentielle à partir de la rentrée 2025.
      • Programme Adopté à l'Unanimité : La Ministre souligne que le programme a été adopté à l'unanimité par les organisations syndicales, représentants de parents et associations familiales, y voyant "une reconnaissance du travail qui a été mené".
      • Approche Progressive : Il s'agit d'un enseignement progressif, élaboré par des spécialistes. Éducation à la vie affective et relationnelle au premier degré, intégrant la sexualité au second degré. Importance de parler de l'intimité du corps pour lutter contre les agressions sexuelles sur mineurs.
      • Notion d'Identité de Genre : Il s'agit de la notion d'identité de genre telle que figurant dans le Code Pénal, proscrivant les discriminations en fonction du sexe, de l'identité de genre ou de l'orientation sexuelle.
      • Niveau des Élèves et Évaluation :
      • Constat des Évaluations Internationales (PISA) : Les résultats de PISA 2022 montrent une baisse significative du niveau en mathématiques.
      • "Choc des Savoirs" : Mesures inspirées du programme éducatif de Marine Le Pen, telles que les groupes de niveau au collège et la transformation du brevet en examen de passage, mais qui sont considérées par certains comme mises à l'arrêt ou incohérentes.
      • Préoccupations sur le Niveau en Lecture : Rapport parlementaire soulignant que 45% des élèves entrant en 6e n'atteignent pas la vitesse de lecture attendue en CM2. Propositions pour limiter l'influence du numérique, privilégier la méthode syllabique et repenser la formation initiale des enseignants.
      • Réponse de la Ministre sur l'Élévation du Niveau : "personne ne peut se satisfaire des évaluations". Détermination à "élever le niveau des élèves", en agissant d'abord au premier degré (dédoublement des classes). Déploiement de plans Math et Français pour conforter la formation des professeurs des écoles. Poursuite des mesures engagées, notamment les groupes de besoins au collège. Évaluation en cours pour identifier les bonnes pratiques.
      • Brevet : Il ne s'agit pas de supprimer le caractère obligatoire du brevet, mais de maintenir qu'il n'est pas un examen de passage en seconde. En cas d'échec au brevet, des mesures de remédiation sont envisagées (prépas secondes, stages de réussite, soutien). Le niveau d'exigence du brevet est renforcé (nouvel équilibre contrôle continu/épreuves finales, nouveau mode de calcul du contrôle continu). Création de la mention "très bien avec félicitations du jury".
      • Évaluations Internationales (TIMSS) : Constat d'un retard en mathématiques, potentiellement lié à l'âge moyen des élèves participant en France. La Ministre examinera avec la direction statistique du ministère le biais potentiel et la pertinence du niveau de test.
      • Sécurité et Climat Scolaire :
      • Restauration de l'Autorité : "Élever le niveau d'exigence ne peut se faire sans restaurer l'autorité dans l'école". Un climat scolaire apaisé est la première condition de la réussite. Fermeté face aux comportements inacceptables.
      • Lutte Contre le Harcèlement et Protection des Professeurs : Chantiers à poursuivre avec détermination. Renforcement des moyens alloués à la vie scolaire (CPE, assistants d'éducation).
      • Sécurisation des Établissements et Abords : Poursuite des investissements. Nécessité de travailler en lien avec la police et la justice. Importance de la collaboration avec les familles et leur responsabilisation. "Il faut un peu dans à l'image de ce qui peut se faire dans les cités éducatives qu'on fasse tous bloc autour de l'école".
      • Tragédie de l'Essonne : Référence à un drame récent, soulignant l'urgence de sécuriser les abords des établissements et le chemin de l'école. Travail engagé avec les ministres de l'Intérieur et de la Justice.
      • Revalorisation du Métier de Professeur et Formation :
      • Constat de Déclassement et d'Abandon : Sentiment de déclassement et parfois d'abandon des professeurs. Nécessité de "regarder en face les réalités du métier" et d'en "tirer toutes les conséquences".
      • Réforme Ambitieuse de la Formation Initiale : Engagement à lancer une réforme ambitieuse. Philosophie générale d'un recrutement dès la fin de la Licence 3 avec 2 années de professionnalisation.
      • Accompagnement Tout au Long de la Carrière : Nécessité de mieux accompagner les professeurs, redynamiser les parcours (en particulier en milieu de carrière), et offrir de nouvelles perspectives.
      • Visibilité des Choix Pédagogiques : Redonner de la visibilité et de la lisibilité. Les réformes ne doivent pas intervenir dans l'urgence mais être guidées par l'intérêt des élèves, professeurs et de la communauté éducative.
      • Attractivité et Rémunération : Référence aux salaires considérés comme trop bas. Rappel des revalorisations déjà intervenues (aucun professeur ne démarre en dessous de 2121€ net/mois). Revalorisation de 11% entre avril 2022 et janvier 2024. Engagement d'un travail sur les milieux de carrière. Le salaire de début de carrière est désormais supérieur à la moyenne de l'OCDE.
      • Confiance dans les Enseignants et Fonctionnement Moins Vertical : Nécessité d'envoyer un message puissant de soutien, reconnaissance et confiance. Donner plus d'autonomie et de capacité d'initiative.
      • Autres Sujets Abordés :
      • Orientation : Doit être choisie, expliquée, anticipée et pensée en lien avec l'enseignement supérieur et le monde professionnel. Renforcer le lien collège-lycée-enseignement supérieur.
      • Budget du Ministère : Augmentation significative du budget du ministère (68,4 milliards en 2017 à 87,10 milliards en 2024). Malgré cela, des députés estiment que les besoins ne sont pas couverts.
      • Instabilité Ministérielle : L'instabilité (sept ministres en peu de temps) est critiquée pour son manque de cap clair et cohérent. Nécessité d'une politique à l'échelle d'une génération.
      • Déconcentration et Autonomie des Établissements : Question sur la possibilité de davantage de déconcentration et d'autonomie pour les établissements et les chefs d'établissement. La Ministre y est favorable, plaidant pour "faire confiance à nos chefs d'établissement" et "laisser des marges de manœuvre".
      • Rythmes Scolaires : Maintien du nombre d'heures d'enseignement quel que soit le rythme. Extinction du fond de solidarité pour le développement des activités périscolaires (FDAP).
      • Pass Culture : Gel temporaire de la part collective pour des raisons budgétaires (dépassement des prévisions). Les demandes pré-réservées ont été validées. Inspection en cours pour évaluer l'utilisation des fonds et la pertinence des contenus. Intention d'examiner l'accès au Pass Culture pour les élèves en établissements médico-sociaux.
      • Stage de Seconde : Bilan de la première session et aménagements pour 2025. Difficultés d'accès aux stages dans certains territoires. Ambiguïté sur la validation des stages linguistiques. Volonté d'ouvrir le plus possible les possibilités de stage.
      • Financement des Écoles Privées : Difficultés rencontrées par certaines communes dans le financement des écoles privées sous contrat, avec un coût par élève dans le public qui augmente mécaniquement. Demande de réviser la circulaire de 2012.
      • Situation de Saint-Martin : Résultats scolaires inférieurs aux moyennes, liés à l'insuffisante prise en compte des caractéristiques locales, en particulier linguistiques. Nécessité d'adapter les méthodes pédagogiques et les procédures d'affectation des personnels.
      • Enseignement des Langues et Cultures de l'Antiquité : Baisse du nombre d'élèves, en particulier au collège. Retravail sur les programmes pour mieux valoriser le lien antiquité-monde moderne. Réflexion sur une spécialité unique grec/latin au lycée.
      • Idées et Faits Importants avec Citations :
      • Ambition Ministérielle : "mon ambition est à la fois simple et exigeante élever le niveau des élèves et garantir une réelle égalité des chances".
      • Approche de Terrain : "une approche commune des politiques publiques celle qui s'appuie sur la réalité de terrain pour agir c'est une méthode que je revendique".
      • Rôle du Parlement : "le Parlement dans le domaine de l'enseignement comme dans toutes les politiques publiques a une mission essentielle évaluer ces politiques publiques et ce faisant nous aider à mieux les piloter".
      • Inclusion des Élèves Handicapés : "naturellement pas de mettre moins de moyens au service des élèves en situation de handicap mais on sait que beaucoup de parents peuvent être on va dire freinés par la lourdeur des démarches administratives". Les PAS permettent "d'avoir une approche plus réactive d'avoir une prise en charge plus globale à l'échelle d'un territoire".
      • Revalorisation des Enseignants : "aucun professeur ne démarre sa carrière en dessous de 2121 € net par mois c'est une revalorisation comme il n'y en avait pas existé depuis 30 ans".
      • Climat Scolaire : "un climat scolaire apaisé est la première condition de la réussite". "la violence quelle qu'en soit la forme n'a pas sa place dans l'école de la République". "l'école ne peut ne peut pas être un lieu où l'on a peur".
      • Situation des AESH : "le travail qu'elle mène elle je dis elle parce que ce sont souvent des femmes est essentiel au côté de nos élèves". "personne ne peut se satisfaire des niveaux de rémunération qui sont notamment lié au fait que beaucoup des aesh n'ont pas un emploi à temps plein".
      • Carte Scolaire Rurale : "il faut absolument qu'on puisse travailler sur une vision pluannuelle".
      • Éducation à l'IA : "l'intelligence artificielle l'apprentissage du numérique la nouvelle la maîtrise des nouvelles technologies n'est pas une option ce sont des leviers fondamentaux pour les élèves".
      • EVARS : "il s'agit d'une éducation à la vie affective et relationnelle dans le premier degré et c'est une éducation qui intègre la sexualité dans le second degré". "apprendre à un jeune que son corps lui appartient et que son intimité doit être respecté et le faire à l'école si c'est ça n'est pas suffisamment expliqué à la maison mais ça me semble quelque chose d'important".
      • Principales Préoccupations Soulevées par les Députés :
      • Incohérence et Manque de Vision : Critiques concernant l'instabilité ministérielle et les revirements (ex: "choc des savoirs"), perçus comme un manque de cap stratégique.
      • Fermetures de Classes et Manque de Moyens : Forte inquiétude face aux fermetures de classes, en particulier dans le premier degré et en milieu rural, et au manque perçu de moyens alloués à l'éducation.
      • Pénurie et Non-Remplacement des Enseignants : Situation alarmante du manque d'enseignants et du non-remplacement, affectant directement les élèves.
      • Conditions de Travail des Personnels : Dégradation des conditions de travail des enseignants, AESH et AED.
      • Niveau des Élèves : Préoccupations récurrentes sur la baisse du niveau, notamment en mathématiques et en lecture.
      • Sécurité : Urgence de sécuriser les établissements et leurs abords suite à des événements tragiques.
      • Éducation à la Vie Affective, Relationnelle et Sexuelle : Questions sur le déploiement effectif du programme, la formation des enseignants, et les inquiétudes de certains parents.
      • Politisation et Laïcité : Signalement de cas de politisation en classe et préoccupations sur le respect de la laïcité.
      • École Privée : Critiques sur les facilités accordées au secteur privé et la possible dégradation de l'école publique en comparaison.
      • Pass Culture : Problèmes d'application de la part collective et exclusion de certaines catégories d'élèves.
      • Réponses de la Ministre Face aux Critiques :
      • La Ministre a défendu les efforts budgétaires ("15 milliards d'euros supplémentaires") et les créations de postes (enseignants et AESH) depuis 2017.
      • Elle a contesté les descriptions jugées "caricaturales" de l'école publique par certains députés d'opposition, en particulier sur la situation à Paris.
      • Elle a souligné la poursuite des politiques engagées précédemment, notamment sur le "choc des savoirs".
      • Elle s'est engagée à prendre en compte les remontées de terrain et à travailler en concertation.
      • Ce briefing offre un aperçu des principaux points discutés lors de l'audition, en mettant en lumière les thèmes centraux, les faits importants et les différentes perspectives exprimées.
    1. NOTE DE SYNTHÈSE : AUDITION DE LA DACG SUR LA POLITIQUE PÉNALE FACE AUX VIOLENCES SCOLAIRES

      Date :2025

      Source : Extraits de "🔴 Quelle politique pénale face aux violences scolaires ?

      Suivez l’audition de la DACG" Participants à l'audition (côté Ministère de la Justice) : Directrice adjointe des affaires criminelles et des grâces, accompagnée de Mme Anne Mao Mercier, adjointe à la chef du bureau de la politique pénale générale.

      Sujet : Examen de la politique pénale face aux violences commises par des adultes sur des élèves en milieu scolaire.

      1. Introduction et Cadre Général

      • L'audition porte sur un sujet d'une importance capitale pour le Ministère de la Justice : la lutte contre les violences commises sur les mineurs, en particulier en milieu scolaire.

      L'école est présentée comme un "sanctuaire" où de telles violences sont "inacceptables". Le Ministère de la Justice est fermement engagé dans ce combat, qui constitue une "priorité de politique pénale du gouvernement". Cette priorité est régulièrement rappelée dans les circulaires et dépêches adressées aux procureurs.

      • Citation clé : "l'école doit être un sanctuaire où on doit effectivement et bien euh développer la personnalité de de nos de nos jeunes jeunes enfants il est inacceptable effectivement que de telles violences puissent y être commises certains faits dramatiques nous obligent tout particulièrement"

      • Une circulaire importante du 28 mars 2023 est citée comme une circulaire cadre visant à élever la lutte contre les violences sur mineurs à un niveau d'engagement similaire à celui mis en œuvre pour les violences intrafamiliales (suite au Grenelle).

      Elle couvre l'ensemble du traitement des violences contre les mineurs, incluant les politiques de juridiction, les politiques partenariales et la politique pénale de fond.

      Toutes les violences sur mineurs sont concernées, avec une attention particulière pour celles commises en établissements scolaires.

      2. Renforcement de l'Arsenal Juridique

      Le Ministère de la Justice, via la Direction des affaires criminelles et des grâces (DACG), contribue activement à l'évolution normative en matière de lutte contre les violences sur mineurs.

      L'arsenal juridique est décrit comme "riche", bien que toujours susceptible d'amélioration.

      • Exemples d'enrichissements récents :Aggravation de la répression des violences sexuelles : Concerne toutes les victimes, mais particulièrement celles en milieu ou aux abords des établissements scolaires.
      • Loi du 21 avril 2021 : A supprimé la notion de consentement pour les relations sexuelles entre majeur et mineur.
      • Allongement de la durée de prescription : Travail législatif mené sur 35 ans pour favoriser la révélation des faits, même des années après. Le point de départ de la prescription a été retardé à la majorité et le délai considérablement allongé (jusqu'à 30 ans pour les crimes sexuels, 20 ans pour les agressions et atteintes sexuelles). La loi de 2021 a introduit la "prescription glissante".
      • Loi du 14 avril 2016 : Création d'un cadre plus étoffé (articles 11-2 et 706-47-4 du code de procédure pénale) pour régir la transmission d'informations entre l'autorité judiciaire et les administrations, en particulier l'Éducation Nationale. Ce dispositif d'information peut être facultatif ou obligatoire.

      3. Élaboration et Animation des Politiques Pénales

      La DACG est également en charge du développement et de l'animation des politiques pénales sur le sujet. De nombreuses circulaires et dépêches ont été diffusées pour guider l'action des procureurs.

      • Objectifs des circulaires :Favoriser la révélation des faits et la transmission des plaintes/dénonciations à l'autorité judiciaire.
      • Améliorer l'efficacité des enquêtes.
      • Garantir l'information, l'accompagnement et la protection des victimes tout au long du parcours judiciaire.
      • Donner des instructions claires sur les réponses pénales à apporter pour prévenir le renouvellement des faits et protéger les victimes.

      4. Renforcement des Partenariats, en Particulier avec l'Éducation Nationale

      Un axe fort de la politique pénale est le renforcement des partenariats entre l'autorité judiciaire et l'Éducation Nationale. L'article 40 du Code de procédure pénale, qui impose aux fonctionnaires de signaler tout crime ou délit dont ils ont connaissance, est un élément clé de ce partenariat.

      • Modalités du partenariat :Conventions : Conclusion de conventions entre les parquets et les établissements scolaires (publics, privés sous contrat). Une dépêche du 8 octobre 2021, faisant suite aux travaux de la CIASE, a invité les parquets à conclure des conventions avec les diocèses (avec un taux de signature de plus de 84% en octobre 2024). Une autre dépêche du 5 septembre 2023 (relative aux infractions en milieu scolaire) a invité à conclure des partenariats avec l'Éducation Nationale/établissements scolaires pour établir des circuits de signalement et des modalités d'échange (plus de 73% de conventions conclues en 2023, estimé à plus de 90% aujourd'hui).
      • Échanges et contacts directs : Notamment entre les référents éducation nationale (dans les parquets) et les référents justice (dans les académies). Cette pratique ancienne a été généralisée suite à une circulaire du 11 mars 2015. Les référents sont des points de contact essentiels pour faciliter l'obtention d'informations.
      • Bonnes pratiques : Attention portée aux réunions annuelles organisées par certains parquets avec les chefs d'établissement et leurs équipes. Ces rencontres permettent d'échanger sur les difficultés, de faire le point sur les circuits mis en place et d'améliorer les pratiques. La DACG souhaiterait généraliser ces réunions, bien qu'elles ne se tiennent pas dans tous les ressorts en raison de la charge de travail.

      5. Qualité du Traitement des Faits Dénoncés

      Au-delà de la transmission des signalements, la DACG met l'accent sur la qualité du traitement des faits dénoncés.

      • Gestion des faits prescrits : Malgré l'allongement des délais de prescription, de nombreuses plaintes portent sur des faits prescrits. Une dépêche du 26 février 2021 invite les procureurs à ouvrir systématiquement des enquêtes même si les faits semblent prescrits.
      • Objectifs de l'ouverture systématique d'enquêtes sur faits prescrits :S'assurer que les faits sont bien prescrits après étude approfondie.
      • Identifier d'autres victimes potentielles pour lesquelles les faits ne seraient pas prescrits.
      • Lutter contre le sentiment d'impunité.
      • Permettre à la victime d'être entendue et à l'auteur (même s'il ne peut être poursuivi) d'être entendu, ce qui peut être important pour la victime.
      • Effectuer des investigations pour s'assurer qu'il n'y a pas d'autres risques de commission d'faits sur d'autres personnes.
      • Délai de traitement et prise en charge des victimes : La DACG est attentive au délai de traitement dans les services d'enquête, à la qualité de l'accueil et de l'audition des victimes, ainsi qu'à la prise en charge pluridisciplinaire.
      • Traitement rapide des procédures : Idée de traiter rapidement les procédures à réception dans les juridictions, notamment dans le cadre de "filières de l'urgence", pour permettre des poursuites rapides (ex: comparution immédiate) et la mise en œuvre de mesures de sûreté (contrôle judiciaire, détention provisoire). L'action contre l'auteur est aussi une mesure de protection pour le mineur.
      • Classement sans suite : Le classement sans suite reste possible. La circulaire de mars 2023 insiste sur l'importance d'un examen attentif de la procédure avant classement et de la notification du classement aux victimes avec explication des motifs.

      6. Réponses Pénales et Suivi

      Les réponses pénales doivent être "rapides" et "fermes quand c'est nécessaire", en fonction de la nature des faits, de la gravité du préjudice et des antécédents de la personne mise en cause.

      • Peines spécifiques : Insistance sur certaines peines, comme l'interdiction d'exercer une activité professionnelle ou bénévole impliquant un contact habituel avec des mineurs (pouvant être obligatoire dans certains cas).
      • Inscription au FIGES : Essentielle pour le contrôle d'honorabilité des personnes recrutées par l'Éducation Nationale.
      • Retour d'information : Nécessité de veiller au retour d'informations à l'Éducation Nationale et à ceux ayant transmis les plaintes/dénonciations au parquet.

      7. Points de Discussion et Questions des Rapporteurs/Députés

      Plusieurs points spécifiques ont été abordés lors de l'échange :

      • Transmission de l'information à l'Éducation Nationale au moment du signalement : La loi de 2016 trouve un équilibre entre nécessité d'informer et présomption d'innocence/enquête. La transmission d'information intervient au moment de l'engagement des poursuites ou de la condamnation, pas nécessairement immédiatement au stade de l'enquête. Des échanges informels entre référents sont possibles mais pas systématiques. Le délai de transmission varie selon la complexité, le risque de réitération et la nécessité de protéger immédiatement la victime.
      • Protection de l'enfant en urgence : La préoccupation est essentielle pour les procureurs. En cas de danger, une saisine du conseil départemental (aide sociale à l'enfance) est possible pour envisager un placement. La diligence de l'enquête et le recours à des procédures rapides (filières d'urgence, comparution immédiate) permettent d'envisager la mise en place de mesures de sûreté pour protéger les victimes.
      • Uniformisation des référents : Présence de référents justice dans tous les parquets, mais la connaissance de leur existence dans toutes les académies n'est pas assurée.
      • Partage d'information : Le niveau de gravité/certitude pour le partage d'information n'est pas nécessairement uniformisé, bien que les textes (loi de 2016) fixent des critères obligatoires qui sont censés s'appliquer uniformément. L'article 11-2 laisse une souplesse (transmission facultative), mais les circulaires sensibilisent les procureurs à la nécessité de partager l'information dans le cadre des relations partenariales.
      • Réunions annuelles : Considérées comme une bonne pratique à généraliser (1er et 2nd degré, tous établissements). Elles ne se tiennent pas partout, mais les échanges existent (conventions, référents). La DACG n'a pas de vision consolidée de leur mise en place systématique ni des établissements invités (publics/privés). L'organisation est laissée à la main des parquets, avec des directives par circulaire. Des outils numériques sont en développement pour aider les magistrats et faciliter les échanges d'information, notamment pour ne pas "louper certaines obligations" de transmission.
      • Information du parquet sur les mesures disciplinaires/conservatoires de l'employeur : L'article 40 impose le signalement des crimes/délits, ce qui peut parfois inclure des informations sur les mesures disciplinaires. Cette information n'est pas toujours complète au départ mais peut être transmise ultérieurement via les échanges. Normalement, les conventions de partenariat peuvent inclure la transmission systématique de cette information. La DACG n'est pas certaine que cette information soit toujours effective dans la pratique mais la juge "toujours très intéressante d'avoir ce type d'information".
      • Recteurs/Inspecteurs sollicitant les parquets sur les procédures judiciaires : Normalement, si des poursuites pénales sont engagées contre du personnel en contact avec des mineurs (notamment pour les infractions visées par l'article 706-47-4), l'information doit être portée obligatoirement à la connaissance de l'Éducation Nationale par le parquet. Cette transmission est prévue par écrit, bien que des pratiques informelles (conversations téléphoniques) puissent exister.
      • Cas individuels (Notre Dame de Baram, Riomont, Lycée Bayen) : La directrice adjointe ne peut pas se prononcer sur des affaires individuelles, invoquant l'article 11 (seul le parquet peut communiquer) et son incompétence pour se prononcer sur des affaires spécifiques sans autorisation.
      • Avertissement avant poursuite (pratique ancienne) : Réponse concernant le cas de 1996 (Notre Dame de Baram) : Le procureur dispose de la liberté de classer ou d'engager des alternatives aux poursuites (comme l'avertissement solennel, qui n'existe plus sous cette forme). Les avertissements avant poursuite étaient réservés aux faits les moins graves. La circulaire de mars 2023 invite à relever le niveau de réponse pénale, en particulier face à la qualité de la personne mise en cause. L'avertissement avant poursuite est un classement qui peut être réexaminé en cas d'éléments nouveaux ou de récidive (principe de l'opportunité des poursuites). Il ne s'agit pas forcément d'une "mise en attente" mais d'un classement faute d'éléments suffisants à l'instant T pour poursuivre l'enquête ou engager des poursuites. En 1996, la loi de 2016 n'était pas adoptée, il est difficile de dire si l'information aurait dû être transmise. Dans le cadre actuel, s'il n'y a pas eu de poursuite, il n'y a pas de transmission obligatoire.
      • Décès de l'auteur (Lycée Bayen) : Le décès de l'auteur n'implique pas forcément un classement immédiat pour extinction de l'action publique ; la procédure peut être toujours en cours (auditions). La directrice adjointe ne peut toujours pas se prononcer.
      • Information des enseignants sur les procédures (différence CRIP/parquet, article 40, etc.) : Les conventions de partenariat devraient inclure des notices d'information à diffuser aux personnels de l'Éducation Nationale. La difficulté réside dans le fait de faire vivre ces conventions et de sensibiliser les magistrats. La bonne volonté ne suffit pas ; des "process, des moyens, des outils" sont nécessaires. Il est vrai que les magistrats (juges des enfants) sont saturés. L'information générale est normalement le but de ces conventions et de leurs annexes, à retravailler lors des réunions.
      • Signalement hors voie hiérarchique : Si un personnel signale des faits sans passer par la voie hiérarchique, c'est bien considéré comme un fait émanant potentiellement de l'Éducation Nationale.
      • Pourcentage de classements sans suite et de faits prescrits (violences sexuelles sur mineurs) : Les chiffres précis ne sont pas immédiatement disponibles mais peuvent être communiqués. Le taux de classement peut être "extrêmement décevant" mais est lié aux difficultés d'établir les faits (déperdition de preuves avec le temps, faits commis en huis clos).
      • Imprescriptibilité des crimes sexuels sur mineurs : La DACG n'y est pas favorable.
      • Arguments contre l'imprescriptibilité :Ne résoudrait pas forcément les problèmes liés à la déperdition de preuves et aux difficultés d'enquête dans des affaires anciennes.
      • Ne garantirait pas nécessairement que plus de victimes parlent (certaines pourraient être incitées à parler avant la date butoir actuelle).
      • Remettrait en cause la cohérence d'ensemble des délais de prescription du système répressif français.
      • Conseils pour déposer plainte rapidement : Il est préférable de s'adresser directement à un service d'enquête (commissariat/gendarmerie) plutôt qu'uniquement par courrier au procureur, pour gagner du temps. Il est conseillé de se faire accompagner par une association d'aide aux victimes pour être orienté et soutenu.
      • Plaintes sur faits prescrits et protection des jeunes : L'ouverture systématique d'enquêtes sur faits prescrits (dépêche de 2021) permet, outre les objectifs déjà cités (écoute de la victime, audition de l'auteur, lutte contre l'impunité), de rechercher d'autres victimes ou risques potentiels en investigant l'environnement de l'auteur. Le lien avec d'autres parquets est implicite via ces enquêtes qui peuvent révéler des faits commis ailleurs.
      • Association au plan "Brisons le silence, agissons ensemble" : Le Ministère de la Justice n'a pas été associé à l'élaboration de ce plan.

      8. Conclusion

      Beaucoup a été fait en matière de politique pénale et de renforcement de l'arsenal juridique pour lutter contre les violences sur mineurs, mais "beaucoup reste à faire".

      La DACG est attentive aux remontées du terrain et aux travaux de la commission d'enquête. Des efforts sont déployés pour renforcer les partenariats, améliorer le traitement des dossiers (même prescrits) et utiliser les outils numériques pour pallier les manques.

      Les défis persistent, notamment en termes de moyens pour les juridictions et d'uniformisation des pratiques à l'échelle nationale.

      La position sur l'imprescriptibilité est claire : elle n'est pas considérée comme une solution pertinente.

    1. Document de synthèse : Audition sur les violences scolaires de l'Inspection générale de l'éducation, du sport et de la recherche

      Source : Extraits de la vidéo "🔴Violences scolaires : audition de l’Inspection générale de l'éducation, du sport et de la recherche"

      Date de l'audition : 31 mars 2025 (selon l'extrait, bien que le député mentionne une audition de Caroline Pascal le 31 mars dernier)

      Participants de l'IGESR :

      • Dominique Marchand, Directrice de l'IGESR
      • Christelle Gillard, Chef du pôle des affaires juridiques et contrôle

      • Éric Roser, Responsable du collège expertise administrative et éducative

      Objet de l'audition : Examiner le rôle, le fonctionnement et l'activité de l'Inspection générale de l'éducation, du sport et de la recherche (IGESR) en matière de contrôle et d'enquête, particulièrement en lien avec les violences scolaires, et ce dans le cadre de travaux d'enquête parlementaire.

      Points clés et thèmes principaux :

      Mission et Périmètre de l'IGESR :

      • L'IGESR est issue de la fusion de quatre inspections générales en 2019 (IGUNR, Inspection des bibliothèques, Inspection du sport et de la jeunesse).

      • Son champ d'activité est large, couvrant l'éducation nationale, l'enseignement supérieur, la jeunesse, les sports, les bibliothèques et la lecture publique.

      • Ses missions se déclinent en trois axes : évaluation des politiques publiques (force de proposition), inspection et contrôle, appui et accompagnement des acteurs locaux et des établissements.

      • Citation : "...notre activité est multiple dans chacun des champs d'action de l'inspection générale qui si je devais les synthétiser en trois axes serait de l'ordre de l'évaluation des politiques publiques... inspection contrôle naturellement... et puis aussi des missions d'appui d'accompagnement des acteurs locaux des établissements."

      Conditions de Saisine et Autorité :

      • L'IGESR intervient exclusivement sous l'autorité et à la demande des ministres de tutelle (ministre de l'éducation, de l'enseignement supérieur, de la recherche, de la jeunesse, des sports ou du Premier ministre).

      • Il n'y a pas d'autosaisine de l'inspection générale.

      • Les rapports sont remis aux ministres commanditaires qui décident de leur usage, de la mise en œuvre des recommandations et de leur diffusion/publication.

      • Citation : "...nous intervenons sous l'autorité du ministre des ministres de tutelle sur leur saisine qu'il n'y a pas d'autosésine de l'inspection générale..." et "...il appartient au commanditaires de mettre en œuvre d'utiliser les recommandations... et enfin d'autoriser la diffusion ou la publication des rapports."

      • Les critères qui poussent un ministre à saisir l'IGESR sont généralement : la gravité de la situation, la nécessité d'un dépaysement (éviter un traitement local), le niveau de responsabilité des personnes impliquées (ex: chef d'établissement), la médiatisation de l'affaire, ou le manque de ressources ou de capacité d'expertise dans l'académie concernée.

      Distinction entre Enquête Administrative et Contrôle :

      • Enquête administrative : Déclenchée suite à des signalements précis ou des situations avérées.

      La méthodologie est très rigoureuse, incluant de nombreuses auditions individuelles avec procès-verbaux. L'objectif est d'établir la matérialité des faits et d'identifier les responsabilités individuelles.

      Elles sont susceptibles d'entraîner des suites disciplinaires. Elles obéissent à un Vademecum méthodologique public.

      • Contrôle : Vise plutôt la régularité des actes au regard des prescriptions législatives et réglementaires.

      La méthodologie est considérée comme moins rigoureuse que celle des enquêtes administratives, bien que des auditions puissent avoir lieu.

      Il s'agit plus de vérifier la conformité à un certain nombre de critères.

      • Citation (Christelle Gillard) : "en ce qui concerne effectivement ce qui distingue c'est aussi la méthodologie qui va être abordée... l'enquête administrative est susceptible de suite disciplinaires et d'identifier des responsabilités individuelles qui pourraient... établir la matérialité des faits..." et "...un contrôle ça sera peut-être un peu plus léger en terme de rigueur méthodologique puisqueeffectivement on est là pour avoir un certain nombre de critères..."

      • La limite entre les deux peut être fragile, et une mission de contrôle peut évoluer vers une enquête administrative si de nouveaux éléments graves apparaissent.

      Statistiques d'activité (partielles) :

      • En 2024 (jusqu'à la date de l'audition), l'IGESR a réalisé 28 enquêtes administratives tous périmètres confondus.
      • Parmi celles-ci, 17 concernent le périmètre éducation nationale.

      • Sur ces 17, 7 portent sur des établissements scolaires.

      Ces 7 établissements étaient publics. Aucun établissement privé (sous ou hors contrat) n'a fait l'objet d'une enquête administrative de l'IGESR en 2024 à la connaissance des personnes auditionnées.

      • Un seul contrôle d'établissement privé sous contrat a eu lieu en 2024.

      Gestion des missions et Indépendance des Inspecteurs :

      • La désignation des inspecteurs pour une mission relève de la responsabilité du chef de service (la Directrice).

      • Les missions sont attribuées sur la base d'un appel à candidature en interne, en fonction des compétences et de l'expertise.

      • Pour les enquêtes administratives, une formation obligatoire préalable est requise. L'équipe doit inclure un pilote expérimenté et des collègues formés.

      • Chaque mission est accompagnée d'un référent (appui conseil en cours de mission) et d'un relecteur (relecture approfondie du rapport final, "challenge" des conclusions). Ces rôles contribuent à la collégialité et à l'objectivité.

      • Les inspecteurs généraux missionnés signent le rapport sous leur responsabilité.

      • La chef de service transmet le rapport au ministre commanditaire, mais ne modifie pas le contenu du rapport lui-même. En cas de désaccord majeur, une procédure réglementaire spécifique est prévue (lettre séparée au ministre, recours à une commission).

      • L'indépendance des inspecteurs est soulignée, notamment par leur capacité à décider seuls de la nécessité de faire un signalement Article 40 du Code de procédure pénale (transmission au procureur des faits susceptibles de constituer une infraction pénale), information que les ministres découvrent lors de la réception du rapport.

      • Citation (Dominique Marchand) : "...c'est un élément important qui euh témoigne de l'indépendance des inspecteurs généraux qui sont missionnés qui signent sous leur responsabilité..." et "...si les inspecteurs généraux considèrent que il y a lieu de faire un article 40 c'est de leur décision et les ministres eux-mêmes n'en sont n'en sont informés mais que lorsqu'ils reçoivent le rapport dans lequel il est indiqué la nécessité d'une saisine de l'article 40."

      Signalements Article 40 :

      • Depuis 2022, l'IGESR a réalisé 43 signalements Article 40.

      • Répartition par année : 11 en 2022, 17 en 2023, 12 en 2024, 3 en 2025 (jusqu'à la date de l'audition).

      • Dans le périmètre éducation nationale (sur 17 enquêtes administratives en 2023-2024), 4 rapports ont fait l'objet d'un signalement Article 40.

      • Ces signalements concernent souvent des comportements inappropriés susceptibles de revêtir une qualification pénale, tels que l'outrage sexiste aggravé (surtout en cas de figure d'autorité) ou le harcèlement moral (lorsqu'il correspond à la définition du code pénal).

      • L'IGESR propose de transmettre une liste des rapports liés aux violences, VSS et harcèlement pour illustrer directement ces situations.

      Suivi des Recommandations :

      • Le suivi des recommandations formulées dans les rapports n'est pas systématisé ni formalisé réglementairement pour l'IGESR (contrairement à d'autres inspections comme l'IGAS).

      • Il appartient aux ministres commanditaires de donner suite aux recommandations.

      • Des échanges peuvent avoir lieu avec les administrations centrales ou les rectorats dans une logique d'appui et de conseil.

      • Le cas du collège Stanislas est mentionné : un premier point d'étape formalisé a eu lieu avec le rectorat de Paris, puis la mise en œuvre a été laissée à la responsabilité du rectorat.

      Diffusion des Rapports :

      • La publication des rapports est à la décision des ministres.

      • La chef de service de l'IGESR considère qu'il n'est pas "anormal" que tous les rapports ne soient pas diffusés, car l'IGESR a aussi un rôle de conseil confidentiel auprès des ministres.

      • Des circonstances conjoncturelles (changement de gouvernement/cabinet) peuvent influencer la publication.

      • Les rapports non rendus publics sont généralement transmis aux ministres et aux directions d'administration centrale concernées.

      • Des rapports peuvent être transformés en guides ou notes pour les usagers, même s'ils ne sont pas publiés intégralement.

      • La transmission à des tiers (élus, personnalités politiques) dépend du Code des relations du public avec l'administration (CRPA), et peut impliquer l'occultation de parties confidentielles (notamment pour les enquêtes administratives).

      Cas Particuliers et Interrogations :

      Collège Stanislas (Paris) :

      • L'auditeur soulève une contradiction perçue entre des PV d'audition d'élèves et de parents (faisant état d'insultes et propos homophobes, de climat raciste) et le rapport final qui, selon l'auditeur, minimiserait ces phénomènes (homophobie présentée comme révolue, racisme non mentionné, homophobie non caractérisée d'institutionnelle ou systémique).

      • Les représentants de l'IGESR défendent le rapport, estimant qu'il décrit précisément les constats (règlement intérieur, voyages scolaires, livrets genrés, difficultés entre élèves) et mentionne un "climat de rejet de l'homosexualité par de conséquences propistes au risque d'homophobie", même si le caractère institutionnel n'a pas été retenu par les inspecteurs.

      • L'absence de mention explicite du racisme dans le rapport est notée, sans explication immédiate des auditionnés.

      • La chef de service rappelle que la responsabilité des constats et conclusions revient aux inspecteurs missionnés qui ont signé le rapport. Elle s'engage à relayer ces interrogations auprès d'eux.

      • La question de la caractérisation "systémique" ou "institutionnelle" renvoie à l'analyse des inspecteurs basée sur l'ensemble des éléments recueillis, y compris ceux à "décharge" (éléments qui contredisent l'existence de problèmes).

      • Lycée Avros (Lille) :

      • Le rapport de l'IGESR de juin 2020 est jugé très favorable sur le fonctionnement pédagogique de l'établissement.

      • L'auditeur s'interroge sur la rupture exceptionnelle du contrat d'association de l'établissement 3 ans plus tard (2023), décision prise par le préfet sur la base de griefs (notamment pédagogiques) mais qui ne semblent pas fondés sur le rapport de l'IGESR.

      • L'IGESR confirme que son rapport de 2020 était positif et ne fondait pas une décision de rupture de contrat.

      • La question est soulevée de l'intervention d'une Chambre Régionale des Comptes sur des aspects pédagogiques, ce qui semble inhabituel pour l'IGESR ("objectivment que c'est à la chambre régionale ou à la cour des comptes que vous devriez poser la question").

      • Lycée Pierre Bayen (Châlons-en-Champagne) :

      • Le rapport analyse les dysfonctionnements dans la transmission de signalements de violences sexuelles d'un enseignant sur des élèves.

      • Une préconisation interpelle : proposer une sensibilisation/formation aux cadres supérieurs de l'académie de Reims sur les modalités d'exercice de l'Article 40.

      • Les représentants de l'IGESR confirment que le rapport met en évidence des défaillances à tous les niveaux de la hiérarchie (chef d'établissement, DSDEN, rectorat) dans le traitement de cette affaire.

      • Ils reconnaissent un besoin général de clarification concernant l'articulation entre les enquêtes administratives/disciplinaires et les procédures pénales (Article 40), car cette distinction n'est pas toujours bien intégrée par les acteurs, malgré les efforts d'information.

      • Questions supplémentaires et Réponses :

      • Existence de "CTIG" (Correspondants territoriaux de l'inspection générale) : Oui, ils existent au niveau académique et en région académique.

      • Programme de travail des inspections territoriales (IA-IPR, IEN) : Arrêté par le recteur. L'IGESR n'intervient pas dans l'appréciation de ce programme (contrairement à l'IGEN avant la fusion).

      • Inspection de l'enseignement privé par les corps territoriaux : L'auditeur s'interroge s'ils y portent suffisamment attention. L'IGESR ne se prononce pas directement sur ce point, mais note que la modification de l'évaluation des enseignants a réduit le nombre d'inspections individuelles dans les classes en général.

      • Remontée des "faits établissement" les plus graves : Ils remontent aux Hauts fonctionnaires de défense et de sécurité. L'IGESR n'est pas destinataire de ces signalements ("De fait établissement l'inspection générale n'est pas destinataire").

      • Création d'une unité dédiée aux signalements de violence scolaire dans les académies : Il existe déjà des conseillers techniques Établissement et Vie Scolaire et les services de défense et de sécurité académique qui ont cette mission. L'IGESR ne formule pas de préconisation sur une nouvelle unité.

      • Mémoire institutionnelle de l'IGESR : L'auditeur regrette que les nouveaux arrivants ne puissent pas répondre sur des dossiers passés (ex: Avros transmission rapport élus). L'IGESR répond qu'ils peuvent interroger les collègues qui ont mené les missions, la responsabilité incombant aux signataires du rapport.

      • Immaculée Conception de Pau : L'IGESR n'a pas été saisie de ce sujet.

      • Signalements directs reçus par l'IGESR : C'est rarissime. Lorsqu'ils en reçoivent (non VSS pour le cas mentionné), ils transmettent au rectorat concerné, car cela relève de sa responsabilité.

      • Recours pour les enseignants (harcèlement hiérarchique, etc.) : Voie hiérarchique ou recours aux corps d'inspection (ex: Inspection Santé Sécurité au Travail) qui existent dans chaque rectorat avec une garantie de confidentialité.

      • Publication des comptes des établissements privés (ex: Boramau/Betaram) : Le contrôle de la publication des comptes ne relève pas du ressort de l'IGESR. C'est une mission externe à leur périmètre.

      Principales lacunes ou points non éclaircis :

      • L'explication complète de l'absence de mention du racisme dans le rapport Stanislas, malgré des témoignages rapportés dans les PV d'audition.

      • Les raisons précises et les fondements de la rupture du contrat d'association du lycée Avros, qui semblent déconnectées du rapport favorable de l'IGESR.

      • Les modalités de coordination formelle entre l'IGESR et d'autres inspections (IGF, IGAS) ou organismes de contrôle (Chambres régionales des comptes) pour assurer une vision complète (le "360") lors d'une inspection d'établissement.

      L'audition mentionne des saisines conjointes, mais pas de mécanismes systématiques pour avoir connaissance des contrôles passés par d'autres instances.

      • Le détail des chiffres d'inspection des établissements privés (sous et hors contrat) par les corps d'inspection territoriaux.

      • La raison pour laquelle la liste des rapports réalisés n'apparaît plus dans le rapport d'activité annuel 2023-2024 (expliqué comme un possible oubli, mais la raison de cet oubli n'est pas totalement éclaircie).

      Synthèse générale :

      • L'audition a mis en lumière le rôle central mais strictement encadré de l'IGESR, qui intervient uniquement à la demande des ministres.

      La distinction entre enquête administrative (rigoureuse, identification des responsabilités individuelles, susceptibles de suites disciplinaires et d'Article 40) et contrôle (vérification de la conformité réglementaire) est cruciale, bien que la limite puisse être ténue sur le terrain. L'indépendance des inspecteurs dans la conduite des missions et la rédaction des rapports, ainsi que leur capacité à décider des Article 40, est un point fort souligné.

      • Cependant, des interrogations majeures subsistent concernant la cohérence entre les témoignages recueillis lors des enquêtes (notamment à Stanislas concernant l'homophobie et le racisme) et le contenu final des rapports publiés, soulevant des questions sur la manière dont l'analyse des faits est conduite et présentée.

      La complexité de la coordination entre les différentes instances de contrôle de l'État (IGESR, Chambres des Comptes, corps territoriaux) et le manque de mécanismes systématiques d'échange d'informations apparaissent comme un défi pour assurer un traitement exhaustif des situations complexes impliquant plusieurs domaines (pédagogique, financier, protection de l'enfance, etc.).

      Enfin, la difficulté à obtenir des réponses claires sur des dossiers passés (Avros, signalements directs) en raison des changements de direction de l'IGESR soulève des questions sur la mémoire institutionnelle de cette instance.

      L'audition sur le lycée Pierre Bayen met en évidence le besoin persistant de formation sur les procédures de signalement (Article 40) au sein même de l'administration.

    1. Document de synthèse : Audition du Président de la Haute Autorité de Santé sur les difficultés d’accès aux soins

      Date : [Date de l'audition - à déterminer à partir du contexte si nécessaire, sinon laisser indéterminé] Source : Excerpts from "🔴 Le président de la Haute autorité de santé sur les difficultés d’accès aux soins" Intervenants : Président de la Haute Autorité de Santé (HAS), Rapporteur, Députés (M. Maudé, M. Tivoli, Mme Cornelou, M. Bernard)

      Synthèse :

      Cette audition du Président de la Haute Autorité de Santé (HAS) devant une commission d'enquête parlementaire portait sur l'organisation du système de soins de santé et les difficultés d'accès aux soins en France.

      Le Président a détaillé les trois grandes missions de la HAS : l'évaluation des produits de santé à des fins de remboursement, l'élaboration de recommandations de bonnes pratiques, et l'évaluation (certification) des établissements de santé et médico-sociaux.

      L'accent a été mis sur la certification des établissements de santé, ses enjeux (qualité, insertion territoriale, parcours patient) et ses résultats (87% des établissements certifiés, dont 23% avec mention "haute qualité des soins").

      Des questions ont été soulevées concernant les différences de résultats entre établissements publics et privés, le rôle des indicateurs de qualité, les compétences partagées (délégations de tâches), la gestion des événements indésirables graves, et l'évaluation des Établissements d'Hébergement pour Personnes Âgées Dépendantes (ÉPAD).

      Principaux Thèmes et Idées :

      • Missions et Rôle de la Haute Autorité de Santé (HAS):
      • La HAS a été créée en 2004 avec deux missions fondamentales :
      • Évaluer les produits de santé (médicaments, dispositifs médicaux) à des fins de remboursement. "...identifier évaluer les produits de santé à des fins de remboursement..."
      • Promouvoir la qualité des pratiques dans le système de santé, notamment par l'élaboration de recommandations et la certification des établissements. "...promouvoir la qualité des pratiques dans le système de santé..."
      • Depuis 2017/2018, la HAS évalue également les établissements sociaux et médico-sociaux, en plus des établissements de santé.
      • La HAS a également un rôle consultatif sur les protocoles de coopération (compétences partagées).
      • Certification des Établissements de Santé :
      • Il s'agit d'une évaluation externe et obligatoire, réalisée tous les quatre ans, pour environ 2500 établissements de santé en France (publics, privés, HAD).
      • Le 6ème cycle de certification débutera en septembre prochain.
      • Les enjeux majeurs de la certification incluent la qualité des soins, l'insertion territoriale de l'établissement, et la construction du parcours de soins pour les patients. "...nous tenons à valoriser l'insertion territoriale de l'établissement de santé et également sa construction du parcours de soins pour les patients..."
      • Le référentiel d'évaluation distingue trois niveaux de mesure : le patient (information, conditions), les équipes de soins (maîtrise des risques), et l'établissement (gouvernance, politique RH, santé des professionnels).
      • L'insertion territoriale est un critère clé, évaluant la coopération et la coordination de l'établissement avec les acteurs de ville, les autres établissements, et la prise en charge à domicile. "...la prise en charge d'un patient le parcours d'un patient nécessite une interdépendance que nous n'avons jamais connu entre des acteurs très divers du système de santé sur un territoire et donc cela signifie qu'il faut une coordination très fine du parcours des patients depuis l'amont de l'établissement jusqu'à l'aval de l'établissement..."
      • Le nouveau référentiel intégrera l'évaluation de l'utilisation de la télésanté pour améliorer les parcours patients.
      • Résultats de la certification (fin 2024) :4% des établissements sont non certifiés.
      • 9% sont certifiés sous condition (doivent répondre à des critères dans l'année).
      • 87% sont certifiés.
      • 23% sont certifiés avec la mention "haute qualité des soins".
      • Comparaison Public/Privé : Le pourcentage d'établissements non certifiés ou certifiés sous condition est similaire dans le public et le privé. Cependant, les établissements privés (lucratifs ou non) ont deux fois plus d'établissements certifiés avec mention "haute qualité des soins" (29-30% vs 15% pour le public). Une hypothèse (non démontrée) est que les établissements privés ont une offre de soins moins large, ce qui pourrait faciliter l'atteinte de la haute qualité.
      • Conséquences de la certification : Une certification simple ou haute qualité a des conséquences sur la dotation de l'établissement. Un établissement non certifié ou certifié sous condition doit discuter avec l'ARS locale pour se remettre en situation. La fermeture n'est pas la logique française.
      • La certification est une décision de la HAS, susceptible de recours gracieux par l'établissement.
      • La certification HAS est une évaluation de pair à pair, réalisée par des professionnels de santé externes, et non un contrôle inopiné par des agents avec pouvoir de police. "...nous ne pouvons pas comme cela débarquer dans un établissement pour organiser la certification mais surtout la GS a été créée comme une autorité publique indépendante à caractère scientifique... pour nous la finalité c'est de mesurer la qualité afin de l'améliorer..."
      • Indicateurs de Qualité et de Sécurité des Soins :
      • La HAS élabore ces indicateurs depuis 2006.
      • Il existe un recueil national permettant un suivi sur près de 20 ans pour certains indicateurs.
      • Les indicateurs couvrent divers aspects (événements indésirables associés aux soins, infections nosocomiales, prise en charge de la douleur, suivi post-intervention).
      • On constate une amélioration dans certains domaines mesurables (infections nosocomiales, AVC).
      • Utilité des indicateurs :Pour l'établissement et les professionnels (évaluation de la qualité).
      • Pour les usagers (accessibles en ligne sur le site de la HAS via Caliscope).
      • Pour la régulation et potentiellement le financement basé sur la qualité.
      • Compétences Partagées / Délégations de Tâches :
      • La HAS est amenée à rendre des avis sur les protocoles de coopération entre professionnels de santé (ex: orthoptistes, kinésithérapeutes, infirmiers).
      • La HAS est extrêmement favorable aux compétences partagées, à condition qu'elles soient encadrées, notamment en termes de formation du professionnel qui reçoit ces compétences et de s'assurer du jusqu'où on peut aller.
      • L'idée de l'accès direct sans diagnostic médical préalable n'est pas exclue mais dépend de la pathologie et nécessite un grand encadrement.
      • La HAS émet des avis sur les protocoles mais ne dispose pas de données chiffrées à postériori sur les bénéfices concrets de ces expérimentations (réduction des délais, augmentation de la couverture vaccinale, etc.).
      • Gestion des Événements Indésirables Graves (EIG):
      • La HAS centralise les déclarations anonymisées des EIG liées aux soins.
      • Il existe une sous-déclaration des EIG, notamment en médecine de ville. L'augmentation des déclarations est vue positivement car elle révèle des problèmes qui n'étaient pas visibles auparavant.
      • Les causes des EIG sont souvent multiples, incluant la fatigue des soignants, la pression, la surcharge, et des facteurs organisationnels. "...souvent multicausal bien entendu et souvent l'événement indésiable est survenu parce qu'il y a eu plusieurs facteurs plusieurs causes profondes qui sont enchaînées et aucune barrière de sécurité n'a permis d'interrompre le processus..."
      • 63% des EIG seraient évitables, dont la moitié liée à des erreurs d'organisation ou des retards de prise en charge.
      • Ratio Soignant/Soigné et Conditions de Travail :
      • La HAS est sollicitée par le ministère pour travailler sur ce sujet suite à une loi votée.
      • Le ratio est un déterminant de la qualité des prises en charge, mais pas le seul. L'organisation et le travail d'équipe sont également cruciaux. "...le le ce n'est qu'un des un des déterminants de la qualité des prises en charge ce n'est pas le seul donc il ne faut pas dire que ce cette loi travailler sur les ratios ça va permettre à soi seul d'améliorer la qualité des prises en charge mais c'est un des déterminants incontestablement..."
      • La qualité peut parfois être maintenue malgré des sous-effectifs grâce à l'engagement, la coordination et l'organisation des équipes.
      • Formation des Professionnels de Santé :
      • Le Président de la HAS, en tant qu'ancien président d'université, s'interroge sur l'impact de Parcoursup sur la diversité des profils d'étudiants infirmiers, craignant une possible diminution de la promotion sociale.
      • Concernant la formation des infirmiers de bloc opératoire (IBOD) et la possibilité pour d'autres infirmiers (IDE) d'exercer certaines missions : il s'agit d'une position pragmatique face au manque d'IBOD, visant à assurer le fonctionnement des blocs, même si l'idéal est d'avoir des infirmiers très bien formés spécifiquement.
      • Une formation au management pour les chefs de service ou de pôle est jugée indispensable mais n'est pas systématiquement requise dans la formation initiale.
      • Évaluation des Établissements Sociaux et Médico-Sociaux (dont ÉPAD) :
      • La HAS a pour mission d'évaluer (non certifier) environ 41000 établissements et services sociaux et médico-sociaux tous les 5 ans (soit 8000 par an).
      • La HAS n'a pas les moyens d'évaluer directement tous ces établissements. Elle élabore le référentiel, et le COFRAC (Comité français d'accréditation) accrédite des organismes qui réalisent les évaluations sur le terrain. La HAS analyse ensuite les résultats.
      • L'évaluation des ÉPAD a réellement débuté en 2023-2024. Les résultats des évaluations seront publiés à partir de juillet, incluant une synthèse, un score qualité et un "radar" des notations sur les critères.
      • Une particularité notable est que l'établissement social ou médico-social choisit l'organisme évaluateur et le rémunère, ce qui "mérite une réflexion".
      • Maltraitance :La HAS a publié des recommandations pour identifier la maltraitance dans les établissements.
      • La bientraitance fait partie des critères d'évaluation, avec des critères impératifs sur l'existence d'une organisation et d'un plan de prévention, et sur la prise en compte des signalements et plaintes.
      • La HAS reçoit de nombreux courriers d'usagers, dont certains signalent des situations de maltraitance en ÉPAD. Si la situation est jugée inquiétante, la HAS peut faire un article 40 du code de procédure pénale (signalement au procureur).
      • La suppression potentielle de signalements internes en ÉPAD relève du domaine de l'inspection et du contrôle (missions des ARS ou conseils départementaux), et non de l'évaluation HAS.
      • Qualité et Moyens :
      • Il est reconnu que disposer de moyens suffisants (effectifs, personnel formé) participe à la qualité. "...les moyens de disposer des ressources à la fois en quantité et aussi en personnes formées ayant les bonnes formations les bonnes qualifications c'est sûr que ça participe à la possibilité d'avoir une bonne qualité..."
      • Cependant, la qualité ne dépend pas uniquement des moyens mais aussi d'autres facteurs comme l'organisation, la coopération d'équipe, la coordination interprofessionnelle. Des établissements moins bien dotés peuvent parfois avoir une meilleure qualité grâce à ces facteurs.
      • Certification Périodique des Professionnels de Santé et Accréditation des Médecins :
      • La certification périodique, prévue pour être opérationnelle depuis janvier 2023, sera un critère de qualité important, complétant le développement professionnel continu.
      • L'accréditation des médecins des spécialités à risque (chirurgiens, anesthésistes) est volontaire et concerne environ 10 000 médecins accrédités (principalement dans le privé). C'est un critère de qualité quantifiable, soutenu par l'assurance maladie. Le Président souhaiterait voir le nombre de médecins accrédités augmenter.
      • Différences entre Instances de Santé Publique :
      • La HAS (évaluation produits/pratiques/établissements) a des missions distinctes du Haut Conseil de la Santé Publique (HCSP - avis sur santé publique, stratégies vaccinales), de la Conférence Nationale de Santé (CNS - démocratie sanitaire), et du Comité Stratégique de l'Innovation en Santé (CSIS). L'articulation avec le HCSP se fait principalement sur certaines recommandations vaccinales.

      Points à Retenir :

      • La HAS joue un rôle central dans l'évaluation et la promotion de la qualité dans le système de santé français, mais ses pouvoirs (notamment en ÉPAD) sont parfois indirects (élaboration de référentiels, analyse des résultats d'organismes tiers) ou se limitent à l'alerte (article 40).
      • Les différences de performance (notamment en haute qualité de soins) entre établissements publics et privés méritent une réflexion approfondie.
      • L'accès aux soins et la qualité ne dépendent pas uniquement des effectifs mais aussi fortement de l'organisation et du management. La formation au management des professionnels de santé est un point crucial.
      • La gestion des événements indésirables graves et la lutte contre la maltraitance sont des sujets prioritaires pour la HAS, mais la sous-déclaration et le manque d'organisation dans certains établissements constituent des défis majeurs.
      • Les compétences partagées sont vues positivement par la HAS, mais l'évaluation de leur impact réel nécessite des données qui ne sont pas toujours disponibles au niveau de la HAS elle-même.
      • La transparence des résultats d'évaluation (ÉPAD, certification) est en progrès (publication prochaine des rapports d'évaluation ÉPAD).

      Prochaines Étapes :

      Publication des résultats d'évaluation des ÉPAD à partir de juillet.

      Démarrage du 6ème cycle de certification des établissements de santé avec un nouveau référentiel intégrant notamment l'évaluation de la télésanté et la formation au management.

      Travail sur les recommandations concernant le ratio soignant/soigné.

      Développement d'outils et de guides pour les établissements sur la prévention et la lutte contre la maltraitance.

      Possible envoi du référentiel de certification détaillé par la HAS à la commission.

    1. Document de synthèse : Les nouveaux enjeux de la paternité et l'égalité parentale

      Ce document de synthèse examine les principaux thèmes et idées issus d'échanges entre journalistes, créateurs de podcasts et législateurs concernant les enjeux contemporains de la paternité en France, notamment à travers le prisme de l'égalité parentale et des droits des femmes.

      La discussion met en lumière les défis persistants en matière de partage des tâches, le rôle crucial des congés parentaux et l'importance d'une éducation qui valorise le "prendre soin" et les émotions.

      1. Le Constat de l'Inégalité et l'"Arnaque des Nouveaux Pères"

      Un thème central qui émerge est le décalage entre la perception que les pères ont de leur implication et la réalité statistique du partage des tâches parentales et domestiques.

      Stéphane Jourdin et Guillaume Daudin, co-auteurs de la bande dessinée "L'arnaque des nouveaux pères", soulignent cette dichotomie :

      "Pour nous il y a arnaque parce qu'il y a un delta entre c'est ce que vous venez de dire entre la perception qu'ils ont et l'affichage qu'ils en font… et qu'ils sont pas les seuls à faire… il y a un décalage entre cette perception cet affichage et la réalité des des statistiques euh froides presque métallique de l'INC..."

      Les chiffres de l'INSEE révèlent que malgré des avancées, la répartition des tâches reste très inégalitaire : les femmes effectuaient environ 80% des tâches domestiques et parentales en 1986, un chiffre qui n'est descendu qu'à 70% au début des années 2010.

      Les hommes tendent à prendre en charge les tâches plus gratifiantes, laissant les tâches ingrates aux femmes.

      Plusieurs "stratégies de défausse" sont identifiées chez les pères, notamment le père qui "vient en aide" sans organiser, le "feignant", l'"élogieux" ("tu fais ça si bien ma chérie"), ou celui qui a des "standards différents".

      L'arrivée de l'enfant exacerbe souvent cette inégalité en augmentant le volume global des tâches et en renforçant la répartition traditionnelle.

      2. Le Rôle Clé des Congés Parentaux : Paternité et Parental

      La discussion met en évidence l'importance cruciale des congés, notamment le congé paternité, pour favoriser un investissement égal des parents dès la naissance.

      Le système actuel (7 jours obligatoires, 28 jours possibles, peu rémunérés) est considéré comme un message clair qui relègue le père au rôle de "parent auxiliaire" ou "assistant".

      "Le message est très clair c'est-à-dire que le boulot du papa ça va être de retrouver le travail de retourner au boulot et que tout ce qui est avant bon bah il peut traîner dans le coin avoir une oreille qui traîne mais l'essentiel des de la transmission sur comment on s'occupe d'un d'un nouveau né va être adressé essentiellement à la mer..."

      Un congé égal pour les deux parents est fortement préconisé, avec une durée plus longue et une partie obligatoire significative.

      L'exemple de la Suède, où le père est encouragé à se retrouver seul avec l'enfant pendant plusieurs semaines ou mois, est cité comme un modèle incitatif qui change fondamentalement l'implication paternelle.

      Il existe un problème de transmission pour les pères, qui ne bénéficient pas de la même socialisation au "prendre soin" que les mères.

      Concernant le congé parental, sa faible utilisation par les hommes (0,8% contre 12% pour les femmes) est attribuée à sa très faible rémunération. Un projet de réforme envisagé (raccourcir sa durée à 4 mois par an mais mieux le rémunérer) est accueilli avec prudence.

      Il est souligné que toute évolution du congé parental sans prérequis paritaire sur les congés maternité et paternité pourrait renvoyer les femmes à la maison, particulièrement les femmes précaires.

      3. L'Adaptation du Monde du Travail et les Freins Structurels

      Le monde de l'entreprise est identifié comme un frein majeur à l'égalité parentale.

      L'organisation du travail reste pensée pour des hommes sans enfants ou qui ne s'en occupent pas.

      La conciliation vie professionnelle-vie familiale est rendue difficile par des horaires rigides, des réunions à des heures clés et une culture d'entreprise qui valorise la disponibilité totale.

      "Aujourd'hui quand vous avez un enfant de 3 mois l'entreprise attend autant de vous que quand vous en avez un de 20 les demandes sont pas du tout les mêmes quand vous revenez de congé paternel bah du jour au lendemain vous reprenez complètement le travail et du jour au lendemain vous avez les mêmes horaires les mêmes tâches et on attend les mêmes choses de vous..."

      Plusieurs leviers sont proposés pour favoriser un meilleur investissement des pères, notamment :

      Rendre la flexibilité obligatoire dans les entreprises : plages horaires aménageables, télétravail accessible, jours d'absence familiaux prévus dans les accords, sans dépendre du bon vouloir du manager.

      Former les managers : les aider à reconnaître la parentalité comme un enjeu de performance durable. Changer le regard social : combattre la pression sociale et le sentiment d'être moins fiable lorsqu'un père s'occupe de ses enfants.

      Agir au niveau des pouvoirs publics : montrer l'exemple dans la fonction publique et conditionner certains avantages fiscaux ou aides aux entreprises à des politiques familiales claires.

      La question de l'utilisation des congés enfants malades (seulement 20% pris par les hommes) illustre parfaitement cette pression sociale et structurelle.

      Il est suggéré d'inscrire dans le code du travail le "droit au respect de la vie de famille" sur le modèle du droit à la déconnexion, afin de sortir de la négociation individuelle.

      4. L'Accompagnement de l'Adolescence et la Santé Mentale

      La période de l'adolescence présente des enjeux spécifiques pour les parents et les enfants, notamment en matière de santé mentale.

      Les dispositifs d'accompagnement pour les parents d'adolescents sont jugés insuffisants. La question de la santé mentale des adolescents, particulièrement touchés par l'anxiété et la dépression, est une préoccupation majeure.

      Il est souligné l'urgence d'un parcours de soins en santé mentale mieux remboursé et accessible.

      Au-delà des soins directs aux adolescents, il est essentiel d'aider les parents à prendre conscience de l'importance de la santé mentale et à gérer les difficultés au sein de la cellule familiale.

      Un aspect novateur soulevé concerne l'impact de l'espace public et virtuel sur les adolescents.

      Le manque d'autonomie dans le monde réel (réduction drastique du périmètre de déplacement autonome) et l'exposition non régulée au monde virtuel (réseaux sociaux, masculinisme) sont identifiés comme des facteurs contribuant aux troubles de santé mentale et à la perpétuation de stéréotypes sexistes.

      "... on a complètement on disait on a trop légiféré par rapport à la responsabilité pénale des maires et et oui permet juste de dire je trouve que ce que vous là vous nous ouvrez un champ qu'on avait pas encore beaucoup exploré là mais la question de l'espace de des enfants et des et des adolescents dans l'espace public enfin comment l'accompagnement des parents c'est aussi ça c'est comment on a une géographie un espace public accueillant et qui comprend les besoins des parents et des des enfants donc je trouve ça vraiment passionnant..."

      Des pistes de politiques publiques sont évoquées, comme se positionner sur la régulation des réseaux sociaux (exemple australien d'un âge minimum), repenser l'aménagement urbain pour favoriser l'autonomie des enfants et des adolescents, et même interroger le droit de vote local pour redonner du pouvoir aux adolescents.

      5. L'Éducation pour l'Égalité : Famille, École et Représentations Culturelles

      L'éducation joue un rôle fondamental dans la construction d'une société plus égalitaire.

      L'exemple des cours d'"économie domestique" en Suède est cité comme une illustration d'une éducation concrète qui apprend aux garçons comme aux filles à maîtriser les tâches du quotidien et qui remet en question les inégalités persistantes.

      "... on a assisté à un cours qui s'appelle de mémoire MKCAP c'est en gros euh économie domestique donc en fait on était dans une salle avec ces collégiens qui pendant 2h ont commencé par préparer des lasagnes... et puis ils sont posés à une table et là ils ont regardé des vidéos éducatives où on leur expliquait que tout ça c'était très bien mais que leur pays restait quand même assez inégalitaire que les femmes en faisaient encore beaucoup plus à la maison que les hommes euh et que et que ça n'allait pas..."

      Il est souligné que l'éducation ne se limite pas à l'école ; l'exemple donné par les parents à la maison est essentiel. Si les enfants voient un partage inégal des tâches, cela renforce les stéréotypes.

      D'autres pistes éducatives incluent :

      Éduquer aux émotions dès le plus jeune âge : apprendre aux enfants à reconnaître et gérer leurs émotions (peur, angoisse), notamment à travers des approches comme la méditation.

      Revaloriser le "prendre soin" : encourager dès la petite enfance une culture qui valorise le soin, chez les garçons comme chez les filles, en combattant l'infériorisation associée au féminin.

      Repenser les représentations culturelles : promouvoir dans les livres, dessins animés et autres supports des modèles de familles et de rôles plus égalitaires.

      Questionner la charge mentale des enfants : alléger les emplois du temps scolaires, revoir les attentes en matière d'orientation professionnelle dès le collège.

      Réintroduire des compétences du quotidien dans l'enseignement : à l'instar des anciens cours de "techno" qui incluaient cuisine et couture, réintégrer ces savoirs et compétences pratiques, peut-être dès l'école maternelle, en les intégrant dans des apprentissages plus larges.

      Conclusion

      La discussion révèle la complexité des enjeux liés à la paternité et à l'égalité parentale.

      Les défis sont à la fois individuels (perceptions et comportements des pères) et structurels (lois, politiques publiques, organisation du travail, éducation).

      Si des avancées sont observées, le chemin vers une égalité réelle est encore long. Les intervenants soulignent l'importance d'une volonté politique forte, de politiques publiques ambitieuses (notamment en matière de congés et de modes d'accueil), d'une transformation du monde du travail, et d'une éducation qui remet en cause les stéréotypes de genre et valorise le "prendre soin" et les émotions.

      Le rôle des médias (BD, podcasts) pour sensibiliser et ouvrir des espaces de dialogue sur ces sujets intimes est également mis en avant.

    1. Ultimately, the legislation acknowledges that “gender identity is a feeling,” and that “the Civil Rights Code is normally about immutable characteristics based on truth.”

      Starkly different stance the majority of media outlets who aim for neutrality and aim to not offend.

    1. Note de synthèse : Audition de Philippe Baptiste sur sa feuille de route pour l’enseignement supérieur et la recherche

      Date : [Date de l'audition - non précisée dans le texte] Sujet : Examen de la feuille de route du Ministre de l'Enseignement supérieur et de la Recherche, Philippe Baptiste. Source : Extraits de l'audition parlementaire de Philippe Baptiste.

      Vue d'ensemble : L'audition de Philippe Baptiste devant la représentation nationale aborde les défis et les orientations stratégiques du ministère de l'Enseignement supérieur et de la Recherche (MESR).

      Le Ministre présente sa feuille de route centrée sur deux axes majeurs : la souveraineté et la jeunesse, tout en reconnaissant les préoccupations des parlementaires concernant le financement, la vie étudiante, la politisation de l'université, l'adaptation des études, la précarité et l'impact de l'intelligence artificielle.

      Thèmes Principaux et Idées Clés :

      Financement et Budget :

      • Préoccupations des Parlementaires : Plusieurs intervenants soulignent les difficultés financières des universités, dont une large majorité est déficitaire. Ils s'inquiètent du non-respect des paliers budgétaires fixés par la Loi de Programmation de la Recherche (LPR) de 2020 et s'interrogent sur l'efficacité du Crédit d'Impôt Recherche (CIR), jugé coûteux (plus de 7 milliards d'euros par an) et bénéficiant principalement aux grandes entreprises sans impact notable sur l'investissement privé en R&D.

      • Position du Ministre : Philippe Baptiste défend le budget obtenu pour son ministère dans un contexte difficile. Il tient à clarifier la distinction entre le périmètre de la mission "Recherche et Enseignement Supérieur" (MIRES) et le budget direct du MESR. Il affirme que la baisse de crédits évoquée (environ 930 millions d'euros) concerne principalement des programmes budgétaires d'autres ministères et non ceux de la recherche universitaire, de base ou de la vie étudiante.

      • Chiffres Clés Avancés par le Ministre :Progression de 300 millions d'euros sur le budget des universités en 2025 par rapport à 2024, compensant le gel de crédits et dédiant 100 millions aux mesures RH de la LPR.

      • Un prélèvement de trésorerie d'environ 100 millions d'euros sur le CNRS, justifié par des réserves de 1,4 milliards d'euros.

      • Trésorerie cumulée des universités dépassant 5 milliards d'euros, soit près de 150 jours de fonctionnement (chiffre contesté par une parlementaire).

      • La LPR a apporté 2,2 milliards d'euros de financement supplémentaire aux universités.

      • Le budget de la vie étudiante reste stable.

      • Au total, le budget du MESR est en progression de 137 millions d'euros par rapport à 2024.

      • Réflexion sur l'allocation des moyens : Le Ministre reconnaît les limites du système actuel d'allocation des moyens (Sympa) et la nécessité de réfléchir à un modèle plus dynamique, prenant en compte l'activité des universités et les contrats d'objectif et de performance. Un rapport de l'IGF et de l'IGESR sur le modèle économique des universités est attendu.

      • Articulation recherche publique/privée : La France est en retard sur ses objectifs en R&D, en grande partie due à un manque d'investissement privé. Le ministère vise à accompagner la hausse de cet investissement et interrogera l'efficacité des dispositifs existants, notamment le CIR.

      • Axes Stratégiques de la Feuille de Route : Souveraineté et Jeunesse :

      • Conviction du Ministre : "l’enseignement supérieur et la recherche sont les clés de notre avenir".

      • Souveraineté : La recherche contribue à garantir la souveraineté de la France en lui permettant de se projeter dans l'avenir, notamment en matière de technologies de pointe, d'industrie, mais aussi de démocratie et de valeurs. Cela assure l'indépendance stratégique du pays. La recherche structure les agences de programme pour identifier les priorités stratégiques dans des domaines comme le spatial, l'IA, la santé, les composants, mais aussi les sciences humaines et sociales.

      • Jeunesse : L'enseignement supérieur forme la jeunesse, qui aura la responsabilité de l'avenir. Le Ministre est conscient des difficultés matérielles et de santé mentale rencontrées par les jeunes.

      • Ancrage Territorial : Ces deux axes doivent s'ancrer dans les territoires.

      • Vie Étudiante et Conditions de Vie :

      • Précarité Étudiante : La précarité est une préoccupation majeure. Environ un étudiant sur quatre serait en situation de précarité. Le système de bourses actuel prend mal en compte la dé-cohabitation.

      • Réforme du Système de Bourses : C'est un "gros chantier", un enjeu d'organisation, de justice sociale et budgétaire. Le Ministre souhaite avancer sur ce sujet en particulier avec les organisations étudiantes.

      • Logement Étudiant : Malgré des efforts passés (30 000 logements sociaux créés entre 2018-2022, dont 12 000 par les Crous), les besoins sont importants. L'objectif est de rénover 12 000 logements vétustes et de créer 35 000 nouveaux logements d'ici 2027. Le Premier Ministre a annoncé 15 000 logements par an sur 3 ans, privilégiant le social et l'intermédiaire.

      • Restauration Universitaire : 43 millions de repas servis l'an dernier, 10 millions de plus qu'en 2022. Plus de 50% des repas sont à 1€.

      • Aménagement des Études : Nécessité d'adapter les études aux autres activités des étudiants, notamment les sportifs de haut niveau qui ne disposent pas d'un statut clair.

      Valeurs et Libertés Académiques :

      • Défense des Valeurs : Le Ministre met au cœur de son action la défense des valeurs fondamentales :
      • Libertés académiques : "aucun compromis n'est possible".
      • Refus absolu de toute forme de discrimination et de violence : lutte contre les violences sexistes et sexuelles, et combat résolu contre le racisme, "et en particulier contre l'antisémitisme".
      • Lutte contre l'Antisémitisme : Un travail est mené avec le garde des Sceaux. Les procureurs ont reçu des consignes claires pour traiter en priorité les signalements (article 40). Une proposition de loi sur les commissions disciplinaires plus efficaces est attendue.
      • Égalité Homme-Femme : Promotion par un effort renouvelé, notamment pour accroître l'attractivité de la science pour les femmes, où elles sont sous-représentées.
      • Politisation de l'Université : Un parlementaire s'inquiète de l'"instrumentalisation croissante" de l'université à des fins idéologiques, notamment via l'utilisation des fonds de la CVEC et les prises de position partisanes de certains universitaires et présidents d'université.
      • Réponse du Ministre : La CVEC est gérée par les établissements avec la participation des étudiants. L'utilisation des fonds doit être conforme aux valeurs républicaines. L'université est un lieu de savoir, de pluralité, de débat libre. Les présidents d'université ont des pouvoirs de police pour garantir l'ordre public et la continuité du service. Une coopération plus étroite avec les préfets et les forces de l'ordre est mise en place. L'organisation de conférences devrait idéalement assurer une expression pluraliste (charte envisagée par France Universités).

      • Organisation et Simplification :

      • Clause de Revoyure de la LPR : Le Ministre compte la mettre en œuvre pour faire le bilan de ce qui a été fait et ce qui reste à faire.

      • Simplification : Nécessité de réduire la bureaucratie qui touche laboratoires et amphithéâtres. Le mode de financement par appels à projets, devenu majoritaire, doit être "profondément repensé", sans pour autant les supprimer entièrement. D'autres moyens de financement doivent être développés.

      Orientation et Accès à l'Enseignement Supérieur :

      • Ancrage Territorial et Besoins Locaux : Les besoins des territoires doivent guider la définition des cartes de formation pour offrir perspectives d'études et d'emploi aux jeunes. L'objectif est de garantir l'accès à une formation de qualité à proximité. La continuité Bac-3 / Bac+3 est une priorité, en coopération avec le Ministère de l'Éducation Nationale.

      • Orientation : Dialogue constant avec le MEN pour améliorer l'orientation et apporter de la clarté aux jeunes et familles. Parcoursup, bien qu'utile, suscite angoisses et questions. Une approche territorialisée de l'orientation est nécessaire pour préserver les chances d'études du plus grand nombre.

      • Régulation du Privé Post-Bac : Le Ministre s'engage à réguler le secteur privé post-bac. Il existe des formations privées lucratives profitant de la "candeur" des étudiants et dont certaines relèvent de la répression des fraudes. Le Ministère travaille à les réguler, notamment en les retirant de Parcoursup et en collaborant avec le Ministère du Travail pour un label conjoint basé sur Caliopi. Un travail sur le code de l'éducation est envisagé.

      • Bacheliers Professionnels : Ils sont souvent confrontés à l'échec dans le supérieur (taux de réussite très faible en licence). Ils doivent être au cœur du projet. Les formations propédeutiques (type "Wisi") peuvent apporter des réponses pour leur donner les moyens de réussir.

      Intelligence Artificielle :

      • Impact Majeur : L'IA bouleverse déjà la recherche (rédaction, recherche expérimentale, traitement de données). C'est un "défi vertigineux" pour l'enseignement supérieur.

      • Intégration dans la Formation : Il est essentiel d'intégrer l'IA dans les maquettes pédagogiques et d'expliquer aux étudiants son fonctionnement, ses limites et ses biais.

      • Fiabilité des Données : L'IA soulève la question de la fiabilité des données et de l'intégrité des publications scientifiques. Le Ministère soutient l'Open Source et l'Open Data. La régulation doit être menée par les communautés scientifiques, souvent au niveau international.

      Questions Spécifiques :

      • Violences sur les Campus : Préoccupation face aux violences, notamment celles liées à l'extrême gauche ("Antifa"). Le Ministre réaffirme la responsabilité des présidents d'université pour garantir l'ordre public et la sécurité, avec le soutien des forces de l'ordre. Des poursuites judiciaires (article 40) sont prioritaires pour les actes graves, en particulier l'antisémitisme et le racisme.

      • Études de Santé : Réforme nécessaire des études de santé (Pass/LAS). Le système actuel, bien qu'imparfait, a réduit le "gâchis" (moins d'étudiants quittant le système après échec). Cependant, la grande diversité des organisations universitaires rend le système complexe. Un travail de simplification est prévu avec le Ministère de la Santé. Les universités sont le lieu historique de formation en santé et la confrontation à la recherche y est essentielle.

      • "Kilabs" au CNRS : Le projet du président du CNRS a suscité interrogations et questions. Le Ministre a décrété un moratoire. Le CNRS a une politique scientifique propre, différente de la somme des politiques universitaires, mais elle doit être articulée avec celle des établissements et discutée en amont avec le ministère. L'excellence n'est pas uniforme, mais il faut donner à chaque laboratoire les moyens de développer son excellence, y compris les futurs laboratoires d'excellence. La concertation en cours sera déterminante.

      • Égalité des Chances en Classes Préparatoires : Les prépas publiques sont performantes, notamment en sciences, mais des questions se posent sur les filières économiques où les prépas privées payantes sont mieux classées. Le Ministre reconnaît le sujet et souhaite l'étudier.

      • Formation des Enseignants (INSPE) : Bien que la formation soit universitaire, les ESPE travaillent en lien étroit avec les enseignants du premier et second degrés, confrontant les universitaires à la pratique sur le terrain.

      • Rénovation Énergétique du Parc Universitaire : Les besoins sont considérables (estimés à 15 milliards d'euros pour l'ESR, dont 40% de passoires énergétiques). Les budgets actuels sont insuffisants malgré les efforts. Le Ministre mentionne l'impossibilité pour les universités d'emprunter facilement et le rôle du budget de l'État dans cette décision. C'est un sujet important.

      Citations Pertinentes :

      • "le niveau des moyens budgétaires consacrés à la recherche est tout aussi préoccupant contrairement à C nombre de voisins européens la France reste loin de l'objectif de Lisbonne qui est fixé qui a fixé l'effort en matière de recherche à 3 % du PIB" (Parlementaire)

      • "le très coûteux crédit d'impôt recherche du moins de sa version actuelle représentant un coût fiscal de plus de 7 milliards d'euros par an il bénéficie essentiellement aux grandes entreprises sans produire des faits notable sur la part d'investissement privé de recherche dans le PIB" (Parlementaire)

      • "l'enseignement supérieur doit être le lieu qui prépare notre jeunesse à se saisir du monde dans toute sa complexité" (Ministre)

      • "l’enseignement supérieur et la recherche sont les clés de notre avenir" (Ministre)

      • "par la recherche nous pouvons contribuer à garantir la souveraineté de notre pays" (Ministre)

      • "pour l'enseignement nous formons la jeunesse de notre pays qui aura en assumé la responsabilité souveraineté et jeunesse voici les deux axes de ma feuille de route" (Ministre)

      • "les libertés académiques sur lesquelles aucun compromis n'est possible" (Ministre)

      • "le refus absolu le refus absolu de toute forme de discrimination et de toute violence" (Ministre)

      • "la promotion des passe par un effort renouvelé en faveur de l'égalité homme-femme dans l'enseignement supérieur et la recherche" (Ministre)

      • "le système d'enseignement supérieur et de la recherche doit avoir les moyens ces moyens il faut les utiliser au mieux au service des chercheurs des enseignants chercheurs des enseignants des étudiants ou encore des personnels non ignants" (Ministre)

      • "faciliter le travail de tous des laborato aux enfi c'est aussi travailler sur un chantier de simplification" (Ministre)

      • "l'université doit rester un lieu de savoir et de débat libre mais elle ne peut pas devenir un instrument de militantisme financé par des contributions obligatoire d'étudiants" (Parlementaire)

      • "la recherche et l'élévation du niveau général d'éducation sont nos rempart contre l'obscurantisme et nos leviers face au bouversement climatique" (Parlementaire)

      • "un plan social de masse se prépare dans les universités et les laboratoires français" (Parlementaire - LFI)

      • "l'objectif n'est pas l'efficacité budgétaire mais la destruction programmée de l'Université de la recherche publique pouroser un système élitiste payant et soumis aux intérêts privés" (Parlementaire - LFI)

      • "le défaut de choix d'orientation n'est certainement pas la seule explication le changement de cadre de pédagogie et de méthode d'apprentissage joue aussi un rôle important" (Parlementaire)

      • "les étudiants ne choisissent pas toujours les études qui leur correspondent le mieux ce qui se traduit en pratique par un taux d'échec parfois élevé" (Parlementaire)

      • "l'université est le lieu de la transmission du savoir et des humanités pas le laboratoire du wisme ni celui de la violence d'extrême gauche" (Parlementaire)

      • "nous assistons à une dérive idéologique préoccupante [au CNRS] l'initiative ho kitics [...] vise à inciter les utilisateurs du réseau social X à migrer vers des plateformes alternatives jugé plus conformes au stand de modération de l'extrême gauche" (Parlementaire)

      • "Quand des étudiants dénoncent une injustice ils sont immédiatement réprimé et quand l'extrême droite droite propage la haine au cœur de l'université le gouvernement détourne le regard" (Parlementaire - LFI)

      • Enjeux et Perspectives : Le Ministre s'engage à travailler sur des sujets clés comme la réforme des bourses, l'amélioration du logement étudiant, la simplification administrative, la régulation de l'enseignement supérieur privé, l'amélioration de l'orientation, le soutien aux bacheliers professionnels et l'adaptation à l'IA.

      Il réaffirme la nécessité de défendre les valeurs républicaines et académiques face aux tensions et aux violences.

      La question du financement reste centrale, malgré la défense du budget par le Ministre.

      L'articulation entre recherche publique et privée et la modernisation de l'allocation des ressources universitaires sont également des chantiers importants.

    1. Briefing Document : Audition de Rachida Dati sur les Violences dans le Secteur de la Culture

      Ce document résume les principaux points soulevés par la Ministre de la Culture, Rachida Dati, lors de son audition concernant le rôle de son ministère face aux violences sexuelles et sexistes dans le secteur de la culture. Il met en évidence les enjeux, les mesures en cours et les pistes de réflexion pour l'avenir.

      Thèmes Principaux :

      La Rupture avec la Loi du Silence : La Ministre insiste sur la nécessité collective de briser la loi du silence qui a longtemps perduré dans le secteur culturel et au-delà. Elle reconnaît que cette loi du silence a permis aux violences de prospérer. "être ici aujourd'hui avec vous marque notre engagement collectif a brisé enfin la loi du silence cette loi du silence qui a longtemps duré" "ce contexte a a toujours existé mais ça n'a jamais été dénoncé avec cette force puisque pendant des années on a détourné le regard" La Culture n'est pas un Cas Particulier (mais présente des Facteurs de Risque) : Bien que les violences sexuelles et sexistes concernent l'ensemble de la société, le milieu de la culture présente des spécificités (porosité entre vie personnelle et professionnelle, précarité, importance du réseau) qui peuvent favoriser les abus de pouvoir et banaliser les situations d'emprise. "le milieu de la culture ne peut pas s'exonérer de regarder évidemment en face ces questions" "il peut y avoir une porosité parfois même une confusion entre la vie personnelle la vie professionnelle entre l'intimité et le travail oui le corps et la sensibilité sont beaucoup plus exposés que dans n'importe n'importe quel autre secteur la précarité la nécessité d'être reconnu et de faire partie de cette famille du cinéma... sont aussi des contextes qui favorisent les abus de pouvoir qui banalisent des situations d'emprise" L'Ancienne Époque comme Prétexte Inacceptable : La Ministre réfute l'argument selon lequel les violences anciennes seraient excusables car relevant d'une "autre époque" ou d'un "autre contexte". Elle rappelle que le viol et les agressions sexuelles étaient déjà pénalement réprimés par le Code pénal dans les années 70 et 80. La différence réside dans la complaisance et le manque de dénonciation. "pour moi on ne peut pas se contenter de dire évidemment c'était une autre époque" "est-ce queà cette époque la le viol les agressions sexuelles était pénalement réprimé oui c'était dans le code pénal mais on a laissé les tribunes prospérer on a laissé les faits prospérer on les a non seulement expliqué on les a parfois excusé pour ne pas dire encouragé" Le Mouvement #MeToo et le Point de Bascule : Le mouvement #MeToo, initié par l'affaire Weinstein, a marqué un point de bascule, notamment dans le milieu du cinéma et plus largement dans le secteur culturel. Il a permis une libération de la parole et une prise de conscience, bien que la Ministre suggère que son succès ait pu être influencé par la catégorie sociale des personnes impliquées. "l'affaire Weinstein porté par le mouvement Mitou a marqué un point de bascule dans le milieu du cinéma et très vite dans tous les secteurs du spectacle de la musique et de la création dans son ensemble" "ce mouvement Mitou il a été dénoncé j'allais dire par par une certaine catégorie sociale je pense que si ça avait été une autre classe sociale je pense que ce mouvement n'aurait pas eu le succès" La Nécessité d'une Action Collective et Vigile : Malgré les progrès réalisés, la lutte contre les violences sexistes et sexuelles est loin d'être terminée. La Ministre appelle à une vigilance constante, à l'amélioration des dispositifs existants et à une mobilisation collective et responsable. "on pense queon n jamais c'est jamais terminé et donc il faut rester vigilant et évidemment faut améliorer les dispositifs pour ne pas dire les reprendre les et encore une fois les amplifier" "la lutte contre les violences sexistes et sexuelles reste un combat mené collectivement en responsabilité et nous sommes au début d'un processus car je pense que beaucoup de femmes et de mineurs se taisent encore" La Libération de la Parole et l'Augmentation des Chiffres : L'augmentation du nombre de signalements et de cas accompagnés est perçue comme une preuve que les actions menées portent leurs fruits et que la parole se libère. Cette augmentation est une étape nécessaire pour regarder la réalité en face. "Les chiffres vous les avez et je ne vais pas revenir dessus vous me direz que les chiffres augmentent c'est vrai mais c'est aussi la preuve que les actions menées aussi portent leurs fruit la parole se libère ces chiffres vont continuer d'augmenter forcément en acceptant de regarder en face la réalité d'entendre les victimes parce que de plus en plus de cas vont sortir" Le Plan de Lutte du Ministère : Un nouveau plan de lutte contre les violences et le harcèlement sexuel et sexiste sera présenté avant le 8 mars. Ce plan s'adresse au Ministère et à ses établissements, aux structures professionnelles, aux politiques culturelles et aux conditions de travail des mineurs. "j'ai demandé à l'administration de travailler sur un nouveau plan de lutte contre les violences et le sement sexuel et sexiste qui va être représenté avant le 8 mars prochain" Les Levier d'Action du Ministère : La Ministre identifie plusieurs leviers pour agir : La Subvention et les Labels : Possibilité de retirer les aides financières si les mesures préconisées ne sont pas mises en place. Les Textes Réglementaires : Intégration de la lutte contre les VSS dans un maximum de textes, comme l'arrêté sur la santé et la sécurité dans la production cinématographique. Les Licences : Conditionner l'obtention ou le renouvellement des licences d'entrepreneur de spectacle vivant à la formation aux VSS. L'Inspection du Travail : Améliorer l'accès des inspecteurs aux lieux de tournage et de spectacle pour des contrôles inopinés. Le CNC collabore à la rédaction d'un guide de contrôle spécifique. La Formation : Rendre obligatoire la formation aux VSS pour les équipes de tournage, les dirigeants de festivals, les équipes du spectacle vivant et de la musique, et renforcer la connaissance du droit du travail dans les écoles supérieures du ministère. Sensibilisation des présidents d'association. "il y a le levier de la subvention et des labels VO ce qui est attribué peut être retiré et peut être retiré immédiatement si les mesures préconisées ne sont pas évidemment mises en place" "La formation au violence sexiste et sexuelle doit conditionner l'obtention ou le renouvellement de ces licences" "il faudrait qu'on ait en amont le maximum d'information pour pouvoir avoir des contrôles inopinés" "il faut absolument renforcer la connaissance du droit du travail avec des modules obligatoires comme on a sur la santé la sécurité au travail il faudrait rendre ces modules obligatoires dans le cadre et le cas des écoles supérieures qui dépendent du ministère de la Culture" L'Amélioration de l'Accompagnement des Victimes et du Suivi des Signalements : La cellule d'écoute Audience, soutenue par le Ministère, est un dispositif central qui offre un accompagnement juridique et psychologique. Le Ministère souhaite renforcer ce dispositif en élargissant ses horaires et ses attributions (aide à la rédaction de plainte, extension à d'autres secteurs comme la presse). Un meilleur suivi des cas signalés au sein du Ministère est également prévu. "c'est pour répondre à la nécessité d'un espace d' sécure lespè que la fésac et les cinq confédérations syndicales du secteur ont porté cette initiative de cellule d'écoute" "l'élargissement de ces horaires d'ouverture ça sera 2 he 2 heures supplémentaires par jour et de ses attributions notamment à l'accompagnement à la rédaction d'une plainte auprès du Procureur de la République par des avocats mais aussi son extension à de nouveaux secteurs comme celui de la presse dès début 2025" "je veux que le sui des cas signalés soit mieux tracé encore" La Lutte Contre l'Arbitraire et la Nécessité d'Enquêtes Internes Certifiées : La Ministre souligne le problème des enquêtes internes qui peuvent parfois être menées sans processus clair ou objectif, voire pour dédouaner les employeurs. Elle souhaite la certification des organismes réalisant ces enquêtes pour garantir leur impartialité. "on a un problème d'enquête interne ou sur les conclusions de ces enquêtes internes parfois il des enquêtes internes et on a un cas très précis où la victime n'a jamais été entendu" "il faudra évidemment qu'on puisse certifier ces organismes qui font ces enquêtes internes surtout quand elles sont externalisées" La Tolérance Zéro et le Rôle de l'État : La Ministre prône une tolérance zéro face aux délits et crimes, refusant l'idée que certaines affaires puissent "s'arranger". Elle affirme sa conviction en un État fort ayant une responsabilité dans la lutte contre ces violences. Elle suggère d'explorer l'élargissement des critères d'intérêt à agir pour permettre à l'État de se constituer partie civile même si la victime n'ose pas porter plainte. "il faudrait une tolérance zéro sur ces sujets parce que sur les articles 40 ce qu'il faut savoir c'est que souvent on dit euh ça peut s'arranger mais il y a des affaires qui enfin il faut que la justice s'en empare on n'arrange pas des choses qui relèvent du délit ou du crime à mon sens surtout sur les atteintes aux personnes" "moi je crois à un État fort voilà et notamment dans dans ce type de sujet moi je seraai assez favorable de revoir les critères d'intérêt à agir" La Protection des Mineurs, une Priorité Absolue : La Ministre met un accent particulier sur la protection des mineurs, souhaitant étendre la réglementation du travail des mineurs jusqu'à 18 ans et non plus 16. Elle souhaite également que le Ministère ait accès au fichier national judiciaire automatisé des auteurs d'infractions sexuelles (FIJAIS). "et plus encore à l'égard des mineurs cette responsabilité elle est globale et donc chacun doit prendre ses responsabilités" "concernant le sujet particulier des dérogations au travail des mineurs de moins de 16 ans moi je suis je suis pour qu'on on aille jusqu'à 18 ans voilà" "j'ai demandé qu'il fasse partie des autorités administratives ayant la possibilité d'accéder aux données du fichier national judiciaire automatisé des auteurs d'infraction sexuelles" La Surveillance des Structures de Formation (Conservatoires, Maîtrises) : Des mesures spécifiques sont envisagées pour les conservatoires (en particulier les non classés) et les maîtrises afin d'imposer des protocoles de signalement, des formations obligatoires et l'élaboration de guides de bonnes pratiques. L'idée d'un label pour les maîtrises est évoquée. "concernant le sujet sur moi j'ai un sujet sur les conservatoires alors sur les conservatoires notamment euh territoriaux non classés" "un cadre strict doit s'appliquer même si la Commission des enfants du spectacle veille aux conditions de travail des jeunes artistes" "je proposais de travailler sur un label pour concernant ces maîtrises qui reposerait sur la présence de personnes formées et qui s'appuie sur un projet pédagogique clair" La Lutte Contre l'Entre-Soi et la Reproduction Sociale : La Ministre dénonce l'entre-soi, la reproduction sociale et le manque de diversité dans les instances de pouvoir et les processus de nomination au sein du ministère et des structures culturelles. "je pense que ce mouvement n'aurait pas eu le succès j' dire malheureuse malheureusement qu'il a eu aujourd'hui il était temps que cette prise de conscience est lieu car je l'ai dit le contraste entre les valeurs défendues par la profession et certains comportements que l'on pouvait y observer devenit intolérable pour la société dans son ensemble" "moi j'ai été très surprise du peu de renouvellement on reprend les mêmes" "c'est aussi ça c'est-à-dire que cette ce ce cette reproduction sociale et sociale d'une certaine catégorie il faudrait ça serait bien qu'elle qu'elle s'arrête un peu mais moi je serai vous je regarderai les process de nomination dans un ministère comme la culture et à tous les niveaux et les conseils d'administration" La Question de l'Homme et de l'Œuvre : La Ministre aborde la difficulté de séparer l'homme de l'artiste, constatant que ce débat semble s'appliquer différemment selon les catégories sociales. Elle estime que l'art ne doit pas servir de bouclier ou d'excuse aux crimes et délits. "tout le sujet c'est aujourd'hui de pouvoir décorréler j'allais dire le statut euh on dit on décorelle l'homme de l'œuvre ou je sais pas quoi de l'artiste je sais pas quoi là" "quand ce sont des artistes ou des milieux très favorisé de pouvoir notamment dans les arts on dit mais ce sont des artistes donc voilà... ce qui est crime et déit devient finalement pas une circonstance atténuante ça devient un art voilà de pouvoir agresser quelqu'un c'est presque artistique" Points Clés et Faits Importants :

      Augmentation de l'accompagnement : Entre 2023 et 2024, la cellule d'écoute Audience a enregistré une augmentation de plus de 50% des accompagnements psychologiques et plus de 77% des conseils juridiques. Manque de Centralisation des Données : La Ministre souligne le manque de centralisation des signalements et des "articles 40" (signalements au procureur) au sein du ministère, rendant difficile le suivi. Articles 40 Déposés : Quatre "articles 40" ont été déposés par le ministère depuis l'arrivée de la Ministre, ce qu'elle juge insuffisant. Accès au FIJAIS : Le Ministère a obtenu l'accès au FIJAIS depuis le 4 décembre 2024. Extension du travail des mineurs : Volonté d'étendre la réglementation du travail des mineurs jusqu'à 18 ans. Création d'une Direction Enseignement et Formation : Une nouvelle direction sera créée au ministère pour avoir des interlocuteurs uniques pour les écoles et y intégrer des modules sur le droit du travail et les VSS. Classes Préparatoires Publiques : Projet de doubler le nombre de classes préparatoires publiques aux écoles d'art pour favoriser la démocratisation. Coordinateurs d'Intimité : Bien que généralisés dans le cinéma, leur obligation sera étudiée pour le spectacle vivant. La contractualisation et la validation des scènes intimes par les comédiens sont jugées favorables. Clause d'Assurance : Existence d'une clause d'assurance interruption de tournage en cas de dénonciation (depuis 2021, non utilisée), avec la volonté de travailler avec les assureurs pour l'améliorer ou la rendre obligatoire. Points Soulevés par le Rapporteur et la Présidente de la Commission (en réaction à la Ministre) :

      Décalage entre Dispositifs et Réalité : Malgré les dispositifs mis en place, les témoignages reçus par la commission montrent un décalage avec la réalité des violences qui semblent systémiques. Manque d'Étude de Victimation : Proposition de lancer une grande étude de victimation pour avoir un état des lieux précis. Suivi des Témoignages d'Audience : Interrogation sur le devenir des témoignages recueillis par Audience et la possibilité d'une assistance juridique. Détection par Signaux Faibles : Suggestion d'enquêtes diligentées en cas de nombreux signalements dans un même lieu. Travail des Enfants : Proposition d'interdire le travail des enfants très jeunes sauf exception, de rendre obligatoire un rapport de fin de tournage par le responsable enfant, de renforcer la relation Tali-Santé-équipes de tournage, et de certifier les accompagnateurs d'enfants. Casting des Enfants : Soulèvement du danger des castings non couverts par la relation de travail et la responsabilité du producteur. Honorabilité des Encadrants : Difficulté pour les associations de contrôler l'honorabilité des intervenants auprès des enfants et suggestion de permettre la consultation du B2 ou l'intervention des services préfectoraux. Méconnaissance du Droit du Travail : Constat d'une méconnaissance importante du droit du travail dans le secteur culturel, et proposition de renforcer sa formation dans les écoles et formations. CHSCT : Constat de l'absence de CHSCT dans le spectacle vivant et la musique, et question de leur mise en place. Registres de Tournage : Proposition d'un registre des lieux de tournage accessible à l'inspection du travail. Quotas de Diversité : Question sur la pertinence des quotas dans les programmations pour favoriser la diversité. Affaire de Fonctionnaire du Ministère : Interrogation sur la reconnaissance par le ministère de défaillances dans l'affaire du fonctionnaire ayant administré des produits éthiques. Ce briefing document offre un aperçu complet des enjeux et des actions envisagées par le Ministère de la Culture pour faire face aux violences sexuelles et sexistes, tout en soulignant les défis persistants et les pistes de réflexion pour l'avenir.

    1. synthèse détaillée des principales thématiques et idées soulevées lors de cette audition, incluant des citations des intervenants.

      Synthèse détaillée de l'audition sur le traitement des signalements de maltraitances sur des enfants placés

      Introduction

      L'audition a réuni des acteurs majeurs du système judiciaire et associatif français pour aborder la problématique cruciale du traitement des signalements de mauvais traitements infligés aux enfants, en s'appuyant notamment sur des affaires dramatiques médiatisées (le petit Tony, le petit Bastien, et le cas de Karine).

      Les intervenants principaux étaient

      • Monsieur Mathieu Bourret (Avocat général près la cour d'appel de Paris, ancien procureur de la République),

      • Madame Magalie La Fourcade (Secrétaire générale de la Commission nationale consultative des droits de l'homme - CNCDH, ancienne juge d'instruction), et

      • Mesdames Laurence Brunet Jambu et Karine (respectivement tante et victime dans une affaire de maltraitance).

      L'objectif n'était pas de revenir en détail sur chaque affaire en raison du secret de l'instruction, mais d'en tirer des leçons pour proposer des réformes structurelles.

      Principales Thématiques et Idées

      • La problématique du traitement des signalements :
      • Les affaires présentées (Tony, Bastien, Karine) démontrent des défaillances dans le système de traitement des signalements, malgré la présence de multiples informations préoccupantes.
      • Dans l'affaire Bastien, il est apparu que la famille était suivie par les services sociaux départementaux, mais qu'un message d'alerte crucial du père n'avait pas été écouté. Mme La Fourcade note que cela "montre à quel point déjà entrer dans l'identification d'une mort violente n'est pas toujours aisé".
      • Dans le cas de Karine, malgré un "nombre inconsidérable de signalements" et un suivi, elle n'a pas été protégée. Mme Brunet Jambu souligne que "tout le monde savait ce que vivait Karine".
      • Les dysfonctionnements systémiques :
      • Les intervenants s'accordent à dire que le système présente des dysfonctionnements structurels, pas seulement des cas isolés. Mme Brunet Jambu affirme que "ça dysfonctionne ça dysfonctionne beaucoup et on sait qu'il faudrait peu de chososees pour que ça fonctionne en fait c'est pas une question de moyen c'est pas une question de tout mettre enfin de de tout casser c'est juste d'améliorer nos pratiques".
      • Plusieurs aspects du système sont remis en question :
      • Les mesures d'assistance éducative en milieu ouvert (AEMO) : La rapporteure de la commission d'enquête s'interroge sur l'efficacité des AEMO, affirmant par expérience que "une majorité une majorité des mesures avec un budget colossal en millions d'euros finissait par des opp parquet". Les visites ponctuelles et annoncées sont pointées du doigt comme inefficaces ("quand le jour où la personne venait il y avait ce qu'il fallait").
      • La circulation de l'information : Des informations cruciales (rapports de TISF, messages téléphoniques) ne parviennent pas toujours aux bonnes personnes ou ne sont pas intégrées aux dossiers des juges pour enfants. Mme La Fourcade insiste sur l'importance de "veiller à ce que les informations puissent être puissent circuler au sein d'un service de façon à ce que un message soit toujours entendu".
      • Le rôle et la charge des travailleurs sociaux : Les travailleurs sociaux sont souvent en surcharge de travail ("ils ont un nombre d'enfants un nombre de dossiers ils peuvent pas gérer tout c'est juste pas possible pour eux"). La rapporteure mentionne un système "en saturation totale" avec des "sureffectifs partout".
      • La complexité des auditions et le manque de coordination : Trop d'intervenants peuvent être présents aux audiences sans que la personne qui connaît le mieux l'enfant (comme la TISF) ne soit entendue. Mme Brunet Jambu demande : "est-ce qu'il faut pas à un moment donné se poser la bonne question qui doit être à l'audience en fait bah pour moi c'est la personne qui accompagne l'enfant qui le voit tous les jours".
      • Les obstacles au signalement :
      • Monsieur Bourret met en lumière les difficultés et les peurs liées au signalement : "il était en réalité extrêmement difficile de procéder à un signalement".
      • Les raisons invoquées incluent l'indifférence, la peur des représailles ("ce qui peut arriver quand on va ou qu'on doit témoigner contre quelqu'un qui est par ailleurs violent"), la rupture du lien avec la famille ou le voisinage, et même une certaine forme d'optimisme dangereux ("faire un Paris sur l'intelligence ou un Paris sur l'avenir").
      • La pénalisation de la non-dénonciation a été discutée, avec des associations de protection de l'enfance arguant que cela pourrait décourager la parole à l'avenir ("vous ne libérerez jamais aucune espèce de parole").
      • Le statut juridique de l'enfant et le regard de la société :
      • Monsieur Bourret soulève la question du statut juridique du mineur dans le Code civil, le considérant encore comme étant "sous la tutelle et l'autorité et la responsabilité de ses parents" malgré l'évolution sociale. Il suggère une réflexion sur un "véritable statut dans le Code civil".
      • Madame La Fourcade insiste sur la nécessité de "changer nous notre regard sur les les enfants en arrêtant de les voir comme des à mi-chemin entre une personne et autre choses et donc avec des droits en mode dégradé". Elle appelle à "avoir une approche risque par les vulnérabilités" en s'inspirant des sciences humaines et sociales et des approches utilisées pour les violences faites aux femmes.
      • Karine témoigne du manque d'écoute des enfants : "on m'a pas écouté et j'ai pas eu la possibilité d'être écouté". Madame La Fourcade rappelle le droit à la participation de l'enfant selon la Convention internationale des droits de l'enfant : "les enfants doivent être associés aux décisions qui les concernent".
      • La distinction faite par certains entre la protection des jeunes enfants (moins de 5 ans) et celle des enfants plus âgés capables de s'exprimer est jugée "honteux" et "condamnable" par Karine et Mme La Fourcade, qui soulignent que "on le protège c'est tout il y a pas d'autres questions à se poser on protège on le protège".
      • Propositions de réformes et pistes de réflexion :
      • Ouvrir le champ des possibles pour les juges des enfants : Dépasser le choix binaire entre placement et AEMO pour proposer des mesures intermédiaires renforcées.
      • Visites inopinées : Permettre des visites non annoncées par les travailleurs sociaux dans les familles suivies, sur le modèle des inspections de lieux de privation de liberté. Mme La Fourcade estime que "il faut pouvoir aller faire des visites inopiné".
      • Améliorer la circulation de l'information : Garantir que les messages d'alerte soient entendus et que les rapports des intervenants au quotidien (TISF) soient systématiquement intégrés aux dossiers judiciaires.
      • Repenser le lieu de protection : Mme La Fourcade propose l'idée audacieuse que, dans certains cas, ce soient les parents violents qui quittent le domicile plutôt que l'enfant, ou qu'un tiers digne de confiance vienne vivre au domicile. Elle s'inspire de la possibilité de l'éviction du conjoint violent pour les victimes de violences conjugales.
      • Avocat pour l'enfant : Rendre obligatoire la désignation d'un avocat pour l'enfant dès la suspicion de violence ou de maltraitance, au civil comme au pénal, pour garantir une écoute et une représentation effective de ses intérêts.
      • Ordonnance de protection pour enfants : Créer un mécanisme similaire à l'ordonnance de protection pour les victimes de violences familiales, permettant une saisine rapide du juge en cas de danger imminent.
      • Soutien aux tiers dignes de confiance : Mettre en place une allocation spécifique et simplifier les démarches administratives pour les tiers dignes de confiance accueillant un enfant, afin de lever les freins financiers et logistiques. Mme Brunet Jambu demande : "est-ce qu'il faut pas changer juste ça dire bah ok le tir digne de confiance on va lui attribuer une allocation spécifique".
      • Formation des professionnels : Renforcer et systématiser la formation de tous les professionnels en contact avec les enfants (magistrats, travailleurs sociaux, enseignants, personnel médical) pour améliorer la détection et la compréhension des signaux de maltraitance (ex: syndrome du bébé secoué, programmes importés du Québec comme Caliop et Lanterne). Mme Brunet Jambu insiste : "c'est la formation des professionnels du matin au soir la formation La formation".
      • Prévention : Remettre la prévention au cœur des politiques publiques, en soutenant la parentalité et en aidant les familles en difficulté. Mme Brunet Jambu estime qu'"il faut remettre de la prévention qu'il faut aussi aider certains parents".
      • Sensibilisation de la population : Inclure la sensibilisation aux gestes de signalement (appeler le numéro vert, saisir le CRIP) dans les formations aux premiers secours, pour faire du signalement un réflexe citoyen. Monsieur Bourret propose d'intégrer cela aux formations dispensées par des organismes comme la Croix-Rouge.
      • Collecte de données fiables : Améliorer le comptage et l'évaluation des morts violentes d'enfants, en envisageant notamment des autopsies plus systématiques, même en cas de morts apparemment accidentelles.
      • Responsabilité (y compris politique) : Interroger la responsabilité collective de la société en cas de défaillance et envisager des mécanismes de responsabilité sans intentionnalité pour les élus ou responsables dont les choix ont des conséquences graves sur la protection de l'enfance (bien que Monsieur Bourret souligne l'absence de responsabilité pénale collective en droit français).
      • Changer le récit : Cesser de considérer les morts violentes d'enfants comme de simples "faits divers" et reconnaître leur caractère structurel, sur le modèle du traitement des féminicides.
      • Les réussites et les espoirs :
      • Malgré les difficultés, des succès existent, comme la condamnation dans les affaires citées. Mme Brunet Jambu souligne que "quand la justice passe elle est belle".
      • Des initiatives positives existent, comme les programmes importés du Québec par l'association Le Partage, qui montrent des résultats concrets (zéro récidive pour les auteurs mineurs pris en charge).
      • L'optimisme reste nécessaire. Monsieur Bourret conclut sur la capacité d'adaptation et l'imagination des acteurs de la justice pour mettre en œuvre des décisions même avec peu de moyens, partageant un "sentiment positif et d'optimisme".

      Conclusion

      • L'audition a mis en évidence des lacunes importantes dans le système français de protection de l'enfance, notamment dans le traitement des signalements et le suivi des familles.

      Les témoignages poignants ont souligné la souffrance des enfants et la nécessité d'un changement de paradigme, passant d'une approche administrative et parfois trop verticale à une approche centrée sur l'enfant, ses besoins primaires, son écoute et la reconnaissance pleine et entière de ses droits.

      Des pistes de réformes concrètes ont été proposées, allant de l'amélioration de la circulation de l'information à la refonte des mesures de protection et au renforcement de la formation et de la sensibilisation de tous les acteurs impliqués, y compris le grand public.

      La question de la responsabilité de la société et des décideurs face à ces drames a également été fortement soulevée.

    1. Briefing Document : Examen des actions de l'Agence Nationale du Sport

      Ce briefing document résume les points clés abordés lors de l'audition de Madame Marie-Amélie Le Fur, nouvelle Présidente de l'Agence Nationale du Sport, et de Monsieur Frédéric Sanaur, Directeur Général, par la commission. Cette audition, inscrite dans le Code du Sport (article L 112-1), a permis d'évaluer les actions et les moyens de l'agence.

      Thèmes Principaux :

      • Double Mission de l'ANS : L'ANS a deux missions principales : le soutien au sport de haut niveau (notamment olympique et paralympique) et le développement de l'accès à la pratique sportive pour tous.
      • Gouvernance Partagée : L'agence est fondée sur une gouvernance partagée impliquant quatre collèges : le mouvement sportif, les collectivités territoriales, le monde économique, et l'État. Un changement notable en novembre 2024 a vu la présidence confiée pour la première fois à une personne issue du mouvement sportif.
      • Soutien au Sport de Haut Niveau : L'ANS alloue d'importants moyens financiers (126 millions d'euros en 2023, dont 110 millions hors équipement) pour soutenir les athlètes, l'encadrement technique, la data et la recherche, et la préparation paralympique. Les Jeux Olympiques et Paralympiques de Paris 2024 ont été un focus majeur, avec des initiatives comme les maisons régionales de performance et le programme "Gagner en France". L'objectif est de maintenir cette ambition pour les futurs Jeux.
      • Développement des Pratiques Sportives : L'ANS investit également massivement dans le développement de l'accès au sport pour tous (284 millions d'euros en 2023, dont 165 millions pour les équipements). Les actions incluent le soutien aux projets sportifs fédéraux, à l'emploi associatif (plus de 5000 emplois soutenus en 2023), et au plan 5000 équipements (plus de 3000 projets subventionnés en 2023). L'enjeu majeur est la réponse à l'augmentation des demandes de pratique post-JO, limitée par le manque d'équipements et de créneaux.
      • Situation Budgétaire et Financements : La question des moyens de l'ANS est une préoccupation majeure. Une diminution significative des crédits est prévue dans le projet de loi de finances pour 2025, s'ajoutant à un déficit de 71,5 millions d'euros en 2023. Le recours aux taxes affectées (comme la taxe Buffet sur les droits de diffusion) et le développement des financements privés (12 millions d'euros en 2023) sont discutés comme moyens de compenser cette baisse.
      • Déclinaison Territoriale : L'ANS s'appuie sur des conférences régionales du sport (19) et des conférences régionales des financeurs (14) pour diagnostiquer les besoins et cofinancer les actions au niveau local. Le fonctionnement inégal de ces structures est reconnu et fait l'objet d'une évaluation pour améliorer leur efficacité.
      • Enjeux Sociaux et Sociétaux du Sport : L'ANS s'engage de plus en plus en faveur de l'innovation sociale par le sport, avec des actions financées via des appels à projets comme "Impact 2024", abordant l'inclusion, l'égalité, la santé, l'éducation, la citoyenneté et le développement durable. La lutte contre la délinquance et la promotion de l'éthique dans le sport sont également prioritaires.
      • Points de Débat et Critiques : Des questions ont été soulevées concernant la visibilité et la lisibilité de l'action de l'ANS, notamment au niveau local, et la potentielle redondance avec les missions du Ministère des Sports. La baisse des moyens pour des publics spécifiques (personnes en situation de handicap, femmes, jeunes filles, sport santé) et l'impact de la baisse budgétaire sur l'héritage des Jeux sont des préoccupations. La gestion de la taxe Buffet et le financement des équipements structurants sont également des sujets de discussion.

      Idées et Faits Importants, avec Citations :

      • Mission double de l'ANS : "le soutien aux sports de haut niveau en particulier dans les disciplines olympique et paralympique et d'autre part le développement de l'accès à la pratique sportive pour tous"
      • Budget 2023 : L'ANS a "dépensé 280 millions d'euros" pour le développement de l'accès à la pratique sportive pour tous en 2023.
      • Déficit 2023 : L'ANS a "enregistré en 2023 un déficit de 71,5 million d'euros".
      • Baisse de crédits prévue en 2025 : La loi de finance pour 2025 devrait "acter un fort recul des crédits au profit de votre établissement". Initialement, le PLF envisageait un financement de 320 millions d'euros, "en recul de plus de 100 millions d'euros par rapport à 2024". Les discussions récentes pourraient amener une "demande de contribution du budget des sport complémentaire à hauteur de 34 millions d'euros supplémentaires ce qui amène une projection à date de diminution du budget de sport de 33 %".
      • Taxe Buffet : En 2023, elle a représenté "60 millions d’euros de sources pour l’ANS". Des inquiétudes sont exprimées quant à d'éventuels contournements via des contrats de sponsoring. Le rendement a été "que de 48 millions d’euros en 2023 contre 60 espérés".
      • Financements privés : Ils représentaient "12 millions d’euros soit moins de 3 % de vos ressources" en 2023, mais 18 millions en 2024. L'objectif est de les développer.
      • Nouvelle gouvernance : Pour la première fois, la présidence a été attribuée à une "candidature du mouvement sportif".
      • Ambition post-Paris 2024 : "nous sommes déjà à date puisque la page des jeux de 2024 a été tournée été tournée projeté vers une ambition de 2030 2032".
      • Objectif d'augmentation des pratiquants : Continuer à "aller d’ici la l’horizon 2027 chercher de nouveau 3 millions de pratiquants supplémentaires" (objectif de 3 millions pour Paris 2024 atteint dès 2023).
      • Budget haute performance 2023 : "126 millions d’euros ont été attribués sur ce volet haute performance".
      • Soutien direct aux athlètes : Sur le budget haute performance, "14 millions qui étaient rétribués directement aux athlètes" en 2023. Objectif de garantir à chaque athlète du cercle haute performance "40000 € de revenus".
      • Plan 5000 équipements : "en 2023 au titre de de ce plan 5000 équipements ce sont plus de 3000 projets qui ont été subventionné à hauteur de 108000 [108] millions d’euros".
      • Augmentation des demandes de licence post-JO : "près de 20 % de hausse de demande de licence qui malheureusement pour des raisons de disponibilité des équipements... ne peuvent pas être totalement honorés".
      • Engagement sur l'éthique : "nous avions également à l’échelle de l’agence décidé de consacrer plus de 12 millions d’euros aux actions en faveur de l’éthique et de la lutte de la contre la délinquance dans le sport" en 2023.
      • Rationalisation des appels à projets : Réduction du nombre d'appels à projets "de 16 à 7".
      • Efficacité de l'investissement sportif : "quand vous investissez 1 € dans le sport c’est 13 € qui sont économisé par ailleurs".
      • Évolution du budget de l'ANS : Le budget est passé de "2610 millions d’euros" en 2019 à "445 millions d’euros" en 2024.
      • Financement des équipements par l'ANS vs CNDS : L'investissement de l'ANS depuis 2019 représente "à peu près 20 années de CNDS en 6 ans".

      Enjeux Futurs :

      • Naviguer dans un contexte budgétaire contraint tout en maintenant les ambitions de l'agence.
      • Améliorer la gouvernance partagée et la déclinaison territoriale pour une action plus efficace et lisible.
      • Renforcer les partenariats privés et publics pour diversifier les sources de financement.
      • Accompagner la demande croissante de pratique sportive post-JO en résolvant notamment la question de l'accès aux équipements.
      • Poursuivre et intensifier l'engagement en faveur de l'innovation sociale par le sport et des publics prioritaires.
      • Clarifier et optimiser l'articulation avec le Ministère des Sports.
      • Anticiper l'organisation des Jeux Olympiques et Paralympiques d'hiver de 2030 et maintenir l'ambition de haute performance.

      En conclusion, l'Agence Nationale du Sport se trouve à un tournant, cherchant à consolider son modèle récent et à répondre aux défis budgétaires et organisationnels, tout en capitalisant sur la dynamique post-Jeux de Paris 2024 pour continuer à développer le sport en France, tant au niveau de la haute performance que de l'accès pour tous.

      La situation budgétaire actuelle représente une menace significative pour la réalisation de ces ambitions.

    1. Note de synthèse : Examen d’un rapport d’information sur la définition pénale du viol

      Introduction :

      • Le rapport d'information examine la définition pénale actuelle du viol en France et propose des réformes pour répondre aux lacunes constatées et à l'impunité persistante du crime de viol.

      Il met en lumière l'ampleur des violences sexuelles, les obstacles rencontrés par les victimes dans leur parcours judiciaire et la nécessité d'adapter la loi aux réalités des situations vécues.

      Le rapport s'inscrit dans le contexte du mouvement #MeToo, de la reconnaissance de la culture du viol et des engagements internationaux de la France, notamment la Convention d'Istanbul.

      Principaux thèmes et idées clés :

      • L'impunité généralisée du viol : Le rapport souligne le taux extrêmement faible de condamnations pour viol en France, le qualifiant de "sommet de l'iceberg".

      Seulement 6% des victimes de violences sexuelles signalent les faits aux forces de l'ordre. Parmi celles qui se présentent, 4 cas sur 10 ne déposent finalement pas plainte.

      Près de 80% des plaintes déposées sont classées sans suite. En fin de compte, "c'est 1 % des viols qui seront sanctionnés par une condamnation des auteurs aux assises.

      Le crime de viol est donc dans 99 % des cas impuni."

      Aucun autre crime n'a un taux de sanction aussi faible.

      • Les obstacles au signalement et à la poursuite : Plusieurs facteurs expliquent cette impunité. Outre les difficultés matérielles (crime intime, souvent sans témoins, temps jouant contre les victimes), le rapport pointe du doigt les freins culturels, notamment la "culture du viol qui tend à tolérer finalement ou à justifier des violences sexuelles et partout dans la société".

      La définition légale actuelle est également identifiée comme une cause majeure. * La définition légale actuelle et ses limites : Le viol est actuellement défini par trois critères : * L'élément légal (l'infraction définie par la loi). * L'élément matériel : "l'acte de pénétration sexuel [...] en exploitant tel que c'est décrit aujourd'hui une contrainte une menace ou une surprise". * L'élément intentionnel ou moral : l'intention consciente de l'auteur d'exploiter ces conditions. Le rapport critique principalement la dépendance excessive aux critères de "violence, contrainte, menace ou surprise". Lorsque les juges ne parviennent pas à démontrer l'usage de ces éléments et l'intentionnalité, ils "privent ainsi sa décision de base légale", entraînant des classements sans suite, notamment dans les cas de "sidération traumatique ou de situation de vulnérabilité des cas qui sont majoritaires où il n'y a ni violence ni contrainte". Dans ces situations (où la victime dit "non" mais ne se débat pas, est paralysée par la peur, est sous l'influence de substances, ou dépend de l'agresseur), "il n'y aura pas de poursuite et pas de condamnation". * Le droit ne remplit pas sa fonction protectrice et expressive : * Fonction protectrice : Le faible taux de condamnation "dissuade les victimes de porter plainte". Les victimes qui ont le courage de porter plainte subissent souvent une "forme de traumatisation secondaire" par le système judiciaire. Les interrogatoires et investigations se concentrent sur la victime pour prouver la coercition, remettant en question son passé sexuel et retournant sa vulnérabilité contre elle. Les enquêteurs et experts sont peu formés au psychotraumatisme. Un magistrat, François Lavallière, a témoigné qu'il lui arrivait de dire aux victimes : "Madame, je vous crois, cependant je ne pouvais au regard des éléments recueillis dans la procédure et des exigences imposées par la loi condamner l'auteur." Il a même ajouté qu'il hésiterait à inviter une victime à porter plainte dans l'état actuel du droit. La journaliste Julia Féres a également exprimé son incertitude quant à conseiller à une victime de porter plainte, soulignant les coûts (financiers, émotionnels) et les faibles chances de succès. * Fonction expressive : La rédaction actuelle de la loi ne communique pas clairement l'importance du consentement. Elle contribue au maintien d'un "malentendu sociétal sur ce qu'est une bonne victime, un vrai viol et sur la présomption de consentement ou de disponibilité sexuelle des victimes". Le principe selon lequel "toute personne initiant un contact sexuel devrait s'assurer du consentement libre et éclairé de l'autre" et que le consentement garantit la liberté de choix à tout moment n'est pas clairement énoncé. Le manque d'éducation à la vie affective, relationnelle et sexuelle et à une culture du consentement en France alimente la culture du viol, comme l'indiquent des chiffres alarmants sur les croyances sexistes chez les jeunes et l'exposition précoce à la pornographie violente. * La nécessité de réformer la définition pénale du viol et l'introduction du consentement : Le rapport soutient fortement la nécessité de modifier la définition pénale du viol pour intégrer la notion de non-consentement. Cela contribuerait à "changer de paradigme et remettre en question les préjugés sexistes". Bien que la modification de la loi ne résolve pas à elle seule tous les problèmes, elle est considérée comme une "première réponse" et une "avancée" pour faciliter les poursuites pénales et dans l'effort d'éducation collective. * Alignement sur les normes internationales et les exemples européens : La réforme est jugée nécessaire pour aligner le droit français sur ses engagements internationaux, en particulier la Convention d'Istanbul. Le GREVIO (instance de contrôle de l'application de la Convention d'Istanbul) a souligné dès 2019 que la définition juridique des infractions sexuelles en France n'est "pas fondée de manière explicite sur le consentement libre et non équivoque de la victime". Quinze pays européens ont déjà adopté une définition du viol basée sur l'absence de consentement, avec des exemples récents en Espagne, en Belgique et en Suède, où les premiers retours suggèrent une augmentation du nombre de plaintes suite aux changements législatifs. * Proposition de modification en quatre axes : Les rapporteurs proposent une modification de la définition pénale du viol selon quatre axes principaux : * Introduire la notion de "non consentement" comme critère essentiel. * Maintenir les quatre critères actuels (violence, contrainte, menace, surprise) car ils mettent en valeur certaines stratégies des auteurs et ont créé une jurisprudence précieuse. * Préciser la manière dont le non-consentement sera apprécié "au regard des circonstances environnantes", offrant un outil supplémentaire aux juges et enquêteurs pour interroger la personne mise en cause sur les "mesures raisonnables" prises pour recueillir le consentement. * Préciser les cas où le consentement "ne saurait être déduit", prenant en compte notamment les situations de sidération (qui concerneraient 77% des cas selon certaines auditions) et de vulnérabilités temporaires (sous emprise de substance, emprise psychologique). * Garanties et perspectives : La rédaction proposée garantirait le respect des grands principes du droit français, tels que la présomption d'innocence et la non-rétroactivité de la loi. Les rapporteurs soulignent que cette réforme est une "première pierre que nous jetons face au mur de l'impunité". Ils réaffirment que d'autres mesures sont indispensables : plus de formation pour les forces de police et de justice, plus de moyens pour les institutions et les associations, et une prévention majeure par l'éducation à la vie sexuelle et affective pour combattre la culture du viol. L'inscription de la proposition de loi est attendue en semaine transpartisane fin mars/début avril, marquant le début du parcours législatif.

      Citations clés :

      • "c'est 1 % des viols qui seront sanctionnés par une condamnation des auteurs aux assises. Le crime de viol est donc dans 99 % des cas impuni."
      • "la culture du viol qui tend à tolérer finalement ou à justifier des violences sexuelles et partout dans la société"
      • "l'acte de pénétration sexuel [...] en exploitant tel que c'est décrit aujourd'hui une contrainte une menace ou une surprise"
      • "Si le juge se borne à caractériser un viol ou une agression sexuelle sans arriver à démontrer l'usage de la violence contrainte menace ou la surprise et l'intentionnalité, il prive ainsi sa décision de base légale."
      • "dans les cas de sidération traumatique ou de situation de vulnérabilité des cas qui sont majoritaires où il n'y a ni violence ni contrainte [...] il n'y aura pas de poursuite et pas de condamnation."
      • "Le taux de condamnation il est en réalité si faible qu'il en dissuide les victimes de porter plainte"
      • François Lavallière : "Madame, je vous crois, cependant je ne pouvais au regard des éléments recueillis dans la procédure et des exigences imposées par la loi condamner l'auteur."
      • Julia Féres : "Au début lorsqu'on me demandait si une victime devait porter plainte je répondais oui aujourd'hui je réponds que je ne suis pas sûre"
      • "Les victimes ne sont pas suffisamment protégées elles peuvent subir une forme de traumatisation secondaire lorsqu'elles ont le courage de porter plainte"
      • "La rédaction actuelle contribue au maintien d'un d'un malentendu sociétal sur ce qu'est une bonne victime un vrai viol et sur la présomption de consentement ou de disponibilité sexuelle des victimes"
      • "toute personne initiant un contact sexuel devrait s'assurer du consentement libre et éclairé de l'autre"
      • "L'inscription du consentement dans la définition pénale du viol doit contribuer à changer de paradigme et remettre en question les préjugés sexistes dans les relations affectives et sexuelles"
      • GREVIO : "en France la définition juridique des infractions sexuelles n'est pas fondée de manière explicite sur le consentement libre et non équivoque de la victime"
      • "nous sommes convaincus qu'il est nécessaire de réformer la définition pénale du viol et que c'est une première étape dans un changement de paradigme pour passer de la culture du viol à la culture du consentement"
      • "introduire la notion de non consentement tout simplement"
      • "nous maintenons les quatre critères actuels c'est quelque chose dont nous avons acquis la conviction très rapidement lors de nos auditions il ne s'agit pas de supprimer les critères de violence contrainte menace et surprise"
      • "Préciser la façon dont le non consentement va être apprécié au regard des circonstances environnantes"
      • "nous voulons préciser le cas les cas où le consentement ne serait être déduit et là on revient à l'exemple de la sidération"
      • "modifier la loi modifier la définition du viol dans le code pénal c'est une première pierre que nous jetons face au mur de l'impunité dans ce pays"

      Conclusion :

      Le rapport dépeint un tableau alarmant de l'impunité du viol en France, largement due à une définition légale obsolète qui ne prend pas en compte la réalité de nombreuses victimes, notamment en cas de sidération.

      L'intégration du consentement dans la définition pénale est présentée comme une étape cruciale et nécessaire, alignée sur les standards internationaux et les évolutions législatives d'autres pays européens.

      Cette réforme, couplée à des efforts accrus en matière de moyens judiciaires, de formation des professionnels et d'éducation au consentement, est jugée indispensable pour mieux protéger les victimes, combattre la culture du viol et assurer une justice plus équitable.

    1. Compte rendu d'audition: Violences dans la culture - Me Anne Bouillon et Me Violaine de Filippis-Abate

      Date: [Date de l'audition - non précisée dans les extraits]

      Intervenantes: Me Anne Bouillon, Me Violaine de Filippis-Abate (Avocates spécialisées dans la lutte contre les violences faites aux femmes)

      Commission d'enquête: Violences commises contre les mineurs et les majeurs dans le secteur du cinéma, de l'audiovisuel et du spectacle vivant.

      Résumé:

      Cette audition a mis en lumière la persistance des violences sexistes et sexuelles dans le milieu culturel, notamment dans le cinéma et l'audiovisuel, tout en soulignant leur caractère systémique qui s'étend à d'autres sphères de la société.

      Les avocates ont insisté sur la nécessité de reconnaître l'existence de ces violences, de comprendre les mécanismes de domination à l'œuvre, d'améliorer la prise en charge de la parole des victimes et de repenser les procédures judiciaires et les mesures de protection.

      Thèmes Principaux et Idées Clés:

      Le caractère systémique et massif des violences: * Les intervenantes affirment que les violences faites aux femmes et aux enfants sont "non seulement massives, mais [qu'elles] structurent profondément qui nous sommes". * Bien que le milieu culturel présente des spécificités ("emprise, chantage, omerta, verticalité des hiérarchies", prétexte de la création artistique), les mécanismes de domination sont universels et se retrouvent dans tous les milieux (politique, entreprises, syndicats, famille, couple). * Ces violences sont "inhérentes à notre organisation sociale" et "une condition de la perpétuation de notre système".

      Les mécanismes de domination et l'omerta:

      • Un point central est la "profonde conviction", voire la "profonde croyance" des auteurs de violences quant à leur "légitimité au passage à l'acte violent".
      • Cette conviction est renforcée par une "construction sociale hiérarchisée" où l'appartenance à la catégorie "homme" (perçue comme supérieure) autorise à "ne pas s'empêcher" (d'abuser, de violenter, de violer, de tuer).
      • Dans le milieu culturel, ce phénomène est accentué par le statut "d'artiste, de créateur", élevé au rang de "monstres sacrés", qui semble autoriser de tels comportements.
      • L'omerta est particulièrement forte dans ce milieu, rendant la "libération de la parole" des victimes d'autant plus difficile.

      La légitimité de la parole des victimes et le déni:

      • Il est crucial de "faire contrepoids" au "déni" des auteurs en faisant émerger la parole des victimes et en la faisant "regagner en légitimité".
      • Dire ("Je te crois") à une victime est une "œuvre de réparation" et une parole "performative" qui permet de lutter contre la "victimisation secondaire".
      • Le déni des auteurs est manifeste lorsqu'ils reconnaissent les faits ("ont contraint leurs femmes à des rapports sexuels") mais refusent de se reconnaître comme "violeur" ou "hommes violents", se tenant "à distance [...] des figures d'altérité du monstre, du pervers".

      Les obstacles à la procédure judiciaire et les pistes d'amélioration:

      • Le "dépôt de plainte" est un "saut dans le vide", une "prise du risque", "vertigineux" et une "brèche dans l'omerta". Seulement "6% des victimes déposent plainte".
      • L'"accueil par l'institution judiciaire et policière est toujours un frein énorme". Les "portes pèsent toujours aussi lourd". La "qualité d'accueil" varie considérablement.
      • Les "classements sans suite" sont massifs ("96% des plaintes déposées entre 2012 et 2021 ont été classées sans suite").
      • Pistes d'amélioration:Lutter contre les "classements sans suite" et "poser le principe d'acte d'enquête obligatoire".

      Il faut "chercher", "faire les enquêtes". * "Mieux accueillir et traiter la parole des victimes" en pratiquant le "Je te crois". * Lutter contre les "processus de victimisation secondaire". * Sortir du "sempiternel parole contre parole" et ne pas demander à la parole des victimes d'être la "preuve cardinale", mais de ne pas être traitées comme des "menteuses". * Réfléchir à la place de la partie civile dans le procès. * Travailler sur la question des procédures "bâillon" et de la "menace de ces procédures bâillon systématiquement".

      La question de la protection des victimes:

      • La "protection" est un "angle mort de nos réflexions".
      • Les progrès en la matière sont "toujours pas suffisants". De nombreuses femmes "vivent la peur au ventre".
      • Il faut "mieux informer les plaignantes de ce qu'il advient de leur plainte".
      • Il faut "réformer l'ordonnance de protection judiciaire qui lie toujours pour être obtenu la vraisemblance des violences et l'existence d'un danger". Il faut "faire sauter cette double condition cumulative".
      • Les féminicides sont un "scandale absolu" et il faut "investiguer ce scandale absolu".

      La prévention et l'éducation:

      • La figure du "monstre" (comme dans le cas de Depardieu) est une construction sociale qui masque la réalité : dans "9 cas sur 10", l'agresseur est "monsieur tout le monde".
      • Il y a un "vrai problème sociétal de d'éducation".
      • Pistes d'amélioration:Renforcer l'éducation dès l'école ("séances la vie affective et sexuelle") et sur l'égalité ("matière obligatoire" sur l'égalité). Ces séances devraient être tenues, car c'est dans la loi depuis 2001.
      • "Sensibiliser" les forces de l'ordre et les instituteurs.
      • Empêcher certains hommes de "devenir des violeurs" en travaillant sur le consentement.
      • L'éducation est une solution à "long terme" qui "prend des décennies".

      Le rôle de l'expertise judiciaire:

      • Le "trouble de stress post-traumatique (TSPT)" est "très mal diagnostiqué" bien qu'il soit un "témoin de l'agression", un "marqueur de l'agression" et donc une "preuve au dossier".
      • Les expertises psy sont souvent "misogynes".
      • Il faut "former les expert psis à ça".
      • Explorer les avancées de l'"épigénétique" et la possibilité de "chercher ces marqueurs-là" (sur le gène nr3c1) pour diagnostiquer le TSPT. Le TSPT est réversible, ce qui est une "bonne nouvelle" mais nécessite des soins.
      • La "présence de l'avocate ou de l'avocat est hyper importante pour éviter les phénomènes de victimisation secondaire". Cette prise en charge devrait être "prise en charge par l'État sans conditions de ressources".
      • L'accompagnement des victimes et l'accès à la justice:
      • L'accompagnement des victimes "avant le dépôt de plainte" est essentiel. Les avocats peuvent déjà le faire, mais il faudrait "prévoir une aide juridictionnelle pour ça" et "systématiser" le travail préparatoire des travailleurs sociaux dans les commissariats.
      • Il est "désespérant" d'avoir à "prendre rendez-vous avec un ou une policière dont [on] sait qu'il va lui assurer une écoute bienveillante" pour éviter la victimisation secondaire.
      • La justice devrait être "friendly aux victimes", perçue comme une "passerelle de sortie possible".
      • Le "classement sans suite" peut entraîner des "suicides".
      • Le discours de la victime d'agression sexuelle peut être "incohérent", ce qui est "au contraire une preuve qu'elle a été agressée".

      La question de la "folie des femmes" et la protection de la vie privée:

      • On "essaie de [les] faire passer pour folle" pour "discréditer, délégitimer leurs paroles". Cela renvoie à des imaginaires de "sorcières" et d'"hystérie".
      • Les expertises psychologiques cherchent souvent une "forme de folie ou d'instabilité psychologique" chez les femmes qui dénoncent.
      • Les personnes ayant parlé sont considérées comme "folles, fragiles et délicates", se voyant proposer des "soins" plutôt que de la "réparation".
      • Il faut interdire l'"investigation sur la sexualité des femmes", car leur "passé leurs propres aventures amoureuses, sexuelles soit investiguer et soit porter finalement comme un argument à leur rencontre". Cela permettrait de ne pas les faire passer pour "légère ou pour nymphomane".

      La question de l'emprise et du contrôle coercitif:

      • L'emprise est un "phénomène protéiforme complexe" et est "très peu" dans le code pénal.
      • Il faudrait une "vraie réflexion sur le contrôle coercitif qui est le pendant en prise", c'est-à-dire les "actes posés et voulus par l'auteur avec un dessin concerté qui est celui de maintenir la victime sous emprise".
      • La jurisprudence a déjà commencé à définir le contrôle coercitif, ce qui est utile.
      • La codification de l'emprise doit être abordée avec "précaution".

      La nécessaire spécialisation et circulation de l'information:

      • La "circulation de l'information" au sein des tribunaux est essentielle pour traiter les dossiers de manière globale.
      • Le "décloisonnement des services" et la "mise en œuvre de processus qui sont spécifiques" (comme les juridictions dédiées aux violences intrafamiliales à Nantes) permettent de "gagner en efficacité" et de mieux appréhender les situations.
      • Il faut rendre la justice "friendly aux victimes".

      Citations Clés:

      • "nous sommes là actuellement en plein vote de la motion de censure"
      • "notre commission d'enquête cherche à faire la lumière sur les violences commises contre les mineurs et les majeurs dans le secteur du cinéma de l'audio visuel et du spectacle vivant"
      • "tous les mécanismes de domination que nous connaissons que sont l'emprise le chantage l'omerta la verticalité des hiérarchies sans compter le prétexte fallacieux de la création artistique au nom de laquelle tout serait toujours permis il trouve un terrain d'expression particulièrement favorable"
      • "la violence faite aux femmes et aux enfants existe et cette violence est non seulement massive mais cette violence structure profondément qui nous sommes"
      • "le dénis est nécessaire nécessaire et indispensable comme garant de la perpétuation du système"
      • "tous ceux qui le sont ont en commun la conviction de la légitimité de leur légitimité à ne pas s'empêcher"
      • "sortir du sampéternel parole contre parole"
      • "mieux accueillir et traiter la parole des victime ça veut dire pratiquer en tout cas c'est ce que nous faisons le je te crois"
      • "lutter contre les classements sans suite"
      • "La protection est souvent un angle mort"
      • "vivre la peur au ventre c'est quelque une expérience à nul autre pareil"
      • "il faut faire sauter cette double condition cumulative [pour l'ordonnance de protection]"
      • "je n'en peux plus nous n'en pouvons plus de compter nos mortes"
      • "ça répond au même système que toutes les autres violences faites aux femmes avec effectivement cette particularité peut-être et même sûrement que on y retrouve des lieux de pouvoir des lieux d'influence"
      • "en terme de prévention d'éducation des l'école justement on dit souvent aux enfants mais non t'inquiète pas les monstres n'existent pas mais en réalité on leur dit pas tu peux enfin évidemment on va pas leur dire à 5 ans bien sûr on va pas leur dire tu peux être agressé par ton copain ou voilà mais ce que je dire par là c'est que on a un vrai problème sociétal de d'éducation"
      • "dans 9 cas sur 10 ce monstre là c'est pas un monstre c'est monsieur tout le monde"
      • "sur les enquêtes on n'a pas d'enquête aujourd'hui on n'a pas assez d'actes d'investigation"
      • "former les expert psis à ça parce qu'en fait déjà ils sont pas formés puis souvent ils sont misogynes c'est un problème"
      • "on sait que le TSPT a des conséquences sur l'épigénétique"
      • "une justice de d'abord de classe une justice de dossier"
      • "un réseau d'entraide masculin qui permet d'échapper à toute poursuite ou à toute dénonciation publique"
      • "on les femmes payent le fruit de siècles de euh croyance articulé autour de l'idée que leur parole est euh déraisonnable"
      • "les tribunaux se saisissent de ça ils sont au travail hein déjà sur cette questionl et sur la circulation de l'information et le décloisonnement"
      • "La justice n'est pas un soin"
      • "elle me disent toutes J'ai besoin d'être reconnu"
      • "on demande aux femme d'avoir un discours clair"
      • "un discours incohérent chez une femme victime d'agression sexuelle c'est au contraire une preuve qu'elle a été agressée"
      • "elles se taisent là-dessus [sur leur vie privée] parce que évidemment les policiers adorent des poser ces questions là"
      • "l'attention du législateur devrait peut-être davantage se tourner sur cette notion là [le contrôle coercitif] plutôt que sur celle de l'emprise"

      Points d'attention et Questions en Suspens:

      • La possibilité d'une "matière obligatoire" sur l'égalité à l'école.
      • Les moyens concrets pour obliger les enquêtes à être menées après un dépôt de plainte (décret, loi, liste d'actes minimums).
      • Comment assurer une prise en charge psychologique (spécialisée en TSPT) pour les victimes, idéalement prise en charge par l'État.
      • Comment réformer les expertises psy pour garantir leur impartialité et leur compétence en matière de TSPT.
      • Comment mettre en place une "plateforme de suivi" des plaintes pour les victimes.
      • Les moyens de protéger réellement les témoins.
      • La définition et la codification éventuelle du contrôle coercitif dans le code pénal.

      Conclusion:

      L'audition a souligné la complexité et la profondeur des violences sexistes et sexuelles, appelant à une approche globale qui combine prévention, éducation, réforme du système judiciaire, amélioration de l'accueil des victimes et renforcement des mesures de protection.

      Les intervenantes ont insisté sur la nécessité d'une volonté politique forte pour mettre en œuvre les changements nécessaires.

    1. Note de Briefing : Analyse de la réponse judiciaire aux violences sexistes et sexuelles

      Objet : Synthèse des principales thématiques, idées et faits saillants ressortant des témoignages d'associations, militantes et professionnelles concernant la réponse judiciaire aux violences sexistes et sexuelles.

      Date : 27 novembre 2024 (basé sur les références temporelles dans les sources)

      Sources consultées : Excerpts from "🔴Associations et militantes: leur avis sur la réponse judiciaire aux violences sexistes et sexuelles" (vidéo ou enregistrement d'une audition).

      Résumé exécutif :

      • Les témoignages convergent pour dépeindre un système judiciaire français largement inefficace et souvent traumatisant pour les victimes de violences sexuelles et sexistes.

      Malgré quelques progrès, les statistiques de dépôt de plainte et de condamnation sont alarmantes, témoignant d'une impunité quasi-totale.

      Les principales critiques portent sur l'accueil et la prise en charge des plaintes, la qualité des enquêtes, le déroulement des procès (source de victimisation secondaire) et l'inadéquation de la définition pénale du viol, qui ne se fonde pas explicitement sur l'absence de consentement.

      Un manque criant de formation des professionnels (police, gendarmerie, justice, santé) au psychotraumatisme et aux spécificités des violences sexuelles est également souligné.

      Les intervenantes insistent sur la nécessité d'une approche globale incluant prévention, éducation, moyens conséquents et une réforme législative plaçant le consentement au cœur de la définition du viol, en phase avec les normes internationales.

      Thèmes principaux et idées clés :

      • Des statistiques alarmantes et une impunité généralisée :
      • Les chiffres des violences sexuelles ont presque doublé depuis 2006.
      • Sur 230 000 victimes (18-75 ans) par an, seulement 6.6% portent plainte.
      • Près de 70% des plaintes pour viol sont classées sans suite.
      • Le taux de condamnation pour des faits ressentis comme des viols est inférieur à 1% (environ 0.6%).
      • "Nous avons un taux qui est inférieur à moins de 1 % euh ce serait plutôt du 0,6 % de fait ressentis comme étant des viols qui donnerait lieu à condamnation" (Magalie la Fourcade).
      • Il y a eu une baisse de 40% du nombre de condamnations en matière de viol entre 2018 et 2021 selon une infostat (même si celle-ci date un peu).
      • "Aujourd'hui on a un problème de législation qui ne s'arrête pas qu'un problème de législation évidemment mais la législation étant d' interprétation stricte en droit pénal ça irrigue aussi la façon dont on mène les enquêtes" (Magalie la Fourcade).
      • Un parcours judiciaire traumatisant pour les victimes (Victimisation secondaire) :
      • Le dépôt de plainte est souvent une épreuve. "comment faire en sorte que cette épreuve du dépôt de plein ne soit pas un enfer pour les victimes" (Introduction).
      • Les enquêtes ne sont pas toujours menées correctement ; l'investigué n'est pas toujours auditionné.
      • Le procès est "trop souvent une épreuve traumatisante". "pour celles qui ont à vivre un procès l'expérience est si éprouvante que l'on parle de victimisation secondaire d au fonctionnement de l'appareil judiciaire" (Introduction).
      • La victime est disproportionnellement ciblée par l'examen de crédibilité, contrairement à la personne mise en cause.
      • Le coût financier et la longueur des procédures sont dénoncés.
      • Plusieurs affaires sont en cours devant la Cour Européenne des Droits de l'Homme (CEDH) pour victimisation secondaire due au fonctionnement de l'appareil judiciaire. "quand j'ai appris qu'il y avait h plaintes contre la France auprès de la CEDH pour victimisation secondaire je mais en fait c'est ça que j'ai vécu" (Julia feus).
      • L'expérience judiciaire peut aggraver le risque de passage à l'acte suicidaire. "les procédures judiciaires on augmenté de façon importante le risque de passage à l'acte suicidaire" (Muriel Salmona).
      • 83% des victimes déclarent n'avoir jamais été protégées ou reconnues en tant que victime.
      • Une ancienne victime témoigne : "j'y croyais beaucoup et j'ai fait tout ce qu'on m'a demandé il a été acquitté et moi j'ai été broyé une deuxième fois" (Julia feus). Elle n'est aujourd'hui "pas sûre" de recommander aux victimes de porter plainte.
      • La victime doit être bien entourée, avoir de l'argent, et s'attendre à perdre.
      • Elle risque que sa plainte ne soit pas reçue (1 chance sur 2) et que 9 plaintes sur 10 soient classées sans suite.
      • Les maltraitances subies pendant la procédure peuvent avoir des conséquences à long terme, même des décennies après. "mon corps se souvient de combien l'institution judiciaire m'a broyé et je pèse mes mots" (Julia feus).
      • Les victimes sont confrontées à des stéréotypes (sainte, fabulatrice, provocatrice). "elle se retrouve en tant que plignante à faire l'objet d'un examen de crédibilité qui est extrêmement virulent autour de l'archétype soit de la sainte... soit de la fabulatrice soit de la provocatrice" (Magalie la Fourcade).
      • Le système actuel favorise l'optimisation sexuelle pour éviter la qualification de viol. "aujourd'hui si vous êtes pas pris sur les quatre critères ce n'est pas un viol donc on est dans un système d'optimisation sexuelle" (Magalie la Fourcade).
      • La nécessité d'une réforme de la définition pénale du viol basée sur le consentement :
      • Les engagements internationaux de la France (Convention du CEDO, Convention d'Istanbul) exigent une définition du viol fondée sur l'absence de consentement.
      • "la définition pénale du viol était totalement insuffisante et a demandé à la France de modifier le code pénal pardon de manière à ce que la définition du viol soit fondée sur l'absence de consentement qu' couvre tout les actes sexuels non consentis ce qui n'est pas permis aujourd'hui et tiennent compte de toutes les circonstances coercitives conformément aux normes internationales relatives au droit de l'homme" (Magalie la Fourcade, citant le CDO).
      • La définition actuelle, basée sur violence, menace, contrainte ou surprise, est interprétée de manière variable par les magistrats et est insuffisante pour couvrir tous les actes sexuels non consentis.
      • Un grand mouvement mondial et européen (Canada, Allemagne, Espagne, Belgique, etc.) a déjà adopté une définition basée sur le consentement.
      • La France doit aligner sa législation sur sa "diplomatie féministe" pour rester crédible.
      • Il ne suffit pas d'ajouter "non consenti" ou "non librement consenti" ; il faut définir le consentement (libre, explicite, ne peut être donné à l'avance, peut être retiré à tout moment) et élargir les critères pour inclure les états de sidération, la ruse et les circonstances coercitives (vulnérabilité, rapports d'autorité, dépendance économique, violences conjugales, etc.).
      • "il faut absolument ouvrir les critères pour pouvoir faire entrer les états de sidération la ruse" (Magalie la Fourcade).
      • La définition belge est donnée en exemple, incluant l'absence de résistance, la vulnérabilité, l'influence de substances, la ruse, etc. (Magalie la Fourcade).
      • Le mot "consentement" dans la loi aurait aussi une fonction éducative.
      • Il y a une divergence sur l'introduction du mot "consentement" : certaines associations craignent qu'il ne se focalise sur la victime et qu'il soit flou ou utilisé contre elle, arguant que le viol est un acte de prédation (Marie France Casalis).
      • Un manque de moyens et de formation des professionnels :
      • Les enquêtes ne sont pas toujours menées par des professionnels formés.
      • Malgré des efforts de formation, celle-ci n'est pas toujours suffisante ou adaptée.
      • La connaissance et la prise en charge du psychotraumatisme ne sont pas généralisées chez tous les intervenants (police, justice, santé).
      • "la connaissance du psychotraumatisme par l'ensemble des intervenants vous paraît-elle acquise et sa prise en charge vous paraît-elle suffisante" (Introduction).
      • Le système est submergé, rendant difficile la prise en charge des victimes qui nécessitent du temps. "le problème de nos forces de l'ordre comme de nos médecins pas comme de nos juristeson c'est qu'ils sont submergés et qu'au milieu de tout ça c'est qu'est-ce qui vient pour viol ça permettra pas de parler" (Marie France Casalis).
      • Le manque de personnel dans les services spécialisés (BLPF à Paris). "en moyenne j'ai vu trois fonctionnaires maximum 7 par blpf sachant que les blpf sont toutes les violences intrafamiliales pas uniquement les viols conjugaux" (Anaïse de fausse).
      • Les outils d'audition spécialisés (protocole Mélanie, Nietzche) ne sont pas systématiquement utilisés.
      • L'aide juridictionnelle n'existe pas au stade du dépôt de plainte.
      • Les délais pour les évaluations psychologiques en unités médico-judiciaires sont très longs (1 an à Paris), conduisant à des expertises alternatives de qualité insuffisante.
      • Le budget alloué aux experts judiciaires est très faible.
      • Les magistrats sont parfois contraints par le cadre juridique inadapté.
      • L'impact du psychotraumatisme sur la procédure et sa prise en charge insuffisante :
      • Le psychotraumatisme est un fil rouge présent à toutes les étapes de la procédure judiciaire.
      • Les conséquences psychotraumatiques sont graves, à long terme, et peuvent entraîner des risques de mort précoce.
      • Le psychotraumatisme n'est pas suffisamment connu, dépisté ou pris en charge par les professionnels de santé, y compris les psychiatres.
      • Les symptômes du psychotraumatisme (sidération, dissociation, mémoire traumatique) sont souvent retournés contre les victimes pour mettre en cause leur crédibilité. "les symptômes qui sont des symptômes universels du trauma... vont être retournés contre les victimes pour mettre à mal leur crédibilité" (Muriel Salmona).
      • Le psychotraumatisme peut être une preuve médico-légale si les experts sont formés.
      • Le dépistage systématique des violences est recommandé mais n'est pas appliqué.
      • L'état de choc traumatique est une urgence médico-psychologique et médico-légale, mais moins de 5% des victimes sont vues en urgence. "personne leur dit qui leur dit quelle campagne dit que c'est une urgence médico-psychologique et médicolégale qui le dit personne" (Muriel Salmona).
      • La prise en charge rapide du psychotraumatisme peut éviter des conséquences très graves. "si ils sont pris en charge rapidement dans les 12 heures qui suivent le trauma on peut éviter toutes les conséquences" (Muriel Salmona).
      • Les victimes de violence sexuelle sont particulièrement vulnérables et ciblées (mineurs, personnes en situation de handicap - 5 à 6 fois plus de risques).
      • Il n'y a pas de centres spécialisés dans la prise en charge du trauma sexuel comme préconisé par la Convention d'Istanbul (un centre par bassin de 200 000 habitants). La Belgique a mis en place 7 centres.
      • Comment porter plainte quand on est en état de choc, qu'on oublie des éléments importants pour la procédure, ou qu'il y a une déconnexion (dissociation) affectant la mémoire et la capacité à se défendre ?
      • Les expertises psychiatriques manquent souvent de pertinence faute de formation des experts au psychotraumatisme.
      • La dissociation chronique due à des violences anciennes (inceste) rend les personnes extrêmement vulnérables et ciblées par les agresseurs.
      • Une prise en charge gratuite et accessible du psychotraumatisme est essentielle.
      • Une culture du viol persistante et le rôle de l'éducation :
      • La justice est rendue par des personnes "bibronnés au Pat et à la culture du viol et à cette idée selon laquelle finalement le corps des femmes est à disposition" (Julia feus).
      • Il faut "faire la peau" à ce "système destructeur oppressif dominateur patriarcal".
      • Il est nécessaire de faire un "examen de conscience" individuel et collectif sur sa place dans ce système (qu'est-ce que j'ai subi, couvert, su sans le dire, fait).
      • La culture du viol se manifeste dans les représentations collectives des victimes (la "bonne victime" est celle qui est morte) et des agresseurs ("bons pères de famille").
      • "on m'a dit en cours d'assise qu'il pouvait pas m'avoir violé parce que c'était un bon père de famille" (Julia feus).
      • "c'est ça qu'il faut changer c'est nos représentations sur le viol et donc sur ce qui est une victime et donc sur ce qui est un violeur aussi" (Julia feus).
      • Ces représentations sont "hyper pratiques" pour le déni et pour ne pas s'attaquer au problème de fond.
      • L'éducation est cruciale pour lutter contre la culture du viol et promouvoir le consentement.
      • Le manque d'éducation sexuelle formelle et l'accès à la pornographie violente dès le plus jeune âge ont des conséquences désastreuses sur la compréhension de la sexualité et du consentement.
      • "50 % des jeunes de 18 à à 25 ans disent qu'ils ont fait leur AUC aation sexuelle avec la pornographie" et ont une culture du viol plus marquée (Muriel Salmona).
      • La tolérance d'une sexualité violente est gravissime.
      • Les spécificités de l'agresseur et la nécessité de s'intéresser à sa stratégie :
      • Les violeurs sont souvent des "crimes sériels" qui affinent leurs techniques et ciblent les victimes vulnérables. "ce sont souvent des CRIM sériel les hommes raffinent leur techniqu et donc on sont de plus performants pour pouvoir sélectionner les victimes prendre celles qui sont trop fragiles pour aller déposer plainte" (Magalie la Fourcade).
      • Un agresseur efficace cherche l'impunité et met en place une stratégie : isoler la victime, la traiter comme un objet, l'humilier, l'affaiblir, transférer la responsabilité sur elle, instaurer un climat de peur et d'insécurité, recruter des alliés, organiser l'atmosphère autour d'elle pour verrouiller le secret.
      • "nous semblerait plus intéressant de nous refocaliser sur la stratégie d'un bon agresseur" (Marie France Casalis).
      • Quelques progrès et pistes d'amélioration :
      • Amélioration de la conservation des prélèvements médicaux-légaux jusqu'au dépôt de plainte dans certains services.
      • Progrès dans l'accueil en police/gendarmerie, notamment via le chat police et la mobilisation de volontaires formés.
      • Évolution positive dans l'exploitation de la téléphonie et de l'informatique.
      • Les Cours Criminelles Départementales sont jugées efficaces par une avocate, avec des magistrats professionnels potentiellement mieux formés et une meilleure prise en charge de la victime pendant le procès.
      • La possibilité de refuser la correctionnalisation existe et doit être utilisée.
      • Le Comité Interministériel du Handicap a mis en place des modules de formation sur les violences sexuelles faites aux personnes handicapées, qui pourraient bénéficier à toutes les personnes vulnérables.
      • Des décisions de justice récentes reconnaissent la sidération et son lien avec l'absence de consentement.
      • Une génération de jeunes magistrats motivés et sensibilisés aux questions de genre émerge.
      • L'information des victimes et de leurs proches sur les démarches à suivre est cruciale et actuellement insuffisante.
      • La prise en charge des auteurs est également essentielle pour lutter contre la récidive, en s'intéressant à leur propre parcours traumatique. "le fait d'avoir subi des violences physiques et sexuelles pour les garçons multiplié par 14 le risque de commettre des violences sexuelles et sexistes" (Muriel Salmona).

      Recommandations implicites et explicites :

      • Réformer la définition pénale du viol pour la fonder sur l'absence de consentement, en définissant clairement le consentement et en élargissant les critères pour inclure les circonstances coercitives, la sidération, la ruse, etc., conformément aux normes internationales.
      • Allouer des moyens financiers et humains conséquents aux services de police, gendarmerie et justice pour garantir des enquêtes de qualité et des délais de procédure raisonnables.
      • Améliorer significativement la formation initiale et continue de tous les professionnels intervenant auprès des victimes (police, gendarmerie, magistrats, avocats, experts judiciaires, personnels de santé) au psychotraumatisme, aux spécificités des violences sexuelles et à l'accueil des personnes vulnérables.
      • Garantir un accueil et une prise en charge du dépôt de plainte respectueux et adapté au trauma vécu.
      • Réduire la focalisation sur la crédibilité de la victime et s'intéresser davantage à la stratégie et au parcours de l'agresseur (faisceau d'indices).
      • Mettre en place des mesures pour lutter contre la victimisation secondaire pendant la procédure judiciaire (ex: interdire l'examen du passé sexuel, permettre l'assistance par un avocat dès le dépôt de plainte, repenser les confrontations).
      • Développer une prise en charge pluridisciplinaire, gratuite et accessible du psychotraumatisme pour toutes les victimes, y compris des centres spécialisés.
      • Renforcer l'éducation à la sexualité et au consentement dès le plus jeune âge et lutter contre la diffusion de contenus violents en ligne.
      • Améliorer l'information des victimes et de leurs proches sur les démarches à suivre après des violences sexuelles.
      • Explorer des pistes pour la prise en charge des auteurs, potentiellement en lien avec leur propre parcours traumatique, afin de lutter contre la récidive.
      • Ne pas barémiser l'indemnisation des victimes et garantir une réparation intégrale et individualisée du préjudice.
      • Maintenir et renforcer les Cours Criminelles Départementales.

      Points de convergence et de divergence :

      • Convergence forte : Constat partagé de l'inefficacité du système judiciaire, de l'impunité, de la victimisation secondaire, du manque de moyens et de formation, de l'importance du psychotraumatisme et de la nécessité d'une réponse globale.
      • Convergence sur la réforme : Accord sur la nécessité de réformer la définition du viol pour inclure l'absence de consentement comme principe central et d'élargir les critères pour inclure les circonstances coercitives.
      • Divergence (principalement sur le terme) : Certaines associations (comme le Collectif féministe contre le viol) s'opposent à l'introduction du mot "consentement" dans la loi, craignant qu'il soit détourné ou qu'il replace la responsabilité sur la victime, préférant se concentrer sur la stratégie de l'agresseur. D'autres estiment que l'intégration explicite du consentement, correctement défini, est indispensable pour aligner la loi sur les normes internationales et changer le paradigme des enquêtes et jugements.

      Prochaines étapes annoncées :

      • La mission sur la définition pénale du viol rendra ses conclusions le 17 décembre prochain.
      • La délégation aux droits des femmes recevra le Ministre de l'Europe et des Affaires étrangères le 10 décembre.
      • Cette note de briefing vise à fournir un aperçu synthétique et fidèle des points clés abordés dans les sources fournies.

      Il est recommandé de se référer aux sources originales pour une compréhension complète des arguments et des nuances exprimées par chaque intervenante.

    1. synthèse détaillée des sources fournies, organisée sous forme de note de synthèse ou "briefing doc", reprenant les thèmes principaux, les idées les plus importantes et incluant des citations pertinentes.

      Note de Synthèse : L'avis du CESE sur la protection de l'enfance

      Date : [Date de la synthèse, ex: 18 novembre 2023] Sujet : Examen de l'avis du Conseil Économique, Social et Environnemental (CESE) sur la protection de l'enfance, présenté à la commission d'enquête de l'Assemblée Nationale par Mme Josianne Bigot, co-rapporteure de l'avis. Source : Excerpts from "🔴 L’avis du CESE sur la protection de l’enfance est présenté à la commission d’enquête" (Vidéo / Transcription)

      Introduction :

      Cette note de synthèse présente les principaux éléments de l'avis du CESE sur la protection de l'enfance, tel qu'exposé par Mme Josianne Bigot devant la commission d'enquête parlementaire. Cet avis fait suite à une saisine du Sénat visant à évaluer l'application des lois de 2016 et 2022 relatives à la protection de l'enfance, dans le contexte d'une crise systémique alarmante. Mme Bigot, forte de son expérience de juge des enfants et de son engagement associatif, souligne les dysfonctionnements persistants et l'aggravation de la situation, ainsi que le décalage entre les textes législatifs et leur application concrète. L'avis du CESE, adopté à la quasi-unanimité, propose un regard "holistique" sur le système et formule des préconisations fortes pour replacer l'enfant au cœur du dispositif.

      Thèmes Principaux et Idées Clés :

      • Une crise systémique de la protection de l'enfance :
      • Le CESE, la commission d'enquête parlementaire et divers acteurs constatent une crise profonde et systémique du système de protection de l'enfance en France.
      • Malgré un arsenal législatif considéré comme "excellent" (lois de 2007, 2016, 2022), les textes ne sont pas pleinement respectés et appliqués sur le territoire.
      • Citation : "la protection de l’enfance est en danger votre expérience et votre témoignage sont précieux pour notre commission d’enquête... le cise avec son rapport pointe la crise systémique de la protection de l’enfance vous alerterz sur les dysfonctionnements qui persistent et même qui s’aggravent ainsi que sur le décalage entre les lois existantes et et leur application concrète sur le territoire"
      • Cette crise se manifeste par : un déficit d'ambition collective, une désorganisation institutionnelle, et l'épuisement/découragement des acteurs.
      • Citation : "on s’est mis très vite d’accord sur les mots clés ils sont apparus lors des discussions qu’on a pu avoir dans le cadre de la commission ces mots clés c’était le déficit d’une ambition collective la dés désorganisation institutionnelle l’épuisement et le découragement des acteurs ce qui ensemble et bien fait que aujourd’hui la protection de l’enfant est en danger"
      • L'invisibilité statistique et le besoin d'un État stratège :
      • Un constat majeur partagé est l'absence criante de statistiques fiables et complètes en matière de protection de l'enfance. Cette "invisibilité statistique" empêche de connaître précisément les besoins, les capacités d'accueil, et les défaillances du système.
      • L'État doit retrouver un rôle de "stratège" et de garant de l'égalité de traitement sur tout le territoire national, ce qui n'est plus le cas aujourd'hui en raison de fortes disparités départementales.
      • Citation : "on est dans une invisibilité totale dans une absence de statistique totale on ne connaît pas les besoins on ne connaît pas les défaillances et il faut absolument que on remédie à ça"
      • Citation : "l’État est le garant de l’égalité de traitement des citoyens et aujourd’hui ça n’est absolument plus le cas sur le territoire puisque ce qui est frappant sur le territoire c’est que il y a une il y a une inégalité très cruelle pour les enfants entre les les territoires"

      Préconisations majeures du CESE (organisées selon les quatre piliers de l'avis) :

      Pilier 1 : Mettre fin à la désorganisation institutionnelle.

      • Donner les moyens au GIP France enfance protégée : Le GIP doit pouvoir collecter des données précises et actualisées des départements (besoins, capacités, mesures non exécutées).
      • Adopter une stratégie interministérielle : Un pilotage renforcé, rattaché idéalement au Premier Ministre, est jugé indispensable pour coordonner l'action de tous les ministères concernés et assurer une vision globale.
      • Renforcer la contractualisation entre l'État et les départements : Conditionner les financements supplémentaires au respect des obligations départementales et à la mise en œuvre de la stratégie nationale.
      • Évaluer rapidement les Conseils Départementaux de Protection de l'Enfance (CDPE) : S'ils se montrent efficaces pour contraindre la coordination des acteurs (administratifs, judiciaires, etc.), il faudrait envisager leur généralisation et leur caractère obligatoire.
      • Ne pas remettre en cause la décentralisation : Le CESE ne préconise pas de revenir sur la décentralisation mais de renforcer le rôle de garant de l'État.

      Pilier 2 : Mieux protéger les enfants.

      • Rendre effectif le Projet Pour l'Enfant (PPE) : Obliger sa mise en œuvre dans tous les départements pour assurer un suivi personnalisé et impliquer l'enfant et sa famille.
      • Développer la prévention primaire et le soutien à la parentalité : Investir davantage en amont pour éviter d'en arriver à des mesures de protection.
      • Assurer la formation de tous les intervenants auprès des enfants : Permettre le repérage des situations de mal-être ou de maltraitance, conformément à la convention de Lanzarote (2010), qui n'est pas appliquée.
      • Diversifier les prises en charge : Privilégier les petites unités d'accueil et les familles d'accueil (dont le nombre est en baisse). Développer un milieu ouvert plus structuré avec un nombre réduit de situations par travailleur social.
      • Focus sur les enfants à double vulnérabilité (handicap) : Systématiser les conventions entre l'ASE (Aide Sociale à l'Enfance), les MDPH (Maisons Départementales des Personnes Handicapées) et les ARS (Agences Régionales de Santé) et développer des lieux de prise en charge pluriprofessionnelle.

      Pilier 3 : Garantir le respect des droits des enfants.

      • Avocat systématique pour l'enfant : Préconiser la présence systématique d'un avocat auprès de l'enfant, notamment dans les procédures judiciaires (proposition qui n'avait pas abouti en 2007 pour raisons financières).
      • Statut pour les administrateurs ad hoc : Leur donner un statut, une formation obligatoire, un contrôle et une rémunération digne pour garantir la fiabilité de leur mission de représentation des intérêts de l'enfant (actuellement rémunérés très faiblement par le Ministère de la Justice).
      • Effectivité de l'accompagnement des jeunes majeurs : Confier à l'IGAS (Inspection Générale des Affaires Sociales) une mission de contrôle systématique pour assurer que les dispositifs prévus par la loi Taquet soient réellement opérationnels.
      • Créer une autorité indépendante de contrôle : Instaurer une autorité indépendante qui aurait le pouvoir de se rendre à tout moment dans les structures d'accueil et dont les enfants auraient connaissance.

      Pilier 4 : Soutenir les professionnels.

      • Améliorer l'attractivité des métiers : Revaloriser les professionnels de la protection de l'enfance (assistants familiaux, TISF, travailleurs sociaux) et reconnaître leur travail (primes, salaires).
      • Garantir les conditions de travail : Adopter et faire appliquer un décret sur les normes d'encadrement dans les établissements (actuellement inexistant, contrairement à la petite enfance).
      • Renforcer la formation des travailleurs sociaux : Adapter les formations aux spécificités du terrain (violences intrafamiliales, grande pauvreté, relations familiales complexes).
      • Accompagnement à long terme des parcours professionnels : Réfléchir à des évolutions ou passerelles pour les travailleurs sociaux compte tenu de la difficulté du métier sur la durée.
      • Lutter contre la précarité de l'emploi : Éviter le recours aux travailleurs sociaux intérimaires pour garantir la stabilité des figures de référence pour les enfants.
      • Replacer l'enfant au cœur du dispositif et l'intérêt supérieur de l'enfant :
      • L'avis et les lois récentes ont cherché à mettre l'enfant au centre du système.
      • L'intérêt supérieur de l'enfant est le fil conducteur de l'avis, défini comme une "dialectique entre les besoins et les droits".
      • Le CESE souhaite que la protection de l'enfance ne soit pas seulement l'affaire des politiques, mais de la société tout entière.
      • Citation : "nous souhaitons que ce soit pas seulement les politiques qui s’engagent sur la protection de l’enfance par que ça ce soit la société tout entière que chacun chacune se sent concerné par la protection de l’enfance et par un enfant en danger ou potentiellement en danger"
      • Proposition d'un Code de l'Enfance :
      • Pour assoir la volonté de reconnaître les droits de l'enfant, le CESE propose la création d'un Code de l'Enfance qui regrouperait tous les textes éparpillés dans différents codes (civil, pénal, famille, éducation). Cela symboliserait le statut particulier de l'enfant.
      • Échanges avec les parlementaires :
      • Les questions des députés ont notamment porté sur :
      • La diffusion de l'avis (présentation à la Ministre, au Sénat, à l'ADF). L'avis a été présenté à la Ministre (qui s'est montrée intéressée) et la directrice de la PJJ a lu le rapport, mais les présentations formelles sont encore en cours.
      • L'absence historique de normes d'encadrement et le rôle des associations gestionnaires. Mme Bigot explique cette situation en partie par l'apport historique du secteur caritatif/religieux (personnel gratuit) et une certaine réserve initiale des associations face aux exigences de formation et de recrutement. Elle note cependant une évolution positive de certaines associations qui soutiennent aujourd'hui l'adoption de normes minimales.
      • L'intérêt potentiel des comités de pilotage mis en place par les procureurs sur les violences intrafamiliales comme modèle pour la coordination en protection de l'enfance.
      • La difficulté de la "graduation" des interventions (prévention, aides administratives, mesures judiciaires, placements) et la saturation des structures. Le manque de moyens et de cohérence entre les systèmes administratif et judiciaire sont pointés comme obstacles à cette graduation souhaitée par les professionnels.
      • Les avantages et inconvénients de la décentralisation avec du recul. Mme Bigot reconnaît le "bazar" initial lors du transfert de compétences mais estime qu'il ne faut pas y toucher à nouveau étant donné l'état actuel du système. Elle insiste sur l'inadmissibilité des inégalités de traitement selon les départements, qu'il faut combattre par d'autres leviers.
      • L'emploi des termes "enfants confiés" versus "enfants placés". Mme Bigot préfère "confiés" comme un terme qui montre une évolution du vocabulaire, tout en reconnaissant que la réalité des "placements" (instabilité, multiples lieux d'accueil) justifie encore l'usage du terme par les anciens enfants placés eux-mêmes pour dénoncer la situation.
      • L'origine de l'absence d'ambition collective. Mme Bigot estime qu'il y a une progression de l'intérêt des élus. Elle suggère que l'émotion ponctuelle ne suffit pas et qu'il faut un engagement personnel de chacun. Elle rejoint l'idée qu'il faut replacer la protection de l'enfance dans une vision plus globale de l'enfance (soutenant l'idée d'un Code de l'Enfance ou d'un ministre de l'enfance, à condition qu'il ait la compétence sur la protection).
      • Le besoin fondamental de stabilité et de figures adultes de référence stables pour les enfants. Mme Bigot est tout à fait d'accord et dénonce le recours aux travailleurs sociaux intérimaires qui nuit à cette stabilité essentielle.

      Conclusion :

      L'avis du CESE et la présentation de Mme Bigot dressent un tableau sombre mais lucide de la situation de la protection de l'enfance en France, marquée par une crise systémique et un décalage flagrant entre la loi et sa mise en œuvre.

      Les préconisations formulées sont ambitieuses et couvrent l'ensemble du système, insistant particulièrement sur le rôle de l'État en tant que garant, la nécessité d'une meilleure coordination des acteurs (notamment via les CDPE), la protection effective des droits de l'enfant (représentation légale, autorité indépendante), le soutien aux professionnels en première ligne, et une vision plus globale de l'enfance dans la société.

      La mise en œuvre de ces recommandations nécessite un engagement fort de tous les acteurs et de la société dans son ensemble pour passer de l'émotion ponctuelle à une réelle ambition collective et un soutien concret pour les enfants en danger.

    1. But the most interesting part to me was that I didn't even spend all that much work in the code editor itself. I spent most of it in the browser, moving between tabs and settings and configuring and gluing a monster.

      "gluing a monster"

    1. Andrej’s original tweet in full (with my emphasis added): There’s a new kind of coding I call “vibe coding”, where you fully give in to the vibes, embrace exponentials, and forget that the code even exists. It’s possible because the LLMs (e.g. Cursor Composer w Sonnet) are getting too good. Also I just talk to Composer with SuperWhisper so I barely even touch the keyboard. I ask for the dumbest things like “decrease the padding on the sidebar by half” because I’m too lazy to find it. I “Accept All” always, I don’t read the diffs anymore. When I get error messages I just copy paste them in with no comment, usually that fixes it. The code grows beyond my usual comprehension, I’d have to really read through it for a while. Sometimes the LLMs can’t fix a bug so I just work around it or ask for random changes until it goes away. It’s not too bad for throwaway weekend projects, but still quite amusing. I’m building a project or webapp, but it’s not really coding—I just see stuff, say stuff, run stuff, and copy paste stuff, and it mostly works.

      The intent was never serious software dev. Instead, quick weekend projects.

    1. Reviewer #1 (Public review):

      Summary:

      In this manuscript, Jin et. al., describe SMARTTR, an image analysis strategy optimized for analysis of dual-activity ensemble tagging mouse reporter lines. The pipeline performs cell segmentation, then registers the location of these cells into an anatomical atlas, and finally, calculates the degree of co-expression of the reporters in cells across brain regions. The authors demonstrate the utility of the method by labeling two ensemble populations during two related experiences: inescapable shock and subsequent escapable shock as part of learned helplessness.

      Strengths:

      - We appreciated that the authors provided all documentation necessary to use their method, and that the scripts in their publicly available repository are well commented. Submission of the package to CRAN will, as the other reviewer pointed out, ensure that the package and its dependencies can be easily installed using few lines of code in the future. Additionally, we particularly appreciate the recently added documentation website and vignettes, which provide guidance on package installation and use cases.<br /> - The manuscript was well-written and very clear, and the methods were generally highly detailed.<br /> - The authors have addressed our previous concerns, and we appreciate their revised manuscript.

    2. Reviewer #2 (Public review):

      Summary:

      This manuscript describes a workflow and software package, SMARTR, for mapping and analyzing neuronal ensembles tagged using activity-dependent methods. They showcase this pipeline by analyzing ensembles tagged during the learned helplessness paradigm. This is an impressive effort, and I commend the authors for developing open-source software to make whole-brain analyses more feasible for the community. After peer-review, the authors addressed reviewer suggestions and concerns regarding the usability and maintainability of the SMARTTR package, ensuring that the package will be published on CRAN, improving documentation, and including unit tests to ensure code stability. Overall, this software package will prove to have a broad impact on the field.

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      Sammons, Masserini et al. examine the connectivity of different types of CA3 pyramidal cells ("thorny" and "athorny"), and how their connectivity putatively contributes to their relative timing in sharp-wave-like activity. First, using patch-clamp recordings, they characterize the degree of connectivity within and between athorny and thorny cells. Based upon these experimental results, they compute a synaptic product matrix, and use this to inform a computational model of CA3 activity. This model finds that this differential connectivity between these populations, augmented by two different types of inhibitory neurons, can account for the relative timing of activity observed in sharp waves in vivo.

      We thank the reviewer for reading our manuscript, as well as for their nice summary and constructive comments

      Strengths:

      The patch-clamp experiments are exceptionally thorough and well done. These are very challenging experiments and the authors should be commended for their in-depth characterization of CA3 connectivity.

      Thank you for the recognition of our efforts.

      Weaknesses:

      (1) The computational elements of this study feel underdeveloped. Whereas the authors do a thorough job experimentally characterizing connections between excitatory neurons, the inhibitory neurons used in the model seem to be effectivity "fit neurons" and appear to have been tuned to produce the emergent properties of CA3 sharp wave-like activity. Although I appreciate the goal was to implicate CA3 connectivity contributions to activity timing, a stronger relationship seems like it could be examined. For example, did the authors try to "break" their model? It would be informative if they attempted different synaptic product matrices (say, the juxtaposition of their experimental product matrix) and see whether experimentally-derived sequential activity could not be elicited. It seems as though this spirit of analysis was examined in Figure 4C, but only insofar as individual connectivity parameters were changed in isolation.

      Including the two interneuron types (B and C) in the model is, on the one hand, necessary to align our modeling framework to the state-of-the-art model by Evangelista et al. (2020), which assumes that these populations act as switchers between an SPW and a non-SPW state, and on the other hand, less straightforward because the connectivity involving these interneurons is largely unknown.

      For B cells, the primary criterion to set their connections to and from excitatory cells was to balance the effect of the strong recurrent excitation and to achieve a mid-range firing rate for each population during sharp wave events. Our new simulations (Figure 5B) show that the initial suppression of population T (resulting in the long delay) indeed depends in equal proportions on the outlined excitatory connections and on how strongly each excitatory population is targeted by the B interneurons. However, these simulations demonstrate that there is a broad, clearly distinct, region of the parameter space that supports a long delay between the peaks, rather than a marginal set of finetuned parameters. In addition, the simulations show that B interneurons optimally contribute to the suppression of T when they primarily target T (Fig. 5B, panels 3,7,11,12,13) rather than A (panels 4,8,9,10,11). On the contrary, as reported in the parameter table, and now also displayed graphically in the new Figure 4A (included above, with arrow sizes proportional to the synaptic product between the parameters determining the total strength of each connection), we assume B to target A less weakly than T (to make up for the higher excitability of population A). Therefore, the long delay between the peaks in our model emerges in spite of the interneuron connectivity, rather than because of it, and it is an effect of the asymmetric connectivity between the two excitatory populations, in particular the extremely low connection from A to T.

      (2) Additional explanations of how parameters for interneurons were incorporated in the model would be very helpful. As it stands, it is difficult to understand the degree to which the parameters of these neurons are biologically constrained versus used as fit parameters to produce different time windows of activity in types of CA3 pyramidal cells.

      Response included in point (1).

      Reviewer #2 (Public Review):

      Sharp wave ripples are transient oscillations occurring in the hippocampus that are thought to play an important role in organising temporal sequences during the reactivation of neuronal activity. This study addresses the mechanism by which these temporal sequences are generated in the CA3 region focusing on two different subtypes of pyramidal neurons, thorny and athorny. Using high-quality electrophysiological recordings from up to 8 pyramidal neurons at a time the authors measure the connectivity rates between these pyramidal cell subtypes in a large dataset of 348 cells. This is a significant achievement and provides important data. The most striking finding is how similar connection characteristics are between cell types. There are no differences in synaptic strength or failure rates and some small differences in connectivity rates and short-term plasticity. Using model simulations, the authors explore the implications of the differences in connectivity rates for the temporal specificity of pyramidal cell firing within sharp-wave ripple events. The simulations show that the experimentally observed connectivity rates may contribute to the previously observed temporal sequence of pyramidal cell firing during sharp wave ripples.

      Thank you very much for your careful review of our manuscript and the overall positive assessment.

      The conclusions drawn from the simulations are not experimentally tested so remain theoretical. In the simple network model, the authors include basket cell and anti-SWR interneurons but the connectivity of these cell types is not measured experimentally and variations in interneuron parameters may also influence temporal specificity of firing.

      As variations in some of these parameters can indeed influence the temporal specificity of firing, we have now performed additional simulations, the results of which are in the new Figures 5 and S5. Please also see response to Reviewer 1, point 1.

      In addition, the influence of short-term plasticity measured in their experiments is not tested in the model.

      We have now included short-term synaptic depression in all the excitatory-to-excitatory synapses and compensated for the weakened recurrent excitation by scaling some of the other parameters. The results of re-running our simulations in this alternative version of the model are reported in Figure S3 and are qualitatively analogous to those in Figure 4.

      Interestingly, the experimental data reveal a large variability in many of the measured parameters. This may strongly influence the firing of pyramidal cells during SWRs but it is not represented within the model which uses the averaged data.

      We have now incorporated variability in the following simulation parameters: the strength and latency of the four excitatory-to-excitatory connections as well as the reversal potential and leak conductance of both types of pyramidal cells, assuming variabilities similar to those observed experimentally (see Materials and Methods for details). Upon a slight re-balancing of some inhibitory connection strengths, in order to achieve comparable firing rates, we found that this version of the model also supports the generation of sharp waves with two pyramidal components (Figure S4B), and is, thus, fully analogous to our basic model. Varying the excitatory connectivities as in the original simulations (cf. Figure 4C and Figure S4C) reveals that increasing the athorny-toathorny or decreasing the athorny-to-thorny connectivity still increases the delay between the peaks, although for some connectivity values the peak of the athorny population appears more spread out in time.

      Reviewer #3 (Public Review):

      Summary:

      The hippocampal CA3 region is generally considered to be the primary site of initiation of sharp wave ripples-highly synchronous population events involved in learning and memory although the precise mechanism remains elusive. A recent study revealed that CA3 comprises two distinct pyramidal cell populations: thorny cells that receive mossy fiber input from the dentate gyrus, and athorny cells that do not. That study also showed that it is athorny cells in particular that play a key role in sharp wave initiation. In the present work, Sammons, Masserini, and colleagues expand on this by examining the connectivity probabilities among and between thorny and athorny cells. First, using whole-cell patch clamp recordings, they find an asymmetrical connectivity pattern, with athorny cells receiving the most synaptic connections from both athorny and thorny cells, and thorny cells receiving fewer. They then demonstrate in spiking neural network simulations how this asymmetrical connectivity may underlie the preferential role of athorny cells in sharp wave initiation.

      Strengths:

      The authors provide independent validation of some of the findings by Hunt et al. (2018) concerning the distinction between thorny and athorny pyramidal cells in CA3 and advance our understanding of their differential integration in CA3 microcircuits. The properties of excitatory connections among and between thorny and athorny cells described by the authors will be key in understanding CA3 functions including, but not limited to, sharp wave initiation.

      As stated in the paper, the modeling results lend support to the idea that the increased excitatory connectivity towards athorny cells plays a key role in causing them to fire before thorny cells in sharp waves. More generally, the model adds to an expanding pool of models of sharp wave ripples which should prove useful in guiding and interpreting experimental research.

      Thank you very much for your careful review of our manuscript and this positive assessment.

      Weaknesses:

      The mechanism by which athorny cells initiate sharp waves in the model is somewhat confusingly described. As far as I understood, random fluctuations in the activities of A and B neurons provide windows of opportunity for pyramidal cells to fire if they have additionally recovered from adaptive currents. Thorny and athorny pyramidal cells are then set in a winner-takes-all competition which is quickly won by the athorny cells. The main thesis of the paper seems to be that athorny cells win this competition because they receive more inputs both from themselves and from thorny cells, hence, the connectivity "underlies the sequential activation". However, it is also stated that athorny cells activate first due to their lower rheobase and steeper f-I curve, and it is also indicated in the methods that athorny (but not thorny) cells fire in bursts. It seems that it is primarily these features that make them fire first, something which apparently happens even when the A to A connectivity is set to 0albeit with a very small lag. Perhaps the authors could further clarify the differential role of single cell and network parameters in determining the sequential activation of athorny and thorny cells. Is the role of asymmetric excitatory connectivity only to enhance the initial intrinsic advantage of athorny cells? If so, could this advantage also be enhanced in other ways?

      Thank you for the time invested in the review of our manuscript. We especially thank you for pointing out that the description of these dynamics was unclear: we have now improved it in the main text and we provide here an additional summary. As correctly highlighted by Reviewer 3, athorny neurons (A) are more excitable than thorny (T) ones due to single-neuron parameters: therefore, if there is a winner-takes-all competition, they are going to win it. Whether there is a competition in the first place, however, depends on the excitatory (and inhibitory) connections. In particular, we should distinguish two questions: does the activity of populations A and B (PV baskets), without adaptation (so at the beginning of the sharp wave) suppress T? And does the activity of populations T and B suppress A?

      The four possible combinations can be appreciated, for example, in the new Figure 5A5. If A can suppress T, but T cannot suppress A (low A-to-T, high T-to-A, bottom right corner, like in the data), A “wins” and T fires later, after a long delay. If both A and T can suppress each other (both cross-connections are low, bottom left corner), we still get the same outcome: A wins because of its earlier and sharper onset (due to single-neuron parameters). If neither population can suppress the other (high cross-connections, top right corner), then there is no competition and the populations reach the peak approximately at the same time. Only in the case in which T can suppress A, but A cannot suppress T (low T-to-A, high A-to-T, top left corner, opposite to the data), then A “loses” the competition. However, since A neurons nevertheless display some early activity (again, due to the single neuron parameters), this scenario is not as clean as the reversed one: rather, A cells have an initial, small peak, then T neurons quickly take over and grow to their own peak, and then, depending on how strongly T neurons suppress A neurons, there may or may not be a second peak for the A neurons. This is the reason why, in the top left corner of Figure 5B, the statistics show either a long positive or long negative delay, depending on whether the first (small) or second (absent, for some parameters) peak of A is taken into account. In summary, the experimentally measured connectivity does not only enhance the initial intrinsic advantage of A cells, but sets up the competitive dynamics in the first place, which are crucial for the emergence of two distinct peaks, rather than a single peak involving both populations.

      Although a clear effort has been made to constrain the model with biological data, too many degrees of freedom remain that allow the modeler to make arbitrary decisions. This is not a problem in itself, but perhaps the authors could explain more of their reasoning and expand upon the differences between their modeling choices and those of others. For example, what are the conceptual or practical advantages of using adaptation in pyramidal neurons as opposed to short-term synaptic plasticity as in the model by Hunt et al.?

      It should be pointed out that the model by Hunt et al. features adaptation in pyramidal neurons as well, as the neuronal units employed are also adaptive-exponential integrate-and-fire. In an early stage of this project, we obtained from Hunt et al. the code for their model, and ascertained that adaptation is the main mechanism governing the alternations between the sharp-wave and the non-sharp-wave states, to the extent that fully removing short-term plasticity from their model does not have any significant impact on the network dynamics. Therefore, our choices are, in this regard, fully consistent with theirs. In order to confirm that synaptic depression does not significantly impact the dynamics also in our model, we now performed additional simulations (Figure S3), addressed in the main text (lines 149-151) and in the response to Reviewer 1, who expressed similar concerns.

      Relatedly, what experimental observations could validate or falsify the proposed mechanisms?

      As sharp wave generation in this model relies on disinhibitory dynamics (suppression of the anti-sharp-wave interneurons C), the model could be validated/falsified by proving/disproving that a class of interneurons with anti-sharp-wave features exists. In addition, the mechanism we proposed for the long delay between the peaks of the athorny and thorny activity requires at least some connectivity from athorny to basket and from basket to thorny neurons.

      In the data by Hunt et al., thorny cells have a higher baseline (non-SPW) firing rate, and it is claimed that it is actually stochastic correlations in their firing that are amplified by athorny cells to initiate sharp waves. However, in the current model, the firing of both types of pyramidal cells outside of ripples appears to be essentially zero. Can the model handle more realistic firing rates as described by Hunt et al., or as produced by e.g., walking around an environment tiled with place cells, or would that trigger SPWs continuously?

      When building this model, we aimed at having two clearly distinct states the network could alternate between, so we picked a rather polarized connectivity to and from the anti-sharp wave cells (C), resulting in polarized states. As a result, we obtain a low, although non-zero, activity of pyramidal neurons in non-SPW states (0.4 spikes/s for athorny and 0.2 spikes/s for thorny). These assumptions can be partially relaxed, for example in the original model by Evangelista et al. (2020), where the background firing rate of pyramidal cells is ~2 spikes/s. It should also be noted that, when walking in an environment tiled with place cells, the hippocampus is subject to additional extra-hippocampal inputs (e.g. from the medial septum, resulting in theta oscillations) and to neuromodulation, which can alter the network in various ways that we have not included in our model. However, our results are not in contradiction to transient SPW-like activity states initiated at a certain phase of the theta oscillation, when the inhibition is weakest.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      (1) The manuscript reads like it was intended as a short-form manuscript for another journal. The introduction and discussion in particular are very brief and would benefit from being expanded and providing a bigger picture for the reader.

      We had originally aimed to submit in the eLife “short report” format. However, also thanks to the suggestion of Reviewer 1, we realized that our text would be better supported by extended introduction and discussion sections, as well as additional figures.

      (2) Graphs would benefit from including all datapoints, where appropriate.

      All datapoints have now been added to boxplots in the main figures and supplement.

      (3) The panels of Figure 4 are laid out strangely, it may be worthwhile to adjust.

      We thank the reviewer for this suggestion. We have now adjusted the layout of Figure 4 and believe it is now easier to follow.

      Reviewer #2 (Recommendations For The Authors):

      Useful points to address include:

      (1) Explore within the model the effect of altering interneuron connectivity. Are there other factors that can influence temporal specificity within SWRs?

      The effects of varying the connectivity to and from B interneurons (the ones which are SPWactive and therefore relevant for temporal specificity) have now been investigated in the new Figure 5B, in which such parameters were varied in pairs or combined with the two most relevant excitatoryto-excitatory connections.

      (2) Implement the experimentally observed short-term plasticity in the model to determine how this influences temporal specificity.

      All the findings in Figure 4 have now been replicated in the new Figure S3, in which excitatory-to-excitatory synapses feature synaptic depression.

      (3) Consider if it is possible to incorporate observed experimental variability in the model and explore the implications.

      All the findings in Figure 4 have now been replicated in the new Figure S4, in which heterogeneity has been introduced in multiple neuronal and synaptic parameters of thorny and athorny neurons.

      (4) Include the co-connectivity rates in the data. Ie how many of the recorded neurons are reciprocally connected? Does this change the model simulations?

      We have now added the rates of reciprocal connections that we observed into the main text (lines 8688). We found 2 pairs of reciprocally connected athorny neurons and 2 pairs of reciprocally connected thorny neurons. These rates of reciprocity were not statistically significant. We did not observe reciprocal connections in other paired neuron combinations (i.e. athorny-thorny or vice-versa). Coconnectivity does not have any effect on the model simulations, as the model includes thousands of neurons grouped in populations without specific sub-structures. It might, however, be more relevant if the excitatory populations were further subdivided in assemblies.

      Reviewer #3 (Recommendations For The Authors):

      (1) Specify which part of CA3 you are recording from.

      We have added this information into our results section - we recorded from 20 cells in CA3a, 274 cells in CA3b and 54 cells in CA3c. This information can now be found in the text on lines 68-69.

      (2) Comment on why you might observe a larger fraction of athorny cells than Hunt et al.

      Hunt et al. cite a broad range for the fraction of athorny cells in their discussion (10-20%). It is unclear where these estimates originate from. In their study, Hunt et al. use the bursting and nonbursting phenotypes as proxies for athorny and thorny cells respectively, and report here numbers of 32 and 70 equating to 31% athorny and 69% thorny. This fraction of athorny cells is more or less in line with our own findings, albeit slightly lower (34% and 66%). However, we believe this difference falls within the range of experimental variability. One caveat is that our electrophysiological recordings likely represent a biased sample of cells. In particular, with multipatch recordings, placement of later electrodes is often restricted to the borders of the pyramidal layer so as not to disturb already patched cells. Thus, our recorded cells do not represent a fully random sample of CA3 pyramidal cells. We believe that, only once a reliable genetic marker for athorny cells has been established can the size of this cell population be properly estimated. Furthermore, the ratio of thorny and athorny cells varies along the proximal distal axis of the CA3 so differences in ratios seen between our study and Hunt et al. may arise from sampling differences along this axis.

      (3) In Figure 3, Aiii (the cell fractions) could also be represented as a vector of two squares stacked one on top of the other, then you could add multiplication signs between Ai, Aii and Aiii, and an equal sign between Aiii and Aiv.

      Thank you! We have implemented this very nice suggestion.

      (4) In Figure 4A, it would be helpful to display the strength of the connections similar to how it is done in Figure 3B.

      We thank the reviewer for this suggestion. We have now updated Fig 4A to include connection strengths.

    1. Reviewer #2 (Public review):

      Summary:

      The authors developed a computational pipeline named CHROMAS to track and analyze chromatophore dynamics, which provides a wide range of biological analysis tools without requiring the user to write code.

      Strengths:

      (1) CHROMAS is an integrated toolbox that provides tools for different biological tasks such as: segment, classify, track and measure individual chromatophores, cluster small groups of chromatophores, analyze full-body patterns, etc.

      (2) It could be used to investigate different species. The authors have already applied it to analyze the skin of the bobtail squid Euprymna berryi and the European cuttlefish Sepia officinalis.

      (3) The tool is open-source and easy to install. The paper describes in detail the command format to complete each task and provides relevant sample figures.

      Weaknesses:

      (1) The generality and robustness of the proposed pipeline need to be verified through more experimental evaluations. For example, the implementation algorithm depends on relatively specific or obvious image features, clean backgrounds, and objects that do not move too fast.

      (2) The pipeline lacks some kind of self-correction mechanism. If at one moment there is a conflicting match with the previous frames, how does the system automatically handle it to ensure that the tracking results are accurate over a long period of time?

    1. Code shape 也就是 z

      你问的这个 “code shape in DeepSDF”,就是指 DeepSDF 里那个传奇角色——形状编码(latent code, shape code),它是 DeepSDF 的灵魂存档点,是一个压缩后的“形状本体”,简称:

      形状 = 一个向量 code

      我们马上来深度拆解,搞清楚这个“code shape”究竟是啥、它怎么玩、怎么训练、怎么使用。


      🧠 一句话解释:

      Code shape 就是 DeepSDF 中用于表示一个三维形状的潜在向量(latent vector)z,整个形状的几何信息被压缩进一个固定维度的数字向量中。


      🧩 用函数结构来描述:

      DeepSDF 的核心函数是:

      [ f_\theta(x, z) \rightarrow \text{Signed Distance} ]

      • \( x \):是一个空间中的三维点坐标
      • \( z \):是当前形状的 latent code(你说的 code shape)
      • \( \theta \):是神经网络的参数(共享)

      对每一个不同的形状,就对应一个不同的 latent code \( z \)<br /> 整个网络就可以通过同一个函数 \( f_\theta \) + 不同的 \( z \),重建成千上万种不同形状。


      🧬 举例说明:

      想象你要用 DeepSDF 表示以下三种物体:

      1. 🐶 一只狗
      2. 🐱 一只猫
      3. 🛸 一个UFO

      你训练的时候会为每个物体学习一个 code:

      text z_狗 = [0.2, -0.8, 1.3, ...] z_猫 = [1.1, 0.5, -0.6, ...] z_UFO = [-0.3, 1.2, 0.0, ...]

      当你把一个点 \( x \) 和 \( z_狗 \) 一起输入网络时,网络就输出这个点在“狗的几何空间”里到表面的距离。


      🧠 所以 code shape 的作用是:

      | 功能 | 描述 | |------|------| | 🧬 编码几何 | 用向量表示复杂的形状本身 | | 🎨 形状生成 | 给一个 code,就能生成完整形状 | | 🔄 形状插值 | 两个 code 插值,可以生成中间形态(如猫狗合体) | | 🧠 作为控制参数 | 可以用作形状的风格控制、编辑、变换条件 |


      🧪 Code shape 是怎么训练出来的?

      DeepSDF 训练时采用如下策略:

      • 每个训练形状都分配一个独立的 latent vector \( z_i \),当作“需要学习的参数”
      • 所有 \( z_i \) 和网络参数 \( \theta \) 一起通过反向传播训练
      • 损失函数是预测的 SDF 和真实 SDF 的差异 + 对 \( z_i \) 的正则化项(避免无限膨胀)

      损失函数: [ \mathcal{L} = \sum_{(x_i, s_i)} |f_\theta(x_i, z_j) - s_i| + \lambda \|z_j\|^2 ]

      这样,每个形状都学到了一个自己的 “身份code” —— 这就是你说的 code shape


      🔮 训练 vs 测试区别

      | 阶段 | code z 从哪来? | 网络怎么用 z? | |------|------------------|----------------| | 🏋️‍♂️ 训练阶段 | 每个形状都有自己的可训练的 z | 网络同时学 z 和参数 θ | | 🧪 测试阶段 | 如果没有 encoder,可以用优化方法反推出 z(Gradient Descent) | 给网络 z,它就能生成对应形状 |


      🧙‍♂️ 神比喻:

      Code shape 就像是每个形状的“DNA向量”<br /> 网络就像一个生物工厂,只要你提供 DNA,它就能“打印”出这个物体的几何结构


      🔧 延伸玩法

      如果你用一个编码器网络 \( E(p) \to z \)(比如从点云中提取),你就可以把整个 DeepSDF 变成一个完整的“输入点云 → 生成 SDF → 重建模型”的自动建模流程!这就是很多 Generalizable SDF 网络(如 OccupancyNet, MetaSDF) 的升级版本。


      要不要我来画一张 “Code Shape 解码形状”的流程图,或者一个“多个 latent code 插值成新形状”的演示例子?😏

  4. Apr 2025
    1. Allow you to save files from apps to any folder in drive

      That is quite something

      How would the code look like

      I am using "STORE_APP_DATA" permission so I can pass any path?

      Will try it straight away

    1. opam switch create cs3110-2025sp ocaml-base-compiler.5.2.0

      I'm on MacOS x86_64 and the following error occurs:

      ```

      === ERROR while compiling ocaml-base-compiler.5.2.0 ==========================

      context 2.3.0 | macos/x86_64 | | https://opam.ocaml.org#8106bcf57fc33218275bbf0d02e725f4e9fcd0f6

      path ~/.opam/cs3110-2025sp/.opam-switch/build/ocaml-base-compiler.5.2.0

      command ~/.opam/opam-init/hooks/sandbox.sh build make -j7

      exit-code 2

      env-file ~/.opam/log/ocaml-base-compiler-68106-48fec6.env

      output-file ~/.opam/log/ocaml-base-compiler-68106-48fec6.out

      output

      /Library/Developer/CommandLineTools/usr/bin/make coldstart

      [...]

      CC runtime/fail_byt.b.o

      CC runtime/fix_code.b.o

      CC runtime/interp.b.o

      CC runtime/startup_byt.b.o

      CC runtime/zstd.b.o

      MKLIB runtime/libcamlrun.a

      MKEXE runtime/ocamlrun

      ld: archive member '/' not a mach-o file in 'runtime/libcamlrun.a'

      clang: error: linker command failed with exit code 1 (use -v to see invocation)

      make[1]: *** [runtime/ocamlrun] Error 1

      make: *** [world.opt] Error 2

      ```

      I solved this by uninstalling GNU binutils with brew following this stackoverflow question.

    1. AbstractBackground Current microbial sequencing relies on short-read platforms like Illumina and DNBSEQ, favored for their low cost and high accuracy. However, these methods often produce fragmented draft genomes, hindering comprehensive bacterial function analysis. CycloneSEQ, a novel long-read sequencing platform developed by BGI-Research, its sequencing performance and assembly improvements has been evaluated.Findings Using CycloneSEQ long-read sequencing, the type strain produced long reads with an average length of 11.6 kbp and an average quality score of 14.4. After hybrid assembly with short reads data, the assembled genome exhibited an error rate of only 0.04 mismatches and 0.08 indels per 100 kbp compared to the reference genome. This method was validated across 9 diverse species, successfully assembling complete circular genomes. Hybrid assembly significantly enhances genome completeness by using long reads to fill gaps and accurately assemble multi-copy rRNA genes, which unable be achieved by short reads solely. Through data subsampling, we found that over 500 Mbp of short-read data combined with 100 Mbp of long-read data can result in a high-quality circular assembly. Additionally, using CycloneSEQ long reads effectively improves the assembly of circular complete genomes from mixed microbial communities.Conclusions CycloneSEQ’s read length is sufficient for circular bacterial genomes, but its base quality needs improvement. Integrating DNBSEQ short reads improved accuracy, resulting in complete and accurate assemblies. This efficient approach can be widely applied in microbial sequencing.

      This work has been published in GigaByte Journal under a CC-BY 4.0 license (https://doi.org/10.46471/gigabyte.154), and has published the reviews under the same license.

      Reviewer 1. Ryan Wick

      This manuscript introduces CycloneSEQ data as a means for producing complete bacterial genome assemblies, with a focus on hybrid assemblies made using a combination of CycloneSEQ data and DNBSEQ data. It also publicly provides deep CycloneSEQ+DNBSEQ read sets for a range of bacterial species.

      Major comments

      The reads for the project were made publicly available via CNGBdb (https://db.cngb.org/search/project/CNP0006129), but I found it to be unusably slow (both the HTTP website and the FTP data downloads). To ensure the data is accessible to a wide audience, I request that it also be hosted in another location to make it available to readers. For example, SRA, ENA or GigaDB.

      The paper makes no mention of the other major long-read platforms: Oxford Nanopore Technologies and Pacific Biosciences. Given the widespread use of these platforms (especially ONT) in bacterial genome assembly, some discussion on CycloneSEQ’s relative advantages or limitations would be beneficial.

      Minor comments

      Lines 100-103: this sentence (‘The GC content was sensitively affected…’) is not clear to me. How are the completeness and accuracy of the assembly affecting GC content?

      Figure S2 unnecessarily includes reference-vs-reference difference counts, which are by definition zero.

      Figure S2 could mention the genome (Akkermansia muciniphila ATCC BAA-835) in the caption – I did not immediately understand what 'for type strain' meant.

      I found Figure 5 difficult to read, with its use of colour to indicate accuracy. This data would be better shown using another visualisation (e.g. bar plot) that more clearly shows quantitative values.

      For the mixed microbial community analysis, it should be stated that Unicycler is exclusively designed for bacterial isolates (its documentation explicitly says to not use it on metagenomes).

      Some of the supplementary figures are erroneously labelled 'Supplementary Table'.

      Some stats on the metagenomic reads would be helpful: e.g. total bp for short and long reads, N50 for long reads, etc.

      The methods describe using seqtk, but the reference for this (#25) is SeqKit (a different tool), so either the tool in the methods or the reference is wrong. Re-review: Thank you for the revisions to the manuscript. While many of my minor comments have been addressed, I still have concerns regarding my major comments, which have not been fully resolved.

      First, I appreciate that the data has now been made available on NCBI. However, the long-read datasets are labelled as Oxford Nanopore MinION data, which is misleading (example: SRR31850034). I understand this may be because SRA does not yet provide CycloneSEQ as a platform option, but this can be clarified through additional metadata. Specifically, the ‘design’ field for each SRA entry simply says ‘genome’, but it could have more detail, including that these are CycloneSEQ reads. The BioProject (PRJNA1194773) description could also include a clear statement that the long-read data is generated using CycloneSEQ.

      Second, I had requested a brief discussion of existing long-read platforms (ONT and PacBio) to provide context on where CycloneSEQ fits into the broader sequencing landscape. The authors have chosen not to include this, stating that they do not have direct comparison data. While I understand that such a comparison is not the purpose of this paper, I still believe that some mention of these platforms is necessary in the Background and/or Discussion sections. This paper introduces a new long-read technology for bacterial genome assembly, and readers will naturally want to understand how it relates to widely used alternatives.

      Finally, regarding my comment about supplementary figure labels, I still see the issue in the revised version provided for review. For example, the caption for Supplementary Figure S3 begins with ‘Supplementary Table S3.’ The authors stated that there were no errors, but this mislabelling remains in the PDF I received.

      As these concerns remain unresolved, I do not consider the manuscript acceptable in its current form.

      Reviewer 2. Keith Robison

      As Open Source Software are there guidelines on how to contribute, report issues or seek support on the code?

      N/A - no software presented (relates to other software questions)

      Additional comments: This is a useful presentation of an emerging sequencing platform.

      Given the complex nature of nanopore signals and the difficulty of decoding them, it has been a pattern with the prior nanopore platform that improvements in basecalling software have yielded significant changes in basecalling performance. Therefore, it would be highly advantageous if the manuscript listed which specific versions / revision numbers of the basecalling software were used so that these results are properly contextualized for comparison to future results which may use newer basecalling software.

      Ideally, the publication would include a link to git (or similar) repository with the complete pipeline used to generate the results

    1. Reviewer #3 (Public review):

      Summary:

      The authors follow up on prior studies where they have argued for the existence of cold nociception in Drosophila larvae. In the proposed pathway, mechanosensitive Class III multidendritic neurons are the noxious cold responding sensory cells. The current study attempts to explore the potential roles of second and third order neurons, based on information of the Class III neuron synaptic outputs that has been obtained from the larval connectome.

      Strengths:

      The major strength of the manuscript is the detailed discussion of the second and third order neurons that are downstream of the mechanosensory Class III multidendritic neurons. These will be useful in further studies of gentle touch mechanosensation and mechanonociception both of which rely on sensory input from these cells. Calcium imaging experiments on Class III activation with optogenetics support the wiring diagram.

      Weaknesses:

      The scientific premise is that a full body contraction in larvae that are exposed to noxious cold is a sensorimotor behavioral pathway. This premise is, to start with, questionable. A common definition of behavior is a set of "orderly movements with recognizable and repeatable patterns of activity produced by members of a species (Baker et al., 2001)." In the case of nociception behaviors, the patterns of movement are typically thought to play a protective role and to protect from potential tissue damage.

      Does noxious cold elicit a set of orderly movements with a recognizable and repeatable pattern in larvae? Can the patterns of movement that are stimulated by noxious cold allow the larvae to escape harm? Based on the available evidence, the answer to both questions is seemingly no. In response to noxious cold stimulation many, if not all, of the muscles in the larva, simultaneously contract (Turner et al., 2016) and as a result the larva becomes stationary. In response to cold, the larva is literally "frozen" in place and it is incapable of moving away. This incapacitation by cold is the antithesis of what one might expect from a behavior that protects the animals from harm.

      An extensive literature has investigated the physiological responses of insects to cold (reviewed in Overgaard and MacMillan, 2017). In numerous studies of insects across many genera (excluding cold adapted insects such as snow flies), exposure to very cold temperatures quickly incapacitates the animal and induces a state that is known as a chill coma. During a chill coma the insect becomes immobilized by the cold exposure, but if the exposure to cold is very brief the insect can often be revived without apparent damage. Indeed, it is common practice for many laboratories that use adult Drosophila for studies of behavior to use a brief chilling on ice as a form of anesthesia because chilling is less disruptive to subsequent behaviors than the more commonly used carbon dioxide anesthesia. If flies were to perceive cold as a noxious nociceptive stimulus, then this "chill coma" procedure would likely be disruptive to behavioral studies, but is not. Furthermore, there is no evidence to suggest that larval sensation of "noxious cold" is aversive.

      The insect chill coma literature has investigated the effects of extreme cold on the physiology of nerves and muscle and the consensus view of the field is that the paralysis that results from cold is due to complex and combined action of direct effects of cold on muscle and on nerves (Overgaard and MacMillan, 2017). Electrophysiological measurements of muscles and neurons find that they are initially depolarized by cold, and after prolonged cold exposure they are unable to maintain potassium homeostasis and this eventually inhibits the firing of action potentials (Overgaard and MacMillan, 2017). The very small thermal capacitance of a Drosophila larva means that its entire neuromuscular system will be quickly exposed to the effect of cold in the behavioral assays under consideration here. It would seem impossible to disentangle the emergent properties of a complex combination of effects on physiology (including neuronal, glial, and muscle homeostasis) on any proposed sensorimotor transformation pathway.

      Nevertheless, the manuscript before us makes a courageous attempt at attempting this. A number of GAL4 drivers tested in the paper are found to affect parameters of contraction behavior (CT) in cold exposed larvae in silencing experiments. However, notably absent from all of the silencing experiments are measurements of larval mobility following cold exposure. Thus, it is not known from the study if these manipulations are truly protecting the larvae from paralysis following cold exposure, or if they are simply reducing the magnitude of the initial muscle contraction that occurs immediately following cold (ie reducing CT). The strongest effect of silencing occurs with the 19-12-GAL4 driver which targets Class III neurons (but is not completely specific to these cells).

      Optogenetic experiments for Class III neurons relying on the 19-12-GAL4 driver combined with a very strong optogenetic acuator (ChETA) show the CT behavior that was reported in prior studies. It should be noted that this actuator drives very strong activation, and other studies with milder optogenetic stimulation of Class III neurons have shown that these cells produce behavioral responses that resemble gentle touch responses (Tsubouchi et al 2012 and Yan et al 2013). As well, these neurons express mechanoreceptor ion channels such as NompC and Rpk that are required for gentle touch responses.

      A major weakness of the study is that none of the second or third order neurons (that are downstream of CIII neurons) are found to trigger the CT behavioral responses even when strongly activated with the ChETA actuator (Figure 2 Supplement 2). These findings raise major concerns for this and prior studies and it does not support the hypothesis that the CIII neurons drive the CT behaviors.

      Later experiments in the paper that investigate strong CIII activation (with ChETA) in combination with other second and third order neurons does support the idea activating those neurons can facilitate the body-wide muscle contractions. But many of the co-activated cells in question are either repeated in each abdominal neuromere or they project to cells that are found all along the ventral nerve cord, so it is therefore unsurprising that their activation would contribute to what appears to be a non-specific body-wide activation of muscles along the AP axis. As well, if these neurons are already downstream of the CIII neurons the logic of this co-activation approach is not particular clear. A more convincing experiment would be to silence the different classes of cells in the context of the optogenetic activation of CIII neurons to test for a block of the effects, a set of experiments that is notably absent from the study.

      The authors argument that the co-activation studies support "a population code" for cold nociception is a very optimistic interpretation of a brute force optogenetics approach that ultimately results in an enhancement of a relatively non-specific body-wide muscle convulsion.

      Comments on revisions:

      The resubmitted version of this manuscript suffers from the same weaknesses that were raised in the prior round of review. The authors claim that muscles have been removed from the electrophysiological preparations of prior studies is overstated. A small subset of muscles are removed during their recording procedures and this does not rule out the possibility that mechanical forces that are generated by the remaining muscles are being sensed by the mechanosensory neurons.

    1. A notable aspect of gambling-related influencer mar-keting is the role of “tipsters,” a distinct category of influ-encers who provide betting advice and predictions. Theseindividuals often establish themselves as authoritative fig-ures, perceived as both credible and trustworthy by CYP

      Tipsters are all questionable. I can input these same stats into an AI service and have the AI write a code for me to give me an edge. This is what happens, and it is easy enough for anyone to replicate. I push back against the idea of a "tipster", but this is just my opinion. Arbitrage, however...

    Annotators

  5. social-media-ethics-automation.github.io social-media-ethics-automation.github.io
    1. a special individual encryption process [i6] for each individual password. This way the database can only con

      I remember learning about the encryption process, and the difficulty levels of ciphers and decrypting code into plain text. I think it is interesting how people used to use Cesar ciphers and have moved to more difficult ciphers such as RSA's, which require two separate keys for encryption and decryption of the plain text.

    1. Dans la continuité des Critical Code Studies3, les ECIA ont l’ambition de lire le code des algorithmes dits d’intelligence artificielle avec les outils qui nous sont propres, soit les outils herméneutiques (au sens large) des sciences humaines, à partir d’un point de vue épistémologique qui s’ancre dans les savoirs de nos disciplines littéraires4. Cela nous permet d’analyser le sens de ce code pour comprendre quelles visions du monde, quelles valeurs et quelles implications éthiques il véhicule.

      Très prometteur !! ++

      • Definition and Nature of Design:

        • Design involves preparing executable plans and structuring form.

          "Prepare the plans for a work to be executed... plan the form and the structure of that thing." - Effective design separates elements to enable later composition.

          "Designing is fundamentally about taking things apart... in such a way that they can be put back together."

      • Misconceptions About Design:

        • Good design isn't mere documentation generated post-implementation.

          "Can we just write our program and then generate some documentation?... No, that's not a plan." - Bad past experiences with monolithic designs don't invalidate design itself.

          "Those were plans but those are not good plans... doesn't mean planning is bad."

      • Key Elements of Good Design:

        • Identify underlying problems rather than stated needs or symptoms.

          "Decompose those wants and needs into problems." - Differentiate known from unknown requirements, causes from symptoms.

          "Take apart causes and root causes from symptoms." - Address unstated, implicit requirements (e.g., maintainability, performance).

          "Unstated requirements... problems nobody wants in the future." - Separate concerns clearly: time/order, places/participants, information/mechanisms.

          "Separating apart when things happen... places and participants... information our systems manipulate."

      • Iterative and Decompositional Nature of Design:

        • Design involves continuous decomposition and recomposition, supporting iterative changes.

          "Good design process is iterative... breaking things down until they're nearly atomic."

      • Why Good Design Matters:

        • Design aids understanding, coordination, extension, reuse, and efficient testing.

          "Design helps you understand a system... coordinate... extension... reuse... design-driven testing." - Design leads to greater efficiency by simplifying iterative changes.

          "Easier to iterate a design than... an implementation."

      • Composition and Performance Analogies (Bartók and Coltrane):

        • Composers set constraints deliberately, solving self-created problems through design.

          "First thing composers do... they create their own problems to solve." - Bartók (composer) specifies details precisely; Coltrane (performer) dynamically applies studied compositional knowledge.

          "Bartók completely specified every note... Coltrane had melody and changes... providing constraints for the performer."

      • Improvisation vs. Planning in Software:

        • Improvisation isn’t random; it relies heavily on preparation and deep understanding.

          "Improvisation is not spontaneous emoting... application of knowledge and vocabulary." - Good developers, like good improvisers, combine studied preparation and dynamic execution.

          "Coltrane was working out this composition dynamically... resources behind it were things he had prepared."

      • Harmony as Critical Design Skill:

        • Harmony is about congruity and simultaneous fitting together.

          "Harmonic sensibility is of critical design skill... systems that fit together." - Bartók and Coltrane exemplify mastery and innovation in harmonic principles.

          "Masters of harmony... retained focus on what fits together."

      • Programming Languages as Instruments:

        • Programming languages/tools should resemble instruments: simple, focused, and designed for skilled users.

          "Languages and libraries like instruments... minimal yet sufficient." - Instruments are intentionally limited, enabling deep expertise through practice.

          "Piano can’t play in-between notes... saxophone one note at a time... instruments are minimal yet sufficient."

      • Problems of Excessive Complexity and Choice:

        • Too many options or excessive configurability leads to poor design, overwhelming users.

          "Having a lot of choices... opposite of enabling us to accomplish things... constraint is a driver of creativity." - Simplicity and constraint enhance usability and creativity.

          "Every module is a good idea... adding them together you end up with something you can't play."

      • Tools Should Empower Skilled Users, Not Beginners:

        • Software tools/instruments should not overly cater to beginners, sacrificing depth.

          "Instruments are made for people who can play them... beginners can't play yet." - Effort and practice are necessary for mastery; attempts to eliminate all effort weaken tool quality.

          "Effort is not a bad thing... neither learning nor teaching is effort-free."

      • Design as Decision-Making:

        • Effective design clearly communicates deliberate, beneficial decisions.

          "Design is about making decisions... conveying those decisions to the next person." - Failure to decide clearly (excess configurability) is a design failure.

          "If you make everything configurable you're failing to design."

      • Summary Recommendations:

        • Embrace decomposition and iterative refinement in design.
        • Develop harmonic sensibility for building coherent, maintainable systems.
        • Create simple, constrained, and purposeful software tools/instruments.
        • Prioritize deliberate decision-making to convey clarity and utility.
      • Final Takeaways:

        • "Design like Bartók" by applying meticulous compositional care across scales.
        • "Code like Coltrane" by dynamically applying studied design knowledge.
        • Seek simplicity and harmony, avoiding unnecessary complexity and choice.

          "Take things apart... design like Bartók... code like Coltrane... pursue harmony."

    1. Reviewer #2 (Public review):

      Summary:

      The study investigates the brain's functional connectivity (FC) dynamics across different timescales using simultaneous recordings of intracranial EEG/source-localized EEG and fMRI. The primary research goal was to determine which of three convergence/divergence scenarios is the most likely to occur.

      The results indicate that despite similar FC patterns found in different data modalities, the timepoints were not aligned, indicating spatial convergence but temporal divergence.

      The researchers also found that FC patterns in different frequencies do not overlap significantly, emphasizing the multi-frequency nature of brain connectivity. Such asynchronous activity across frequency bands supports the idea of multiple connectivity states that operate independently and are organized into a multiplex system.

      Strengths:

      The data supporting the authors' claims are convincing and come from simultaneous recordings of fMRI and iEEG/EEG, which has been recently developed and adapted.

      The analysis methods are solid and involved a novel approach to analyzing the co-occurrence of FC patterns across modalities (cross-modal recurrence plot, CRP) and robust statistics, including replication of the main results using multiple operationalizations of the functional connectome (e.g., amplitude, orthogonalized, and phase-based coupling).

      In addition, the authors provided a detailed interpretation of the results, placing them in the context of recent advances and understanding of the relationships between functional connectivity and cognitive states.

      The authors also did a control analysis and verified the effect of temporal window size or different functional connecvitity operationalizations. I also applaud their effort to make the analysis code open-sourced.

      Comments on revisions:

      The authors addressed all my concerns in the previous round of review.

    2. Author response:

      The following is the authors’ response to the previous reviews

      Public Reviews:

      Reviewer #1 (Public review):

      The paper proposes an interesting perspective on the spatio-temporal relationship between FC in fMRI and electrophysiology. The study found that while similar networks configurations are found in both modalities, there is a tendency for the networks to spatially converge more commonly at synchronous than asynchronous timepoints. However, my confidence in the findings and their interpretation is undermined by an incomplete justification for the expected outcomes for each of the proposed scenarios.

      As detailed below, the reviewer’s comment motivated us to conduct simulations to establish the relationship between the scenarios that we seek to adjudicate and the empirical outcomes.

      Main Concern

      Fig 1 makes sense to me conceptually, including the schematics of the trajectories, i.e.:

      - Scenario1. Temporally convergent, same trajectories through connectome state space

      - Scenario2. Temporally divergent, different trajectories through connectome state space

      However, based on my understanding (and apologies if I am mistaken), I am concerned that these scenarios do not necessarily translate into the schematic CRP plots shown in fig 2C, or the statements in the main text, i.e.:

      - For scenario1, "epochs of cross-modal spatial similarity should occur more frequently at on-diagonal (synchronous) than off-diagonal (asynchronous) entries, resulting in an on-/off-diagonal ratio larger than unity"

      - For scenario2, "epochs of spatial similarity could occur equally likely at on-diagonal and off-diagonal entries (ratio≈1)"

      Where do the authors get these statements and the schematics in fig2C from? They do not seem to be fully justified via previous literature, theory, or simulations?

      In particular, I am not convinced based on the evidence currently in the paper, that the ratio of off- to on-diagonal entries (and under what assumptions) is a definitive way to discriminate between scenarios 1 and 2.

      For example, what about the case where the same network configuration reoccurs in both modalities at multiple time points. It seems to me that you would get a CRP with entries occurring equally on the on-diagonal as on the off-diagonal, regardless of whether the dynamics are matched between the two modalities or not (i.e. regardless of scenario 1 or 2 being true).

      This thought experiment example might have a flaw in it, and the authors might ultimately be correct, but nonetheless a systematic justification needs to be provided for using the ratio of off- to on-diagonal entries to discriminate between scenario 1 and 2 (and under what assumptions it is valid).

      Thank you for raising this important point. In response, we have now included simulation results to complement our earlier authors’ response, which provided literature references and a theoretical explanation of the on-/off-diagonal ratio metric.

      In the absence of theory, the authors could use surrogate data for scenario 1 and 2. For example:

      a. For scenario 1, run the CRP using a single modality. E.g. feed in the EEG into the analysis as both modality 1 AND modality 2. This should provide at least one example of CRP under scenario 1 (although it does not ensure that all CRPs under this scenario will look like this, it is at least a useful sanity check).

      Note: This simulation was included in the previous round of author’s responses.

      b. For scenario 2, run the CRP using a single modality plus a shuffled version. E.g. feed in the EEG into the analysis as both modality 1 AND a temporally shuffled version of the EEG as modality 2. The temporal shuffling of the EEG could be done by simple splitting the data into blocks of say ~10s and then shuffling them into a new order. This should provide a version of the CRP under scenario 2 (although it does not ensure that all CRPs under this scenario will look like this, it is at least a useful sanity check)

      The authors have provided CRP plots for option a. It shows a CRP, as expected, consistent with scenario 1. This is a useful sanity check. However, as mentioned above, it does not ensure that all CRPs under this scenario will look like this.

      However, the authors have not shown a CRP as per option b. As such, there is an incomplete justification for the expected outcomes of the scenarios.

      Note that another option, which has not been carried out, is to use full simulations, with clearly specified assumptions, for scenario1 and 2. One way of doing this is to use a simplified (state-space) setup where you randomly simulate N spatially fixed networks that are independently switching on and off over time (i.e. "activation" is 0 or 1). Note that this would result in a N-dimensional connectome state space.

      Using this, you can simulate and compute the CRPs for the two scenarios:

      a. Scenario 1: where the simulated activation timecourses are set to be the same between both modalities

      b. Scenario 2: where the simulated activation timecourses are simulated separately for each of the modalities

      We followed the reviewer’s suggestion and have now included full simulations to address the concerns regarding the theory of the on-/off-diagonal ratio metric. As recommended, we defined a random quantized signal with N levels to represent the recurrent manifestation of N fixed connectome states. This setup was used to demonstrate the relationship between the two scenarios and the CRP observations used to adjudicate between the scenarios in our paper.

      The CRP matrices in Fig. S10 provide an example illustration of this simulation. In the case where the two state timeseries are identical, there are more co-occurrences of the same state (white entries) on the diagonal than off the diagonal (left subplot). This is in line with Scenario 1, where both spatial and temporal convergence are present. Conversely, in Scenario 2, where state time courses are shuffled, co-occurrences of the same states are more dispersed, and the diagonal prominence vanishes (right subplot). This difference illustrates how the CRP reflects the presence or absence of temporal alignment, dissociating scenarios 1 and 2.

      To quantitively validate this observation, we calculated the on-/off-diagonal ratio across simulations with varying N values. For Scenario 2 (shuffled version), the ratio consistently remained close to 1, indicating the absence of temporal synchronization. In contrast, Scenario 1 (non-shuffled version) produced significantly higher ratios, exceeding 1, confirming the metric's ability to capture meaningful synchrony. These results demonstrate that the simulations successfully replicate the expected relationship between the two scenarios and the CRPs, and validate the theoretical foundation of the ratio metric under the defined assumptions.

      Minor Concern

      Leakage correction. The paper states: "To mitigate this issue, we provide results from source-localized data both with and without leakage correction (supplementary and main text, respectively)." It is great that the authors provide both. However, given that FC in EEG is almost totally dominated by spatial leakage (see Hipp paper), the main results/figures for the scalp EEG should be done using spatial leakage corrected EEG data.

      Thank you. We agree that source leakage is an important consideration, which is why the current work investigated the intracranial EEG-fMRI data as a primary approach and subsequently added the scalp EEG-fMRI approach. While source leakage correction is essential for addressing spurious connectivity, it can also risk removing genuine functional connectivity that includes zero-lag relationships. We are reassured by the observation that the scalp data both without and with leakage correction confirmed the findings of the intracranial data, i.e., the presence of spatial and a lack of temporal cross-modal convergence. As such we do not believe that source leakage had a considerable impact on the specific question at hand.

      Reviewer #2 (Public review):

      Summary:

      The study investigates the brain's functional connectivity (FC) dynamics across different timescales using simultaneous recordings of intracranial EEG/source-localized EEG and fMRI. The primary research goal was to determine which of three convergence/divergence scenarios is the most likely to occur.

      The results indicate that despite similar FC patterns found in different data modalities, the timepoints were not aligned, indicating spatial convergence but temporal divergence.

      The researchers also found that FC patterns in different frequencies do not overlap significantly, emphasizing the multi-frequency nature of brain connectivity. Such asynchronous activity across frequency bands supports the idea of multiple connectivity states that operate independently and are organized into a multiplex system.

      Strengths:

      The data supporting the authors' claims are convincing and come from simultaneous recordings of fMRI and iEEG/EEG, which has been recently developed and adapted.

      The analysis methods are solid and involved a novel approach to analyzing the co-occurrence of FC patterns across modalities (cross-modal recurrence plot, CRP) and robust statistics, including replication of the main results using multiple operationalizations of the functional connectome (e.g., amplitude, orthogonalized, and phase-based coupling).

      In addition, the authors provided a detailed interpretation of the results, placing them in the context of recent advances and understanding of the relationships between functional connectivity and cognitive states.

      The authors also did a control analysis and verified the effect of temporal window size or different functional connecvitity operationalizations. I also applaud their effort to make the analysis code open-sourced.

      Recommendations for the authors:

      Reviewer #2 (Recommendations for the authors):

      The authors answer my concerns and they are resolved.

    1. Author response:

      The following is the authors’ response to the original reviews

      eLife Assessment

      This valuable study investigates how hearing impairment affects neural encoding of speech, in particular the encoding of hierarchical linguistic information. The current analysis provides incomplete evidence that hearing impairment affects speech processing at multiple levels, since the novel analysis based on HM-LSTM needs further justification. The advantage of this method should also be further explained. The study can also benefit from building a stronger link between neural and behavioral data.

      We sincerely thank the editors and reviewers for their detailed and constructive feedback.

      We have revised the manuscript to address all of the reviewers’ comments and suggestions. The primary strength of our methods lies in the use of the HM-LSTM model, which simultaneously captures linguistic information at multiple levels, ranging from phonemes to sentences. As such, this model can be applied to other questions regarding hierarchical linguistic processing. We acknowledge that our current behavioral results from the intelligibility test may not fully differentiate between the perception of lower-level acoustic/phonetic information and higher-level meaning comprehension. However, it remains unclear what type of behavioral test would effectively address this distinction. We aim to xplore this connection further in future studies.

      Public Reviews:

      Reviewer #1 (Public Review):

      The authors are attempting to use the internal workings of a language hierarchy model, comprising phonemes, syllables, words, phrases, and sentences, as regressors to predict EEG recorded during listening to speech. They also use standard acoustic features as regressors, such as the overall envelope and the envelopes in log-spaced frequency bands. This is valuable and timely research, including the attempt to show differences between normal-hearing and hearing-impaired people in these regards. I will start with a couple of broader questions/points, and then focus my comments on three aspects of this study: The HM-LSTM language model and its usage, the time windows of relevant EEG analysis, and the usage of ridge regression.

      Firstly, as far as I can tell, the OSF repository of code, data, and stimuli is not accessible without requesting access. This needs to be changed so that reviewers and anybody who wants or needs to can access these materials. 

      It is my understanding that keeping the repository private during the review process and making them public after acceptance is standard practice. As far as I understand, although the OSF repository was private, anyone with the link should be able to access it. I have now made the repository public.

      What is the quantification of model fit? Does it mean that you generate predicted EEG time series from deconvolved TRFs, and then give the R2 coefficient of determination between the actual EEG and predicted EEG constructed from the convolution of TRFs and regressors? Whether or not this is exactly right, it should be made more explicit.

      Model fit was measured by spatiotemporal cluster permutation tests (Maris & Oostenveld, 2007) on the contrasts of the timecourses of the z-transformed coefficient of determination (R<sup>2</sup>). For instance, to assess whether words from the attended stimuli better predict EEG signals during the mixed speech compared to words from the unattended stimuli, we used the 150dimensional vectors corresponding to the word layer from our LSTM model for the attended and unattended stimuli as regressors. We then fit these regressors to the EEG signals at 9 time points (spanning -100 ms to 300 ms around the sentence offsets, with 50 ms intervals). We then conducted one-tailed two-sample t-tests to determine whether the differences in the contrasts of the R<sup>2</sup> timecourses were statistically significant. Note that we did not perform TRF analyses. We have clarified this description in the “Spatiotemporal clustering analysis” section of the “Methods and Materials” on p.10 of the manuscript.

      About the HM-LSTM:

      • In the Methods paragraph about the HM-LSTM, a lot more detail is necessary to understand how you are using this model. Firstly, what do you mean that you "extended" it, and what was that procedure? 

      The original HM-LSTM model developed by Chung et al. (2017) consists of only two levels: the word level and the phrase level (Figure 1b from their paper). By “extending” the model, we mean that we expanded its architecture to include five levels: phoneme, syllable, word, phrase, and sentence. Since our input consists of phoneme embeddings, we cannot directly apply their model, so we trained our model on the WenetSpeech corpus (Zhang et al., 2021), which provides phoneme-level transcripts. We have added this clarification on p.4 of the manuscript.

      • And generally, this is the model that produces most of the "features", or regressors, whichever word we like, for the TRF deconvolution and EEG prediction, correct? 

      Yes, we extracted the 2048-dimensional hidden layer activity from the model to represent features for each sentence in our speech stimuli at the phoneme, syllable, word, phrase and sentence levels. But we did not perform any TRF deconvolution, we fit these features (downsampled to 150-dimension using PCA) to the EEG signals at 9 timepoints around the offset of each sentence using ridge regression. We have now added a multivariate TRF (mTRF) analysis following Reviewer 3’s suggestions, and the results showed similar patterns to the current results (see Figure S2). We have added the clarification in the “Ridge regression at different time latencies” section of the “Methods and Materials” on p.10 of the manuscript.

      Resutls from the mTRF analyses were added on p.7 of the manuscript.

      • A lot more detail is necessary then, about what form these regressors take, and some example plots of the regressors alongside the sentences.

      The linguistic regressors are just 5 150-dimensional vectors, each corresponding to one linguistic level, as shown in Figure 1B.

      • Generally, it is necessary to know what these regressors look like compared to other similar language-related TRF and EEG/MEG prediction studies. Usually, in the case of e.g. Lalor lab papers or Simon lab papers, these regressors take the form of single-sample event markers, surrounded by zeros elsewhere. For example, a phoneme regressor might have a sample up at the onset of each phoneme, and a word onset regressor might have a sample up at the onset of each word, with zeros elsewhere in the regressor. A phoneme surprisal regressor might have a sample up at each phoneme onset, with the value of that sample corresponding to the rarity of that phoneme in common speech. Etc. Are these regressors like that? Or do they code for these 5 linguistic levels in some other way? Either way, much more description and plotting is necessary in order to compare the results here to others in the literature.

      No, these regressors were not like that. They were 150-dimensional vectors (after PCA dimension reduction) extracted from the hidden layers of the HM-LSTM model. After training the model on the WenetSpeech corpus, we ran it on our speech stimuli and extracted representations from the five hidden layers to correspond to the five linguistic levels. As mentioned earlier, we did not perform TRF analyses; instead, we used ridge regression to predict EEG signals around the offset of each sentence, a method commonly employed in the literature (e.g., Caucheteux & King, 2022; Goldstein et al., 2022; Schmitt et al., 2021; Schrimpf et al., 2021). For instance, Goldstein et al. (2022) used word embeddings from GPT-2 to predict ECoG activity surrounding the onset of each word during naturalistic listening. We have included these literatures on p.3 in the manuscript, and the method is illustrated in Figure 1B.

      • You say that the 5 regressors that are taken from the trained model's hidden layers do not have much correlation with each other. However, the highest correlations are between syllable and sentence (0.22), and syllable and word (0.17). It is necessary to give some reason and interpretation of these numbers. One would think the highest correlation might be between syllable and phoneme, but this one is almost zero. Why would the syllable and sentence regressors have such a relatively high correlation with each other, and what form do those regressors take such that this is the case?

      All the regressors are represented as 2048-dimensional vectors derived from the hidden layers of the trained HM-LSTM model. We applied the trained model to all 284 sentences in our stimulus text, generating a set of 284 × 2048-dimensional vectors. Next, we performed Principal Component Analysis (PCA) on the 2048 dimensions and extracted the first 100 principal components (PCs), resulting in 284 × 100-dimensional vectors for each regressor. These 284 × 100 matrices were then flattened into 28,400-dimensional vectors. Subsequently, we computed the correlation matrix for the z-transformed 28,400-dimensional vectors of our five linguistic regressors. The code for this analysis, lstm_corr.py, can be found in our OSF repository. We have added a section “Correlation among linguistic features” in “Materials and Methods” on p.10 of the manuscript.

      We consider the observed coefficients of 0.17 and 0.22 to be relatively low compared to prior model-brain alignment studies which report correlation coefficients above 0.5 for linguistic regressors (e.g., Gao et al., 2024; Sugimoto et al., 2024). In Chinese, a single syllable can also function as a word, potentially leading to higher correlations between regressors for syllables and words. However, we refrained from overinterpreting the results to suggest a higher correlation between syllable and sentence compared to syllable and word. A paired ttest of the syllable-word coefficients versus syllable-sentence coefficients across the 284 sentences revealed no significant difference (t(28399)=-3.96, p=1). We have incorporated this information into p.5 of the manuscript.

      • If these regressors are something like the time series of zeros along with single sample event markers as described above, with the event marker samples indicating the onset of the relevant thing, then one would think e.g. the syllable regressor would be a subset of the phoneme regressor because the onset of every syllable is a phoneme. And the onset of every word is a syllable, etc.

      All the regressors are aligned to 9 time points surrounding sentence offsets (-100 ms to 300 ms with a 50 ms interval). This is because all our regressors are taken from the HM-LSTM model, where the input is the phoneme representation of a sentence (e.g., “zh ə_4 y ie_3 j iəu_4 x iaŋ_4 sh uei_3 y ii_2 y aŋ_4”). For each unit in the sentence, the model generates five 2048dimensional vectors, each corresponding to the five linguistic levels of the entire sentence. We have added the clarification on p.11 of the manuscript.

      For the time windows of analysis:

      • I am very confused, because sometimes the times are relative to "sentence onset", which would mean the beginning of sentences, and sometimes they are relative to "sentence offset", which would mean the end of sentences. It seems to vary which is mentioned. Did you use sentence onsets, offsets, or both, and what is the motivation?

      • If you used onsets, then the results at negative times would not seem to mean anything, because that would be during silence unless the stimulus sentences were all back to back with no gaps, which would also make that difficult to interpret.

      • If you used offsets, then the results at positive times would not seem to mean anything, because that would be during silence after the sentence is done. Unless you want to interpret those as important brain activity after the stimuli are done, in which case a detailed discussion of this is warranted.

      Thank you very much for pointing this out. All instances of “sentence onset” were typos and should be corrected to “sentence offset.” We chose offset because the regressors are derived from the hidden layer activity of our HM-LSTM model, which processes the entire sentence before generating outputs. We have now corrected all the typos. In continuous speech, there is no distinct silence period following sentence offsets. Additionally, lexical or phrasal processing typically occurs 200 ms after stimulus offsets (Bemis & Pylkkanen, 2011; Goldstein et al., 2022; Li et al., 2024; Li & Pylkkänen, 2021). Therefore, we included a 300 ms interval after sentence offsets in our analysis, as our regressors encompass linguistic levels up to the sentence level. We have added this motivation on p.11 of the manuscript.

      • For the plots in the figures where the time windows and their regression outcomes are shown, it needs to be explicitly stated every time whether those time windows are relative to sentence onset, offset, or something else.

      Completely agree and thank you very much for the suggestion. We have now added this information on Figure 4-6.

      • Whether the running correlations are relative to sentence onset or offset, the fact that you can have numbers outside of the time of the sentence (negative times for onset, or positive times for offset) is highly confusing. Why would the regressors have values outside of the sentence, meaning before or after the sentence/utterance? In order to get the running correlations, you presumably had the regressor convolved with the TRF/impulse response to get the predicted EEG first. In order to get running correlation values outside the sentence to correlate with the EEG, you would have to have regressor values at those time points, correct? How does this work?

      As mentioned earlier, we did not perform TRF analyses or convolve the regressors. Instead, we conducted regression analyses at each of the 9 time points surrounding the sentence offsets, following standard methods commonly used in model-brain alignment studies (e.g., Gao et al., 2024; Goldstein et al., 2022). The time window of -100 to 300 ms was selected based on prior findings that lexical and phrasal processing typically occurs 200–300 ms after word offsets (Bemis & Pylkkanen, 2011; Goldstein et al., 2022; Li et al., 2024; Li & Pylkkänen, 2021). Additionally, we included the -100 to 200 ms time period in our analysis to examine phoneme and syllable level processing (cf. Gwilliams et al., 2022). We have added the clarification on p. of the manuscript.

      • In general, it seems arbitrary to choose sentence onset or offset, especially if the comparison is the correlation between predicted and actual EEG over the course of a sentence, with each regressor. What is going on with these correlations during the middle of the sentences, for example? In ridge regression TRF techniques for EEG/MEG, the relevant measure is often the overall correlation between the predicted and actual, calculated over a longer period of time, maybe the entire experiment. Here, you have calculated a running comparison between predicted and actual, and thus the time windows you choose to actually analyze can seem highly cherry-picked, because this means that most of the data is not actually analyzed.

      The rationale for choosing sentence offsets instead of onsets is that we are aligning the HM-LSTM model’s activity with EEG responses, and the input to the model consists of phoneme representations of the entire sentence at one time. In other words, the model needs to process the whole sentence before generating representations at each linguistic level. Therefore, the corresponding EEG responses should also align with the sentence offsets, occurring after participants have seen the complete sentence. The ridge regression followed the common practice in model-brain alignment studies (e.g., Gao et al., 2024; Goldstein et al., 2022; Huth et al., 2016; Schmitt et al., 2021; Schrimpf et al., 2021), and the time window is not cherrypicked but based on prior literature reporting lexical and sublexical processing at these time period (e.g., Bemis & Pylkkanen, 2011; Goldstein et al., 2022; Gwilliams et al., 2022; Li et al., 2024; Li & Pylkkänen, 2021).

      • In figures 5 and 6, some of the time window portions that are highlighted as significant between the two lines have the lines intersecting. This looks like, even though you have found that the two lines are significantly different during that period of time, the difference between those lines is not of a constant sign, even during that short period. For instance, in figure 5, for the syllable feature, the period of 0 - 200 ms is significantly different between the two populations, correct? But between 0 and 50, normal-hearing are higher, between 50 and 150, hearing-impaired are higher, and between 150 and 200, normal-hearing are higher again, correct? But somehow they still end up significantly different overall between 0 and 200 ms. More explanation of occurrences like these is needed.

      The intersecting lines in Figures 5 and represent the significant time windows for withingroup comparisons (i.e., significant model fit compared to 0). They do not depict betweengroup comparisons, as no significant contrasts were found between the groups. For example, in Figure 1, the significant time windows for the acoustic models are shown separately for the hearing-impaired and normal-hearing groups. No significant differences were observed, as indicated by the sensor topography. We have now clarified this point in the captions for Figures 5 and 6.

      Using ridge regression:

      • What software package(s) and procedure(s) were specifically done to accomplish this? If this is ridge regression and not just ordinary least squares, then there was at least one non-zero regularization parameter in the process. What was it, how did it figure in the modeling and analysis, etc.?

      The ridge regression was performed using customary python codes, making heavy use of the sklearn (v1.12.0) package. We used ridge regression instead of ordinary least squares regression because all our linguistic regressors are 150-dimensional dense vectors, and our acoustic regressors are 130-dimension vectors (see “Acoustic features of the speech stimuli” in “Materials and Methods”). We kept the default regularization parameter (i.e., 1). This ridge regression methods is commonly used in model-brain alignment studies, where the regressors are high-dimensional vectors taken from language models (e.g., Gao et al., 2024; Goldstein et al., 2022; Huth et al., 2016; Schmitt et al., 2021; Schrimpf et al., 2021). The code ridge_lstm.py can be found in our OSF repository, and we have added the more detailed description on p.11 of the manuscript.

      • It sounds like the regressors are the hidden layer activations, which you reduced from 2,048 to 150 non-acoustic, or linguistic, regressors, per linguistic level, correct? So you have 150 regressors, for each of 5 linguistic levels. These regressors collectively contribute to the deconvolution and EEG prediction from the resulting TRFs, correct? This sounds like a lot of overfitting. How much correlation is there from one of these 150 regressors to the next? Elsewhere, it sounds like you end up with only one regressor for each of the 5 linguistic levels. So these aspects need to be clarified.

      • For these regressors, you are comparing the "regression outcomes" for different conditions; "regression outcomes" are the R2 between predicted and actual EEG, which is the coefficient of determination, correct? If this is R2, how is it that you have some negative numbers in some of the plots? R2 should be only positive, between 0 and 1.

      Yes we reduced 2048-dimensional vectors for each of the 5 linguistic levels to 150 using PCA, mainly for saving computational resources. We used ridge regression, following the standard practice in the field (e.g., Gao et al., 2024; Goldstein et al., 2022; Huth et al., 2016; Schmitt et al., 2021; Schrimpf et al., 2021). 

      Yes, the regression outcomes are the R<sup>2</sup> values representing the fit between the predicted and actual EEG data. However, we reported normalized R<sup>2</sup> values which are ztransformed in the plots. All our spatiotemporal cluster permutation analyses were conducted using the z-transformed R<sup>2</sup> values. We have added this clarification both in the figure captions and on p.11 of the manuscript. As a side note, R<sup>2</sup> values can be negative because they are not the square of a correlation coefficient. Rather, R<sup>2</sup> compares the fit of the chosen model to that of a horizontal straight line (the null hypothesis). If the chosen model fits the data worse than the horizontal line, then R<sup>2</sup> value becomes negative: https://www.graphpad.com/support/faq/how-can-rsup2sup-be-negative 

      Reviewer #2 (Public Review):

      This study compares neural responses to speech in normal-hearing and hearing-impaired listeners, investigating how different levels of the linguistic hierarchy are impacted across the two cohorts, both in a single-talker and multi-talker listening scenario. It finds that, while normal-hearing listeners have a comparable cortical encoding of speech-in-quiet and attended speech from a multi-talker mixture, participants with hearing impairment instead show a reduced cortical encoding of speech when it is presented in a competing listening scenario. When looking across the different levels of the speech processing hierarchy in the multi-talker condition, normal-hearing participants show a greater cortical encoding of the attended compared to the unattended stream in all speech processing layers - from acoustics to sentencelevel information. Hearing-impaired listeners, on the other hand, only have increased cortical responses to the attended stream for the word and phrase levels, while all other levels do not differ between attended and unattended streams.

      The methods for modelling the hierarchy of speech features (HM-LSTM) and the relationship between brain responses and specific speech features (ridge-regression) are appropriate for the research question, with some caveats on the experimental procedure. This work offers an interesting insight into the neural encoding of multi-talker speech in listeners with hearing impairment, and it represents a useful contribution towards understanding speech perception in cocktail-party scenarios across different hearing abilities. While the conclusions are overall supported by the data, there are limitations and certain aspects that require further clarification.

      (1) In the multi-talker section of the experiment, participants were instructed to selectively attend to the male or the female talker, and to rate the intelligibility, but they did not have to perform any behavioural task (e.g., comprehension questions, word detection or repetition), which could have demonstrated at least an attempt to comply with the task instructions. As such, it is difficult to determine whether the lack of increased cortical encoding of Attended vs. Unattended speech across many speech features in hearing-impaired listeners is due to a different attentional strategy, which might be more oriented at "getting the gist" of the story (as the increased tracking of only word and phrase levels might suggest), or instead it is due to hearing-impaired listeners completely disengaging from the task and tuning back in for selected key-words or word combinations. Especially the lack of Attended vs. Unattended cortical benefit at the level of acoustics is puzzling and might indicate difficulties in performing the task. I think this caveat is important and should be highlighted in the Discussion section. RE: Thank you very much for the suggestion. We admit that the hearing-impaired listeners might adopt different attentional strategies or potentially disengage from the task due to comprehension difficulties. However, we would like to emphasize that our hearing-impaired participants have extended high-frequency (EHF) hearing loss, with impairment only at frequencies above 8 kHz. Their condition is likely not severe enough to cause them to adopt a markedly different attentional strategy for this task. Moreover, it is possible that our normalhearing listeners may also adopt varying attentional strategies, yet the comparison still revealed notable differences.We have added the caveat in the Discussion section on p.8 of the manuscript.

      (2) In the EEG recording and preprocessing section, you state that the EEG was filtered between 0.1Hz and 45Hz. Why did you choose this very broadband frequency range? In the literature, speech responses are robustly identified between 0.5Hz/1Hz and 8Hz. Would these results emerge using a narrower and lower frequency band? Considering the goal of your study, it might also be interesting to run your analysis pipeline on conventional frequency bands, such as Delta and Theta, since you are looking into the processing of information at different temporal scales.

      Indeed, we have decomposed the epoched EEG time series for each section into six classic frequency bands components (delta 1–3 Hz, theta 4–7 Hz, alpha 8–12 Hz, beta 12–20 Hz, gamma 30–45 Hz) by convolving the data with complex Morlet wavelets as implemented in MNE-Python (version 0.24.0). The number of cycles in the Morlet wavelets was set to frequency/4 for each frequency bin. The power values for each time point and frequency bin were obtained by taking the square root of the resulting time-frequency coefficients. These power values were normalized to reflect relative changes (expressed in dB) with respect to the 500 ms pre-stimulus baseline. This yielded a power value for each time point and frequency bin for each section. We specifically examined the delta and theta bands, and computed the correlation between the regression outcome (R<sup>2</sup> in the shape of number of subject * sensor * time were flattened for computing correlation) for the five linguistic predictors from these bands and those obtained using data from all frequency bands. The results showed high correlation coefficients (see the correlation matrix in Supplementary Figures S2 for the attended and unattended speech). Therefore, we opted to use the epoched EEG data from all frequency bands for our analyses. We have added this clarification in the Results section on p.5 and the “EEG recording and preprocessing” section in “Materials and Methods” on p.11 of the manuscript.

      (3) A paragraph with more information on the HM-LSTM would be useful to understand the model used without relying on the Chung et al. (2017) paper. In particular, I think the updating mechanism of the model should be clarified. It would also be interesting to modify the updating factor of the model, along the lines of Schmitt et al. (2021), to assess whether a HM-LSTM with faster or slower updates can better describe the neural activity of hearing-impaired listeners. That is, perhaps the difference between hearing-impaired and normal-hearing participants lies in the temporal dynamics, and not necessarily in a completely different attentional strategy (or disengagement from the stimuli, as I mentioned above).

      Thank you for the suggestion. We have added more details on our HM-LSTM model on p.10 “Hierarchical multiscale LSTM model” in “Materials and Methods”: Our HM-LSTM model consists of 4 layers, at each layer, the model implements a COPY or UPDATE operation at each time step t. The COPY operation maintains the current cell state of without any changes until it receives a summarized input from the lower layer. The UPDATE operation occurs when a linguistic boundary is detected in the layer below, but no boundary was detected at the previous time step t-1. In this case, the cell updates its summary representation, similar to standard RNNs. We agree that exploring modifications to the model’s updating factor would be an interesting direction. However, since we have already observed contrasts between normal-hearing and hearing-impaired listeners using the current model’s update parameters, we believe discussing additional hypotheses would overextend the scope of this paper.

      (4) When explaining how you extracted phoneme information, you mention that "the inputs to the model were the vector representations of the phonemes". It is not clear to me whether you extracted specific phonetic features (e.g., "p" sound vs. "b" sound), or simply the phoneme onsets. Could you clarify this point in the text, please?

      The model inputs were individual phonemes from two sentences, each transformed into a 1024-dimensional vector using a simple lookup table. This lookup table stores embeddings for a fixed dictionary of all unique phonemes in Chinese. This approach is a foundational technique in many advanced NLP models, enabling the representation of discrete input symbols in a continuous vector space. We have added this clarification on p.10 of the manuscript.

      Reviewer #3 (Public Review):

      Summary:

      The authors aimed to investigate how the brain processes different linguistic units (from phonemes to sentences) in challenging listening conditions, such as multi-talker environments, and how this processing differs between individuals with normal hearing and those with hearing impairments. Using a hierarchical language model and EEG data, they sought to understand the neural underpinnings of speech comprehension at various temporal scales and identify specific challenges that hearing-impaired listeners face in noisy settings.

      Strengths:

      Overall, the combination of computational modeling, detailed EEG analysis, and comprehensive experimental design thoroughly investigates the neural mechanisms underlying speech comprehension in complex auditory environments.

      The use of a hierarchical language model (HM-LSTM) offers a data-driven approach to dissect and analyze linguistic information at multiple temporal scales (phoneme, syllable, word, phrase, and sentence). This model allows for a comprehensive neural encoding examination of how different levels of linguistic processing are represented in the brain.

      The study includes both single-talker and multi-talker conditions, as well as participants with normal hearing and those with hearing impairments. This design provides a robust framework for comparing neural processing across different listening scenarios and groups.

      Weaknesses:

      The analyses heavily rely on one specific computational model, which limits the robustness of the findings. The use of a single DNN-based hierarchical model to represent linguistic information, while innovative, may not capture the full range of neural coding present in different populations. A low-accuracy regression model-fit does not necessarily indicate the absence of neural coding for a specific type of information. The DNN model represents information in a manner constrained by its architecture and training objectives, which might fit one population better than another without proving the non-existence of such information in the other group. To address this limitation, the authors should consider evaluating alternative models and methods. For example, directly using spectrograms, discrete phoneme/syllable/word coding as features, and performing feature-based temporal response function (TRF) analysis could serve as valuable baseline models. This approach would provide a more comprehensive evaluation of the neural encoding of linguistic information.

      Our acoustic features are indeed direct the broadband envelopes and the log-mel spectrograms of the speech streams. The amplitude envelope of the speech signal was extracted using the Hilbert transform. The 129-dimension spectrogram and 1-dimension envelope were concatenated to form a 130-dimension acoustic feature at every 10 ms of the speech stimuli. Given the duration of our EEG recordings, which span over 10 minutes, conducting multivariate TRF (mTRF) analysis with such high-dimensional predictors was not feasible. Instead, we used ridge regression to predict EEG responses across 9 temporal latencies, ranging from -100 ms to +300 ms, with additional 50 ms latencies surrounding sentence offsets. To evaluate the model's performance, we extracted the R<sup>2</sup> values at each latency, providing a temporal profile of regression performance over the analyzed time period. This approach is conceptually similar to TRF analysis.

      We agree that including baseline models for the linguistic features is important, and we have now added results from mTRF analysis using phoneme, syllable, word, phrase, and sentence rates as discrete predictors (i.e., marking a value of 1 at each unit boundary offset). Our EEG data spans the entire 10-minute duration for each condition, sampled at 10-ms intervals. The TRF results for our main comparison—attended versus unattended conditions— showed similar patterns to those observed using features from our HM-LSTM model. At the phoneme and syllable levels, normal-hearing listeners showed marginally significantly higher TRF weights for attended speech compared to unattended speech at approximately -80 to 150 ms after phoneme offsets (t=2.75, Cohen’s d=0.87, p=0.057), and 120 to 210 ms after syllable offsets (t=3.96, Cohen’s d=0.73d = 0.73, p=0.083). At the word and phrase levels, normalhearing listeners exhibited significantly higher TRF weights for attended speech compared to unattended speech at 190 to 290 ms after word offsets (t=4, Cohen’s d=1.13, p=0.049), and around 120 to 290 ms after phrase offsets (t=5.27, Cohen’s d=1.09, p=0.045). For hearing-impaired listeners, marginally significant effects were observed at 190 to 290 ms after word offsets (t=1.54, Cohen’s d=0.6, p=0.059), and 180 to 290 ms after phrase offsets (t=3.63, Cohen’s d=0.89, p=0.09). These results have been added on p.7 of the manuscript, and the corresponding figure is included as Supplementary F2.

      It is not entirely clear if the DNN model used in this study effectively serves the authors' goal of capturing different linguistic information at various layers. Specifically, the results presented in Figure 3C are somewhat confusing. While the phonemes are labeled, the syllables, words, phrases, and sentences are not, making it difficult to interpret how the model distinguishes between these levels of linguistic information. The claim that "Hidden-layer activity for samevowel sentences exhibited much more similar distributions at the phoneme and syllable levels compared to those at the word, phrase and sentence levels" is not convincingly supported by the provided visualizations. To strengthen their argument, the authors should use more quantified metrics to demonstrate that the model indeed captures phrase, word, syllable, and phoneme information at different layers. This is a crucial prerequisite for the subsequent analyses and claims about the hierarchical processing of linguistic information in the brain.

      Quantitative measures such as mutual information, clustering metrics, or decoding accuracy for each linguistic level could provide clearer evidence of the model's effectiveness in this regard.

      In Figure 3C, we used color-coding to represent the activity of five hidden layers after dimensionality reduction. Each dot on the plot corresponds to one test sentence. Only phonemes are labeled because each syllable in our test sentences contains the same vowels (see Table S1). The results demonstrate that the phoneme layer effectively distinguishes different phonemes, while the higher linguistic layers do not. We believe these findings provide evidence that different layers capture distinct linguistic information. Additionally, we computed the correlation coefficients between each pair of linguistic predictors, as shown in Figure 3B. We think this analysis serves a similar purpose to computing the mutual information between pairs of hidden-layer activities for our constructed sentences. Furthermore, the mTRF results based on rate models of the linguistic features we presented earlier align closely with the regression results using the hidden-layer activity from our HM-LSTM model. This further supports the conclusion that our model successfully captures relevant information across these linguistic levels. We have added the clarification on p.5 of the manuscript.

      The formulation of the regression analysis is somewhat unclear. The choice of sentence offsets as the anchor point for the temporal analysis, and the focus on the [-100ms, +300ms] interval, needs further justification. Since EEG measures underlying neural activity in near real-time, it is expected that lower-level acoustic information, which is relatively transient, such as phonemes and syllables, would be distributed throughout the time course of the entire sentence. It is not evident if this limited time window effectively captures the neural responses to the entire sentence, especially for lower-level linguistic features. A more comprehensive analysis covering the entire time course of the sentence, or at least a longer temporal window, would provide a clearer understanding of how different linguistic units are processed over time. Additionally, explaining the rationale behind choosing this specific time window and how it aligns with the temporal dynamics of speech processing would enhance the clarity and validity of the regression analysis.

      Thank you for pointing this out. We chose this time window as lexical or phrasal processing typically occurs 200 ms after stimulus offsets (Bemis & Pylkkanen, 2011; Goldstein et al., 2022; Li et al., 2024; Li & Pylkkänen, 2021). Additionally, we included the -100 to 200 ms time period in our analysis to examine phoneme and syllable level processing (e.g., Gwilliams et al., 2022). Using the entire sentence duration was not feasible, as the sentences in the stimuli vary in length, making statistical analysis challenging. Additionally, since the stimuli consist of continuous speech, extending the time window would risk including linguistic units from subsequent sentences. This would introduce ambiguity as to whether the EEG responses correspond to the current or the following sentence. We have added this clarification on p.12 of the manuscript.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      As I mentioned, I think the OSF repo needs to be changed to give anyone access. I would recommend pursuing the lines of thought I mentioned in the public review to make this study complete and to allow it to fit into the already existing literature to facilitate comparisons.

      Yes the OSF folder is now public. We have made revisions following all reviewers’ suggestions.

      There are some typos in figure labels, e.g. 2B.

      Thank you for pointing it out! We have now revised the typo in Figure 2B.

      Reviewer #2 (Recommendations For The Authors):

      (1) I was able to access all of the audio files and code for the study, but no EEG data was shared in the OSF repository. Unless there is some ethical and/or legal constraint, my understanding of eLife's policy is that the neural data should be made publicly available as well.

      The preprocessed EEG data in .npy format in the OSF repository. 

      (2) The line-plots in Figures 4B,5B, and 6B have very similar colours. They would be easier to interpret if you changed the line appearance as well as the colours. E.g., dotted line for hearingimpaired listeners, thick line for normal-hearing.

      Thank you for the suggestion! We have now used thicker lines for normal-impaired listeners in all our line plots.

      Reviewer #3 (Recommendations For The Authors):

      (1) The authors may consider presenting raw event-related potentials (ERPs) or spatiotemporal response profiles before delving into the more complex regression encoding analysis. This would provide a clearer foundational understanding of the neural activity patterns. For example, it is not clear if the main claims, such as the neural activity in the normal-hearing group encoding phonetic information in attended speech better than in unattended speech, are directly observable. Showing ERP differences or spatiotemporal response pattern differences could support these claims more straightforwardly. Additionally, training pattern classifiers to test if different levels of information can be decoded from EEG activity in specific groups could provide further validation of the findings.

      We have now included results from more traditional mTRF analyses using phoneme, syllable, word, phrase, and sentence rates as baseline models (see p.7 of the manuscript and Figure S3). The results show similar patterns to those observed in our current analyses. While we agree that classification analyses would be very interesting, our regression analyses have already demonstrated distinct EEG patterns for each linguistic level. Consequently, classification analyses would likely yield similar results unless a different method for representing linguistic information at these levels is employed. To the best of our knowledge, no other computational model currently exists that can simultaneously represent these linguistic levels.

      (2) Is there any behavioral metric suggesting that these hearing-impaired participants do have deficits in comprehending long sentences? The self-rated intelligibility is useful, but cannot fully distinguish between perceiving lower-level phonetic information vs longer sentence comprehension.

      In the current study, we included only self-rated intelligibility tests. We acknowledge that this approach might not fully distinguish between the perception of lower-level phonetic information and higher-level sentence comprehension. However, it remains unclear what type of behavioral test would effectively address this distinction. Furthermore, our primary aim was to use the behavioral results to demonstrate that our hearing-impaired listeners experienced speech comprehension difficulties in multi-talker environments, while relying on the EEG data to investigate comprehension challenges at various linguistic levels.

      Minor:

      (1) Page 2, second line in Introduction, "Phonemes occur over ..." should be lowercase.

      According to APA format, the first word after the colon is capitalized if it begins a complete sentence (https://blog.apastyle.org/apastyle/2011/06/capitalization-after-colons.html). Here

      the sentence is a complete sentence so we used uppercase for “phonemes”.

      (2) Page 8, second paragraph "...-100ms to 100ms relative to sentence onsets", should it be onsets or offsets?

      This is typo and it should be offsets. We have now revised it.

      References

      Bemis, D. K., & Pylkkanen, L. (2011). Simple composition: An MEG investigation into the comprehension of minimal linguistic phrases. Journal of Neuroscience, 31(8), 2801– 2814.

      Gao, C., Li, J., Chen, J., & Huang, S. (2024). Measuring meaning composition in the human brain with composition scores from large language models. In L.-W. Ku, A. Martins, & V. Srikumar (Eds.), Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (pp. 11295–11308). Association for Computational Linguistics.

      Goldstein, A., Zada, Z., Buchnik, E., Schain, M., Price, A., Aubrey, B., Nastase, S. A., Feder, A., Emanuel, D., Cohen, A., Jansen, A., Gazula, H., Choe, G., Rao, A., Kim, C., Casto, C., Fanda, L., Doyle, W., Friedman, D., … Hasson, U. (2022). Shared computational principles for language processing in humans and deep language models. Nature Neuroscience, 25(3), Article 3.

      Gwilliams, L., King, J.-R., Marantz, A., & Poeppel, D. (2022). Neural dynamics of phoneme sequences reveal position-invariant code for content and order. Nature Communications, 13(1), Article 1.

      Huth, A. G., de Heer, W. A., Griffiths, T. L., Theunissen, F. E., & Gallant, J. L. (2016). Natural speech reveals the semantic maps that tile human cerebral cortex. Nature, 532(7600), 453–458.

      Li, J., Lai, M., & Pylkkänen, L. (2024). Semantic composition in experimental and naturalistic paradigms. Imaging Neuroscience, 2, 1–17.

      Li, J., & Pylkkänen, L. (2021). Disentangling semantic composition and semantic association in the left temporal lobe. Journal of Neuroscience, 41(30), 6526–6538.

      Maris, E., & Oostenveld, R. (2007). Nonparametric statistical testing of EEG- and MEG-data. Journal of Neuroscience Methods, 164(1), 177–190.

      Schmitt, L.-M., Erb, J., Tune, S., Rysop, A. U., Hartwigsen, G., & Obleser, J. (2021). Predicting speech from a cortical hierarchy of event-based time scales. Science Advances, 7(49), eabi6070.

      Schrimpf, M., Blank, I. A., Tuckute, G., Kauf, C., Hosseini, E. A., Kanwisher, N., Tenenbaum, J. B., & Fedorenko, E. (2021). The neural architecture of language: Integrative modeling converges on predictive processing. Proceedings of the National Academy of Sciences, 118(45), e2105646118.

      Sugimoto, Y., Yoshida, R., Jeong, H., Koizumi, M., Brennan, J. R., & Oseki, Y. (2024). Localizing Syntactic Composition with Left-Corner Recurrent Neural Network Grammars. Neurobiology of Language, 5(1), 201–224.

    1. Reviewer #2 (Public review):

      Summary:

      The manuscript by Maurice and Katarzyna describes a self-supervised, annotation-free deep-learning approach capable of quantitatively representing complex poses and behaviors of C. elegans directly from video pixel values. Their method overcomes limitations inherent to traditional methods relying on skeletonization or keypoint tracking, which often fail with highly coiled or self-intersecting worms. By applying self-supervised contrastive learning and a Transformer-based network architecture, the authors successfully capture diverse behavioral patterns and depict the aging trajectory of behavioral repertoire. This provides a useful new tool for behavioral research in C. elegans and other flexible-bodied organisms.

      Strengths:

      Reliable tracking and segmentation of complex poses remain significant bottlenecks in C. elegans behavioral research, and the authors made valuable attempts to address these challenges. The presented method offers several advantages over existing tools, including freedom from manual labeling, independence from explicit skeletonization or keypoint tracking, and the capability to capture highly coiled or overlapping poses. Thus, the proposed method would be useful to the C. elegans research community.

      The research question is clearly defined. Methods and results are engagingly presented, and the manuscript is concise and well-organized.

      Weaknesses:

      (1) In the abstract, the claim of an 'unbiased' approach is not well-supported. The method is still affected by dataset biases, as mentioned in the aging results (section 4.3).<br /> (2) In section 3.2, the rationale behind rotating worm images to a vertical orientation is unclear.<br /> (3) The methods section is clearly written but uses overly technical language, making it less accessible to the audience of eLife, the majority of whom are biologists. Clearer explanations of key methods and the rationale behind their selection are needed. For example, in section 3.3, the authors should briefly explain in simple language what contrastive learning is, why they chose it, and why this method potentially achieves their goal.<br /> (4) The reason why the gray data points could not be resolved by Tierpsy is not quantitatively described. Are they all due to heavily coiled or overlapping poses?<br /> (5) In section 4.1, generating pose representations grouped by genetic strains would provide insights into strain-specific differences resolved by the proposed method.<br /> (6) Fig. 3a requires clarification. Highly bent poses (red points) intuitively should be close to highly coiled poses (gray points). The authors should explain the observed greenish/blueish points interfacing with the gray points.<br /> (7) In Fig. 3a, some colored points overlap with the gray point cloud. Why can Tierpsy resolve these overlapping points representing highly coiled poses? A more systematic quantitative comparison between Tierpsy and the proposed method is required.<br /> (8) The claim in section 4.2 regarding strain separation in pose embedding spaces is unsupported by Fig. 3a, which lacks strain-based distinctions. As mentioned in point #5, showing pose representations grouped by different strains is required.<br /> (9) In section 4.2, how the authors could verify the statement, "This likely occurs since most strains share common behaviors such as simple forward locomotion"?<br /> (10) An important weakness of the proposed method is its low direct interpretability, as it is not based on handcrafted features. To better interpret the pose/behavior embedding space, it would be helpful to compare it against more basic Tierpsy features in Fig. 3 and 4. This comparison could reveal what understandable features were learned by the neural network, thereby increasing human interpretability.<br /> (11) The main conclusion of section 4.3 is not sufficiently tested. Is Fig. 5a generated only from data of N2 animals? To quantitatively verify the statement, "Young individuals appear to display a wide range of behaviors, while as they age their behavior repertoire reduces," the authors should perform a formal analysis of behavioral variability throughout aging.<br /> (12) In Fig. 5a, better visualization of aging trajectories could include plotting the center of mass along with variance of the point cloud over time.<br /> (13) To better reveal aging trajectories of behavioral changes for different genetic backgrounds, it would be meaningful to generate behavior representations for different strains as they age.<br /> (14) As a methods paper, the ease of use for other researchers should be explicitly addressed, and source code and datasets should be provided.

    2. Author response:

      We thank the editors and the reviewers for their valuable comments and for taking the time to evaluate our manuscript.

      Answers to Reviewer 1:

      (1) The core contribution of our method is that it learns meaningful spatiotemporal embeddings directly from image data without requiring pose estimation or eigenworm-based features as input. The learned embedding space can serve as a foundation for downstream tasks such as behavioral classification, clustering, or anomaly detection, further supporting its utility beyond visualization through eigenworm-derived features. Here we use the Tierpsy-derived features for latent space interpretation and for validation that our approach does indeed encode meaningful postural information. Additionally, without any Tierpsy-calculated features users can still color embeddings by known metadata like mutation or age and compare different strains to each other. 

      (2) The numbers shown in Fig. 2.3 are illustrative placeholders intended to conceptually represent a vector of behavioral features. They do not correspond to any specific measurements or carry intrinsic meaning. We agree that this may lead to confusion, and we will clarify this in the revised manuscript.

      (3) The visualizations in Figs. 4 (b) and (c) show the embeddings of sequences of behavior, rather than individual poses. Therefore, motion-related features such as speed are related to temporal patterns in those sequences rather than static postures. The color overlays reflect average motion characteristics (e.g., speed) of short behavior clips projected into the embedding space, rather than being directly linked to any single frame or pose.

      Answers to Reviewer 2:

      (1) In the abstract, our use of the term "unbiased" refers specifically to the avoidance of human-generated bias through feature engineering—i.e., the model does not rely on handcrafted features or predefined pose representations – the representations are based on data only. However, we agree that the model is still subject to dataset biases and will rectify this in the revised manuscript.

      (2) The worm images are rotated to a common vertical orientation to remove orientation as a source of variability in the input. This ensures that the model focuses on learning pose and behavioral dynamics rather than arbitrary head-tail or angular positioning. While data augmentation could in theory account for this variability, we found in our preliminary experiments that applying this preprocessing step led to more stable and interpretable embeddings.

      (3) We agree that simplifying the technical explanations would enhance the manuscript’s accessibility. In the revised version, we will briefly introduce contrastive learning in a less technical language.

      (4) The gray points in Fig. 3a represent frames that Tierpsy could not resolve, primarily due to coiled, self-intersecting, or overlapping worm postures as Tierpsy uses skeletonization to estimate the centerline. This approach can fail if kind of challenging elements are part of the image.

      (5) We appreciate this suggestion and consider it for a revised version of the manuscript.

      (6) Although it may seem intuitive for highly bent (red) poses to lie near coiled (gray) ones in the embedding space, the clustering pattern observed reflects how the network organizes pose information. The red/orange cluster consists of distinguishable bent poses that are visually distinct and consistently separable from other postures. In contrast, the greenish and blueish poses are less strongly bent and may share more visual overlap with the unresolved (gray) images.

      (7) The overlap occurs because some highly bent or coiled worms can still be (partially) resolved by Tierpsy, depending on specific pose conditions (e.g., head and tail not touching, not self-overlapping). However, Tierpsy fails to consistently resolve such frames. We will describe these cases in more detail in the revised manuscript.

      (8) Thank you, we agree this claim needs to be better supported and will develop it in the revision.

      (9) To support this statement we mainly visualized the respective sequences embedded in this area of the embedding space and found that it mostly consists of common behaviors such as forward locomotion. 

      (10) We agree that interpretability is important and plan to include additional figures quantifications of the embedding space using more basic Tierpsy features.

      (11) Fig. 5a is indeed based solely on N2 animals. In the revised manuscript we will include quantitative measures of behavioral variability and its change with age.

      (12) We appreciate this suggestion and consider it for a revised version

      (13) We agree this would be a valuable analysis. However, our current dataset primarily includes aging data for N2 animals. We acknowledge this limitation and consider adding more strains for future work.

      (14) We will include links to our source code in the revised manuscript

      Answers to Reviewer 3:

      (1-2) Our current method is agnostic to head-tail orientation, which indeed restricts the ability to distinguish behaviors that rely on directional cues. We made this design choice as we believe that correctly identifying head/tail orientation can be a challenging task that may introduce additional biases or fail in difficult imaging conditions. However, we fully agree that integrating directional information would improve behavioral resolution, and this is a natural extension of our current framework. In future work, we aim to incorporate head-tail disambiguation.

      (3) We explicitly designed our preprocessing and training pipeline to encourage size invariance, for example by resizing individuals to a consistent scale, as the focus of our work is to encode posture and movement only. However, we acknowledge that absolute size information is lost in this process, which can be informative for distinguishing genotypes or age-related changes.

      (4) We agree that a direct quantitative comparison between our embedding-based representations and skeleton-based feature sets would strengthen the paper. Our current focus was to assess whether meaningful behavioral features could be learned from a skeleton-free representation.

    1. Reviewer #1 (Public review):

      Summary:

      Some years ago, Brookshire proposed a method to identify oscillations in behavioural data that controls for effects of aperiodic trends. Such trends can produce false positive results if not controlled for. Although this method successfully controlled for this issue, it was also relatively insensitive to true effects, and it remained unclear whether it was unable to replicate published evidence for behavioural oscillations because they were false positives or the method could not detect them. In simulated data, Harris & Beale show that their revised version of the method proposed by Brookshire is more sensitive to effects and equally unsusceptible to false positives. When applied to available data, this new version indeed revealed evidence for behavioural oscillations. This paper is therefore an important piece in the puzzle of the ongoing debate on behavioural oscillations.

      Strengths:

      (1) The paper is well written and compact.

      (2) The new method proposed is tested thoroughly, and its application in simulated data shows its properties.

      (3) It is very important that the code is made publicly available.

      (4) The fact that this new version identifies behavioural oscillations in available datasets can resolve the current debate on the existence of such oscillations.

      Weaknesses:

      I see the following weaknesses as minor.

      (1) I wonder whether the frequency-dependent results (e.g., Figures 7 and 8) need to be seen in light of the sampling rate used in the simulations. For example, a lower sampling rate might be sufficient if only low frequencies are of interest in the data and lead to higher sensitivity as the number of trials (per time point) can be increased. Conversely, a higher sampling rate might lead to a higher sensitivity for the detection of effects at higher frequencies.

      (2) The behavioural oscillations from individual participants do not need to have common phases for this analysis to reveal an effect. However, this also means that in a scenario where they do have common phases, this similarity remains "unused" by the analysis (e.g., due to similar phases, the oscillation could be easier to identify on the group level as signals that are not phase locked are averaged out). In such a scenario, it remains unclear whether the analysis proposed is the most sensitive one.

    1. e blocked trust account, however, a court mayorder more to be set aside.26 Additionally, all earnings of anunemancipated minor related to a contract defined in Section6750 of California’s Family Code “shall remain the sole legalproperty of the minor child,” not just the fifteen percent depositedin Coogan Trust Accounts.27

      Whole section explains exactly how laws work with money kids earn and how it works to give reader background information to refer to as they read on

    1. camouflaging brutalities behind a series of euphemisms and pseudolegal terms. Ultimately, the army reverted to the crudest moral code of war, according to which everything which ensured one's survival was permitted (and thus considered moral), and everything even remotely suspect of threatening it must be destroyed (and was by definition immoral).

      brutality working its way into the system itself

    1. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      This computational modeling study builds on multiple previous lines of experimental and theoretical research to investigate how a single neuron can solve a nonlinear pattern classification task. The authors construct a detailed biophysical and morphological model of a single striatal medium spiny neuron, and endow excitatory and inhibitory synapses with dynamic synaptic plasticity mechanisms that are sensitive to (1) the presence or absence of a dopamine reward signal, and (2) spatiotemporal coincidence of synaptic activity in single dendritic branches. The latter coincidence is detected by voltage-dependent NMDA-type glutamate receptors, which can generate a type of dendritic spike referred to as a "plateau potential." The proposed mechanisms result in moderate performance on a nonlinear classification task when specific input features are segregated and clustered onto individual branches, but reduced performance when input features are randomly distributed across branches. Given the high level of complexity of all components of the model, it is not clear which features of which components are most important for its performance. There is also room for improvement in the narrative structure of the manuscript and the organization of concepts and data.

      Strengths:

      The integrative aspect of this study is its major strength. It is challenging to relate low-level details such as electrical spine compartmentalization, extrasynaptic neurotransmitter concentrations, dendritic nonlinearities, spatial clustering of correlated inputs, and plasticity of excitatory and inhibitory synapses to high-level computations such as nonlinear feature classification. Due to high simulation costs, it is rare to see highly biophysical and morphological models used for learning studies that require repeated stimulus presentations over the course of a training procedure. The study aspires to prove the principle that experimentally-supported biological mechanisms can explain complex learning.

      Weaknesses:

      The high level of complexity of each component of the model makes it difficult to gain an intuition for which aspects of the model are essential for its performance, or responsible for its poor performance under certain conditions. Stripping down some of the biophysical detail and comparing it to a simpler model may help better understand each component in isolation. That said, the fundamental concepts behind nonlinear feature binding in neurons with compartmentalized dendrites have been explored in previous work, so it is not clear how this study represents a significant conceptual advance. Finally, the presentation of the model, the motivation and justification of each design choice, and the interpretation of each result could be restructured for clarity to be better received by a wider audience.

      Thank you for the feedback! We agree that the complexity of our model can make it challenging to intuitively understand the underlying mechanisms. To address this, we have revised the manuscript to include additional simulations and clearer explanations of the mechanisms at play.

      In the revised introduction, we now explicitly state our primary aim: to assess to what extent a biophysically detailed neuron model can support the theory proposed by Tran-Van-Minh et al. and explore whether such computations can be learned by a single neuron, specifically a projection neuron in the striatum. To achieve this, we focus on several key mechanisms:

      (1) A local learning rule: We develop a learning rule driven by local calcium dynamics in the synapse and by reward signals from the neuromodulator dopamine. This plasticity rule is based on the known synaptic machinery for triggering LTP or LTD in the corticostriatal synapse onto dSPNs (Shen et al., 2008). Importantly, the rule does not rely on supervised learning paradigms and neither is a separate training and testing phase needed.

      (2) Robust dendritic nonlinearities: According to Tran-Van-Minh et al., (2015) sufficient supralinear integration is needed to ensure that e.g. two inputs (i.e. one feature combination in the NFBP, Figure 1A) on the same dendrite generate greater somatic depolarization than if those inputs were distributed across different dendrites. To accomplish this we generate sufficiently robust dendritic plateau potentials using the approach in Trpevski et al., (2023). 

      (3) Metaplasticity: Although not discussed much in more theoretical work, our study demonstrates the necessity of metaplasticity for achieving stable and physiologically realistic synaptic weights. This mechanism ensures that synaptic strengths remain within biologically plausible ranges during training, regardless of initial synaptic weights.

      We have also clarified our design choices and the rationale behind them, as well as restructured the interpretation of our results for greater accessibility. We hope these revisions make our approach and findings more transparent and easier to engage with for a broader audience.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      This study extends three previous lines of work:  

      (1) Prior computational/phenomenological work has shown that the presence of dendritic nonlinearities can enable single neurons to perform linearly non-separable tasks like XOR and feature binding (e.g. Tran-Van-Minh et al., Front. Cell. Neurosci., 2015).

      Prior computational and phenomenological work, such as Tran-Van-Minh et al. (Front. Cell. Neurosci., 2015), directly inspired our study, as we now explicitly state in the introduction (page 4, lines 19-22). While Tran-Van-Minh theoretically demonstrated that these principles could solve the NFBP, it remains untested to what extent this can be achieved quantitatively in biophysically detailed neuron models using biologically plausible learning rules - which is what we test here.

      (2) This study and a previous biophysical modeling study (Trpevski et al., Front. Cell. Neurosci., 2023) rely heavily on the finding from Chalifoux & Carter, J. Neurosci., 2011 that blocking glutamate transporters with TBOA increases dendritic calcium signals. The proposed model thus depends on a specific biophysical mechanism for dendritic plateau potential generation, where spatiotemporally clustered inputs must be co-activated on a single branch, and the voltage compartmentalization of the branch and the voltage-dependence of NMDARs is not enough, but additionally glutamate spillover from neighboring synapses must activate extrasynaptic NMDARs. If this specific biophysical implementation of dendritic plateau potentials is essential to the findings in this study, the authors have not made that connection clear. If it is a simple threshold nonlinearity in dendrites that is important for the model, and not the specific underlying biophysical mechanisms, then the study does not appear to provide a conceptual advance over previous studies demonstrating nonlinear feature binding with simpler implementations of dendritic nonlinearities.

      We appreciate the feedback on the hypothesized role of glutamate spillover in our model. While the current manuscript and Trpevski et al. (2023) emphasize glutamate spillover as a plausible biophysical mechanism to provide sufficiently robust and supralinear plateau potentials, we acknowledge, however, that the mechanisms of supralinearity of dendritic integration, might not depend solely on this specific mechanism in other types of neurons. In Trpevski et al (2023) we, however, realized that if we allow too ‘graded’ dendritic plateaus, using the quite shallow Mg-block reported in experiments, it was difficult to solve the NFBP. The conceptual advance of our study lies in demonstrating that sufficiently nonlinear dendritic integration is needed and that this can be accounted for by assuming spillover in SPNs—but regardless of its biophysical source (e.g. NMDA spillover, steeper NMDA Mg block activation curves or other voltage dependent conductances that cause supralinear dendritic integration)—it enables biophysically detailed neurons to solve the nonlinear feature binding problem. To address this point and clarify the generality of our conclusions, we have revised the relevant sections in the manuscript to state this explicitly.

      (3) Prior work has utilized "sliding-threshold," BCM-like plasticity rules to achieve neuronal selectivity and stability in synaptic weights. Other work has shown coordinated excitatory and inhibitory plasticity. The current manuscript combines "metaplasticity" at excitatory synapses with suppression of inhibitory strength onto strongly activated branches. This resembles the lateral inhibition scheme proposed by Olshausen (Christopher J. Rozell, Don H. Johnson, Richard G. Baraniuk, Bruno A. Olshausen; Sparse Coding via Thresholding and Local Competition in Neural Circuits. Neural Comput 2008; 20 (10): 2526-2563. doi: https://doi.org/10.1162/neco.2008.03-07-486). However, the complexity of the biophysical model makes it difficult to evaluate the relative importance of the additional complexity of the learning scheme.

      We initially tried solving the NFBP with only excitatory plasticity, which worked reasonably well, especially if we assume a small population of neurons collaborates under physiological conditions. However, we observed that plateau potentials from distally located inputs were less effective, and we now explain this limitation in the revised manuscript (page 14, lines 23-37).

      To address this, we added inhibitory plasticity inspired by mechanisms discussed in Castillo et al. (2011) , Ravasenga et al., and Chapman et al. (2022) , as now explicitly stated in the text (page 32, lines 23-26). While our GABA plasticity rule is speculative, it demonstrates that distal GABAergic plasticity can enhance nonlinear computations. These results are particularly encouraging, as it shows that implementing these mechanisms at the single-neuron level produces behavior consistent with network-level models like BCM-like plasticity rules and those proposed by Rozell et al. We hope this will inspire further experimental work on inhibitory plasticity mechanisms.

      P2, paragraph 2: Grammar: "multiple dendritic regions, preferentially responsive to different input values or features, are known to form with close dendritic proximity." The meaning is not clear. "Dendritic regions" do not "form with close dendritic proximity."

      Rewritten (current page 2, line 35)

      P5, paragraph 3: Grammar: I think you mean "strengthened synapses" not "synapses strengthened".

      Rewritten (current page 14, line 36)

      P8, paragraph 1: Grammar: "equally often" not "equally much".

      Updated (current page 10, line 2)

      P8, paragraph 2: "This is because of the learning rule that successively slides the LTP NMDA Ca-dependent plasticity kernel over training." It is not clear what is meant by "sliding," either here or in the Methods. Please clarify.

      We have updated the text and removed the word “sliding” throughout the manuscript to clarify that the calcium dependence of the kernels are in fact updated

      P10, Figure 3C (left): After reading the accompanying text on P8, para 2, I am left not understanding what makes the difference between the two groups of synapses that both encode "yellow," on the same dendritic branch (d1) (so both see the same plateau potentials and dopamine) but one potentiates and one depresses. Please clarify.

      Some "yellow" and "banana" synapses are initialized with weak conductances, limiting their ability to learn due to the relatively slow dynamics of the LTP kernel. These weak synapses fail to reach the calcium thresholds necessary for potentiation during a dopamine peak, yet they remain susceptible to depression under LTD conditions. Initially, the dynamics of the LTP kernel does not allow significant potentiation, even in the presence of appropriate signals such as plateau potentials and dopamine (page 10, lines 22–26). We have added a more detailed explanation of how the learning rule operates in the section “Characterization of the Synaptic Plasticity Rule” on page 9 and have clarified the specific reason why the weaker yellow synapses undergo LTD (page 11, lines 1–7).

      As shown in Supplementary Figure 6, during subthreshold learning, the initial conductance is also low, which similarly hinders the synapses' ability to potentiate. However, with sufficient dopamine, the LTP kernel adapts by shifting closer to the observed calcium levels, allowing these synapses to eventually strengthen. This dynamic highlights how the model enables initially weak synapses to "catch up" under consistent activation and favorable dopaminergic conditions.

      P9, paragraph 1: The phrase "the metaplasticity kernel" is introduced here without prior explanation or motivation for including this level of complexity in the model. Please set it up before you use it.

      A sentence introducing metaplasticity has been added to the introduction (page 3, lines 36-42) as well as on page 9, where the kernel is introduced (page 9, lines 26-35)

      P10, Figure 3D: "kernel midline" is not explained.

      We have replotted fig 3 to make it easier to understand what is shown. Also, an explanation of the Kernel midpoint is added to the legend (current page 12, line 19)

      P11, paragraph 1; P13, Fig. 4C: My interpretation of these data is that clustered connectivity with specific branches is essential for the performance of the model. Randomly distributing input features onto branches (allowing all 4 features to innervate single branches) results in poor performance. This is bad, right? The model can't learn unless a specific pre-wiring is assumed. There is not much interpretation provided at this stage of the manuscript, just a flat description of the result. Tell the reader what you think the implications of this are here.

      Thanks for the suggestion - we have updated this section of the manuscript, adding an interpretation of the results that the model often fails to learn both relevant stimuli if all four features are clustered onto the same dendrite (page 13, lines 31-42). 

      In summary, when multiple feature combinations are encoded in the same dendrite with similar conductances, the ability to determine which combination to store depends on the dynamics of the other dendrite. Small variations in conductance, training order, or other stochastic factors can influence the outcome. This challenge, known as the symmetry-breaking problem, has been previously acknowledged in abstract neuron models (Legenstein and Maass, 2011). To address this, additional mechanisms such as branch plasticity—amplifying or attenuating the plateau potential as it propagates from the dendrite to the soma—can be employed (Legenstein and Maass, 2011). 

      P12, paragraph 2; P13, Figure 4E: This result seems suboptimal, that only synapses at a very specific distance from the soma can be used to effectively learn to solve a NFBP. It is not clear to what extent details of the biophysical and morphological model are contributing to this narrow distance-dependence, or whether it matches physiological data.

      We have added Figure 5—figure supplement 1A to clarify why distal synapses may not optimally contribute to learning. This figure illustrates how inhibitory plasticity improves performance by reducing excessive LTD at distal dendrites, thereby enhancing stimulus discrimination. Relevant explanations have been integrated into Page 18, Lines 25-39 in the revised manuscript.

      P14, paragraph 2: Now the authors are assuming that inhibitory synapses are highly tuned to stimulus features. The tuning of inhibitory cells in the hippocampus and cortex is controversial but seems generally weaker than excitatory cells, commensurate with their reduced number relative to excitatory cells. The model has accumulated a lot of assumptions at this point, many without strong experimental support, which again might make more sense when proposing a new theory, but this stitching together of complex mechanisms does not provide a strong intuition for whether the scheme is either biologically plausible or performant for a general class of problem.

      We acknowledge that it is not currently known whether inhibitory synapses in the striatum are tuned to stimulus features. However, given that the striatum is a purely inhibitory structure, it is plausible that lateral inhibition from other projection neurons could be tuned to features, even if feedforward inhibition from interneurons is not. Therefore, we believe this assumption is reasonable in the context of our model. As noted earlier, the GABA plasticity rule in our study is speculative. However, we hope that our work will encourage further experimental investigations, as we demonstrate that if GABAergic inputs are sufficiently specific, they can significantly enhance computations (This is discussed on page 17, lines 8-15.).

      P16, Figure 5E legend: The explanation of the meaning of T_max and T_min in the legend and text needs clarification.

      The abbreviations  T<sub>min</sub> and  T<sub>max</sub> have been updated to CTL and CTH to better reflect their role in calcium threshold tracking. The Figure 5E legend and relevant text have been revised for clarity. Additionally, the Methods section has been reorganized for better readability.

      P16, Figure 5B, C: When the reader reaches this paper, the conundrums presented in Figure 4 are resolved. The "winner-takes-all" inhibitory plasticity both increases the performance when all features are presented to a single branch and increases the range of somatodendritic distances where synapses can effectively be used for stimulus discrimination. The problem, then, is in the narrative. A lot more setup needs to be provided for the question related to whether or not dendritic nonlinearity and synaptic inhibition can be used to perform the NFBP. The authors may consider consolidating the results of Fig. 4 and 5 so that the comparison is made directly, rather than presenting them serially without much foreshadowing.

      In order to facilitate readability, we have updated the following sections of the manuscript to clarify how inhibitory plasticity resolves challenges from Figure 4:

      Figure 5B and Figure 5–figure supplement 1B: Two new panels illustrate the role of inhibitory plasticity in addressing symmetry problems.

      Figure 5–figure supplement 1A: Shows how inhibitory plasticity extends the effective range of somatodendritic distances.

      P18, Figure 6: This should be the most important figure, finally tying in all the previous complexity to show that NFBP can be partially solved with E and I plasticity even when features are distributed randomly across branches without clustering. However, now bringing in the comparison across spillover models is distracting and not necessary. Just show us the same plateau generation model used throughout the paper, with and without inhibition.

      Figure updated. Accumulative spillover and no-spillover conditions have been removed.

      P18, paragraph 2: "In Fig. 6C, we report that a subset of neurons (5 out of 31) successfully solved the NFBP." This study could be significantly strengthened if this phenomenon could (perhaps in parallel) be shown to occur in a simpler model with a simpler plateau generation mechanism. Furthermore, it could be significantly strengthened if the authors could show that, even if features are randomly distributed at initialization, a pruning mechanism could gradually transition the neuron into the state where fewer features are present on each branch, and the performance could approach the results presented in Figure 5 through dynamic connectivity.

      To model structural plasticity is a good suggestion that should be investigated in later work, however, we feel that it goes beyond what we can do in the current manuscript.  We now acknowledge that structural plasticity might play a role. For example we show that if we can assume ‘branch-specific’ spillover, that leads to sufficiently development of local dendritic non-linearities, also one can learn with distributed inputs. In reality, structural plasticity is likely important here, as we now state (current page 22, line 35-42). 

      P17, paragraph 2: "As shown in Fig. 6B, adding the hypothetical nonlinearities to the model increases the performance towards solving part of the NFBP, i.e. learning to respond to one relevant feature combination only. The performance increases with the amount of nonlinearity." This is not shown in Figure 6B.

      Sentence removed. We have added a Figure 6 - figure supplement 1 to better explain the limitations.

      P22, paragraph 1: The "w" parameter here is used to determine whether spatially localized synapses are co-active enough to generate a plateau potential. However, this is the same w learned through synaptic plasticity. Typically LTP and LTD are thought of as changing the number of postsynaptic AMPARs. Does this "w" also change the AMPAR weight in the model? Do the authors envision this as a presynaptic release probability quantity? If so, please state that and provide experimental justification. If not, please justify modifying the activation of postsynaptic NMDARs through plasticity.

      This is an important remark. Our plasticity model differs from classical LTP models as it depends on the link between LTP and increased spillover as described by Henneberger et al., (2020).

      We have updated the method section (page 27, lines 6-11), and we acknowledge, however, that in a real cell, learning might first strengthen the AMPA component, but after learning the ratio of NMDA/AMPA is unchanged ( Watt et al., 2004). This re-balancing between NMDA and AMPA might perhaps be a slower process.

      Reviewer #2 (Public Review):

      Summary:

      The study explores how single striatal projection neurons (SPNs) utilize dendritic nonlinearities to solve complex integration tasks. It introduces a calcium-based synaptic learning rule that incorporates local calcium dynamics and dopaminergic signals, along with metaplasticity to ensure stability for synaptic weights. Results show SPNs can solve the nonlinear feature binding problem and enhance computational efficiency through inhibitory plasticity in dendrites, emphasizing the significant computational potential of individual neurons. In summary, the study provides a more biologically plausible solution to single-neuron learning and gives further mechanical insights into complex computations at the single-neuron level.

      Strengths:

      The paper introduces a novel learning rule for training a single multicompartmental neuron model to perform nonlinear feature binding tasks (NFBP), highlighting two main strengths: the learning rule is local, calcium-based, and requires only sparse reward signals, making it highly biologically plausible, and it applies to detailed neuron models that effectively preserve dendritic nonlinearities, contrasting with many previous studies that use simplified models.

      Weaknesses:

      I am concerned that the manuscript was submitted too hastily, as evidenced by the quality and logic of the writing and the presentation of the figures. These issues may compromise the integrity of the work. I would recommend a substantial revision of the manuscript to improve the clarity of the writing, incorporate more experiments, and better define the goals of the study.

      Thanks for the valuable feedback. We have now gone through the whole manuscript updating the text, and also improved figures and added some supplementary figures to better explain model mechanisms. In particular, we state more clearly our goal already in the introduction.

      Major Points:

      (1) Quality of Scientific Writing: The current draft does not meet the expected standards. Key issues include:

      i. Mathematical and Implementation Details: The manuscript lacks comprehensive mathematical descriptions and implementation details for the plasticity models (LTP/LTD/Meta) and the SPN model. Given the complexity of the biophysically detailed multicompartment model and the associated learning rules, the inclusion of only nine abstract equations (Eq. 1-9) in the Methods section is insufficient. I was surprised to find no supplementary material providing these crucial details. What parameters were used for the SPN model? What are the mathematical specifics for the extra-synaptic NMDA receptors utilized in this study? For instance, Eq. 3 references [Ca2+]-does this refer to calcium ions influenced by extra-synaptic NMDARs, or does it apply to other standard NMDARs? I also suggest the authors provide pseudocodes for the entire learning process to further clarify the learning rules.

      The model is quite detailed but builds on previous work. For this reason, for model components used in earlier published work (and where models are already available via model repositories, such as ModelDB), we refer the reader to these resources in order to improve readability and to highlight what is novel in this paper - the learning rules itself. The learning rule is now explained in detail. For modelers that want to run the model, we have also provided a GitHub link to the simulation code. We hope this is a reasonable compromise to all readers, i.e, those that only want to understand what is new here (learning rule) and those that also want to test the model code. We explain this to the readers at the beginning of the Methods section.

      ii. Figure quality. The authors seem not to carefully typeset the images, resulting in overcrowding and varying font sizes in the figures. Some of the fonts are too small and hard to read. The text in many of the diagrams is confusing. For example, in Panel A of Figure 3, two flattened images are combined, leading to small, distorted font sizes. In Panels C and D of Figure 7, the inconsistent use of terminology such as "kernels" further complicates the clarity of the presentation. I recommend that the authors thoroughly review all figures and accompanying text to ensure they meet the expected standards of clarity and quality.

      Thanks for directing our attention to these oversights. We have gone through the entire manuscript, updating the figures where needed, and we are making sure that the text and the figure descriptions are clear and adequate and use consistent terminology for all quantities.

      iii. Writing clarity. The manuscript often includes excessive and irrelevant details, particularly in the mathematical discussions. On page 24, within the "Metaplasticity" section, the authors introduce the biological background to support the proposed metaplasticity equation (Eq. 5). However, much of this biological detail is hypothesized rather than experimentally verified. For instance, the claim that "a pause in dopamine triggers a shift towards higher calcium concentrations while a peak in dopamine pushes the LTP kernel in the opposite direction" lacks cited experimental evidence. If evidence exists, it should be clearly referenced; otherwise, these assertions should be presented as theoretical hypotheses. Generally, Eq. 5 and related discussions should be described more concisely, with only a loose connection to dopamine effects until more experimental findings are available.

      The “Metaplasticity” section (pages 30-32) has been updated to be more concise, and the abundant references to dopamine have been removed.

      (2) Goals of the Study: The authors need to clearly define the primary objective of their research. Is it to showcase the computational advantages of the local learning rule, or to elucidate biological functions?

      We have explicitly stated our goal in the introduction (page 4, lines 19-22). Please also see the response to reviewer 1.

      i. Computational Advantage: If the intent is to demonstrate computational advantages, the current experimental results appear inadequate. The learning rule introduced in this work can only solve for four features, whereas previous research (e.g., Bicknell and Hausser, 2021) has shown capability with over 100 features. It is crucial for the authors to extend their demonstrations to prove that their learning rule can handle more than just three features. Furthermore, the requirement to fine-tune the midpoint of the synapse function indicates that the rule modifies the "activation function" of the synapses, as opposed to merely adjusting synaptic weights. In machine learning, modifying weights directly is typically more efficient than altering activation functions during learning tasks. This might account for why the current learning rule is restricted to a limited number of tasks. The authors should critically evaluate whether the proposed local learning rule, including meta-plasticity, actually offers any computational advantage. This evaluation is essential to understand the practical implications and effectiveness of the proposed learning rule.

      Thank you for your feedback. To address the concern regarding feature complexity, we extended our simulations to include learning with 9 and 25 features, achieving accuracies of 80% and 75%, respectively (Figure 6—figure supplement 1A). While our results demonstrate effective performance, the absence of external stabilizers—such as error-modulated functions used in prior studies like Bicknell and Hausser (2021)—means that the model's performance can be more sensitive to occasional incorrect outcomes. For instance, while accuracy might reach 90%, a few errors can significantly affect overall performance due to the lack of mechanisms to stabilize learning.

      In order to clarify the setup of the rule, we have added pseudocode in the revised manuscript (Pages 31-32) detailing how the learning rule and metaplasticity update synaptic weights based on calcium and dopamine signals. Additionally, we have included pseudocode for the inhibitory learning rule on Pages 34-35. In future work, we also aim to incorporate biologically plausible mechanisms, such as dopamine desensitization, to enhance stability.

      ii. Biological Significance: If the goal is to interpret biological functions, the authors should dig deeper into the model behaviors to uncover their biological significance. This exploration should aim to link the observed computational features of the model more directly with biological mechanisms and outcomes.

      As now clearly stated in the introduction, the goal of the study is to see whether and to what quantitative extent the theoretical solution of the NFBP proposed in Tran-Van-Minh et al. (2015) can be achieved with biophysically detailed neuron models and with a biologically inspired learning rule. The problem has so far been solved with abstract and phenomenological neuron models (Schiess et al., 2014; Legenstein and Maass, 2011) and also with a detailed neuron model but with a precalculated voltage-dependent learning rule (Bicknell and Häusser, 2021).

      We have also tried to better explain the model mechanisms by adding supplementary figures.

      Reviewer #2 (Recommendations For The Authors):

      Minor:

      (1) The [Ca]NMDA in Figure 2A and 2C can have large values even when very few synapses are activated. Why is that? Is this setting biologically realistic?

      The elevated [Ca²⁺]NMDA with minimal synaptic activation arises from high spine input resistance, small spine volume, and NMDA receptor conductance, which scales calcium influx with synaptic strength. Physiological studies report spine calcium transients typically up to ~1 μM (Franks and Sejnowski 2002, DOI: 10.1002/bies.10193), while our model shows ~7 μM for 0.625 nS and around ~3 μM for 0.5 nS, exceeding this range. The calcium levels of the model might therefore be somewhat high compared to biologically measured levels - however, this does not impact the learning rule, as the functional dynamics of the rule remain robust across calcium variations.

      (2) In the distributed synapses session, the study introduces two new mechanisms "Threshold spillover" and "Accumulative spillover". Both mechanisms are not basic concepts but quantitative descriptions of them are missing.

      Thank you for your feedback. Based on the recommendations from Reviewer 1, we have simplified the paper by removing the "Accumulative spillover" and focusing solely on the "Thresholded spillover" mechanism. In the updated version of the paper, we refer to it only as glutamate spillover. However, we acknowledge (page 22, lines 40-42) that to create sufficient non-linearities, other mechanisms, like structural plasticity, might also be involved (although testing this in the model will have to be postponed to future work).

      (3) The learning rule achieves moderate performance when feature-relevant synapses are organized in pre-designed clusters, but for more general distributed synaptic inputs, the model fails to faithfully solve the simple task (with its performance of ~ 75%). Performance results indicate the learning rule proposed, despite its delicate design, is still inefficient when the spatial distribution of synapses grows complex, which is often the case on biological neurons. Moreover, this inefficiency is not carefully analyzed in this paper (e.g. why the performance drops significantly and the possible computation mechanism underlying it).

      The drop in performance when using distributed inputs (to a mean performance of 80%) is similar to the mean performance in the same situation in Bicknell and Hausser (2021), see their Fig. 3C. The drop in performance is due to that: i) the relevant feature combinations are not often colocalized on the same dendrite so that they can be strengthened together, and ii) even if they are, there may not be enough synapses to trigger the supralinear response from the branch spillover mechanism, i.e. the inputs are not summated in a supralinear way (Fig. 6B, most input configurations only reach 75%).

      Because of this, at most one relevant feature combination can be learned. In the several cases when the random distribution of synapses is favorable for both relevant feature combinations to be learned, the NFBP is solved (Figs. 6B, some performance lines reach 100 % and 6C, example of such a case). We have extended the relevant sections of the paper trying to highlight the above mentioned mechanisms.

      Further, the theoretical results in Tran-Van-Minh et al. 2015 already show that to solve the NFBP with supralinear dendrites requires features to be pre-clustered in order to evoke the supralinear dendritic response, which would activate the soma. The same number of synapses distributed across the dendrites i) would not excite the soma as strongly, and ii) would summate in the soma as in a point neuron, i.e. no supralinear events can be activated, which are necessary to solve the NFBP. Hence, one doesn’t expect distributed synaptic inputs to solve the NFBP with any kind of learning rule. 

      (4) Figure 5B demonstrates that on average adding inhibitory synapses can enhance the learning capabilities to solve the NFBP for different pattern configurations (2, 3, or 4 features), but since the performance for excitatory-only setup varies greatly between different configurations (Figure 4B, using 2 or 3 features can solve while 4 cannot), can the results be more precise about whether adding inhibitory synapses can help improve the learning with 4 features?

      In response to the question, we added a panel to Figure 5B showing that without inhibitory synapses, 5 out of 13 configurations with four features successfully learn, while with inhibitory synapses, this improves to 7 out of 13. Figure 5—figure supplement 1B provides an explanation for this improvement: page 18 line 10-24

      (5) Also, in terms of the possible role of inhibitory plasticity in learning, as only on-site inhibition is studied here, can other types of inhibition be considered, like on-path or off-path? Do they have similar or different effects?

      This is an interesting suggestion for future work. We observed relevant dynamics in Figure 6A, where inhibitory synapses increased their weights on-site when randomly distributed. Previous work by Gidon and Segev (2012) examined the effects of different inhibitory types on NMDA clusters, highlighting the role of on-site and off-path inhibition in shunting. In our context, on-site inhibition in the same branch, appears more relevant for maintaining compartmentalized dendritic processing.

      (6) Figure 6A is mentioned in the context of excitatory-only setup, but it depicts the setup when both excitatory and inhibitory synapses are included, which is discussed later in the paper. A correction should be made to ensure consistency.

      We have updated the figure and the text in order to make it more clear that simulations are run both with and without inhibition in this context (page 21 line 4-13)

      (7) In the "Ca and kernel dynamics" plots (Fig 3,5), some of the kernel midlines (solid line) are overlapped by dots, e.g. the yellow line in Fig 3D, and some kernel midlines look like dots, which leads to confusion. Suggest to separate plots of Ca and kernel dynamics for clarity. 

      The design of the figures has been updated to improve the visibility of the calcium and kernel dynamics during training.

      (8) The formulations of the learning rule are not well-organized, and the naming of parameters is kind of confusing, e.g. T_min, T_max, which by default represent time, means "Ca concentration threshold" here.

      The abbreviations of the thresholds  ( T<sub>min</sub>,  T<sub>max</sub> in the initial version) have been updated to CTL and CTH, respectively, to better reflect their role in tracking calcium levels. The mathematical formulations have further been reorganized for better readability. The revised Methods section now follows a more structured flow, first explaining the learning mechanisms, followed by the equations and their dependencies.

    1. Reviewer #1 (Public review):

      Summary:

      This manuscript aims to explain the emergence of grid-like spatial firing patterns. Rather than taking the existence of grid cells as a given and asking what does their properties say about their function, the authors reverse the approach: they begin with a proposed computational function that the brain may need to perform-coding of 2D spatial trajectories using sequences of neural activity-and ask what type of neural code would optimally support this function. They show that, under a set of formal assumptions, such a code leads to the emergence of spatial periodicity and a hexagonal grid pattern. The aim is to provide a normative explanation for the existence of grid cells grounded in functional constraints.

      Strengths:

      The manuscript presents a mathematically well-defined framework that is internally consistent. The derivation is structured and leads to a hexagonal lattice as the most efficient solution for representing directional trajectories. The authors provide comparisons to experimental observations and extend the model to explain several findings in the grid cell literature. In the revised version, the discussion of foundational assumptions is expanded, and the manuscript better situates itself in relation to prior theoretical work. Overall, this work adds a very interesting view to the broader conversation about the role and origin of grid cells by offering a theoretical alternative grounded in trajectory coding.

      Weaknesses:

      The model depends on assumptions that, while plausible, should be treated as chosen assumptions. These include the premise that (1) grid function is trajectory coding, (2) that trajectory coding is implemented through sequences of neural activity, and (3) that such sequences are largely independent of spatial position. In the revised manuscript, the authors provide more literature to support these assumptions.

  6. social-media-ethics-automation.github.io social-media-ethics-automation.github.io
    1. Then Sean Black, a programmer on TikTok saw this and decided to contribute by creating a bot that would automatically log in and fill out applications with random user info, increasing the rate at which he (and others who used his code) could spam the Kellogg’s job applications:

      This shows how automation can rapidly scale data poisoning. The use of a bot to submit fake data highlights vulnerability in systems that lack strong validation systems. It also how low barriers to entry for automation can turn small scale protests into large scale disruptions

    1. I would argue that "whole tree" thinking is enhanced by --follow being the default. What I mean is when I want to see the history of the code within a file, I really don't usually care whether the file was renamed or not, I just want to see the history of the code, regardless of renames. So in my opinion it makes sense for --follow to be the default because I don't care about individual files; --follow helps me to ignore individual file renames, which are usually pretty inconsequential.
    1. Author response:

      The following is the authors’ response to the previous reviews

      We would like to thank you for your valuable comments and suggestions, which have greatly contributed to improving our manuscript.

      We have carefully addressed all the reviewers' suggestions, and detailed responses for each Reviewer are provided at the end of this letter. In summary:

      • The Introduction has been revised to provide a more focused discussion on results, toning down the speculative discussion on seasonal host shifts.

      • The methodology section has been clarified, particularly the power analysis, which now includes a clearer explanation. The random effects in the models have been better described to ensure transparency.

      • The Results section was reorganized to highlight the key findings more effectively.

      • The Discussion has been restructured for clarity and conciseness, ensuring the interpretation of the results is clearer and better aligned with the study objectives.

      • Minor edits throughout the manuscript were made to improve readability and accuracy.

      We hope you find this revised version of the manuscript satisfactory.

      Reviewer #1 (Public review):

      Summary:

      This study examines the role of host blood meal source, temperature, and photoperiod on the reproductive traits of Cx. quinquefasciatus, an important vector of numerous pathogens of medical importance. The host use pattern of Cx. quinquefasciatus is interesting in that it feeds on birds during spring and shifts to feeding on mammals towards fall. Various hypotheses have been proposed to explain the seasonal shift in host use in this species but have provided limited evidence. This study examines whether the shifting of host classes from birds to mammals towards autumn offers any reproductive advantages to Cx.

      quinquefasciatus in terms of enhanced fecundity, fertility, and hatchability of the offspring. The authors found no evidence of this, suggesting that alternate mechanisms may drive the seasonal shift in host use in Cx. quinquefasciatus.

      Strengths:

      Host blood meal source, temperature, and photoperiod were all examined together.

      Weaknesses:

      The study was conducted in laboratory conditions with a local population of Cx. quinquefasciatus from Argentina. I'm not sure if there is any evidence for a seasonal shift in the host use pattern in Cx. quinquefasciatus populations from the southern latitudes.

      Comments on the revision:

      Overall, the manuscript is much improved. However, the introduction and parts of the discussion that talk about addressing the question of seasonal shift in host use pattern of Cx. quin are still way too strong and must be toned down. There is no strong evidence to show this host shift in Argentinian mosquito populations. Therefore, it is just misleading. I suggest removing all this and sticking to discussing only the effects of blood meal source and seasonality on the reproductive outcomes of Cx. quin.

      Introduction and discussion have been modified, toned down and sticked to discuss the results as suggested.

      Reviewer #1 (Recommendations for the authors):

      Some more minor comments are mentioned below.

      Line 51: Because 'of' this,

      Changed as suggested.

      Line 56: specialists 'or' generalists

      Changed as suggested.

      Line 56: primarily

      Changed as suggested.

      Line 98: Because 'of' this,

      Changed as suggested.

      Reviewer #2 (Public review):

      Summary:

      Conceptually, this study is interesting and is the first attempt to account for the potentially interactive effects of seasonality and blood source on mosquito fitness, which the authors frame as a possible explanation for previously observed hostswitching of Culex quinquefasciatus from birds to mammals in the fall. The authors hypothesize that if changes in fitness by blood source change between seasons, higher fitness on birds in the summer and on mammals in the autumn could drive observed host switching. To test this, the authors fed individuals from a colony of Cx. quinquefasciatus on chickens (bird model) and mice (mammal model) and subjected each of these two groups to two different environmental conditions reflecting the high and low temperatures and photoperiod experienced in summer and autumn in Córdoba, Argentina (aka seasonality). They measured fecundity, fertility, and hatchability over two gonotrophic cycles. The authors then used generalized linear mixed models to evaluate the impact of host species, seasonality, and gonotrophic cycle on fecundity, fertility, and hatchability. The authors were trying to test their hypothesis by determining whether there was an interactive effect of season and host species on mosquito fitness. This is an interesting hypothesis; if it had been supported, it would provide support for a new mechanism driving host switching. While the authors did report an interactive impact of seasonality and host species, the directionality of the effect was the opposite from that hypothesized. The authors have done a very good job of addressing many of the reviewer's concerns, especially by adding two additional replicates. Several minor concerns remain, especially regarding unclear statements in the discussion.

      Strengths:

      (1) Using a combination of laboratory feedings and incubators to simulate seasonal environmental conditions is a good, controlled way to assess the potentially interactive impact of host species and seasonality on the fitness of Culex quinquefasciatus in the lab.

      (2) The driving hypothesis is an interesting and creative way to think about a potential driver of host switching observed in the field.

      Weaknesses:

      (1) The methods would be improved by some additional details. For example, clarifying the number of generations for which mosquitoes were maintained in colony (which was changed from 20 to several) and whether replicates were conducted at different time points.

      Changed as suggested.

      (2) The statistical analysis requires some additional explanation. For example, you suggest that the power analysis was conducted a priori, but this was not mentioned in your first two drafts, so I wonder if it was actually conducted after the first replicate. It would be helpful to include further detail, such as how the parameters were estimated. Also, it would be helpful to clarify why replicate was included as a random effect for fecundity and fertility but as a fixed effect for hatchability. This might explain why there were no significant differences for hatchability given that you were estimating for more parameters.

      The power analysis was conducted a posteriori, as you correctly inferred. While I did not indicate that it was performed a priori, you are right in noting that this was not explicitly mentioned. As you suggested, the methodology for the power analysis has been revised to clarify any potential doubts.

      Regarding the model for hatchability, a model without a random effect variable was used, as all attempts to fit models with random effects resulted in poor validation. These points have now been clarified and explained in the corresponding section.

      (3) A number of statements in the discussion are not clear. For example, what do you mean by a mixed perspective in the first paragraph? Also, why is the expectation mentioned in the second paragraph different from the hypothesis you described in your introduction?

      Changed as suggested.

      (4) According to eLife policy, data must be made freely available (not just upon request).

      Data and code will be publicly available. The corresponding section was modified.

      Reviewer #2 (Recommendations for the authors):

      Your manuscript is much improved by the inclusion of two additional replicates! The results are much more robust when we can see that the trends that you report are replicable across 3 iterations of the experiment. Congratulations on a greatly improved study and paper! I have several minor concerns and suggestions, listed below:

      38-39: I think it is clearer to say "no statistically significant effect of season on hatchability of eggs" ... or specify if you are referring to blood or the interaction of blood and season. It isn't clear which treatment you are referring to here.

      Changed as suggested.

      54-57: This could be stated more succinctly. Instead of citing papers that deal with specific examples of patterns, I would suggest citing a review paper that defines these terms.

      Changed as suggested.

      83-84: What if another migratory bird is the preferred host in Argentina? I would state this more cautiously (e.g. "may not be applicable...").

      Changed as suggested.

      95-96: I don't understand what you mean by this. These hypotheses are specifically meant to understand mosquitoes that DO have a distinct seasonal phenology, so I'm not sure why this caveat is relevant. And naturally this hypothesis is host dependent, since it is based on specific host reproductive investments. I think that the strongest caveat to this hypothesis is simply that it hasn't been proven.

      Changed as suggested.

      97-115: This is a great paragraph! Very clear and compelling.

      Thanks for your words!

      118: Do you have an exact or estimated number of rafts collected?

      Sorry, I have not the exact number of rafts, but it was at leas more than 20-30.

      135: "over twenty" was changed to "several"; several would imply about 3 generations, so this is misleading. If the colony was actually maintained for over twenty generations, then you should keep that wording.

      Changed as suggested.

      163-164: Can you please clarify whether the replicates were conducted a separate time points?

      Changed as suggested.

      Note: the track changes did not capture all of the changes made; e.g. 163-164 should show as new text but does not.

      You are absolutely right; when I uploaded the last version, I unfortunately deleted all tracked changes and cannot recover them. In this new version, I will ensure that all minimal changes are included as tracked changes.

      186 - 189: the terms should be "fixed effect" and "random effect"

      Changed as suggested.

      191: Edit: linear

      Changed as suggested.

      194: why was replicate not included as a random effect here when it was above? Also, can you please clarify "interaction effects"? Which interactions did you include?

      Changed as suggested. Explained above and in methodology. Hatchability models with random effect variable were poor fitted and validated. The interactions for hatchability were a four-way (season, blood source, cycle and replicate)

      207-208: I'm not sure what you mean by "aimed to achieve"? Weren't you doing this after you conducted the experiments, so wouldn't this be determining the power of your model (post-hoc power analysis)? Also, I think you should provide the parameter estimates that were used (e.g. effect size - did you use the effect size you estimated across the 3 replicates?).

      Changed as suggested.

      214-215: this should be reworded to acknowledge that this is estimated for the given effect size; for example, something like "This sample size was sufficient to detect the observed effect with a statistical power of 0.8" or something along those lines (unless I am misunderstanding how you conducted this test).

      Changed as suggested.

      246. Abbreviate Culex

      Changed as suggested.

      253-255: This sentence isn't clear. What do you mean by mixed? Also, the season really seemed to mainly impact the fitness of mosquitoes fed on mouse blood and here the way it is phrased seems to indicate that season has an impact on the fitness of those fed with chicken blood.

      Changed as suggested.

      258-260: You stated your hypothesis as the relative fitness shifting between seasons, but this statement about the expectation is different from your hypothesis stated earlier. Please clarify.

      You are right. Thank you for noting this. It was changed as suggested.  

      263-266: I also don't understand this sentence; what does the first half of the sentence have to do with the second?

      Changed as suggested.

      269-270: This doesn't align with your observation exactly; you say first AND second are generally most productive, but you observed a drop in the second. Please clarify this.

      Changed as suggested.

      280: I suggest removing "as same as other studies"; your caveats are distinct because your experimental design was unique

      Changed as suggested.

      287: you shouldn't be looking for a "desired" effect; I suggest removing this word

      Changed as suggested.

      288: It wasn't really a priori though, since you conducted it after your first replicate (unless you didn't use the results from the first replicate you reported in the original drafts?)

      It was a posteriori. Changed as suggested.

      290: Why is 290 written here?

      It was a mistype. Deleted as suggested.

      291-298: The meaning of this section of your paragraph is not clear.

      Improve as suggested.

      304-313: This list of 3 explanations are directed at different underlying questions. Explanations 1 and 2 are alternative explanations for why host switching occurs if not due to differences in fitness. This isn't really an explanation of your results so much as alternative explanations for a previously reported phenomenon. And the third is an explanation for why you may not have observed the expected effect. I suggest restructuring this to include the fact that Argentinian quinqs may not host switch as part of your previous list of caveats. Then you can include your two alternative explanations for host switching as a possible future direction (although I would say that it is really just one explanation because "vector biology" is too broad of a statement to be testable). Also, you haven't discussed possible explanations for your actual result, which showed that mosquito fitness decreased when feeding on mouse blood in autumn conditions and in the second gonotrophic, while those that fed on chicken did not experience these changes. Why might that be?

      The discussion was restructured to include all these suggested changes. Additionally, it was also discussed some possible explanations of our results.

      315-317: This statement is vague without a direct explanation of how this will provide insight. I suggest removing or providing an explanation of how this provides insight to transmission and forecasting.

      Changed as suggested.

      319-320: According to eLife policy, all data should be publicly available. From guidelines: "Media Policy FAQs Data Availability Purpose and General Principles To maintain high standards of research reproducibility, and to promote the reuse of new findings, eLife requires all data associated with an article to be made freely and widely available. These must be in the most useful formats and according to the relevant reporting standards, unless there are compelling legal or ethical reasons to restrict access. The provision of data should comply with FAIR principles (Findable, Accessible, Interoperable, Reusable). Specifically, authors must make all original data used to support the claims of the paper, or that is required to reproduce them, available in the manuscript text, tables, figures or supplementary materials, or at a trusted digital repository (the latter is recommended). This must include all variables, treatment conditions, and observations described in the manuscript. The authors must also provide a full account of the materials and procedures used to collect, pre-process, clean, generate and analyze the data that would enable it to be independently reproduced by other researchers."

      - so you need to make your data available online; I also understand the last sentence to indicate that code should be made available.  

      Data and code will be publicly available.

      Table 1: it is notable that in replicate 2, the autumn:mouse:gonotrophic cycle II fecundity and fertility are actually higher than in the summer, which is the opposite of reps 1 and 3 and the overall effect you reported from the model. This might be worth mentioning in the discussion.

      Mentioned in the discussion as suggested.

      Tables 1 and 2: shouldn't this just be 8 treatments? You included replicate as a random effect, so it isn't really a separate set of treatments.

      This table reflects the output of the whole experiment, that is why it is present the 24 expetiments.

      Figure 3: Can you please clarify if this is showing raw data?

      Changed as suggested.

      Note: grammatical copy editing would be beneficial throughout

      Grammar was improved as suggested.

    1. Welcome back. In this video, I want to cover another part of the AWS global network, specifically the Edge Network, and that's AWS Local Zones. Now, this is a key architectural concept that you'll need to understand for all of the AWS exams, and especially so for the real world. So, let's jump in and get started.

      Now, before we talk about local zones, let's just refresh our memory on what the typical region and availability zone architecture looks like without local zones. So, we have a region, and let's say that this is US West 2, and within this, we have three availability zones: US West 2A, US West 2B, and US West 2C. And then running in this region across those availability zones is a VPC.

      Now, an AWS region has high performance and resilient internet connections, and sitting between these and the AWS private zone is the AWS public zone. So, this is the zone where all of the AWS public services for that region run within. And then lastly, on our right, we have our business premises. What we know about this architecture so far is that it scales. It can grow with your requirements, and that's really important because this is fully managed within the region. We also know that it's resilient to failure. The failure of one availability zone won't impact other availability zones, assuming a solutions architect has designed a solution which has infrastructure duplicated across all of the availability zones and things in one availability zone consume from that availability zone only, often regionally resilient services.

      Now, what I haven't talked about until now is the effects of geographic distance. The availability zones in this region might be hundreds of kilometers away from the business premises. Now, this distance, even assuming that we're using fiber, can cause latency. And this latency causes a reduction in performance, and this performance impact is noticeable at this distance. To many use cases, a few milliseconds of latency might not sound like much, but for applications which are sensitive to latency, this can really matter. An example might be a financial trading application. Even if we use Direct Connect, physics and the speed of data transfer from point A to point B matters. So, how can we fix this? Well, we can use AWS local zones, and let's see how this changes the architecture.

      Let's adjust the diagram a little and make it easier to see. And we're going to add some subnets in availability zones 2A, 2B, and 2C. And we'll also have some EC2 instances running in these subnets. When we're discussing local zones, we can refer to this region as the parent region. So, this region is the parent region to any local zones which operate in the same geographic area. So, we're also going to add some local zones to this architecture. Now, these are identified starting with the region name and then a unique identifier for the local zone. In this example, we have US West 2 and then LAS-1, which is a local zone in Las Vegas. And we have US West 2 as its parent region. So, you can see the link between the local zone and the parent region because you can read the parent region at the start of the local zone name.

      Now, it's possible to have multiple local zones in a given city. For instance, in this example, we have US West 2-LAS-1A and 1B. And both of these are in Los Angeles. Notice how they use the international city code to identify them. Now, think of these as related to the parent region, but they operate as their own independent infrastructure points. So, they have their own independent connections to the internet. And additionally, generally, they also support Direct Connect, which means you can achieve high performance, private connectivity between your business locations and these local zones.

      Now, different services support local zones in different ways. And over the course of your studies, you're going to learn how. With EC2 and VPCs, the VPC is simply extended by creating subnets within the local zones. And then within these subnets, you can create resources as normal, utilizing the proximity of the local zone. So, these resources benefit from super low latencies. The performance between the business premises and the local zone is at the extreme end of what's possible because of the smaller geographic separation between the local zone and your business premises.

      Now, an important thing to keep in mind is that some things within the local zones still utilize the parent region. So, in this example, the subnets created in the local zones behave just like those in the parent region, and they have private connectivity just like any other subnets would. Local zones have private networking with the parent region. So, remember that. However, if we create EBS snapshots, then these use S3 in the parent region. It means they still benefit from the AZ replication across all availability zones within that region that snapshots would normally benefit from. So, certain things occur within the local zone, but certain things rely on the parent region. And one common example is EBS snapshots.

      Now, let's finish up this video with some key summary points because for most of the AWS certifications, you only need to have this high-level architectural overview. So, think about local zones as one additional zone or one additional availability zone, so they don't have built-in resilience. Conceptually, one zone runs in one specific facility. So, you can think of them like a single availability zone but near your location. So, they're closer to you, so they have lower latency, and lower latency means better performance. So, just imagine taking one of the availability zones within a region and duplicating it but putting it in a building next to your business premises. Now, it won't always be that close, but there are some businesses which are built very close to these AWS local zones by design. So, you're able to get really close to the AWS infrastructure.

      Now, not all AWS products support using local zones, and for the ones that do, many of them are opt-in and many of them have limitations. So, if you're ever going to utilize local zones, you need to make sure that you check the AWS documentation for an up-to-date overview of what's supported within the local zones in your specific geographic area. And I've made sure to include a link attached to this video which gives you up-to-the-minute overviews for all of the AWS local zones.

      Now, Direct Connect to local zones is generally supported, and this allows local zones to be used to support any extreme performance needs or performance requirements. And once again, local zones do utilize the parent region for various things, and one example is EBS snapshots are taken to the parent region and replicated over S3 in that parent region.

      Now, just to summarize this, you should use local zones as an architect when you need the absolute highest level of performance. Local zones, much like CloudFront edge locations, are much more likely to be positioned closer to your business than the parent region and any of the normal availability zones. But if you do utilize local zones, you need to make sure that they do offer the functionality that you require. So, essentially, this is just another tool that you can use to build architectures as a solutions architect.

      Now, this is everything I wanted to cover in this video. I just wanted to give you a high-level overview of the architecture of local zones. So, go ahead and complete the video, and when you're ready, I'll look forward to you joining me in the next.

    1. Welcome back, and in this lesson, I want to cover the high-level architecture of Amazon Lex. Amazon Lex is a product that allows you to create interactive chatbots. For most areas of study and for solutions architects working in the real world, you only need a basic level of understanding, and that's exactly what this video will provide. If you need to know anything beyond this, the course you're studying will likely include follow-up videos to this one. If not, don't worry—this video will cover everything that you need. Now let's jump in and get started.

      Amazon Lex is a back-end service. It's not something you're likely to use from a user perspective. Instead, you'll use it to add capabilities to your application. Lex provides text or voice conversational interfaces. For the exam, remember “Lex for voice” or “Lex for Alexa.” If you're familiar with Amazon voice products, just know that Lex powers those products—it provides the conversational capability. It's what lets the lady in the tube answer your questions.

      Lex provides two main bits of functionality. First is automatic speech recognition (ASR), which is simply speech-to-text. Now, I say “simple,” but doing this well is exceptionally difficult. If any of you have tried using Siri, Apple’s voice assistant, you may have noticed how often it gets things wrong compared to the Alexa product. That’s because Siri doesn’t do ASR as well as Lex. And for any lawyers listening—this is just my opinion.

      Lex also provides natural language understanding (NLU) services, which allow it to discover your intent and even perform intent chaining. Imagine the act of ordering a pizza. You might start the conversation by saying, “Can I order a pizza please?” or “I want to order a pizza,” or even “A large pepperoni pizza, please.” The intent—the thing you want to do—is ordering pizza, and it's Lex's job to determine that. But what about your next sentence? “Make that an extra large, please.” Lex needs to understand that this second statement relates to the first. As humans, this is easy—we're good at natural language processing. Computers historically haven't been, but Lex enables voice and text understanding in your applications without needing to code that functionality yourself. You simply integrate Lex, and it does the hard work for you.

      As a service, Lex scales well and integrates with other AWS products such as Amazon Connect. It’s quick to deploy and uses a pay-as-you-go pricing model, meaning it only costs when you’re actively using it. This makes it ideal for event-driven or serverless architectures. In terms of use cases, Lex can help you build chatbots—the kind that pop up on websites asking if you need help—or automated support chats for logging tickets. You can also build voice assistants that respond when you ask for something, just like the lady in the tube. Use cases also include Q&A bots or enterprise productivity bots—basically, any interactive bot that accepts text or voice and performs a service.

      Let’s now review some of the key Lex concepts. Lex provides bots that are designed to interactively converse in one or more languages. I previously mentioned the term "intent." This represents an action the user wants to perform—things like ordering a pizza, ordering a milkshake, or getting a side of fries. In addition to intents, we have the concept of utterances. When creating an intent, you can provide sample utterances—these are ways an intent might be expressed. So to order a pizza, milkshake, or fries, a user might say “Can I order,” “I want to order,” or “Give me a.” These are all different ways of expressing or uttering an intent.

      Along with configuring utterances, you also need to tell Lex how to fulfill the intent, and this is often done using Lambda integration. If Lex understands that the user wants to order a pizza, it needs a way to initiate that process—Lambda functions are typically used for this purpose. Lambda works especially well in event-driven architectures, making it a natural complement to Lex. Additionally, Lex includes the concept of a slot, which you can think of as a parameter for an intent. These might include the size of the pizza (small, medium, or large), the type of crust (normal or cheesy), and other similar details. You can configure slots as required parameters that Lex must gather from the user during the interaction.

      Just to reiterate, Lex is a product you won’t usually interact with directly through the console. It’s something you’ll architect into your applications. If you want to provide interactive voice assistance via a chat or voice-capable bot, you’ll use Amazon Lex. So remember this for the exam.

      With that being said, that is everything I wanted to cover in this video. Go ahead and complete the video, and when you're ready, I’ll look forward to you joining me in the next.

    1. Welcome back and in this lesson I want to talk about Amazon Athena. This product is one of those hidden gems available within AWS which are really valuable as long as you understand the features that it provides. So let's quickly jump in and explore the architecture.

      So what is Athena? Well, it's a serverless interactive querying service. Put simply, it means that you can take data stored within S3 and perform ad hoc queries on that data, paying for only the amount of data consumed while running the query and the storage used within S3 to store the original data. It has no base monthly cost, no per minute or per hour charges, you just pay for the data consumed.

      Now what's really special about Athena is how it handles the structured, semi-structured and even unstructured data that it uses. Athena uses a process called schema on read. And the way that I want you to imagine this is like a window or a lens through which you see the data in a certain way but where the original data is unchanged. Your original data stored on S3 is never changed. It remains on S3 in its original form. The schema which you define in advance modifies data in flight as it's read through the schema. So it translates the original unmodified source data into a table-like structure as it's read through the schema. As you query the data, the original data is read, left unmodified and the translation only happens within the product during the querying process. I can't stress this enough, the original data is maintained in its unmodified state within S3.

      Normally with databases you create tables and you have to load data into those tables. The data needs to be in the format of the tables or you need to perform ETL processors which stands for extract, transform and load. With Athena this isn't required. You define how you want the data to look in the form of a schema and in a non-modifying way data is loaded through this on the fly. And then any output can be sent to other AWS services.

      Now let's look at this visually because it's going to be easier to understand if you see the architecture. So Athena starts with the source data which is stored on S3 and conceptually this is read only. It's never modified. Now the product supports a wide range of data formats and this is growing all of the time. Some examples include XML, JSON, comma and tab separated values, Avro, Parquet, Ork and even custom application log formats such as Apache and AWS services such as CloudTrail, VPC Flow Logs and more. So this data on S3 is fixed. It doesn't get changed and that's probably one of the services most fundamental concepts that you need to understand. And it's why I've repeated it probably 10 times already in this lesson.

      So inside the product you create a schema and in this schema you're essentially defining tables. These tables define how to get from the format of the original source data to a table like structure. So unlike a traditional database where a table is the final structure, in Athena you're defining a way to take the original data and present it in a way that you want which allows you to run queries against these tables. It's almost like a recipe. You're defining how to convert from ingredients to a final meal. It's a method to get from the source data to the structure that you want to be able to query.

      So these tables within Athena don't actually contain data like a traditional database product. They contain information, directives on how to convert the source data to be able to query on it. So this schema is used at the time of querying when data is read and this is why it's called schema on read. The data is conceptually streamed through the schema while being queried so it can be queried in a relational style way using normal SQL like queries. And the output can be displayed on the console, saved or output to other AWS tools.

      And all the time for this whole process there's no base or constant cost. You just pay for the amount of data consumed by the query and you can even optimize the original data set to reduce the amount of data that has to be used for individual queries. The key thing to understand about Athena going into the exam is that it has no infrastructure. You don't need to think about setting up any database infrastructure in advance. You don't need to think about data manipulation in advance and you don't need to load data in advance. Keep those things in mind when going into the exam. They will help inform you when Athena is the right choice and when it's not.

      So Athena is great in situations where loading or transforming of data isn't desired. Where you have data already on S3 in a source or raw format and you need to query it without doing any loading or transformation. In the demo lesson which is coming up next you'll see an example of using a large data set, the open street map data. If you needed to load and transform that prior to use it would massively reduce its utility. A benefit of Athena is how you don't need to do any loading or transformation of data in advance. And this makes it ideal for ad hoc or occasional queries of data in S3. Why? Well because you don't need any servers running in advance and you don't need to think in advance about loading or transformation. You have a business need and immediately run a query.

      Athena is also useful if you're a cost-conscious business. It's great because it's servilous. You pay for any data read as part of a query. There are no base costs and no upfront costs. Again, think ad hoc, sporadic and cost effective. Athena is also the preferred solution especially in the exam for any queries which involve AWS service logs because it has native support of VPC flow logs, cloud trail logs, elastic load balancer logs, cost reports and much more. And it can also query data from the Glue Data Catalog and supports web server logs. And again, these are keywords to look for in the exam.

      A newer feature of Athena is called Athena Federated Query. Now be really careful with this one because I don't want you being confused. For most situations if you see SQL mentioned or no SQL mentioned or any specific database product then the answer to that question is likely not to be Athena. But Athena now has the capability to query non-S3 data sources. Athena uses data source connectors that run on AWS Lambda to perform federated queries. So a data source connector is basically a piece of code that can translate between a target data source which isn't S3 and Athena. So you can think of a connector as almost like an extension to Athena's querying engine.

      So you've got pre-built connectors which exist for data sources like cloud watch logs, DynamoDB, DocumentDB, Amazon RDS and even other JDBC compliant relational data sources such as MySQL, Postgres and many more. So Athena Federated Query really is a feature which is going to massively improve the utility of the product. Now that's all of the theory that you need to be aware of for the product as well as some of the key use cases that you might see in the exam. So at this point go ahead, finish this lesson and then when you're ready I look forward to you joining me in the next.

    1. Welcome back! In this lesson, I want to talk about a powerful feature of CloudFormation called custom resources. Now, we've got a lot to cover, so let's jump in and get started straight away.

      The way that CloudFormation is architected isn't complicated. You define logical resources within a template, and these define what you want CloudFormation to do, so what infrastructure you want it to create. CloudFormation uses these logical resources in a template to create a stack, and this stack creates physical resources. If you update the logical resources by updating and reapplying a template, then the physical resources are updated. If you remove a logical resource from the template and then reapply that template to a stack, then the physical resources are affected in the same way.

      CloudFormation doesn't support everything within AWS. It can lag behind in terms of products or features of those products, and there are some things which it just doesn't support or things it never will support. CloudFormation custom resources are the answer to anything that you want to do in CloudFormation that it doesn't support natively. Custom resources are a type of logical resource that allows CloudFormation to do things it doesn't yet support or doesn't natively support, or they allow integration with external systems.

      Examples of things that you can do with custom resources might include populating an S3 bucket with objects when you create it or deleting objects from a bucket when that bucket is being deleted. This is something that would normally error. If you try to delete a CloudFormation stack that contains a bucket with objects inside it, by default, it won't allow you to do that, and it will error. Another example is that you might want to request configuration information from an external system as part of setting up an EC2 instance. You can even use custom resources to provision non-AWS resources. So, using custom resources, the functionality of CloudFormation can be extended much beyond what it can support natively.

      Now, the architecture of custom resources is simple. CloudFormation begins the process of creating the custom resource, and in doing so, it sends data to an endpoint that you define within that custom resource. This might be a Lambda function, or it could be an SNS topic. Whichever one you pick, it sends an event to this resource. Whenever a custom resource is created, updated, or deleted, CloudFormation sends data to that custom resource. It sends event data, which contains the operation that's happening, as well as any property information. So, the custom resource, for example, a Lambda function, is invoked and provided with that information.

      Now, the compute that's backing that custom resource—let's use the example of a Lambda function—can respond to CloudFormation, letting it know of the success or failure, and it can pass back any data. Assuming a Lambda function which backs a custom resource responds with a success code, then everything is assumed to be good. The custom resource is created, and any data generated by that Lambda function is passed back into CloudFormation and made available to anything else within the CloudFormation template. So again, two options for how you can back custom resources are Lambda or an SNS topic.

      Now, let's look at this visually because it will help you understand the architecture. Then I'll show you a practical example from the AWS console. So let's consider a scenario where a CloudFormation template is used to create a stack that creates an S3 bucket. Let's look at this without using a custom resource. We start with a CloudFormation template, which is a simple one containing a simple S3 bucket logical resource. Using this template, we create a stack, and this creates a logical resource inside the stack, which in turn creates the corresponding physical resource, the S3 bucket.

      At this point, if you deleted the stack, it would delete the logical resource, which would delete the physical resource. If the bucket is empty, everything would work as expected. But let's say that a human gets involved, Gabby, and Gabby makes a manual change by adding additional objects into the bucket. Now we have a problem because the physical resource is out of sync with CloudFormation, and we have an even bigger problem because we have a bucket with objects inside it. If we tried to delete the stack at this point, CloudFormation would attempt to delete the logical resource, which would attempt to delete the physical one. But because the bucket contains objects, the delete operation would fail. This is just a limitation of S3 and CloudFormation. It's the type of situation you might hit when you're dealing with any complex architecture or when you need to do something that CloudFormation just doesn't support. This is one of the situations that custom resources aim to help with.

      Let's look at what capabilities custom resources provide us that might help in this situation. So we start off with the same basic components. We've got a CloudFormation template, which creates a stack. This time, the template has more resources than just the S3 bucket. But first, it creates an empty bucket, just like the previous example. However, in addition, it has a custom resource, and that custom resource is supported by a Lambda function. Because this custom resource is backed by a Lambda function, it means that when the custom resource is being created by the stack, the Lambda function is invoked or executed, and it's passed some data.

      This data is event data, and this data block contains anything given to the resource as properties. In this example, let's assume that the custom resource is provided with the bucket name of the bucket created by the CloudFormation stack, so the empty bucket. Now, for the sake of example, let's say we've designed this custom resource, this Lambda function, to download some new objects into this empty S3 bucket. This means that we now have a bucket with objects inside it, which is the same problem we had with the previous example.

      Now, in addition to the bucket name being provided to this custom resource, the event data also contains details of how the Lambda function—or how the thing that's backing up the custom resource—can respond back to CloudFormation. This is called the response URL. So, the Lambda function, because it completed successfully, sends a response back, a success response, to this response URL. This means the logical resource will be created successfully, and the stack itself will move into a "Create Complete" status, so all is good.

      So, we've used a custom resource at this point to download some additional objects into that S3 bucket. Now we have a CloudFormation stack in the "Create Complete" status and an S3 bucket with some objects in it. At this point, let's say that a human comes along, say Gabby again, and Gabby uploads some additional objects to the S3 bucket, such as additional cat images. We still have a bucket with objects, but now it's more objects than the custom resource added earlier. We have the objects added by the custom resource and the three additional cat pictures that Gabby has just manually uploaded.

      Now, let's say we're going to do a stack delete operation. We select the stack, right-click on it, and select "Delete Stack." This starts the process of deleting the stack. Now, the stack has two logical resources and two corresponding physical resources: the bucket with objects and the custom resource backed by a Lambda function. In the above example, CloudFormation immediately tried to delete the bucket with objects, which is why it failed. But in this example, because when the custom resource was created, it needed the bucket to already have been created, CloudFormation knows that the custom resource depends on the bucket. There's a dependency. So, when you're deleting a stack, CloudFormation will follow the reverse of that dependency. This means the custom resource will be deleted before the S3 bucket.

      What happens now is that CloudFormation starts the deletion of all the resources contained in the stack, but it starts with the custom resource. The way it does this is by sending the message to the Lambda function. The message has a similar structure to when the stack was created: an event data block that contains the fact that the stack is being deleted and still contains the name of the S3 bucket. The Lambda function performs whatever actions are configured for a delete operation, which in this case is to remove all the objects from the S3 bucket. Once this has been completed, once the Lambda function has finished all its operations, it will signal back to CloudFormation that this was successful. Again, this will happen by using the response URL that's contained in the event data.

      Once the success response has been completed, the stack will delete the custom resource. Once the custom resource has been deleted, there will be no further dependencies inside the stack, and the stack will go ahead and delete the S3 bucket. This time, this will succeed because the S3 bucket is empty, and because the deletion of the S3 bucket completes successfully, the stack itself can be deleted, and the process completes successfully. By using a custom resource, we can add additional capability to CloudFormation. We can make it download additional objects into an S3 bucket and also have it clean up any objects, including those added by a human being outside of CloudFormation, before the S3 bucket is deleted. By doing it this way, we can avoid any stack deletion issues caused by buckets that contain objects.

      This is a simple example of how we can extend the functionality of CloudFormation by using custom resources. With that being said, that's everything I wanted to cover. Go ahead and complete this video, and when you're ready, I look forward to you joining me in the next.

    1. Welcome back and in this lesson I want to briefly cover CloudFormation change sets. Now this is a feature which makes it safer to use CloudFormation within a full infrastructure as code environment or when CI/CD processes are being used within your organization. So let's jump in, step through what change sets are and what benefits they provide.

      The usual flow that you engage with with CloudFormation goes something like this. You take a template, use it to create a stack which creates physical resources based on the logical resources in the template. That's a create stack operation. Or you delete a stack which deletes the physical resources created by the stack. Or you can take a newer version of a template, maybe it has additional resources or maybe it's a bug fix. In either case you take that new template, apply it to an existing stack and this changes existing physical resources and this is known as an update stack operation.

      When a stack update occurs, when logical resources are changed which results in changes to physical resources, that change has one of three effects. We have no interruption and this is where certain changes made to a stack might not impact the operation of the physical resource. The change is just made and that's it. Next is some interruption which might mean something like an EC2 instance rebooting. It's not a damaging event but it can impact service. And finally certain changes might cause a replacement which creates a new copy of that physical resource and the old one is removed. This is disruptive and can result in data loss so it's critical to keep this in mind when making changes to existing CloudFormation stacks.

      Now change sets let you apply a new template to a stack but instead of applying the change immediately it creates a change set which is an overview of the changes to be applied to the stack. What makes change sets powerful is that you can create many different change sets for a stack so you can preview different changes with different new versions of the template and when you've reviewed the change set or change sets then you can choose to discard them or you can pick one to apply by executing it on the stack which creates the stack update operation and updates the logical and physical resources managed by that stack.

      Now visually the architecture looks like this. Let's use a simple example and don't worry we're going to be stepping through this in the console very shortly. So this example is a CloudFormation template which creates three buckets: catpicks, dogpicks and memes. So we use this to create a stack which creates the three S3 buckets. So far so good. But now let's say that we didn't actually mean to create the memes bucket. We only like animal pictures. Memes are just no good. So we create a new template and we use this to update the stack and because we're not using change sets immediately it deletes one of the S3 buckets: the memes bucket. The memes bucket has been removed from the template, its logical resource. Because of that, it is also removed and that means the physical resource that the stack manages is deleted from your AWS account.

      Now using change sets we can improve this. The starting point is the same: we create a stack with the version 1 template. But instead of using the version 2 template to update the stack, instead we create a change set. Now this is a distinct thing, an object which represents the change between the original stack and the new version of the template. We can create one or more of these but when we're satisfied we can execute that change set against the stack and this has the same effect as the top method but we have more control and visibility over the changes, especially with larger and more complex templates.

      Okay so I'm going to move across now to my console and demo how this works in practice. If you really want you can also do this in your own environment. I have included the CloudFormation templates within this lesson's folder on the course GitHub repository. But for this one I would suggest just watching. It's probably not something that's worth the effort of implementing yourself. I just want you to be aware of how this looks from the console UI.

      Okay so I've switched over to my console and to do this demo lesson I need to be logged in as the IAM admin user of the general AWS account, so that's the management account of the organization. And as always I've got the Northern Virginia region selected. So I'm going to go ahead and move across to the CloudFormation console. So I'll type "cloud formation" in the search box at the top and then click to move to the CloudFormation console. And just before I apply anything, these are the templates which I'm going to be using in this really brief demo lesson. So we've got template one and this is a really simple CloudFormation template which creates three S3 buckets, and then template two which is going to be the one that I update the stack with. This only has two S3 buckets. So the first template creates cat pics, dog pics and memes, and then the second template only has cat pics and dog pics.

      So let's move back to the console: create stack, upload a template file, choose file and then depending on the course that you're doing inside the infrastructure-as-code folder or the CloudFormation folder or the deployment folder, the name will vary depending on the course. You should see a change sets folder and in there we've got these two templates: template one and template two. Select template one and click on open. Then I'm going to scroll down and click on next and I'm going to call this stack "change sets", click on next, scroll to the bottom, next again, scroll to the bottom and click on create stack. So this is going to create our three S3 buckets. We can see that it's doing them in parallel: cat pics, dog pics and memes. That should only take a few seconds. There we go. One more refresh and it's moved into create complete.

      Now that that's in create complete, if I click on resources we'll be able to see the three logical resources: cat pics, dog pics and memes, and their corresponding physical resources, so the three S3 buckets. Now there's a change sets tab, which if we click on you'll see that there are no current change sets for this stack and we can create a change set from here or we can go to stack actions and then create change set for current stack. So that's what we're going to do. We're going to create a change set. So I'm going to click on create change set. This dialog looks much like the one that you would get if you're just updating a stack without using change sets. So I'm going to replace the current template. I'm going to upload a template file and I'm going to choose template 2. And then once that's loaded click on next, click on next again, scroll down, next again, scroll all the way down and then create the change set.

      And we're going to name the change set. So I'm going to call it "change sets - version 2". You can call this anything you want but I always find that it's useful to have the stack name at the start and then some kind of version indication. And if you wanted to type a description you could do that. Let's say "removing memes" and then create the change set. Initially it will show as create pending. I'll hit refresh. It will show as create complete. And now this is a separate entity in its own right. We've created a change set. We've not actually updated the original stack. So if I go back to stacks, we'll still see the "change set" stack. If I click on it and go to resources, all three of the logical and physical resources still exist.

      So whilst we've uploaded this new template version and created a change set, we haven't actually done anything with this change set. So to use a change set, let's go and click on the change sets tab and then open up this change set. Again we'll see a visual list of all the changes that CloudFormation has detected between the version of the template the stack's using and this change set. And in this particular case it's telling us that an action of remove is occurring against the logical ID of "memes" and this physical ID. So because we've removed this logical resource from this template, it's telling us that the logical resource will be removed along with the corresponding physical resource.

      We can click on this template tab and see an overview of the template that's part of this change set. In this case it's the template without the "memes" logical resource. If I click on the JSON changes button, you'll see a JSON formatted overview of exactly what's happened between the version of the template the stack's using and the version in this change set. It's a list of JSON objects and it's one JSON object per change. So in this case there's only one change which is logical resource ID "memes" and the action is to remove. And this is how you can get a really accurate overview of the changes between the version of the template that the stack's using and the version of the template that's contained within this change set. So it can form part of a really rigorous change management process within your business.

      Now just keep in mind at this point I could have the single change set for this stack or I could have 10 change sets. And I can list them all individually. I can delete them all individually. But if I wanted this change set to be applied against the stack, I could do so by clicking on execute. So I'm going to do that. As soon as I click on execute, then it's going to run the update stack operation. And at this point, the changes would be exactly the same as if you just updated the stack, applied a new template and executed that immediately. The template that the stack is using will be changed to the one contained in the change set. The logical resource in this case will be removed and the corresponding physical resource will also be removed. And the end effect of this will be an updated stack. It has an update complete status. And if we go ahead and click on resources, we can see that this memes bucket has been completely removed from this stack. And that's a really simple example of how you can use change sets within CloudFormation.

      Now at this point that's everything which I wanted to demonstrate in this lesson. I'm going to go ahead and click on delete and then delete stack. And if you've been following along in your own environment, you need to do the same to return the account into the same state as it was at the start of this lesson. With that being said though, that is everything I wanted to cover. So go ahead, complete this video, and when you're ready I look forward to you joining me in the next.

    1. Trolling can be used, in principle, for good or bad ends.

      I've seen a lot examples of both bad and good aplication of trolling. It was used to challenge discriminations and deliver encourages, like the ice bucket challenge and the "we are all XXX" protests, but what we have deeper impressions are the bad and chaotic trollings that doesn't make sense or is offensive, such as the reddit and forum trolling that targets less vigilant individuals and even tries to control them in real life. Also, there are weird and funny trollings that are so well known and classic that they become a cultural icon, like the Rick Astely trolling that appears everywhere with a superlink, QR code or any kinds of id or website.

    1. Welcome back and in this lesson I want to continue on from the last lesson where I stepped through nested stacks only this time I want to talk about cross stack references which are similar but used in a very different set of architectural scenarios. So let's jump in and get started.

      In the last lesson I talked about this architecture. I talked about how nested stacks could be used to get past the cloud formation resources per stack limit. I talked about how if you wanted to reuse templates for modular parts of architecture, for example creating a standard VPC template once and then reusing it, then nested stacks were ideal. And I talked about how if you wanted to simplify the process of creating large infrastructure using cloud formation you could do it by using nested stacks because a single root stack could create any nested stacks which it needed.

      Now the limitation of nested stacks was that if you reused a particular template, say the VPC template, you would only reuse the code not the actual VPC that it creates. If you implemented 10 root stacks each of which was identical then you'd have 10 application stacks, 10 active directory stacks and 10 VPC stacks which included 10 VPCs. Now in some cases we want to consume a shared component, for example the VPC, for lots of different implementations. The problem is the isolation which is a design feature of cloud formation.

      Let's say that you have a stack which is a well-structured and secured VPC and you want this to be a shared VPC usable by other application stacks in the same region and the same account. The issue is that stacks are by design self-contained and isolated. There's a logical boundary around each stack which means that things in one stack can't be by default referenced in another. In this example if we were deploying EC2 instances into AppStack 1 and AppStack 2 they couldn't natively reference the subnets created by the shared VPC stack. Now you could manually add the VPC ID and the subnet IDs into AppStack 1 and AppStack 2 but that means they're static parameters, they're not references and this is where cross stack references come in handy if you want one stack to be able to reference the resources created in another in order to reuse those actual resources.

      Now to understand the benefit of cross stack references first understand that because of the isolation of stacks normally the outputs of stacks are only visible from the user interface or the command line. You can't use the built-in ref function of cloud formation to reference anything from one stack in another. The exception to this as I detailed in the previous lesson is that root stacks can reference the outputs of nested stacks but that architecture as you also learned means that the stacks are linked in terms of their lifecycle. Sometimes like when you want to create a shared VPC architecture you actually want a situation where a VPC might have a long running lifecycle and applications which use that specific VPC they might have a short lifecycle so you don't want to define all of those as part of the same nested stack.

      An example of this imagine you work for a software development company each time a new version of your application is committed to a github repository you want to use cloud formation to create a VPC run the application in the VPC and operate a set of tests before tearing it all down. Now you could create an isolated VPC each time using a nested stack architecture but if you wanted to save costs you could use the same shared VPC the same set of NAT gateways and the same set of subnets to do this outputs of a template can be exported. An export is defined within an output of a stack it takes that output and adds it under an exported name to a list of exports in one region of your account so the export name has to be unique.

      So for a shared VPC design some examples of what you might choose to export might be VPC ID, subnet IDs, side arrangers, security group IDs anything which you could expect to use elsewhere so external to that shared VPC stack. To repeat though the export name needs to be unique inside one region of your account. To use the export inside another stack instead of using the ref function which is how you reference other resources in the same stack you use the import value function you provide import value with the export name and it returns the value exported in that other stack so that's how you can use exports from one stack in another.

      So let's have a look visually at how this works. Architecturally we start with a single AWS account running in one particular region in this example US East 1. Inside this region we have a VPC stack and we want this to be a shared services VPC which can be used by other stacks. Step one is inside that stack make sure that anything we want to use is added as an output for example the VPC ID. So this is an example of the output section for this particular stack we've got shared VPC ID as an output and then we use the reference function to reference the actual VPC logical resource that's created inside this stack.

      Now this means this output will be visible from the command line or the console UI but to use this value in any other stack in this region of the account we need to use the export directive to export that value to a list within one region of your account and this is the exports list and this operates per region per account so this is only visible inside your account in one specific region. Every region inside your account has its own list of exports everyone else's accounts and all of the regions in those accounts each of those has their own dedicated list of exports so within the exports list any of the exports need to be unique so we can only have one export called shared VPC ID within that region of that account.

      Now once a value is in the exports list it can be referenced in other stacks using the import value function this function replaces the ref function and remember the ref function is what you can use to reference other logical resources inside a single stack the import value function when used in a stack allows you to reference values which are exported from other stacks and added to the exports list. Now this only works in the same region as a stack is being applied in cross region or cross account isn't supported for cross stack references so essentially the process is that you need to create an output in one stack export the value for that output into the exports list and then use the import value function to import that exported value into each stack that you want to use it in and that's how you can create shared services by using cross stack references.

      Now there are a number of situations where you would choose to use cross stack references for example when you're implementing service oriented architectures i.e. when you need to provide services from one stack to another another example is if you have a churn of short lived applications which all consume from a shared services VPC then you don't want them to be in the same stack or as part of a nested stack if you have things which have different life cycles long versus short then you want to separate them into different stacks and use cross stack references if you want to reuse a stack so reuse the resources created by a stack rather than reusing a template then cross stack references are ideal for the exam I want you to be clear that a template is not a stack or vice versa a template is used to create one or more stacks each stack is unique if you want to reuse a template then you can choose to use nested stacks which allow you to use the same template that you've created once in many distinct architectures a VPC template for example might be used as part of an email system a financial system or for the implementation of hundreds of different isolated client environments that's if you want to reuse a template and each time you reuse that template it creates its own distinct infrastructure but if you want to reuse an actual stack so the resources inside a stack as with this example of a shared VPC then you should use cross stack references rather than nested stacks so cross stack references allow you to reuse actual resources nested stacks allow you to reuse templates they're very different things I hope by this point it makes sense understanding the differences for the exam is essential and if you pick the wrong one in a real-world situation the results can be less than ideal at this point though that's everything that I wanted to cover so go ahead and complete this lesson and when you're ready I look forward to you joining me in the next.

    1. Welcome back. In the next two lessons I want to cover two features of Cloud Formation. In this lesson I'm going to be covering Cloud Formation nested stacks and in the lesson following I'll cover cross stack references. Now we've got a lot to cover so let's jump in and get started.

      Most simple projects and deployments which use Cloud Formation will generally utilize a single Cloud Formation stack and a Cloud Formation stack is isolated meaning it contains all of the AWS resources that the project needs. These might be things such as a VPC, DynamoDB, S3, maybe EC2 and Lambda, SNS, SQS and maybe even a directory service. Now there's nothing wrong with having a Cloud Formation stack built in this way. It's isolated, the resources inside it are created together, they're updated together and eventually they're deleted together. The idea is that all of the resources within a Cloud Formation stack share a life cycle. Stacks make it simple to package everything up into one collection of resources.

      Now designing Cloud Formation in this way where everything's contained in one single stack is fine as long as you don't hit any of the limits that might impact your project. There are a few things that you need to be aware of. The first is that there is a limit of 500 resources per stack and for larger deployments this could be a problem. Another issue with isolated stacks is that you can't easily reuse resources. If you had a stack like this one which created a VPC it's not practical to reference that VPC in other stacks which might also want to use it. Stacks are by design isolated by default. You can use the ref function to reference resources from other resources in the same stack but you can't use this to reference resources in other stacks.

      So stacks are isolated. You have to treat them as self-contained groupings of infrastructure which share the same life cycle. So you create a stack that creates all of the resources, you update a stack that updates all of the resources and eventually you delete the stack and that deletes all of the resources. Everything shares the same life cycle.

      At the professional level or just for any projects which are complex you'll tend to use a multi-stack architecture. So you'll implement your project using multiple stacks and there are two ways to architect a multi-stack project. Nested stacks and cross stack references and choosing between them is what I want you to be fully comfortable with for the exam and in this lesson we're going to be starting by looking at nested stacks. So let's look at that architecture next.

      Nested stacks technically are pretty simple to understand. You start with one stack which is referred to as the root stack. In this example this stack is both the root and the parent stack. A root stack is the stack which gets created first. So this is the thing that you create either manually through the console UI or the command line or using some form of automation. So the root stack is the only component of a nested stack which gets created manually by an entity either a human or a software process. Now a parent stack is the parent of any stacks which it immediately creates. So complex nested stack structures can actually have multiple levels. A root stack can create several nested stacks and each of those can in turn create additional nested stacks. So a parent stack is just a way that we can refer to anything which has its own nested stacks. So in this case this stack is going to be a root stack and a parent stack.

      Now a root stack can have parameters just like a normal stack and also have outputs also just like a normal stack and that's because a root stack is just a normal stack. There's nothing special about nested stacks. Inside all stacks you have logical resources and examples of these that you've seen so far include S3 buckets, a virtual private cloud or VPC and maybe even a DynamoDB table. Now you can also have a cloud formation stack as a logical resource and you define it using the type of AWS double colon cloud formation double colon stack and this is a logical resource just like any other only it creates a stack of its own. So you have to give the nested stack a URL to the cloud formation template which will be used to create it.

      So that template will contain its own resources it's just a normal cloud formation template as I've just mentioned it could even contain its own nested stack. So in this case HTTPS colon forward slash forward slash some URL dot com forward slash template dot yaml is the URL to a template which will be used to create this nested stack the stack that's called VPC stack. Now you can also provide nested stack resources with some parameters in this example we're creating a nested stack called VPC stack and if the template for VPC stack had three parameters so param one, param two, param three then we would need to provide values into that stack as it gets created. For every parameter that the template has for this nested stack we need to provide a value as we create it if not the stack creation process will fail.

      So in this particular case the template dot yaml file that's used for VPC stack has three parameters param one, two and three and when we're creating it as a nested stack we need to supply parameters for those values that are used to create that stack. Now the exception to this is if the VPC stack template had default values for its parameters if it has default values then we wouldn't have to provide those when creating it as a nested stack but it's best practice to populate the VPC stack logical resource with parameters for everything which is parameterized within the template that's used to create that stack.

      So in this case we have the root stack it currently has one logical resource VPC stack this creates a nested stack resource and we're passing in these three parameter values. So when the VPC stack nested stack finishes creating then the logical VPC stack resource within the root stack moves into a create complete status and any outputs of that nested stack are returned to the root stack and these can be referenced using the logical resource name of the nested stack so VPC stack and then dot outputs and then the actual output name of the nested stack. So you can only reference outputs when using nested stacks you can't directly reference logical resources created in any of the nested stacks you can only reference the outputs that you make visible when creating the nested stack.

      Now we might also have other nested stacks contained within the root stack and because these are also logical resources they too would be created but they might have dependencies either ones which cloud formation calculates or one where we use the depends on directive to explicitly inform cloud formation that there is a dependency between different stacks for example we might have an active directory nested stack called AD stack which depends on the VPC stack whether it's a self-managed active directory or one provided by directory service it will need to run from a VPC and so it will depend on that VPC and that VPC is getting created within the VPC stack nested stack.

      Now the root stack can take the outputs from one nested stack and give them as parameters to another examples of this might be the VPC ID or the subnet IDs of the resources created inside the VPC stack. Once the AD stack finishes creating it too might have outputs which are then returned to the root stack then we might create another nested stack perhaps an application stack and this might depend on the AD stack maybe it uses active directory for user authentication and once complete this application stack can also provide its outputs back to the root stack maybe this is a login URL for the application itself as each of the nested stacks finished provisioning the resource in the root stack will be marked as create complete and once all of the logical resources for the nested stacks are complete then the root stack itself will be marked as create complete.

      Now there are two really important aspects to nested stacks that you need to understand in order to pick between nested stacks and cross stack references which I'll be talking about in the next lesson. First by breaking up solutions into modular templates it means that these templates can be reused in this example we have VPC stack which is probably something that can be used again and again for different deployments if you upload the template somewhere then many nested stack architectures can use that template and crucially this is reusing the code or the template for a stack it's not reusing the same stack itself so we're not reusing the VPC that's being created by VPC stack what it means is that we can reuse the template that created VPC stack so by uploading the template for the VPC stack other nested stacks can reuse this same YAML template but if you do reuse the same VPC template in another stack it will create a separate VPC the benefit is the ability to reuse the same templates you're not reusing the same stacks now the AD stack template can also probably be used for different projects which use Active Directory but again every time that this particular template is reused it will create a different Active Directory you're reusing the code not the actual resources this isn't the same when we use cross stack references which I'll be covering in the next lesson because then you're actually reusing resources that are created by a stack when using nested stacks you're reusing the template not the actual stack so you generally use nested stacks when you've created individual building blocks so modular templates and you can reuse each of these templates to form part of a single solution which is life cycle linked so you might be able to reuse the template which creates a VPC on lots of different nested stacks but crucially it would always create a dedicated VPC you would not be using the same VPC in the same stack you would be recreating a new stack and new resources each and every time now nested stacks are generally used when all of the infrastructure that you're creating is forming part of the same solution when it's life cycle linked in this example the application needs Active Directory which needs a VPC it's unlikely that one will exist without the other you aren't going to want to switch out Active Directory for another Active Directory or the VPC for another VPC it's likely that these will all be created together operate together and maybe someday be deleted together now nested stacks do allow for a few main benefits before we finish this lesson I just want to summarize them use nested stacks when you want to overcome the resource limit of using a single stack if you have five stacks together as a nested stack you can have 2,500 resources use nested stacks when you're modularizing your templates that way you can create a VPC template once and use it for many implementations but remember if you use nested stacks and each one of those projects will create its own physical VPC with nested stacks you're only reusing the template so the code you're not reusing resources themselves use nested stacks when you want to make stack installations easier this is because you can apply a root stack and have that root stack automatically orchestrate the application of many nested stacks the one single decision point between using nested stacks and cross stack references is only use nested stacks when everything is life cycle linked when everything in the stack structure needs to be created with each other updated with each other and eventually deleted with each other if you're anticipating needing one part long term but not others then nested stacks are the wrong choice if you imagine needing to use the same actual VPC across multiple implementations then cross stack references are probably better suited and we'll talk about that in the next lesson if you want to make frequent changes to one part of an application and not others then it's probably better to have individual non nested stacks and utilize cross stack references which we'll talk about next okay so that's everything I wanted to cover about nested stacks in the next lesson I'll be comparing this to cross stack references so go ahead and complete this lesson and when you're ready I look forward to you joining me in the next.

    1. reply to u/No-Entertainer-7217 at https://old.reddit.com/r/typewriters/comments/1k45o89/typewriters_needing_new_ribbons/

      "Working but needs new ribbon" is almost always seller's code for I have no idea if it really works, but I'm going to try to sell you this machine for the price of a fully functioning machine that was just serviced by a professional shop despite the fact that I just took it out of grandpa's barn and I'm not sure if the mouse inside is dead or not. Also, I can't afford $10 to replace an old ribbon to truly participate in the charade of the price I'm going to try to fleece you with.

      Educate yourself a bit: <br /> - https://boffosocko.com/2025/03/29/first-time-typewriter-purchases-with-specific-recommendations-for-writers/ - buying typewriter ribbon: https://site.xavier.edu/polt/typewriters/tw-faq.html#q1 and https://boffosocko.com/research/typewriter-collection/#Typewriter%20Ribbon - typewriter 101: https://www.youtube.com/playlist?list=PLJtHauPh529XYHI5QNj5w9PUdi89pOXsS

      If you want to tell people Tom Hanks taught you to change a typewriter ribbon: https://www.youtube.com/watch?v=GBbsNKaVAB0. There are lots of other YouTube videos out there for specific machines as well as reviews of them.

      You can definitely test out a machine without ribbon, but if they're trying to charge you over $100 for a machine, it really should have fresh ribbon.

  7. social-media-ethics-automation.github.io social-media-ethics-automation.github.io
    1. Mia Jankowicz. A TikToker said he wrote code to flood Kellogg with bogus job applications after the company announced it would permanently replace striking workers. Business Insider, December 2021. URL: https://www.businessinsider.com/tiktoker-wrote-code-spam-kellogg-strike-busting-job-ad-site-2021-12 (visited on 2023-12-05).

      Kellogg employees went on strike but the company decided to handle this by hiring replacement workers.

      A Twitter use shared some code that sent in fake applications to Kellogg. Although trolling, it seemed a somewhat of a tactical way to delay the hiring process in order to give the worker's fair pay and treatment. This is probably one of the better ways of trolling where it's going towards advocacy in worker rights and treatment.

    2. Assassination of Martin Luther King Jr. November 2023. Page Version ID: 1186577416. URL: https://en.wikipedia.org/w/index.php?title=Assassination_of_Martin_Luther_King_Jr.&oldid=1186577416#Alleged_government_involvement

      This source highlights a tweet that comedian Jaboukie Young-White made back in 2020, impersonating the FBI making a foul comment about the assassination of Martin Luther King. This is not the first time Young has impersonated a public figure, which highlights an important issue regarding digital media. Not only does trolling include leaving comments and insults for other users, but another form of trolling is impersonating someone you're not. This is especially interesting because in this class we are learning to code bots to make posts for us, would that be a form of impersonation?

    1. Welcome back and in this video I want to cover two things which are at the core of CloudFormation as a product - physical resources and logical resources. In covering both of those you're also going to be learning about templates and stacks, so this will be a good video to cover the basics of CloudFormation. Now we've got a lot to cover so let's jump in and get started.

      CloudFormation begins with a template which is a document written in either YAML or JSON, both of which you should now have an awareness of. Defined within a CloudFormation template are logical resources. Think of logical resources as what you want to create but not how you want them created. When using CloudFormation you focus on the what and let CloudFormation deal with the how.

      CloudFormation templates can be used to create CloudFormation stacks. A template can be used to create one stack, a hundred stacks or twenty stacks in different regions. The idea is that one template defines what resources you want, and defining good templates means a template can be used many times in many accounts in many regions. We refer to that as a portable template.

      The initial job of a stack is to create physical resources based on the logical resources defined within the template. For every logical resource in a template when a stack is created a physical resource is also created. If a stack's template is updated in some way and then the stack itself is updated the physical resources are also changed. The stack keeps the logical and physical resources in sync. If a stack is deleted then normally the physical resources are also deleted. So think about CloudFormation as a product which looks at a template specifically the logical resources within a template, and then it creates, modifies or deletes physical resources as required.

      So visually it looks like this. This is a CloudFormation template and this one has been written using YAML. The template contains logical resources. In this example instance is the name of the logical resource and this is the type, so AWS double colon EC2 double colon instance. Now logical resources are generally going to have properties which are used by CloudFormation when configuring the actual physical resources. In this example this sets the Amazon machine image to use, the type of the instance and the SSH key pair to use when connecting to the instance.

      So the collection of logical resources and other things which I'll be covering in future videos is called a CloudFormation template. This template can be used to create one or many CloudFormation stacks, and a stack when created also creates physical resources based on the logical resources. This means because we've set the AMI to use in the template and the SSH key to use these will be used when creating the physical resource, in this case an EC2 instance. So this physical EC2 instance is a representation of the logical resource defined in the CloudFormation template.

      Now the stack will also react to template changes to update or delete physical resources as required. Once a logical resource defined inside the CloudFormation template moves into a create complete state, meaning that the physical resource has been created, then the logical resource can be referenced by other logical resources to retrieve various physical configuration elements or IDs, for example in this case the physical machine ID of the EC2 instance.

      So in summary logical resources are contained inside CloudFormation templates. CloudFormation templates are used to create CloudFormation stacks and the stacks job is to create, update or delete physical resources based on what's contained in that template. CloudFormation as a product aims to keep the two in sync, so physical and logical resources.

      When you use a template to create a stack, CloudFormation will scan the template and create a stack with logical resources inside and then create physical resources which match those logical resources. If you update the template then you can use it to update that same stack. When you do that the stack's logical resources will change, either new logical resources will be added or existing ones are updated or deleted and CloudFormation will perform the same actions on the physical resources. So adding new ones, updating existing ones or removing physical resources entirely. If you delete a stack its logical resources are also deleted which causes it to delete the matching physical resources.

      CloudFormation is a really powerful tool which you'll be using extensively in the real world and this is the same whether you're a solutions architect, a developer or an engineer. I use CloudFormation constantly in all of the AWS courses that I create and so by taking the courses you'll be gaining a lot of practical and theory understanding of how CloudFormation works.

      Now if you're taking any of my courses with my CloudFormation mini deep dive then you'll be learning even more by talking about every important aspect of CloudFormation that's relevant for the course that you're taking as well as giving you plenty of practical examples. CloudFormation lets you automate infrastructure. Imagine that you host WordPress blogs. You can use one template to create one, ten, a hundred or more deployments rather than having to create a hundred individual sites.

      CloudFormation can also be used as part of change management. You can store templates in source code repositories, add changes and get approval before applying them. Or they can be used to just quickly spin up one-off deployments and if you're taking any of my AWS courses you'll be seeing that I'll be using CloudFormation extensively as part of any of the practical demo lessons in the course. We'll be using templates to spin up any of the infrastructure that will support the demo lesson that you're going to be taking.

      Now that's all of the theory that I wanted to cover about physical and logical resources within CloudFormation. It is a fairly theoretical topic but you need to understand what a physical resource is, what a logical resource is and how the two relate together as far as they're used within CloudFormation. Now at this point that's everything that I wanted you to cover in this video so go ahead and complete the video and when you're ready I'll look forward to you joining me in the next.

    1. Welcome back and in this lesson I want to talk about AWS config. Now let's just jump in and get started because we've got a lot to cover.

      AWS config is an interesting service because people often misunderstand what it does. This is especially important within exam situations where you don't have the benefit of Google and have to make architectural decisions quickly.

      Now AWS config has two main jobs. Its primary function is to record changes over time on resources within an AWS account. Once enabled, the configuration of every resource in the account is monitored. Every time a resource's configuration changes, a configuration item is created which stores the configuration of that resource at a specific point in time. The information which is stored is the configuration of the resource, the relationship to other resources and who makes any changes. So for example, if you had a security group attached to an instance and you added a rule to that security group, then it would track the pre-change state, the post-change state, the fact that you changed it and the fact that it was attached to that EC2 instance.

      Now this makes AWS config great for auditing changes and for checking if resources are compliant with standards defined by your organization. The most important thing to understand about AWS config is that it doesn't prevent changes happening. It's not a permissions product or a protection product. Even if you define standards for resources, it can check compliance against those standards but it doesn't prevent you from breaching those standards and creating non-compliant resources. An example of compliance might be a certain set of allowed ports within security groups. You can add additional ports exposing an instance to a certain amount of risk. Now AWS config won't stop you but that non-compliance, that additional port will be identified.

      Now config is a regional service so when enabled it monitors changes within a particular AWS region in a particular AWS account but it can be configured for cross-region and cross-account aggregation. It can also generate notifications via SNS and it can generate events via EventBridge and Lambda when resources change in terms of their compliance state so while AWS config won't prevent you changing something it can be used for automatic remediation.

      Now the product stores all of the configuration data and changes in a consistent format within the S3 config bucket and the product allows you to access that data so all of the configuration history of all of the resources and you can interact with them directly from that bucket or using the AWS config APIs.

      Now there are two sides to AWS config, the features which are standard and the parts of the product which are optional. Now the standard part is on the left and the optional part is on the right. So starting on the left we have some account resources and we have AWS config. To use the product we have to enable it and this enables the recorder functionality and this takes config information of all the resources and stores them in an S3 bucket, the config bucket and this is all part of the standard functionality provided by the product.

      Now you could just enable all of this functionality and leave this as it is. This would allow you to record and review all changes to resources over time. Every time a change happens a configuration item would be generated and all of these for all resources would be stored in a standard format in the config bucket.

      But we can do a lot more with the product and this is where the real power of AWS config comes from because we can use config rules. Now config rules are either AWS managed ones or you can define your own which uses Lambda. What happens is that these rules evaluate resources against a defined standard. Resources based on these rules are either compliant or non-compliant based on if they meet criteria specified within the config rule. Now custom rules use Lambda to evaluate if resources match criteria. The Lambda function does the evaluation using whatever things that you can code and then returns information back to AWS config.

      AWS config can then notify or work with other products for automatic remediation. For example, it can use SNS to send either a stream of changes or compliance notifications and these will either go to human operators or other applications to deal with. In addition though you can integrate AWS config with EventBridge. So for any changes in the state of config rules whenever anything becomes compliant or non-compliant this event can be sent to EventBridge and then EventBridge can be used to invoke Lambda functions to perform automatic remediation of any changes. So to fix the problems automatically.

      Now this isn't strictly part of AWS config. You're essentially using EventBridge to send any events from AWS config to targets to perform this automatic remediation. You can also fix these type of config changes using SSM. So AWS config can integrate with systems manager and apply fixes to remediate any issues. But Lambda can be more flexible for account level things whereas SSM can be effective for anything relating to the configuration of instances.

      Now that's all of the theory that I wanted to cover in this lesson. Go ahead and complete this lesson and then when you're ready I look forward to you joining me in the next lesson.

    1. Welcome back, and in this lesson, I want to cover something that starts to feature much more in AWS exams, and that's CloudFront Lambda at Edge. You don't need to have any experience implementing it, but you do need to know how it's architected and what it's capable of doing. So let's jump in and get started.

      Lambda at Edge is a feature of CloudFront that allows you to run lightweight Lambda functions at CloudFront Edge locations. These Lambda functions can adjust traffic between the viewer and the origin in a number of interesting ways, and we'll talk about some of those soon. However, there are some limitations that you need to be aware of because the Lambda functions are running at the Edge. They don't have the full Lambda feature set. Currently, only Node.js and Python are supported as runtimes. You can't access any VPC-based resources since the functions are running in the AWS public zone, and additionally, Lambda layers are not supported. Lastly, the functions have different size and execution time limits compared to normal Lambda.

      Now, you don't need to memorize all of these facts. What's more important is a good understanding of the architecture. So let's look at that next. Lambda at the Edge starts with the traditional CloudFront architecture. So, customers on the left, the Edge locations in the middle, and the origins on the right. Any interaction between a customer, Edge location, and origin consists of four individual parts of that communication: the connection between the customer and the Edge location, which is known as the viewer request; the connection between the Edge location and the origin, known as the origin request; when the origin responds, there's a connection between the origin and the CloudFront Edge, known as the origin response; and finally, the connection between the Edge location and the customer, known as the viewer response.

      Now, with Lambda at the Edge, each of these individual components of the wider communication can run a Lambda function, and this Lambda function can influence the traffic as part of that connection. A viewer request Lambda function runs after the CloudFront Edge location receives a request from a viewer. An origin request function runs before CloudFront forwards that request onto the origin. An origin response function runs after CloudFront receives a reply from the origin, and then finally, a viewer response function runs before the response is forwarded back to the viewer.

      Now there are limits on how the Lambda functions can run in each part of the architecture. At the viewer side, a Lambda at Edge function has a limit of 128 MB for memory allocation and a function timeout of five seconds. At the origin side, the memory limits are the same as a normal Lambda, but with a 30-second timeout. Again, don't worry too much about these limits. For now, try to focus at a high level on what these limits mean, what types of architectures, and what type of adjustments these Lambda functions can perform on each of these different components of the flow of data between a viewer and an origin and back again.

      Now, let's look at some example solutions which involve Lambda at the Edge. I've included a link attached to this lesson which gives a few examples of situations where you would use Lambda at the Edge. I can't give an exhaustive list in this lesson because, just like Lambda itself, you can pretty much do anything that you want as long as you can code it within a Lambda environment. But a couple of common examples that you might find useful for the exam: First, you can use Lambda at the Edge to perform A/B testing. This is generally done with a viewer request function. You can use a viewer request function in an A/B testing scenario to present two different versions of an image without creating redirects or changing the URL. In this architecture, the function views the viewer request and modifies the request URL based on which version of an image you want the viewer to receive. With this architecture, the Lambda function can modify the viewer request and change the URL based on any logic that you can define inside a Lambda function. It can be things like a percentage-based algorithm or it can be random chance.

      Another scenario which you might use Lambda at the Edge for is running a function as part of the origin request. This can be used to perform a gradual migration between different S3 origins. You can use a function running in this part of the architecture to gradually transfer traffic from an existing S3 origin over to a new one, and you can do so in a controlled way. For example, based on a weighted value, which represents a percentage of the traffic of your application, you can increase the percentage of traffic which goes to the new S3 origin versus the old. And all of this can be done in a controlled way without updating the CloudFront distribution.

      You can also use Lambda at the Edge to customize behavior based on the type of device that your customer has. Given a particular type of device or a particular capability of that device, you can display different objects. This might include different sizes of objects or objects with different quality levels. For example, if you have a device which has a high DPI screen, so a higher dots per inch, then you might want to display objects which themselves have a higher DPI value and save lower DPI objects for devices which can't support high DPI. So again, that's something that you can use Lambda at the Edge for rather than making any changes to the CloudFront distribution.

      You can also use Lambda at the Edge to vary the content displayed by country. So a Lambda function that's running in the origin request component of the communication can be used to adjust what gets displayed based on the country of the customer. Now, the link that I've included attached to this lesson gives a lot more examples of the types of scenarios that would benefit from Lambda at the Edge. In addition to the scenarios, most of these include example Lambda function code that you can implement in your own environments.

      So this is going to be something that's outside of the scope of this course. You only need to be aware of the architecture of Lambda at the Edge for this certification. But if you do want to experiment with this in your own time, then you can use the examples contained on this page, and again, the URL is attached to the lesson, and you can do your own experimentation. But at this point, that's all of the architecture that I wanted to cover in this lesson. I just want you to be aware of exactly what Lambda at Edge can be used for because you might get a question on it in the exam. So make sure that you're comfortable with all of the examples that I've given in this lesson and all of the examples which are included on the link that are attached to this lesson.

      Again, you won't need to remember all of the different facts and the execution limits and the memory amounts. It's all about the architecture. So if you familiarize yourself with all of the examples that are on screen now and the ones that are in the link attached to this lesson, then you'll have all of the information that you need to answer questions about this topic in the exam. But at this point, that's all of the theory that I wanted to cover in this lesson. So go ahead, complete the lesson, and then when you're ready, I'll look forward to you joining me in the next.

    1. Welcome back, and in this lesson, I want to talk about CloudFront TTL as well as CloudFront invalidations. Both of these features can be used to influence how long objects are stored at edge locations and when they're ejected. We've got a lot to cover, so let's jump in and get started.

      Now let's step through in detail exactly what happens with caching one image at one edge location. A simple architecture consists of an S3 origin on the left, an edge location in the middle, and three users at the top, middle, and bottom right. Let's say we have a photo of Whiskers the cat, and he's crying a little, so it's not his best-ever photo, but that's the one uploaded into the S3 bucket by the Categorum application. Our first customer makes a request for the picture of Whiskers. Since this image is not stored on the edge location, an origin fetch happens where the image is retrieved from the origin and placed on the edge location before being returned to the customer, which is the response.

      Now, let's say the image of Whiskers is replaced in the origin. We take away the picture of Whiskers the cat crying and replace it with a much better one, showing a much happier-looking picture of Whiskers. However, now that this image has been replaced on the origin, the copy on the edge location is still the old version. What happens when another customer makes a request to the same edge location? This time, the older or bad image of Whiskers is returned. Why? Because it's the one that's cached at the edge location, and from an object perspective, it's still viewed as valid. Even though the origin has a new version, it’s never checked because the cached copy in the edge location is viewed as valid by CloudFront.

      There are ways to influence this, which I'll explain later in the lesson, but for now, you need to understand that this architecture can be problematic. At some point, every object cached by CloudFront will expire. When that happens, it doesn't get immediately discarded, but it's viewed as stale, meaning it's no longer current. If another customer requests a copy of this object, the edge location doesn't immediately return the object. Instead, as with step two, it forwards the request to the origin, and the origin will respond in one of two ways. This depends on the version of the object cached at the edge location versus the one stored in the origin. It's either current, or the origin has an updated object.

      If the object is current, then a 304 "Not Modified" response is returned, and the object is delivered directly from the edge location to the customer, marking the copy in the edge location as current again. If there is a difference between the edge location and the origin—meaning the origin has a newer version—then a 200 "OK" message is returned along with the new version of the object, which replaces the one cached at the edge location. This is how an edge location behaves within the CloudFront network.

      An object stays in the cache ideally for the entire time that it's valid. However, if the edge location faces capacity issues, it could eject the object early. Even when an object expires, the next time it's accessed, assuming it's still within the edge location cache, the edge location checks the version of the object it has versus the one in the origin. If they're the same, a 304 code is returned, and the object is not updated but marked as current again. A new version is only transferred if the communication between the edge location and the origin determines that the version of the object in the origin has been updated.

      The problem to understand is step four on this diagram. The middle user received an old version of the object, even though a newer version was stored in the origin. This is an issue because even though the updated copy existed in the origin, the user received a copy of the old object. This is something to be aware of, as there are ways to influence this, which we will explore further in the lesson.

      Now, let's talk about object validity. An edge location views an object as not expired when it's within its TTL (Time to Live) period. Before we go further, it's important to understand that the more often the edge location delivers objects directly to your customer (a cache hit), the lower the load on your origin, which results in better performance for the user. So, where possible, we want to avoid edge locations needing to perform origin fetches, as that would degrade CloudFront's performance.

      Objects cached by CloudFront have a default validity period of 24 hours, defined on a behavior within a distribution. The default TTL is 24 hours, meaning any objects cached by CloudFront using this behavior will have a TTL of 24 hours. After this time, the object is viewed as expired. Additionally, you can set two other values: the minimum TTL and the maximum TTL. These values, on their own, do not influence caching behavior but set lower and upper bounds for the TTL of individual objects.

      It’s possible to define per-object TTL values. If you don't specify an object TTL, the default TTL attached to the behavior is used (the 24-hour default). An origin, whether an S3 bucket or a custom origin, can direct CloudFront to use object-specific TTL values using headers. These headers include Cache-Control: s-max-age and Cache-Control: max-age, both of which are set in seconds. These headers tell CloudFront to apply a TTL value in seconds for a particular object. Once the specified TTL has passed, the object is viewed as expired.

      We also have the Expires header, which works differently from the above headers. Instead of specifying a number of seconds, this header specifies a date and time when the object should be viewed as expired. For both the headers specifying seconds and the Expires header, the minimum and maximum TTL values set on the behavior act as limiters. If a per-object TTL is lower than the minimum TTL for the behavior, the minimum TTL is used, and if it's higher than the maximum TTL, the maximum TTL is applied.

      It’s important to understand this architecture. The default TTL on a behavior is 24 hours, and this applies to any object without a per-object TTL set. You can also set minimum and maximum TTL values that act as limiters for any per-object TTLs specified using headers like Cache-Control and Expires. These headers can be set using custom origins or S3. If you're using custom origins, the headers can be injected by your application or web server. If using S3, these headers are defined in the object metadata and can be set via the API, command line, or console UI.

      Let's now cover one last topic before we finish the lesson: cache invalidations. Cache invalidations are performed on a distribution, and whatever invalidation pattern you specify is applied to all edge locations within that distribution. Invalidation is not immediate but expires any objects regardless of their TTL based on the pattern you define. Examples include invalidating a specific object using a specific path (e.g., /images/whiskers1.jpeg), using a wildcard to invalidate objects starting with a pattern (e.g., /images/whiskers*), or invalidating all objects in a given path (e.g., /images/*).

      There is also the option to invalidate all objects cached by a distribution with the wildcard /*, affecting every edge location. Keep in mind that there is a cost to perform cache invalidations, and this cost is the same regardless of the number of objects matched by the pattern. Cache invalidation should only be used to correct errors, and if you're frequently updating or invalidating individual files, using versioned file names might be a better solution.

      For example, using versioned file names like whiskers1_v1.jpeg and replacing it with whiskers1_v2.jpeg allows you to update the file without needing invalidation. This approach avoids the cost of invalidations, ensures that even cached copies in browsers won't interfere with the updated version, and provides better logging and consistency across edge locations.

      Remember, versioned file names differ from S3 object versioning, which involves different data stored under the same name. Using versioned file names means different file names for each version of an object, ensuring they are cached independently on each edge location. This approach is cost-effective and consistent, making it the preferred choice when you're frequently updating objects.

      That's all the theory I wanted to cover in this lesson. Thanks for watching. Go ahead and complete the lesson, and when you're ready, I look forward to you joining me in the next one.

    1. Welcome back, and in this lesson, I want to go into some more depth about Amazon Cognito, which is one of the core identity products available within AWS. Now, we do have a lot to cover, so let's jump in and get started. This is going to be one of the most important non-graphical screens of information in the entire course. I want to make sure that you understand the terrible naming within the Cognito product. Cognito provides two main pieces of functionality. Both are very different, but both are essential to understand.

      The service as a whole provides authentication, authorization, and user management for web and/or mobile applications. Authentication means to log in to verify credentials, authorization means to manage access to services, and user management means to allow the creation and management of a serverless user database. Now, there are two parts of Cognito: user pools and identity pools, and the naming on these is terrible. This is why most students struggle to understand the detail of how Cognito works. The end goal of a user pool is to allow you to sign in, and if successful, you get a JSON web token, known as a JWT. This JWT can be used for authentication with applications, and certain AWS products such as API Gateway can even accept it directly. But, and this is crucial to understand, most AWS services cannot use JWTs. To access most AWS services, you need actual AWS credentials.

      Now, user pools do not grant access to AWS services; their job is to control sign-in and deliver a JWT. So they do things like sign-up and sign-in services. They also provide a built-in customizable web user interface to sign in users. They provide certain security features such as multi-factor authentication, check for compromised credentials, and offer account takeover protection, as well as phone and email verification. You can also implement customized workflows and user migration by using Lambda triggers, and we'll talk about that if applicable elsewhere in the course.

      Now, where it gets confusing is that user pools, as well as allowing sign-in from built-in users, they also allow social sign-in using identities provided by Facebook, Google, Amazon, Apple, as well as offering sign-in services using other identity types such as SAML identity providers. But the important thing to understand is this is about offering a joined-up user management experience. At no point can a user pool be used to directly access most AWS resources. When you think of user pools, imagine a database of users, which can include external identities. They sign in and they get a JWT. That's it. I'm stressing this point because it's really important to conceptually separate this from an identity pool, which is coming up next.

      Now, the aim of an identity pool is to exchange a type of external identity for a set of temporary AWS credentials, which can then be used to access AWS resources. Now, one option is unauthenticated identities, which can be used to offer guest access to AWS resources. Imagine you have a mobile application and want to allow high scores to be stored in a leaderboard, which is hosted using DynamoDB, and you want to offer this without a user having to sign up, and this is one way to do that. Identity pools can also be used to swap an external identity for temporary AWS credentials, and this means things like Google identity, Facebook, Twitter, SAML 2.0 for corporate logins, and even user pool identities. So from an identity pool perspective, user pools are just treated as another form of identity.

      Now, all of these are examples of authenticated identities. If another identity provider, which we trust, says that they have authenticated successfully, then identity pools will exchange that identity for temporary AWS credentials. Now, I hope at this point that you do see the difference. User pools are about offering a joined-up sign-up or sign-in experience with user directory and profile management services. So it's about login and about managing user identities. Identity pools are about swapping either an unauthenticated or authenticated identity for AWS credentials, and one possible type of identity is actually a user pool identity. And this is a reason why these two different components of Cognito are often difficult to separate because they can operate together.

      Now, identity pools work by assuming an IAM role on behalf of the identity. That assumption generates temporary credentials, and they're provided back in return in most cases to a mobile or web application. These IAM roles are configured within identity pools, and there's going to be a demo coming up very soon where you can experience that. Now, for the rest of this lesson, let's just step through a few architecture overviews, and I think doing this visually will help you to understand how the product works. First, let's step through an architecture that just uses user pools. Remember, user pools are about user management, sign-in, sign-up, and anything associated with that process.

      So we start with a web and mobile application and a Cognito user pool with both internal identities and social sign-in. So anyone can sign in to the pool with any type of identity, and the result is a Cognito user pool token, also known as a JSON web token or JWT. This user pool token proves that the identity has been used to sign in, and it now represents a Cognito user pool user. Whether an internal user is used or a social identity, the authenticated identity is now a user pool identity. And this token can then be used to access self-managed resources such as applications running on servers that you manage or accessing databases which you also manage. It can also be used with an API Gateway, which is capable of accepting user pool tokens directly. Remember, these are known as JWTs. An API Gateway is capable of accepting JWTs for authentication.

      So let's focus for a second on what's just happened. A user pool is a collection of identities of users. It's used to allow sign-up and sign-in both for internal users and social sign-ins. The tokens which are generated as a result can be used for self-managed systems, and the tokens can be used to authenticate for API Gateway. But, and this is the single biggest thing to remember about Cognito, these tokens cannot be used to access AWS resources. In general, that requires AWS credentials, and AWS credentials can be handed out via identity pools. So let's look at those next.

      So we have the same web and mobile application. This time, though, we aren't using user pools. We're allowing customers to log in directly using external identities. How this works is as follows. We start with a collection of supported external identities. And these include the same social identities, which I've demonstrated previously with user pools. Our application allows users to sign in with any of those external identities. So when they click on a sign-in button within our application, they're directed at an external ID provider sign-in page. You might have experienced one of these before. This is an example of the sign-in with Google page. After a customer authenticates with their Google credentials, which it's worth pointing out that we never have access to, because this sign-in takes place on the external identity provider, in any case, we receive a Google token as a result, but it could be a Facebook token, an Amazon token, or whatever external ID provider is used. Crucially, it can be one of many different types.

      If we want to support many different external identity providers, then we need to configure that support. But now that we have this external ID provider token, this proves that a user has logged in with an external ID provider. This token can't be used to access AWS resources, and that's where identity pools come in handy. Our application takes this token and passes it to an identity pool that we've configured. We've configured this to support every external identity that we want to allow logins from. This is a key thing to keep in mind. If we want to support five external ID providers, we need five different configurations, five different types of tokens to be supported.

      What happens next is that Cognito is configured with roles, at least one for authenticated identities and one for unauthenticated or guest identities. In this case, we have an authenticated identity, the Google token. And so, on our behalf, Cognito assumes a role and generates temporary AWS credentials, which are then passed back to the application. The application can then use these credentials to access AWS resources. Once they expire, the application renews them again using Cognito, and the process continues. The permissions the application has are based on the roles' permissions. At no point does the application store any credentials within code or any credentials permanently. So the process is that an external identity provider authenticates a user, Cognito identity pools swap the external ID token for temporary credentials, and these are used to authorize access to AWS resources.

      So once again, focus on the fact that user pools are about sign-in and sign-up for users, and identity pools are about swapping identity tokens from an external ID provider for temporary AWS credentials. These are two very different and isolated tasks. Now, you can use user pools and identity pools together to fix one small lingering problem. With this configuration, your application has to be able to deal with many different ID tokens from many different external providers. Now, one option is that we could use user pools to handle the many different types of identity, and then we can use identity pools to swap the Cognito user pool token for AWS credentials.

      Now, the swapping of any external ID provider token for AWS credentials is known as Web Identity Federation, and you're going to experience that term both in the real world and in the exam. So let's quickly step through the final architecture, which combines both user pools and identity pools. We start with a user pool, and this is configured to support external identities and its internal store of users. Whatever is used, whichever identity type is used to log in, the identity that's authenticated is now a Cognito user pool user. So there's only one type of token which is generated, whether sign-in is using internal users or social sign-in. This is the user pool token or JWT.

      By using a user pool, we've abstracted away from all of the configuration of many different external ID providers. We have conceptually one user store to manage, one set of user profiles all provided via a Cognito user pool. So if you log in with a user pool user or if you log in via the user pool but using Google credentials, the outcome is the same. A user pool token is returned to the application, so the user pool simplifies the management of identity tokens. Next, the application can pass this user pool token into an identity pool, and this assumes an IAM role defined in the identity pool, which generates temporary AWS credentials, and these temporary credentials are returned to the application. The benefit to this approach is that the identity pool need only be configured with a single external identity provider, the user pool. But otherwise, the process is the same as using an identity pool directly, just with less admin overhead. The application can then use those AWS credentials to access AWS resources, and that's pretty much everything I wanted to cover.

      Now, in summary, user pools manage user sign-up and user sign-in, either internal or using social logins. And what you get as a result is a user pool token, also known as a JSON web token or JWT, and that is the output of any form of sign-in using user pools. Now, identity pools swap external identity tokens for AWS credentials. This process is called federation. External identity tokens can be direct external identity tokens, such as Google, Amazon, Facebook, and many others, or they can be user pool tokens, which can themselves represent external ID logins. Once an application uses an identity pool to gain access to temporary AWS credentials, it can access AWS resources.

      Now, this process allows for a near-unlimited number of users. An unlimited is much more than the 5000 IAM user limit, which means this is great for web-scale applications. Now you're going to get experience of identity pools in an upcoming advanced demo. For now, though, that's everything that I wanted to cover. Really try to focus on understanding the two different parts of Cognito really well. I promise it will be helpful for both the exam and for the real world. Now, at this point, that's everything I wanted to cover, so thanks for watching. Go ahead and complete this video, and when you're ready, I'll look forward to you joining me in the next.

    1. Welcome back and in this lesson I want to cover Amazon Kinesis Data Analytics. This is a real-time data processing product, and it's critical that you understand its features together with when you should and shouldn't use it for the exam. Before I start talking about Kinesis Data Analytics, I want to position the product relative to everything else. Kinesis data streams are used to allow the large-scale ingestion of data into AWS and the consumption of that data by other compute resources known as consumers. Kinesis Data Firehose provides delivery services. It accepts data in and then delivers it to supported destinations in near real-time and it can also use Lambda to perform transformation of that data as it passes through. Kinesis Data Analytics is a service that provides real-time processing of data which flows through it using the structured query language known as SQL. Data inputs at one side, queries run against that data in real-time, and then data is output to destinations at the other.

      The product ingests from either Kinesis data streams or Kinesis Firehose and can optionally pull in static reference data from S3, but I'll show you how that works visually in a moment. Now after data is processed, it can be sent on in real-time to destinations, and currently, the supported destinations are Firehose and indirectly any of the destinations which Firehose supports. But keep in mind, if you're using Firehose, then the data becomes near real-time rather than real-time. The product also directly supports AWS Lambda as a destination, as well as Kinesis Data Streams, and in both of those cases, the data delivery is real-time. So you only have near real-time if you choose Firehose or any of those indirect destinations. If you use Lambda or Kinesis Data Streams, then you keep the real-time nature of the data. Conceptually, the product fits between two streams of data: input streams and output streams, and it allows you, in real-time, to use SQL queries to adjust the data from the input to the output.

      Now let's look at it visually because it will be easier to see how all of the various components fit together. So on the left, we start with the inputs, the source streams, and this can be Kinesis Streams or Kinesis Firehose. In the middle, we create a Kinesis Analytics application; this is a real-time product, and I'll explain what that means in a second. The Kinesis Analytics application can also take data in from a static reference source, an S3 bucket, and then the Kinesis Analytics application will output to destination streams on the right, so Kinesis Streams or Kinesis Firehose. Remember, all of these are external sources or destinations; they exist outside of Kinesis Data Analytics. Kinesis Data Analytics doesn't actually modify the sources in any way. What actually happens is this: inside the analytics application, you define sources and destinations known as inputs and outputs.

      So conceptually, what happens is for the input side, objects called in-application input streams are created based on the inputs. Now you can think of these like normal database tables, but they contain a constantly updated stream of data from the input sources, the actual Kinesis Streams or Firehose. These exist inside the analytics application, but they always match what's happening on the streams which are outside of the application. Now the reference table is a table which matches data contained within an S3 bucket and it can be used to store static data which can enrich the data coming in over the streams. Consider the example of a popular online game where a Kinesis Stream has all of the data about player scores and player activities. In this particular case, the reference table might contain data on player information which can augment the stuff coming in via the stream. So if the stream only contains the raw score and activity data, then the reference data will contain other metadata about those players, so maybe player names, certain items the player has, or awards, and these can all be used to enrich the data that's coming in real-time from Kinesis Streams.

      Now the core to the Kinesis Analytics application is the application code, and this is coded using the structured query language, or SQL. It processes inputs and it produces outputs. So in this case, it operates on data in the in-application input stream table and the reference table, and any output from the SQL statement is added to in-application output streams. Again, think of these like tables which exist within the Kinesis Analytics application; only these tables map onto real external streams, so any data that's outputted into those tables by the Kinesis Analytics application is entered onto the Kinesis Stream or Kinesis Firehose, and then these will feed into any consumers of the stream or destinations of the firehose. Additionally, any errors generated by the SQL query can be added to an in-application error stream, and all of this happens in real time. So data is captured from the source streams via the in-application input stream, the virtual tables. It's manipulated by the analytics application using the SQL query, and then stored into the in-application output streams which put that data into either the external Kinesis Stream or external Kinesis Firehose.

      All of this just to stress it again happens in real time, and if the output data is delivered into a Kinesis Stream, then it stays real-time. If the output data is delivered into a Kinesis Firehose, then it becomes near real-time, delivering to all of those supported destinations. Now, you only pay for the data processed by the application, but it is not cheap, so you should only use it for scenarios which really fit this type of need. Before we finish this lesson, let's talk about the scenarios where you might choose to use Kinesis Data Analytics. There are some particular use cases or scenarios which fit using Kinesis Data Analytics. At a high level, this is anything which uses streaming data that needs real-time SQL-based processing, so things like time series analytics, maybe election data and e-sports, things like real-time dashboards for games, high score tables or leaderboards, and even things like real-time metrics for security and response teams. Anything which needs real-time stream-based SQL processing is an ideal candidate for Kinesis Data Analytics.

      Now, I mentioned in the previous lesson that Data Firehose can also support transformation of data using Lambda, but remember the key differentiator is that Data Firehose is not a real-time product, and using Lambda you're restricted to relatively simple manipulations of data. Using Kinesis Data Analytics, you can create complex SQL queries and use those queries to manipulate input data into whatever format you want for the output data. So it has a lot more in terms of features than Data Firehose, so if you're dealing with any exam questions which need really complex manipulation of data in real-time, then Kinesis Data Analytics is the product to choose. Okay, so with that being said, that's everything that I wanted to cover in this theory lesson. Go ahead and complete the lesson, and then when you're ready, I look forward to you joining me in the next lesson.

    1. Welcome back and in this lesson I want to go into a little bit more depth about API Gateway. Now we've got a lot to cover in a single lesson so let's jump in and get started. API Gateway is a service which lets you create and manage APIs. Now an API is an application programming interface, it's a way that applications communicate with each other. So for example if you run the Netflix application on your TV then it's using an API to communicate with the Netflix back-end services.

      API Gateway acts as an endpoint or an entry point for applications looking to talk to your services, and architecturally it sits between applications which utilize APIs and the integrations which are the back-end services which provide the functionality of that API. Now API Gateway is highly available and scalable so you don't have to worry about either, it's delivered as a managed service. It handles authorizations so you can define who can access your APIs using the API Gateway, it can be configured to handle throttling so how often individuals can use APIs, it can perform caching to reduce the amount that your back-end services are called as part of the usage of your API, it supports cores so you can control security of cross-domain calls within browsers and it supports transformations and all of this within the API Gateway product.

      It also supports the open API spec which makes it easy to create definition files for APIs so APIs can be imported into API Gateway and it also supports direct integration with AWS services. So for things like writing into DynamoDB, starting a step function or anything through to sending messages to SNS topics you might not even need any back-end compute so it's capable of directly integrating with a range of AWS services.

      Now API Gateway is a public service and so it can act as the front-end for services running within AWS or on-premises and it can also be an effective migration product to provide a consistent front-end while the backing services are being moved from on-premises into AWS or even re-architected moving from monolithic compute services such as virtual servers through to serverless architectures using Lambda. Now lastly it can provide APIs that use HTTP, REST or even web socket-based APIs.

      Now visually this is how the high-level architecture of API Gateway looks. We have API Gateway in the middle here and this is acting as the endpoint for the consumers of our API and this could be mobile applications or the APIs or even web applications loaded from static hosting within an S3 bucket. In any case these all connect to the API running on the API Gateway using the endpoint DNS name.

      Now it's actually the API Gateway's job to act as an intermediary between clients and what are called integrations and these are the back-end services which provide the functionality to API Gateway. API Gateway is capable of connecting to HTTP endpoints running in AWS or on-premises, it can use Lambda for compute and this is something that's typically used within serverless architectures and as I mentioned previously it can even directly integrate with some AWS services such as DynamoDB, SNS and step functions.

      Now there are three phases in most API Gateway interactions: the request phase which is where the client makes a request to the API Gateway and then this is moved through API Gateway to the service provided by the integrations, and then finally the response phase where the response is provided back to the client. The request phase at a high level does three things: it authorizes, validates and transforms incoming requests from the client into a form that the integration can handle and then the response takes the output from the integration, it transforms it, prepares it and then returns it through to the client.

      API Gateway also integrates with CloudWatch to store logging and metric-based data for request and response side operations and it also provides a cache which improves performance for clients and also reduces the number of requests made to the back-end integrations. So that's the high-level architecture and through the remainder of this lesson I want to touch on a number of the pieces of functionality in a little bit more detail and we'll start with authentication.

      API Gateway supports a range of authentication methods. Now you can allow APIs to be complete open access so no authentication is required but there are different types of authentication which are supported by the product and let's use the example of the Categorum application which is now serverless. API Gateway can use Cognito user pools for authentication, this is one of the supported methods. If this method is used then the client authenticates with Cognito and receives a Cognito token in return assuming a successful authentication, it passes that token in with the request to API Gateway and because of the tight integration which API Gateway has with Cognito it can natively validate the token.

      So that's Cognito but API Gateway can also be extended to use Lambda-based authorization which used to be called custom authorization. With this flow we assume that the client has some form of bearer token something which asserts an identification and it passes this into API Gateway with the request. Now at this point API Gateway not knowing how to natively validate this authentication or authorization it calls a Lambda authorizer and it's the job of this function to validate the request. So it either does some custom compute maybe checking a local user store or it calls an ID provider an external provider of identification to check the ID. If this all comes back okay and the Lambda function is happy it returns to API Gateway an IAM policy and a principal identifier, API Gateway then evaluates the policy and it either sends the request on to a Lambda function so invoking the Lambda function or it returns a 403 access denied error if the access is denied.

      Now IAM can also be used to authenticate and authorize with API Gateway by passing credentials in the headers but this level of detail is beyond what's required for the exam. I just think it's useful to give you the architecture visually so you can picture how all the components fit together.

      At this point let's move on and talk about endpoint types. With API Gateway it's possible to configure a number of different endpoint types for your APIs. First we've got edge optimized and with edge optimized endpoint types any incoming requests are routed to the nearest cloud front pop or point of presence. We've also got regional endpoints and these are used when you have clients in the same region so this doesn't deploy out using the cloud front network instead you get a regional endpoint which clients can connect into so this is relatively low overhead it doesn't use the cloud front network and this is generally suitable when you have users or other services which consume your APIs in the same AWS region. Lastly we have private endpoint types and these are endpoints which are only accessible within a VPC via an interface endpoint so this is how you can deploy completely private APIs if you use the private endpoint type.

      The next concept I want to talk about are API Gateway stages. When you deploy an API configuration in API Gateway you do so to a stage for example you might have the prod and dev stage for the Categorum application. Most things within API Gateway are defined based on a stage so in this case you could have the production application connecting to the prod stage and developers testing new additions via the dev stage. Each of these stages has its own unique endpoint URL as well as its own settings. Each of these stages can be deployed onto individually so you might have version one of the API configuration deployed into production and this uses version one of a lambda function as a backing integration and then we might have version two which is currently under development deployed into the dev stage and this also could use a separate backing lambda function containing the new code.

      Now you can roll back deployments on a stage so they can be used for some pretty effective isolation and testing but what you can also do with API Gateway stages is to enable canary deployments on stages. What this means is that when enabled any new deployments which you make to that stage are actually deployed on a sub part of that stage the canary part of that stage and not the stage itself. So traffic distribution can be altered between the base stage and the canary based on a user configurable value and eventually the canary can be promoted to be the base stage and the process repeated.

      In this example it means that version two of the API configuration can be tested by the development team and then canary can be enabled on production, version two can be deployed onto production and this will be deployed into the canary because canary is enabled on the production stage. We can adjust the distribution of traffic between the main production stage and its canary until we can completely happy and then we can promote the canary to be the full base stage and this process of development production cycles can then continue. If you're not happy with how a canary is performing, if it's got bugs or if it's negative in terms of performance then you can always remove it and return back to the base stage.

      Now at this point I have to apologize I hate getting you to remember facts and figures but for the exam I genuinely think these facts and figures might help so do your best to note them down and remember them even if you only do it at a high level even if you only get the basics I think it will help you answer certain exam questions quicker and with less thought.

      So to start with error codes generated by API gateway are generally in one of two categories: first we have 400 series error codes and these are client errors this suggests that something is wrong on the client side so something wrong either on the client or in terms of how it's making a request through to API gateway maybe permissions are wrong maybe headers are malformed anything that's on the client side, then we have 500 series errors and these are server errors so this indicates that there's a valid request but there's a back-end issue.

      Now inside both of these categories there are a number of important requests that you need to remember the error code number four and I want to step through these on this part of the lesson. So 400 – 400 this is one that's really hard to diagnose because it can actually have many different root causes but if you do see a 400 error then you should at least be aware that it's a generic client side error, we've got 403 and this suggests an access denied error so either that the authorizer has executed and then indicates to API gateway that the request should be denied or the request has been filtered by something like the web application firewall.

      Next we've got a 429 error code and this is an indication that throttling is occurring, I mentioned earlier that API gateway can be configured to throttle requests so if you're getting a 429 error it means that you've exceeded a configured throttling amount so 429 associate that with throttling. Now if you get a 502 error this is a bad gateway exception and this indicates that a bad output has been returned by whatever is providing the backing services so if you've got a lambda function servicing request your API then a 502 error suggests that that lambda is returning something that's invalid. A 503 error indicates service unavailable so this could indicate that the backing endpoint is offline or you're having some form of major service issues so 503 is definitely one to remember I have seen that come up in the exam. 504 indicates an integration failure now there is a limit of 29 seconds for any requests to API gateway so even though lambda has a timeout of 15 minutes if lambda is providing backing compute for an API gateway API then if that request takes longer than 29 seconds then this can generate a 504 error so you need to make sure that any lambda functions that are backing your APIs are capable of responding within that 29 second limit otherwise you might get 504 errors.

      And I've included a link that's attached to this lesson which details all of the error codes as well as a little bit more detail if you do want to use it for extra reading. Now one final thing before we finish up with this in-depth lesson for API gateway and that's to talk about caching. Now you should be familiar with the general concept of caching at this point in the course as it relates to API gateway. We start in the middle with an API gateway stage and this is important because caching is configured per stage, this matters both for the exam and if you're developing this infrastructure for production situations.

      Now what happens without a cache is that any users at the application make requests to the API gateway stage and there are some back-end integrations which service those requests. Without a cache those services would be used on each and every request. With caching though you define a cache on that stage, it can be anywhere from 500 mb to 237 gb in size, it caches things by default for 300 seconds and this can be configured from zero meaning disabled through to a maximum of 3600 seconds, and a point that you should know for the exam is that this cache can be encrypted.

      Now using a cache means that calls will only be made to the back end when there's a cache miss and this means reduced load, reduced cost and improved performance because of the lower latency that caching provides. Okay so that's everything I wanted to cover in this in-depth lesson on API gateway, this is definitely a service where you need to be aware of much more in the developer and operation streams of AWS certifications. At this point though that is everything that I'm going to be talking about so go ahead complete this lesson and when you're ready I'll look forward to you joining me in the next.

    1. Welcome back, and in this lesson, I want to cover the serverless architecture. Serverless is a type of architecture which is relatively commonplace within AWS, mainly because AWS includes many products and services which support its use. The key thing to understand about the serverless architecture, aside from the fact that there are really servers running behind the scenes, is that it's not one single thing, and while serverless is an architecture, it's more a software architecture than a hardware architecture.

      The aim with the serverless architecture—and where its name comes from—is that as a developer, architect, or administrator, you're aiming to manage few, if any, servers, because servers are things which carry overhead, such as cost, administration, and risk, and the serverless architecture aims to remove as much of that as possible. In many ways, serverless takes the best bits from a few different architectures, mostly microservices and event-driven architectures, and within serverless you break an application down into as many tiny pieces as possible, even beyond microservices, into collections of small and specialized functions.

      These functions start up, do one thing really, really well, and then they stop. In AWS, logically, because of this, Lambda is used, but there are other platforms such as Microsoft Azure which has their own equivalent, namely Azure Functions, and from an architecture perspective, the actual technology which is used is less relevant. These functions which make up your application run in stateless and ephemeral environments, and why this matters is because if the application is architected to assume a clean and empty environment, then these functions can run anywhere.

      Every time they run, they obtain the data that they need, they do something, and then optionally, they store the result persistently somehow or deliver that output to something else. The reason why Lambda is cheap is because it's scalable, each environment is easy to provision, and each environment is the same, so the serverless architecture uses this to its advantage, where each function that runs does so in an ephemeral and stateless environment.

      Another key concept within serverless is that generally everything is event-driven, which means that nothing is running until it's required, and any function code that your application uses is only running on hardware when it's processing a system or customer interaction—an event. Serverless environments should use fast products such as Lambda for any general processing needs, since Lambda as a service is built based on execution duration, and functions only run when some form of execution is happening.

      Because serverless is event-driven, it means that while not being used, a serverless architecture should be very close to zero cost until something in that environment generates an event, so serverless environments generally have no persistent usage of compute within that system. Now, where you need other systems beyond normal compute, a serverless environment should use, where possible, managed services—it shouldn't reinvent the wheel.

      Examples are using S3 for any persistent object storage, or DynamoDB (which we haven't covered yet) for any persistent data storage, and third-party identity providers such as Google, Twitter, Facebook, or even corporate identities such as Active Directory, instead of building your own. Other services that AWS provides, such as Elastic Transcode, can be used to convert media files or manipulate these files in other ways.

      With the serverless architecture, your aim should be to consume as a service whatever you can, code as little as possible, use function as a service for any general-purpose compute needs, and then use all of those building blocks together to create your application. Now, let's look at this visually, because I think an architecture diagram might make it easier to understand exactly what a serverless architecture looks like, so let's step through a simple serverless architecture, and we're going to do so visually.

      I want your default position to be that unless we state otherwise, you're not using any self-managed compute—so no servers and no EC2 instances—unless we discuss otherwise, and that should be your starting position. At each step throughout this architecture, I'll highlight exactly why the parts are serverless and why it matters.

      Now, we're going to use a slightly more inclusive example—this time, we're going to use PetTube, which was rebranded to be a little more inclusive after an uproar about it only being for cats. To start with, we've got Julie using her laptop, and she wants to upload some woofy holiday videos, so she browses to an S3 bucket that's running as a static website for the PetTube application, downloads some HTML, and that HTML has some JavaScript included within it.

      One crucial part of the serverless architecture is that modern web browsers are capable of running client-side JavaScript inside the browser, and this is what actually provides the front end for the PetTube application—JavaScript that's running in the browser of the user that's downloaded from a static website S3 bucket. So at this point, the application has no self-managed compute that's being used—we've simply downloaded HTML from an S3 bucket with some included JavaScript that's now running in Julie's web browser.

      Now, PetTube uses third-party identity providers for its authentication; like all good serverless applications, it doesn't use its own store of identity or its own store of users, which results in lower admin overhead, and also avoids the limit on the number of IAM users that can exist inside one AWS account, which is 5,000 IAM users per account. So if we used IAM users for authentication, then PetTube would be limited to 5,000 users, and each user of the application would need one additional account—one additional username and one additional password.

      Instead of doing that, we use a third-party identity provider, and one that our users are already likely to have an account inside, so that reduces the number of accounts that our users are required to maintain. The JavaScript that's running in Julie's browser communicates with the third-party identity provider, and we're going to assume that we're using Google, and you'll have seen the screen that's generated if you've ever logged into Gmail or anything that uses Gmail logins—but this could just as easily be Twitter, Facebook, or any other third-party identity provider.

      The key thing to understand is that Julie logs into this identity provider, and it's this identity provider that validates that the user claiming to be Julie is in fact Julie, so it checks her username and password, and if it's happy with the process—or if it's happy with the username and password combination that Julie's provided—then it returns to Julie an identity token, which proves that she's authenticated with the Google identity provider.

      Now, AWS can't directly use third-party identities, and so the JavaScript that's running in Julie's browser communicates with an AWS service called Cognito, and Cognito swaps this Google identity token for temporary AWS credentials, which can be used to access AWS resources. So the JavaScript in Julie's browser now has available some temporary AWS credentials that it can use to interact with AWS, and so it uses these temporary credentials to upload a video of Woofy to an S3 bucket—this is the original bucket of our application, the bucket where the master videos go that our customers upload.

      Notice that so far in this process, no self-managed compute or servers have been used to provision this service—we've performed all of these activities without using any compute servers or compute instances that we need to manage or design as solutions architects, since it's all delivered by using managed services such as S3, Cognito, and the Google identity provider.

      Now, when the Woofy video arrives inside the originals bucket, that bucket is configured to generate an event which contains the details of the object that was uploaded, and it's set to send that event to and invoke a Lambda function to process that video. That Lambda function takes in the event and creates jobs within the Elastic Transcoder service, which is a managed service offered by AWS that can take in media and manipulate that media—one of the things it can do is transcode the media, generating media of different sizes from one master video file.

      Multiple jobs get created, one for each size of video that's required, and the Elastic Transcoder gets the location of the original video as part of the initiation of the job and loads in that video at the start of each job processing cycle, so each job outputs an object to a transcoder bucket—one object for each different size of the original video—and in addition, details on each of the new videos are added to a database, in this case DynamoDB.

      Again at this stage, notice that we still have no self-managed servers—the only resources that are consumed are storage space in S3 and DynamoDB, and any processing time used for the Lambda function and Elastic Transcoder jobs. With this architecture so far, we've allowed a customer to upload a master video, transcoded it into different video sizes, and at no point have we consumed any self-managed compute, EC2 instances, or any other long-running compute services—it’s all managed services or compute that’s used in Julie’s browser.

      Now the last part of the architecture is where Julie, by clicking another part of the client site that's running inside her browser, can interact with another Lambda function—we’ll call this My Media—and this Lambda function will load data from the database, identify which objects in the transcode bucket are Julie’s, and return URLs for Julie to access, and this is how Julie can load up a web page which shows all of the videos that she’s uploaded to the PetTube application.

      Now this is a simplified diagram—in reality, it’s a little bit more complex—for example, API Gateway would generally be used between any client-side processing and the Lambda functions, but conceptually this is actually how it works. We’ve got no self-managed servers, no self-managed database servers, and little, if any, costs that are incurred for base usage—it’s a fully consumption-based model that consumes compute only when it’s being used, when events are generated either from a system-side or a client-side, and it uses third-party services as much as possible.

      Now there are many third-party services to choose from, and you can never expect to know them all end-to-end, but the key thing to understand about serverless is the way to do things, and I’ve covered that in this lesson. Later in the section, you’ll experience how to implement a serverless application within the demo lesson called PetCuddleatron, and this will show you how to implement a serverless application just like the one that’s on screen—it’s slightly less complex, but it’s one that uses many of the same architectural fundamentals, and it should start to really cement the theory that you’re learning right now.

      Now before we move on to this demo, there are a few more services that I need to cover which the PetCuddleatron demo lesson will utilize. So for now, that’s it for this lesson—thanks for watching, go ahead and complete this video, and then when you’re ready, I’ll look forward to you joining me in the next.

    1. Welcome back and in part three of this series, I want to finish off and talk about some advanced elements of Lambda. Now we've got a lot to cover, so let's jump in and get started.

      First, I want to talk about the ways a Lambda function can be invoked. We've got three different methods for invoking a Lambda function: we've got synchronous invocation, asynchronous invocation, and invocation using event source mappings. And I want to step through each of them visually so that you can understand in detail how they work because this is essential for the exam.

      So let's start off with synchronous invocation of Lambda. With this model, you might start off with a command line or API directly invoking a Lambda function; the Lambda function is provided with some data and it executes that data. Now all this time, the command line or API is waiting for a response because it's synchronous — it needs to wait here until the Lambda function completes its execution. So the Lambda function finishes and it returns that data, whether it's a success or a failure.

      Now synchronous invocation also happens if Lambda is used indirectly via the API gateway, which is the use case for many serverless architectures. So we might have some clients using a web application via API gateway and this proxies through to one or more Lambda functions; again, the Lambda function performs some processing all the while the client is waiting for a response within their web application. And then when the Lambda function responds, this goes back via the API gateway and back through to the client.

      The common factors with both of these approaches is that the client sends a request which invokes Lambda and the result, be it a success or failure, is returned during that initial request — the client is waiting for any data to be returned. Another implication of a synchronous invocation is that any errors or retries have to be handled within the client. The Lambda function runs once, it returns something, and then it stops; if there's a problem or data isn't processed correctly, then the client needs to rerun that request — and this happens at the client side.

      So synchronous invocation is generally used when it's a human directly or indirectly invoking a Lambda function. Next, let's look at asynchronous invocation, and this is typically used when AWS services invoke Lambda functions on your behalf. Let's use an example: an S3 bucket with S3 events enabled — so we upload a new image of whiskers to this S3 bucket, which causes an event to be generated and sent through to Lambda, and this is an asynchronous invocation.

      So S3 isn't waiting around for any kind of response — it basically just forgets about it at this point. Once it sent that event through to Lambda, it doesn't continue waiting; it doesn't worry about this event at all. Now maybe as part of processing this image, it's generating a thumbnail or maybe performing some kind of analysis and storing that data into DynamoDB, but again, S3 isn't waiting around for any of this — it's asynchronous. Lambda is responsible for any reprocessing in the event that there's a failure, and this reprocessing value is configurable between zero and two times.

      Now a key requirement for this is that the function code needs to be idempotent — and this is important. If you've never heard this term before, let me explain. Let's say that you had $10 in your bank account and I wanted to increase this value to $20. Now there are two ways that I could do this if I operated the bank: I could simply add $10 to your balance, increasing it from 10 to 20, or I could explicitly set the balance to 20.

      Now if I set the balance to 20 and this operation failed at some undetermined point in this process, then I could simply rerun the process, safe in the knowledge that even running it again on your balance would only at worst set the value to $20 again — this is known as an idempotent operation. You can run it as many times as you want and the outcome will be the same.

      Now if I performed the operation where I added $10 to your account and the operation failed, it could have failed before it added the $10 or after; if it failed after and I rerun the operation, well now you'd have $30 — and this is an example of something which is not idempotent.

      When Lambda retries an operation it doesn't really provide any other information — the function just reruns. So logically in this example you would need to make sure that your function code isn't additive or subtractive — it just needs to perform its intended task; with this example it needs to set your balance to $20.

      Generally when designing a Lambda function which is used in this way, the Lambda function needs to finish with a desired state — it needs to make something true. If you're using Lambda functions which are designed in a non-idempotent way, you can end up with some questionable results.

      Now Lambda can be configured to send any events which it can't process after those automatic retries to a dead letter queue, which can be used for diagnostic processing. And a new feature of Lambda is the ability to create destinations — so events processed by Lambda functions can be delivered to another destination such as SQS, SNS, another Lambda function, and even EventBridge; and separate destinations can be configured based on successful processing or failures.

      So this is asynchronous invocation — it's generally used by AWS services which are capable of generating events and sending those events to Lambda. It means that Lambda can automatically reprocess failed events and the original source of the event isn't waiting for processing to complete.

      But there is a third type of invocation. The last type of invocation is known as Event Source Mapping, and this is typically used on streams or queues which don't generate events — so things where some kind of polling is required. Let's look at an example: let's say that we have a Kinesis data stream, and into this stream, a fleet of producer vans driving around scanning with LIDAR and imaging equipment are all producing data which is being put into a Kinesis stream.

      Now Kinesis is a stream based product — generally consumers can read from a stream but it doesn't generate events when data is added, so historically this wouldn't have been an ideal fit for Lambda which is an event driven service. So what happens is that we have a hidden component called an event source mapping which is polling queues or streams looking for new data and getting back source batches — so batches of source data from this data source.

      Now these source batches are then broken up as required based on a batch size and sent into a Lambda function as event batches. Now a single Lambda function invocation could in theory receive hundreds of events in a batch — it depends on how long each event takes to process. Remember Lambda has a 15 minute timeout so you need to carefully control this event batch size to ensure that the Lambda function doesn't terminate before completing this batch.

      Now there's one really important thing that you need to understand about event source mapping. With a synchronous invocation an event is delivered to Lambda from the source and Lambda doesn't need permissions to the source service unless it actually wants to read more data from that source — for example, if an object is added to an S3 bucket, then S3 generates and delivers an event which contains details of that event (so which object was uploaded and perhaps some other metadata).

      But unless you need to read additional data from S3 maybe to get the actual object, well then the Lambda function doesn't need S3 permissions. With event source mapping invocation, the source service isn't delivering an event — the event source mapping is reading from that source. And so the event source mapping uses permissions from the Lambda execution role to access the source service.

      And this is really important to know because it does come up in the exam. So even if a Lambda function receives an event batch containing Kinesis data, even though the Lambda function doesn't directly read from Kinesis, the execution role needs Kinesis permissions because the event source mapping uses them on its behalf to retrieve that data.

      Now any batches which consistently fail can be sent to an SQS queue or an SNS topic for further processing or analysis. Now that's the third type of invocation — this is event source mapping invocation, and that's the method used when Lambda functions are processing SQS queues, Kinesis streams, DynamoDB streams and even Amazon managed streaming for Apache Kafka. And this last one is something that we won't be covering within the course, but it's important to know all of the different types of products that use event source mapping based invocation.

      With that being said that's all of the three types of invocation I wanted to cover — so let's move on to a different topic, this time Lambda versions. With Lambda functions it's possible to define specific versions of Lambda functions — so you could have different versions of the given function, for example, version one, version two and version three.

      Now as it relates to Lambda, a version of a function is actually the code plus the configuration of that Lambda function — so the resources and any environment variables in addition to any other configuration information. Now when you publish a version, that version is immutable — it never changes once it's published, and it even has its own Amazon resource name. So once you publish a version you can no longer change that version.

      There's also the concept of dollar latest, and dollar latest points at the latest version of a Lambda function — now this can obviously change as you publish later and later versions of the function, so this is not immutable. You can also create aliases — so for example, dev, stage and prod, and these can point at a particular version of a Lambda function, and these can be changed — so these aliases are not immutable.

      So generally with large scale deployments of Lambda you'd be producing Lambda function versions for all of the major changes, and using aliases so that different components of your serverless application can point at those specific immutable version numbers — so that's important to know for the exam.

      So the last thing I want to talk about is Lambda startup times, and to understand that you need to understand how Lambda functions are actually executed. Lambda code runs inside a runtime environment — and this is also referred to as an execution context; think of this as a small container which is allocated an amount of resource which runs your Lambda code.

      When a Lambda function is first invoked — let's say by receiving an S3 event — this execution context needs to be created and configured, and this takes time. First the environment itself is created and this requires physical hardware; then any runtimes which are required are downloaded and installed — let's say this is for Python 3.8; then the deployment package is downloaded and then installed — and this takes time.

      Now this process is known as a cold start, and all in, this process can take hundreds of milliseconds or more — which can be significant if a Lambda function is performing a task which touches a human who is expecting a response. Now if this is an S3 event, then maybe this extra time isn't such a big deal — but you need to be aware that this cold start occurs because an execution context is being created and configured, any prerequisites are being downloaded and installed, the deployment package is being downloaded and installed — and that's all before the function itself can execute.

      Now if the same Lambda function is invoked again without too much of a gap, then it's possible that Lambda will use the same execution context — and this is known as a warm start. It doesn't need to set up the environment or download the deployment package because all of that is already contained within the execution context; this time the context just receives the event and immediately begins processing.

      A warm start means the code can be running within milliseconds because there's no lengthy build process. A Lambda function which invokes again fairly soon after a cold start can reuse an execution context — but if too long a time period goes between invocations, then the context can be deleted which results in another cold start.

      Also one function invocation runs at a time per context — so if you need 20 invocations of a function at once, then this can result in 20 cold starts. Now you can make this process more efficient — you can actually use a feature known as provisioned concurrency where you can inform AWS in advance. An execution context can be provisioned for you in advance for Lambda invocations.

      You might use these when you know that you have periods of high load on a serverless application or if you're preparing for a new production release of a serverless application and want to pre-create all of these execution environments. Now there are also other things that you can do to improve performance — you can use the temp space to pre-download things within an execution context.

      For example, maybe you're using some animal images as part of your processing — well, if another function uses the same execution context, then it too will have access to those same animal images without having to download them a second time. Now you do need to be careful because your functions need to be able to cope with the environment being new and clean every time — they can never assume the presence of anything.

      From a code perspective, you can create other things like database connections outside of the Lambda function handler code. So when you create a Lambda function, generally most things go within the Lambda function handler — but if you create anything outside of the Lambda function handler, then these will be made available for any future function invocations in the same context.

      So anything that you define within a Lambda function handler is limited to that one specific invocation of that Lambda function, but for anything which you anticipate there being a potential for reuse, you can declare that outside of the Lambda function handler — and in theory, that will be available for any other invocations of the Lambda function which occur within that same execution context.

      But again, you need to make sure that your function doesn't require or expect that — every single time a function invokes, it should be absolutely fine with recreating everything. You should by default assume that execution contexts are stateless and any invocation of a Lambda function is going to be operating in a completely freshly created environment, but if you want to be efficient, your functions should also be able to reuse common aspects that persist through different function invocations.

      Now again, these are all deep dive things that you need to be aware of for the exam. I've covered a lot of these elements across all three parts of this Lambda deep dive mini series, but at this point, that's everything I wanted to cover in part three — and this is the last part of this mini series. So thanks for watching. Go ahead and complete this video, and when you're ready, I look forward to you joining me in the next lesson.

    1. Welcome back to part two of this lesson series going into a little bit more depth on Lambda. In this part of the series I'm going to be talking about Lambda networking, Lambda permissions and Lambda monitoring. Now this is a lot to cover in one lesson so let's jump in and get started.

      Lambda has two networking modes and you need to be aware of both of them for the exam. First we have public which is the default and then second we have VPC networking. Now you need to understand the architecture of both of them so let's step through them in a little bit more detail.

      For public networking we start with an AWS environment and inside it a single Lambda function. Now this is part of a wider application, let's say the Categorum Enterprise application running in a VPC, which uses Aurora for the database, EC2 for compute, and the Elastic file system for shared file storage. Now this is the default configuration for Lambda where it's running in the public AWS network, so Lambda using this configuration can access public space AWS services such as SQS and DynamoDB or internet-based services such as IMDB if the Lambda function wanted to fetch the latest details of cat-themed movies and TV shows.

      So Lambda running by default using public networking means that it has network connectivity to public space AWS services and the public internet; it can connect to both of those from a networking perspective, and as long as it has the required methods of authentication and authorization then it can access all of those services. Now public networking offers the best performance for Lambda because no customer-specific networking is required—Lambda functions can run on shared hardware and networking with nothing specific to one particular customer—but this does mean that any Lambda functions running with this default have no access to services running within a VPC unless those services are configured with public addressing as well as security rules to allow external access, so this is a big limitation that you need to understand for the exam.

      So the architecture on screen now—this Lambda function could not access Aurora, EC2, or the Elastic File system unless they had public addressing and the security was configured to allow that access, so in this example without configuration changes the Lambda function could access public services but would have no access to anything running inside the VPC. Now in most cases in my experience Lambda is used with this public networking model, but there are situations where this isn't enough and for those situations Lambda can be configured to run inside a VPC.

      Let's look at how. This time we have the same architecture—so a VPC running within AWS—but this time the Lambda function is configured to run inside a private subnet at the bottom. Now this is the same subnet where the Catergram Enterprise infrastructure is running from, and for the exam specifically the key thing to understand about Lambdas running inside a VPC is that they obey all of the same rules as anything else running in a VPC because they're actually running within that VPC.

      So to start with, this means that Lambda functions running inside a VPC can freely access other VPC-based resources assuming any network ACLs and security groups allow that access, but the flip side of this means they can't access things outside of the VPC unless networking configuration exists within the VPC to allow this external access. So by default with this architecture the Lambda function couldn't access DynamoDB or any internet-based endpoints such as with this example IMDB.

      Now if you face any exam questions or you need to design any solutions which involve Lambda functions running within a VPC, then just treat them like anything else running in that VPC. So this means that you could use a VPC endpoint, for example a gateway endpoint, to provide access to DynamoDB; because the Lambda function is running within the VPC it could utilize a gateway endpoint to access DynamoDB, or in the case that the Lambda function needed access to AWS public services or the internet, you could deploy a NAT gateway in a public subnet and then attach an internet gateway to the VPC.

      Remember Lambda running within a VPC behaves like any other VPC-based service—the same gateways and configurations are needed to allow VPC-based Lambda functions to communicate with the AWS public zone and the public internet. Now you also need to give your Lambda functions EC2 network permissions via the execution role, which I'll cover very soon because the Lambda service needs to create network interfaces within your VPC, it requires these permissions, and this architecture of using network interfaces within a VPC is what I want to quickly cover now.

      Now there used to be disadvantages to running Lambda in a VPC—significant disadvantages—and the reason was the networking architecture that Lambda used. VPC-based Lambda functions don't actually run within your VPC; the way they work is similar to Fargate, so we have AWS and there's a Lambda service VPC and a customer VPC.

      Now let's keep things simple and say that we only have three Lambda functions. Now the way that this historically worked is that each of these Lambda functions when invoked would create an elastic network interface within the customer VPC and traffic would flow between this service VPC and the customer VPC. Now the problem is that configuring these elastic network interfaces on a per-function, per-invocation basis would take time and add delay to the execution of the Lambda function code.

      In addition, this architecture doesn't scale well because parallel function executions or concurrency required additional elastic network interfaces, and the more popular a system became the worse the problem became—with larger systems you had more and more performance issues and more and more issues with keeping VPC capacity available for larger and larger numbers of ENIs.

      Now luckily this is the old architecture—this is the way that Lambda used to handle this private networking—it's not how it works anymore. With the new way, instead of requiring an elastic network interface per function execution, AWS analyzes all of the functions running in a region in an account and builds up a set of unique combinations of security groups and subnets.

      So for every unique one of those, one ENI is required in the VPC. So if all your functions used a collection of subnets but the same security groups, then one network interface would be required per subnet; if they all used the same subnet and all used the same security group, then all of your Lambda functions could use the single elastic network interface. So a single connection between the Lambda Service VPC and your VPC is created for every unique combination of security groups and subnets used by your Lambda functions.

      Now the network interfaces using this architecture are created when you configure the Lambda function, and typically this might take 90 seconds, but this is done once—so when you create the function or when you update the network and configuration, this network and configuration is created or updated—and that means that it isn't required every single time a Lambda function is invoked, so it doesn't delay your function invocations.

      Now this means that you can use private networking at scale without increasing the number of elastic network interfaces required, so where it used to be a bad idea performance-wise to use VPC-based Lambdas, this is no longer the case.

      So that's networking—so this is how you configure Lambda functions if you need them to have access to private VPC services—and it's important that you understand both the public and VPC networking model especially for the exam, because you will face questions on the exam about executing Lambda functions within a VPC.

      Again one really important hint that I will provide is just treat Lambda functions running in a VPC like any other VPC-based resource, and by now you should know how to architect a VPC so that services running in that VPC have access to everything that they need, so just treat Lambda functions in the same way.

      Now let's look at the security of Lambda functions. When it comes to Lambda permissions, there are actually two key parts of the permissions model that you need to understand—one of them is pretty well known and that's covered at the associate level, the other not so much.

      Now let's start with a typical Lambda environment—this is a runtime environment, the thing where your Lambda functions execute within—so this is running a runtime, in this case Python 3.8, it's allocated some resources and the code loads and runs within this environment.

      Now for this environment to access any AWS products and services it needs to be provided with an execution role—this is a role which is assumed by Lambda and by doing so the code within the environment gains the permissions of that role based on the role's permissions policy—so a role is created which has a trust policy which trusts Lambda, and the permissions policy that that role has is used to generate the temporary credentials that the Lambda function uses to interact with other resources.

      So in many ways this is just the same as an EC2 instance role—so this governs what permissions the function receives, which might be something like loading data from DynamoDB and storing output data into S3.

      Now this is the most well-known aspect of Lambda permissions, but there is another part—Lambda actually has resource policies. Now this in many ways is like a bucket policy on S3; it controls who can interact with a specific Lambda function.

      It's this resource policy which can be used to allow external accounts to invoke a Lambda function or certain services to use a Lambda function such as SNS or S3. The resource policy is something changed when you integrate other services with Lambda, and you can manually change it via the CLI or the API; unless something's changed between creating this lesson and when you're watching it, it currently can't be changed using the console UI, so this is only something which can be manipulated using the CLI or the API.

      So that's how security works within a Lambda function. Now one more thing that I want to cover before finishing up with part two is logging.

      So Lambda uses CloudWatch, CloudWatch Logs, and X-Ray for various aspects of its logging and monitoring—so any logging information generated from Lambda executions, that goes into CloudWatch Logs, so the output of Lambda functions, any messages that you output to the log, any errors, details on the duration of the execution—that's all stored into CloudWatch Logs.

      Any metrics—so details such as invocation successes or failure numbers, any retries, anything to do with latency—that's all stored in CloudWatch, so CloudWatch is the thing that stores metrics, and this is important to understand: logging goes into CloudWatch Logs, and any details on the number of invocations, successes or failures, anything around metrics goes straight into CloudWatch.

      Now Lambdas can also be integrated into X-Ray, which I cover elsewhere in the course, and this can be used to add distributed tracing capability—so if you need to trace the path of a user or the path of a session through a serverless application which uses Lambda, then you can use the X-Ray service.

      Now I don't expect this to feature heavily on the exam but just remember the terms X-Ray and distributed tracing because that might come in handy for one or two exam questions if these topics do crop up.

      Now one really important thing to remember for the exam is that for Lambda to be able to log into CloudWatch Logs to generate the output of any of the executions, you need to give Lambda permissions via the execution role—so there's actually a pre-built policy and role within AWS specifically designed to give Lambda functions the basic permissions that they require to log information into CloudWatch Logs.

      And one really common exam scenario is where you're trying to diagnose why a Lambda function is not working—there's nothing in CloudWatch Logs—and one possible answer is that it doesn't have the required permissions via the execution role.

      Now that's everything I wanted to cover in part two of this Lambda in-depth mini series—so we've covered networking, both public and private, we've covered security, and we've covered logging—so go ahead and complete this lesson and when you're ready I look forward to you joining me in part three.

    1. Welcome back and in this multi-part lesson mini-series, I want to talk about AWS Lambda. Lambda is a function as a service or a fast product. This means that you provide specialized short running and focused code to Lambda and it takes care of running it and billing you only for what you consume. So a Lambda function is a piece of code which Lambda runs and every Lambda function is using a supported runtime. So an example of a supported runtime is Python 3.8. So when you create a Lambda function, you need to define which runtime that piece of code uses. Now, when you provide your code to Lambda, it's loaded into and executed within a runtime environment. And this runtime environment is specifically created to run code using a certain runtime, a certain language. So when you create a Lambda function that uses the Python 3.8 runtime, then the runtime environment that's created is itself specifically designed to run Python 3.8 code.

      Now, when you create a Lambda function, you also define the amount of resource that a runtime environment is provided with. So you directly allocate a certain amount of memory and based on that amount of memory, a certain amount of virtual CPU is allocated, but this is indirect. You don't get to choose the amount of CPU. This is based on the amount of memory. Now, the key thing to understand about Lambda as a service, because it's a function as a service product, because it's designed for short running and focused functions, you only actually build for the duration that a function runs. So based on the amount of resource allocated to an environment and based on the duration that that function runs for per invocation, that determines how much you'll build for the Lambda product. So you'll build for the duration of function executions.

      Now, Lambda is a key part of serverless architectures running within AWS. And over this section of the course, you're going to get some experience of how you can use Lambda to create serverless or event-driven architectures. Architecturally, the way that Lambda works is this. You define a Lambda function. Now, you can think of a Lambda function as a unit of configuration. Yes, you can also use the term Lambda function to describe the actual code. But when you think of a Lambda function, think of it as the code plus all the associated wrappings and configuration. Your Lambda function at its most basic is a deployment package which Lambda executes. So when you create a Lambda function, you define the language which the function is written in. You provide Lambda with a deployment package and you set some resources. And whenever the Lambda function is invoked, what actually happens is the deployment package is downloaded and executed within this runtime environment.

      Now, Lambda supports lots of different runtimes. Some of the common ones are various different versions of Python. We also have Ruby. We've got Java. We've also got Go and there's also C# as well as various versions of Node.js. Now, you can also create custom ones using Lambda layers. And many of these are created by the community. For the exam though, one really important point is that if you see or hear the term Docker, consider this to mean not Lambda. So Docker is an anti-pattern for Lambda. Now, Lambda does now support using Docker images, but this is distinct from the word Docker. If you hear the term Docker in the exam, then it generally will be referring to traditional containerized computing. So that's using a specific Docker image to spin up a container and use it in a containerized compute environment such as ECS.

      Now, you can also use container images with Lambda. Now, that's a different process. That means that you're using your existing container build processes, the same ones that you use to create Docker images. But instead, you're creating specific images designed to run inside the Lambda environment. So don't confuse Docker container images and Docker with images used for Lambda. They're two different things. The only thing that they share is that you can use your existing build processes to build Lambda images. Now, custom runtimes could allow languages such as Rust, which is a very popular community-based language to work within the product. So if you search using Google or any other popular search engine, you'll be able to find lots of languages which have been added by the community using the Lambda layer functionality. And I'll be talking about that elsewhere in the course.

      Now, you select the runtime to use when creating the function, and this determines the components which are available inside the runtime environment. So Python code, for instance, requires Python of a certain version to be installed in addition to various Python modules. Conceptually, think about it like this. Every time a Lambda function is invoked, which means to execute that function, a new runtime environment is created with all of the components that that Lambda function needs. Let's say, for example, a Python 3.8-based Lambda function. So the code loads, it's executed, and then it terminates. Next time, a new clean environment is created, it does the same thing, and then it terminates. Lambda functions are stateless, which means no data is left over from a previous invocation. Every time a function is invoked, it's a brand new invocation, a brand new environment. Now, I'm going to be talking about this in part 3 of this series, because this isn't always the case, but you have to assume that it is architecturally. So your code running within Lambda needs to be able to work 100% of the time if it's a new environment. Lambda runtime environments have no state. Now, there are some situations where a function might be invoked multiple times within the same environment. And I'll be talking about that in part 3 of this series. But as a base level, a default, assume that every time a Lambda function is invoked, it's inside a brand new runtime environment.

      Now, you also define the resources that Lambda functions use, and this determines how much resource the runtime environment gets. Now, you directly define the memory. And this is anywhere from 128 MB to 10,240 MB in one MB steps. Now, you don't directly control the amount of virtual CPU. This scales with the memory. So 1769 MB of memory gives you one VCPU of allocation, and it's linear. So the less memory means less virtual CPUs, and more memory means additional VCPU capacity. The runtime environment also has some disk space allocation. 512 MB is mounted as forward slash TMP within the runtime environment. This is the default amount, but it can scale to 10,240 MB. Now, you can use this, but keep in mind, you have to assume that it's blank every single time a Lambda function is invoked. This should only be viewed as temporary space.

      Lambda functions can run for up to 900 seconds or 15 minutes. And this is known as the function timeout. This is important because for anything beyond 15 minutes, you can't use Lambda directly. And that's a really important figure to know for the exam. You know by now I'm not a fan of people memorizing facts and figures, but this is definitely one that you need to remember for the exam. So 15 minutes is a critical amount of time for a Lambda function. You can use other things, such as step functions, to create longer running workflows, but one invocation of one function has a maximum of 15 minutes or 900 seconds.

      Now, we're going to be covering security in more detail in part two, as well as networking. But the security for a Lambda function is controlled using execution roles. And these are IAM roles, assumed by the Lambda function, which provides permissions to interact with other AWS products and services. So any permissions which a Lambda function needs to be provided with are delivered by creating an execution role and attaching that to a specific Lambda function.

      Now, just a few final things before we finish up some common uses of Lambda. So Lambda forms a core part of the delivery of serverless applications within AWS. And generally this uses products such as S3, API gateway, and Lambda. So these three together are often used to deliver serverless applications. Lambda can also be used for file processing, using S3, S3 events, and Lambda. So a very common example that's used in training is watermarking images. So have images uploaded to S3, generate an S3 event, invoke a Lambda function, which applies a watermark, and then terminates. And you're only billed for the compute resources used during those Lambda function invocations. You can also use Lambda for database triggers. So this is using DynamoDB, as well as DynamoDB streams, and then Lambda. So Lambda can be invoked any time data is inserted, modified, or deleted from a DynamoDB table with streams enabled. And this is another powerful architecture.

      You can also use Lambda to implement a form of serverless cron. So you can use EventBridge or CloudWatch events to invoke Lambda functions at certain times of day, or certain days of week, to perform certain scripted activities. And this is something that traditionally you would need to run on something like an EC2 instance, but using Lambda means that you're only billed for the amount of time that these functions are executing. So this is another really common use case. And then finally, you can perform real-time stream data processing. So Lambda's can be configured to invoke whenever data is added to a Kinesis stream. And this can be useful because Lambda is really scalable. And so it can scale with the amount of data being streamed into a Kinesis stream. And again, this is another really common architecture for any businesses that are streaming large quantities of data into AWS, and they require some form of real-time processing.

      Now that's everything that I wanted to cover in part one of this series. Remember, it's a three-part mini-series, part two and part three, are going to introduce some more advanced concepts. Specifically, though, is that you'll need for the exam. But at this point, go ahead, complete this lesson, and then when you're ready, I'll look forward to you joining me in the next.

    1. Welcome back, and in this first technical lesson of this section of the course, we'll be stepping through what an event-driven architecture is and comparing it to other architectures available within AWS. As a solutions architect, this matters because you're the one who needs to design a solution using a specific architecture around a given set of business requirements, so you need to have a good base level understanding of all of the different types of architectures available to you within AWS. You can't build something unless you fully understand the architectures, so let's jump in and get started because we've got a lot to cover.

      Now, to help illustrate how an event-driven architecture works, let's consider an example. And the example that I want to use is a popular online video sharing platform that you've all probably heard of. Yes, that's right, it's CatTube. One of the popular ways that CatTube is used is for people to upload holiday videos of their cats. So Bob uploads a 4k quality video of whiskers on holiday to CatTube. Now at this point, CatTube begins some processing and it generates lots of different versions of that video at various different quality levels, for example, 1080p, 720p, and 480p. Now this is only part of the application, but it happens to be the most intensive in terms of resource usage. The website also needs to display videos, manage playlists and channels, and store and retrieve data to and from a database.

      Now, there are a few ways that we could architect this solution. Historically, the most popular systems architecture was known as a monolithic architecture. Now think of this as a single black box with all of the components of the application within it. So in this example, I'm just showing a subset, but we've got the upload component where Bob uploads his collection of videos where whiskers is on holiday, the processing component which does the conversion of videos, and then we have the store and manage component which interacts with the underlying persistent storage. Now this architecture has a number of considerations, a number of important things to keep in mind. Because it's all one entity, it fails together as an entity. If one component fails, it impacts the whole thing end to end. If uploading fails, it could also affect processing as well as store and manage. Logically, you know that they're separate things, you know that uploading is different than processing, which is different than store and manage, but if they're all contained in a single monolithic architecture, one code base, one big monolithic component, then the failure of any part of that monolith can affect everything else.

      The other thing to consider when talking about monoliths is they also scale together. They're highly coupled. All of the components generally expect to be on the same server directly connected and have the same code base. You can't scale one without the other. Generally, with monolithic architectures, you need to vertically scale the system because everything expects to be running on the same piece of compute hardware. And finally, and this is one of the most important aspects of monolithic architectures that you need to be aware of, they generally build together. All of the components of a monolithic architecture are always running and because of that, they always incur charges. Even if the processing engine is doing nothing, even if no videos are being uploaded, the system capacity has to be enough to run all of them. And so they always have allocated resources, even if they aren't consuming them. So using a monolithic architecture tends to be one of the least cost-effective ways to architect systems, ranging from small to enterprise scale.

      Now we've seen earlier in the course how we can evolve a monolithic design into a tiered one. With a tiered architecture, the monolith is broken apart. What we have now is a collection of different tiers and each of these tiers can be on the same server or different servers. With this architecture, the different components are still completely coupled together because each of the tiers connects to a single endpoint of another tier. The upload tier needs to be able to send data directly at the processing tier, and again, this could be on the same server or a different server. With the WordPress example that you looked at earlier in the course, we separated the database component of the monolithic application onto its own RDS instance and left the EC2 instance running the Apache web server and the WordPress application. But both of those services still needed to communicate with each other. They were very tightly coupled.

      Now, the immediate benefit of a tiered architecture versus a monolith is that these individual tiers can be vertically scaled independently. Put simply, you can increase the size of the server that's running each of these application tiers. What this means is that if the processing tier, for example, requires more CPU capacity, then it can be increased in size to cope with that additional load without having to increase the size of the upload or the store and manage tiers. But this architecture can be evolved even more. Instead of each tier directly connecting to each other tier, we can utilize load balances located between each of the tiers. Remember in the previous section I mentioned internal load balances. This is an example of when internal load balances are useful. It means that in this example the upload tier is no longer communicating with a specific instance of the processing tier. And it means that the store and manage tier is not communicating with a specific instance of the processing tier. Both of them are going via a load balancer. And if you remember from the section of the course where I talked about load balances, this means it's abstracted. It allows for horizontal scaling, meaning additional processing tier instances can be added. Communication occurs via the load balances, so the upload and store and manage tiers have no exposure to the architecture of the processing tier, whether it's one instance or a hundred. This means that the processing tier is now able to be scaled horizontally by adding additional instances, and it's now highly available. If one instance fails, the load balancer just redistributes the connections across the working instances. So by abstracting away from individual instance architecture for the individual tiers, using load balances now means we can scale each tier independently, either vertically or horizontally.

      Now, this architecture isn't perfect for two main reasons. First, the tiers are still coupled. The upload tier, for example, expects and requires the processing tier to exist and to respond. While the load balancer means that we can have multiple instances for the processing tier, for example, the processing tier has to exist. If it fails completely, then the upload tier itself will fail because the upload tier expects at least one instance of the processing tier to answer it. If there's a backlog in processing, if the processing tier slows down and it starts to take longer to accept jobs for processing, then that can also impact the upload tier and the customer experience. The other issue with this architecture is that even if there's no jobs to be processed, the processing tier has to have something running. Otherwise, there'll be a failure when the upload tier attempts to add an upload job. So it's not possible to scale the individual tiers of the application back down to zero because the communication is synchronous. The upload tier expects to perform a synchronous communication with the processing tier. It expects to ask for a job to be entered and it requires an answer. So while the tiered architecture improves things, it doesn't solve all of the problems.

      Okay, so this is the end of part one of this lesson. It was getting a little bit on the long side and so I wanted to add a break. It's an opportunity just to take a rest or grab a coffee. Part two will be continuing immediately from the end of part one. So go ahead, complete the video, and when you're ready, join me in part two.

    1. Welcome back, and in this lesson, I want to cover application and network load balances in a little bit more detail. It's critical for the exam that you understand when to pick application load balances and when to pick network load balances, as they're both used for massively different situations. Now, we do have a lot to cover, so let's jump in and get started.

      I want to start by talking about consolidation of load balances. Historically, when using classic load balances, you connected instances directly to the load balancer or you integrated an auto scaling group directly with that load balancer, an architecture which looked something like this: a single domain name, categor.io, using a single classic load balancer with an attached single SSL certificate for that domain, and then an auto scaling group is attached to that, with the classic load balancer distributing connections over those instances.

      The problem is that this doesn't scale because classic load balancers don't support SNI, and you can't have multiple SSL certificates per load balancer, meaning every single unique HTTPS application that you have requires its own classic load balancer, which is one of the many reasons that classic load balancers should be avoided. In this example, we have Catergram and Dogagram, both of which are HTTPS applications, and the only way to use these is to have two different classic load balancers.

      Compare this to the same application architecture, with both applications—Catergram and Dogagram—this time using a single application load balancer. This one handles both applications, and we can use listener-based rules, which I’ll talk about later in the lesson, where each of these listener-based rules can have an SSL certificate handling HTTPS for both domains. Then we can have host-based rules which direct incoming connections at multiple target groups that forward these on to multiple auto scaling groups, which is a two-to-one consolidation—halving the number of load balancers required to deliver these two different applications.

      But imagine how this would look if we had a hundred legacy applications and each of these used a classic load balancer; moving from version one to version two offers significant advantages, and one of those is consolidation. So now I just want to focus on some of the key points about application load balancers—things which are specific to the version two or application load balancer.

      First, it's a true layer seven load balancer and it's configured to listen on either HTTP or HTTPS protocols, which are layer seven application protocols that an application load balancer understands and can interpret information carried using both. Now, the flip side is that the application load balancer can't understand any other layer seven protocols—so things such as SMTP, SSH, or any custom gaming protocols are not supported by a layer seven load balancer like the application load balancer, and that's important to understand.

      Additionally, the application load balancer has to listen using HTTP or HTTPS listeners; it cannot be configured to directly listen using TCP, UDP, or TLS, and that does have some important limitations and considerations that you need to be aware of, which I’ll talk about later on in this lesson.

      Because it's a layer seven load balancer, it can understand layer seven content—so things like the type of the content, any cookies used by your application, custom headers, user location, and application behavior—meaning the application load balancer is able to inspect all of the layer seven application protocol information and make decisions based on that, something that the network load balancer cannot do. It has to be a layer seven load balancer, like the application load balancer, to understand all of these individual components.

      An important consideration about the application load balancer is that any incoming connections—HTTP or HTTPS (and remember HTTPS is just HTTP transiting using SSL or TLS)—in all of these cases, whichever type of connection is used, are terminated on the application load balancer. This means that you can't have an unbroken SSL connection from your customer through to your application instances—it’s always terminated on the load balancer, and then a new connection is made from the load balancer through to the application.

      This matters to security teams, and if your business operates in a strict security environment, this might be very important and, in some cases, can exclude using an application load balancer. It can't do end-to-end unbroken SSL encryption between a client and your application instances, and it also means that all application load balancers which use HTTPS must have SSL certificates installed on the load balancer, because the connection has to be terminated there and then a new connection made to the instances.

      Application load balancers are also slower than network load balancers because additional levels of the networking stack need to be processed, and the more levels involved, the more complexity and the slower the processing. So if you're facing any exam questions that are really strict on performance, you might want to look at network load balancers instead.

      A benefit of application load balancers is that, because they're layer seven, they can evaluate the application health at layer seven—in addition to just testing for a successful network connection, they can make an application layer request to the application to ensure that it's functioning correctly.

      Application load balancers also have the concept of rules, which direct connections arriving at a listener—so if you make a connection to a load balancer, what it does with that connection is determined by rules, which are processed in priority order. You can have many rules affecting a given set of traffic, and they’re processed in priority order, with the last one being the default catch-all rule, though you can add additional rules, each of which can have conditions.

      Conditions inside rules include checking host headers, HTTP headers, HTTP request methods, path patterns, query strings, and even source IP, meaning these rules can take different actions depending on the domain name requested (like categor.io or dogogram.io), the path (such as images or API), the query string, or even the source IP address of any customers connecting to that application load balancer.

      Rules can also have actions—these are the things the rules do with traffic: they can forward that traffic to a target group, redirect it to something else (maybe another domain name), provide a fixed HTTP response (like an error or success code), or perform authentication using OpenID or Cognito.

      Visually, this is how it looks: a simple application load balancer deployment with a single domain, categor.io, using one host-based rule with an attached SSL certificate. The rule uses host header as a condition and forward as an action, forwarding any connections for categor.io to the target group for the categor application.

      If you want additional functionality, let’s say that you want to use the same application load balancer for a corporate client trying to access categor.io—maybe users of Bowtie Incorporated using the 1.3.3.7 IP address are attempting to access it, and you want to present them with an alternative version of the application. You can handle that by defining a listener rule where the condition is the source IP address of 1.3.3.7, and the action forwards traffic to a separate target group—an auto scaling group handling a second set of instances dedicated to this corporate client.

      Because the application load balancer is a layer seven device, it can see inside the HTTP protocol and make decisions based on anything in that protocol or up to layer seven. Also, the connection from the load balancer to the instances for target group two will be a separate set of connections—highlighted by a slightly different color—because HTTP connections from enterprise users are terminated on the load balancer, with a separate connection to the application instances. There’s no option to pass through encrypted connections to the instances—it must be terminated—so if you need unbroken encrypted connections, you must use a network load balancer.

      Since it’s a layer seven load balancer, you can use rules that work on layer seven protocol elements, like routing based on paths or headers, or redirecting traffic at the HTTP level. For example, if this ALB also handles traffic for dogogram.io, you could define a rule that matches dogogram.io and, as an action, configure a redirect toward categor.io—the obviously superior website. These are just a small subset of features available within the application load balancer, and because it's layer seven, you can perform routing decisions based on anything observable at that level, making it a really flexible product.

      Before finishing, let’s take a look at network load balancers. They function at layer four, meaning they can interpret TCP, TLS, and UDP protocols, but have no visibility or understanding of HTTP or HTTPS. They can't interpret headers, see cookies, or handle session stickiness from an HTTP perspective, as that requires cookie awareness—which a layer four device doesn’t have.

      Network load balancers are incredibly fast, capable of handling millions of requests per second with about 25% of the latency of application load balancers, since they don't deal with upper layers of the stack. They’re ideal for non-HTTP or HTTPS protocols—such as SMTP (email), SSH, game servers, or financial applications that don’t use web protocols.

      If exam questions refer to non-web or non-secure web traffic that doesn’t use HTTP/HTTPS, default to network load balancers. One downside is that health checks only verify ICMP and basic TCP handshaking, not application awareness, so no detailed health checking is possible.

      A key benefit is that they can be allocated static IPs, which is useful for white-listing—corporate clients can white-list NLB IPs to let them pass through firewalls, which is great for strict security environments. They can also forward TCP directly to instances, and because network layers build on top of each other, the network load balancer doesn’t interrupt any layers above TCP, allowing unbroken encrypted channels from clients to application instances.

      This is essential to remember for the exam—using network load balancers with TCP listeners is how you achieve end-to-end encryption. They're also used for PrivateLink to provide services to other VPCs—another crucial exam point.

      To wrap up, let’s do a quick comparison. I find it easier to remember when to use a network load balancer, and if it’s not one of those cases, default to application load balancers for their added flexibility. Use network load balancers if you need unbroken encryption between client and instance, static IPs for white-listing, the best performance (millions of RPS and low latency), non-HTTP/HTTPS protocols, or PrivateLink.

      For everything else, use application load balancers—their functionality is often valuable in most scenarios. That’s everything I wanted to cover about application and network load balancers for the exam. Go ahead and complete this video, and when you're ready, I'll look forward to you joining me in the next.

    1. Briefing Document : L'Enfermement des Enfants - Thèmes et Idées Clés

      Ce document de briefing synthétise les principaux thèmes, idées et faits saillants ressortant des sources fournies concernant l'enfermement des enfants, tant dans un contexte migratoire que pénal, ainsi que les enjeux liés à leur prise en charge psychiatrique.

      1. L'Enfermement des Enfants : Une Exception qui Devient Norme ?

      La première source, un colloque sur l'enfermement des enfants, pose d'emblée le principe que l'enfermement d'un enfant devrait être une exception.

      "l'enfermement euh d'un enfant euh par essence et euh quelque chose de qui doit être exceptionnel et qui euh qui peut en tout cas au regard des des textes qui s'applique textes internationaux comme de nos textes fondamentaux euh être une exception si ce n'est une contrariété ou quelque chose de d'assez euh contreintuitif."

      L'enfant est défini comme un être en devenir, en construction, que la société pousse à l'ouverture et à la progression. L'enfermement, au contraire, constitue un arrêt et une fermeture, ce qui apparaît comme un non-sens au regard de la nature de l'enfance.

      Le colloque distingue deux formes principales d'enfermement :

      Hors champ pénal : Concerne les enfants en contexte migratoire (risque de menace à l'ordre public, demande d'asile à la frontière, mineurs étrangers en transit).

      Dans le champ pénal : Conséquence d'une décision de justice suite à une infraction.

      2. L'Enfermement des Enfants en Contexte Migratoire : Chiffres et Conséquences

      Les chiffres présentés soulignent une réalité préoccupante :

      Au niveau international : Environ 330 000 enfants sont privés de liberté chaque année dans un contexte migratoire.

      En France (estimation depuis 2012) : 40 000 enfants placés en rétention, dont plus de 1500 en métropole et l'immense majorité à Mayotte.

      En 2023 : 3349 enfants placés en rétention en France, dont 3262 à Mayotte.

      En zone d'attente (chiffres 2022) : Au moins 5338 enfants enfermés, contre 372 en 2021.

      L'âge moyen des enfants concernés est très faible.

      La durée de rétention peut être brève, mais ses conséquences sont réelles et graves sur la santé des enfants :

      "la communauté scientifique est assez unanime sur le fait que le enfermer un enfant particulier pour des raisons migratoires a de graves conséquences sur sur son état de santé"

      Ces conséquences incluent : repli sur soi, refus de s'alimenter, insomnies, stress, stress post-traumatique.

      Les enfants peuvent être exposés à des événements traumatisants en rétention (tentatives de suicide, automutilations, violences, éloignements sous contrainte).

      L'environnement global de la rétention est décrit comme anxiogène (bruits, grillages, barbelés, présence policière constante).

      Des psychiatres observent que même les bébés ressentent l'anxiété de cet environnement.

      La situation à Mayotte est particulièrement critique, avec un nombre d'enfants enfermés 37 fois supérieur à celui de l'Hexagone.

      Les conditions de prise en charge dans les centres de rétention administrative y sont qualifiées de "juste au-dessus du niveau terrain vague", avec des problèmes d'accès à l'eau et une violence moins organisée qu'en métropole mais bien présente en raison de l'ennui et du manque d'activités.

      La confusion entre zones d'attente et rétention administrative y règne.

      Sur le territoire hexagonal, les zones d'attente sont une notion juridique parfois sans existence physique concrète (chambres d'hôtels).

      La non-admission signifie que fictivement, les personnes en zone d'attente ne sont pas considérées comme étant en France.

      Le réacheminement vers le pays d'origine n'est pas autorisé pour les enfants.

      L'accès aux droits est problématique, notamment l'accès aux interprètes pour les personnes non francophones.

      Des situations absurdes sont relevées, comme des mineurs non contestés recevant des Obligations de Quitter le Territoire Français (OQTF).

      La question des nourrissons en zone d'attente est également soulevée.

      Un protocole départemental à Menton concernant l'accueil des mineurs non accompagnés a fait l'objet de critiques de la Défenseure des Droits et pourrait conduire à un contentieux.

      Le recours à l'incarcération dans le champ pénal est parfois utilisé par des magistrats comme une mesure de protection pour les mineurs non accompagnés, notamment victimes de trafic, par manque d'alternatives adéquates.

      Cela a pu entraîner une saturation des quartiers mineurs.

      Une circulaire ministérielle de 2023 est mentionnée, rappelant l'interdiction de placer en CRA ou LRA les familles accompagnées de mineurs (sauf à Mayotte) et incitant à transformer les anciennes places familles en places adultes.

      Elle rappelle également la nécessité d'exécuter les mesures d'éloignement et d'envisager l'assignation à résidence, tout en soulignant la compatibilité des obligations avec la vie privée et familiale (scolarisation des enfants).

      La possibilité d'éloignement de la famille lors d'une visite domiciliaire sur autorisation du juge est également évoquée, suscitant une vigilance particulière.

      La question des rattachements fictifs de mineurs à des adultes pour justifier leur éloignement est condamnée par la CEDH.

      Des modifications de dates de naissance pour transformer des mineurs en majeurs sont également signalées, notamment en zone d'attente en métropole.

      3. La Protection de l'Enfance et la Dimension Sémantique

      Un éducateur de la PJJ souligne la dimension sémantique préoccupante dans le débat public actuel, où la notion de protection des enfants en CRA ou en prison tend à disparaître au profit d'un discours sur les "enfants dangereux", notamment en lien avec les trafics de drogue.

      La situation à Mayotte est perçue comme dérogeant à un certain nombre de droits fondamentaux, posant la question de l'égalité des droits sur le territoire de la République.

      L'évolution des pratiques d'éloignement, passant de l'expulsion des parents laissant les enfants seuls à l'éloignement de toute la famille, est également source d'inquiétude au regard des besoins fondamentaux de l'enfant (liens avec ses parents).

      4. Soins Psychiatriques des Mineurs : Consentement, Isolement et Contention La question des soins sans consentement chez les mineurs est abordée, soulignant une "idiotie juridique" puisque, par définition, les soins (y compris l'hospitalisation) sont décidés par les parents ou les détenteurs de l'autorité parentale, sur indication médicale.

      La seule exception est la décision du représentant de l'État (SDRE). Le consentement du mineur n'est pas requis stricto sensu pour l'hospitalisation.

      Cependant, la dimension clinique et humaine reste essentielle, et l'adhésion du mineur et de ses parents aux soins est un enjeu important, tout comme la préparation de la sortie et le maintien de l'adhésion.

      Le pédopsychiatre intervenant souligne que, bien que le consentement juridique soit parental, la question de la prise en compte de l'opinion de l'enfant et de l'adolescent dans les décisions de soins est fondamentale.

      L'isolement et la contention sont des pratiques utilisées, mais dans des contextes spécifiques et pas toujours en urgence, notamment pour des troubles importants du comportement ou des troubles du spectre autistique.

      La DGOS travaille à une réforme des autorisations d'activités en psychiatrie, entrée en vigueur en juin 2023, visant à homogénéiser la qualité de la prise en charge, notamment en pédopsychiatrie.

      Désormais, un établissement souhaitant faire de la pédopsychiatrie devra être autorisé spécifiquement, de même que pour les soins sans consentement.

      L'offre de soins devra comprendre l'ambulatoire, l'hospitalisation partielle et à temps complet. La prise en charge en pédopsychiatrie s'étend désormais jusqu'à 18 ans.

      La décision médicale d'isolement ou de contention doit être une mesure de dernier recours, non une sanction ou une facilitation pour les soignants, et ne peut concerner que les patients en soins sans consentement.

      Un contrôle du juge est prévu, avec une durée maximale et des réévaluations médicales régulières.

      Cependant, des situations persistent où des mineurs en soins libres font l'objet de mesures d'isolement ou de contention, soulevant un flou jurisprudentiel quant à la compétence du juge dans ces cas.

      Le ministère de la Santé reste convaincu que ces décisions sont justifiées médicalement. Un travail est nécessaire avec le ministère de la Justice, le CGLPL et les professionnels pour clarifier et améliorer cette situation.

      La question du statut du mineur hospitalisé en psychiatrie à la demande de ses parents mérite d'être réexaminée.

      5. Le Contexte de la Psychiatrie Infantile et les Défis La pédopsychiatrie en France est confrontée à une crise démographique majeure et à un modèle psychiatrique qui peine à s'aligner sur les directives internationales prônant la désinstitutionnalisation.

      La diversité doctrinale complexifie la compréhension et l'application du droit.

      La situation est encore plus critique en pédopsychiatrie, avec des départements entiers sans service d'hospitalisation à temps complet, entraînant une hospitalisation massive en état de crise par manque de prévention.

      Le statut juridique des mineurs en crise diffère de celui des adultes.

      Dès lors qu'il y a une décision d'autorité parentale (ou une ordonnance de placement provisoire), le mineur est considéré comme étant en soins libres, même en cas d'hospitalisation.

      Cette fiction juridique signifie qu'il ne bénéficie pas des mêmes droits et protections que les patients en soins sans consentement (droits de la défense, contrôle du JLD).

      L'isolement et la contention ne sont pas autorisés en soins libres. Cependant, le CGLPL constate que cette règle n'est pas toujours respectée, avec un flou jurisprudentiel persistant et des juges parfois réticents à se déclarer compétents ou à lever les mesures illégales.

      Le CGLPL, bien que reconnaissant le manque de base légale de ces pratiques, peine à formuler des recommandations concrètes et hésite à recourir à l'article 40 du Code de procédure pénale (signalement d'un acte de séquestration).

      Le rôle des avocats est crucial pour la progression du droit de la psychiatrie. Il est nécessaire de continuer à saisir les tribunaux, y compris le juge administratif sur la question des conditions indignes d'hospitalisation, en s'inspirant des avancées obtenues pour les conditions de détention.

      Un JLD spécialisé pourrait être une piste à explorer, bien que la tendance actuelle soit à la déspécialisation.

      La prise en compte de l'opinion de l'enfant dans les décisions de soins est un principe fondamental (article 12 de la Convention des droits de l'enfant), qui doit être mieux intégré dans les procédures, y compris par le JLD.

      Le rôle de l'assistance éducative et de l'administrateur ad hoc peut être clé pour garantir la protection des droits de l'enfant.

      6. L'Action du Contrôleur Général des Lieux de Privation de Liberté (CGLPL)

      Le CGLPL visite régulièrement les quartiers mineurs (QM) et les établissements pénitentiaires pour mineurs (EPM), parfois plusieurs fois. Il reçoit très peu de saisines directes de la part des mineurs incarcérés.

      Les contrôleurs auditionnent les mineurs seuls en cellule ou en salle d'audience, inspectent les conditions matérielles de détention et discutent des conditions de vie. Ils constatent souvent un manque d'accès à des activités adaptées et un ennui important.

      La violence est présente mais moins organisée qu'en milieu adulte. L'offre de formation et d'enseignement est essentielle mais doit être adaptée.

      La question des transferts vers des établissements pour majeurs à la majorité est un point de vigilance.

      Le CGLPL dispose d'une ligne téléphonique confidentielle pour les détenus. Il publie des rapports et fait des recommandations pour améliorer les conditions de détention et le respect des droits.

      7. Le Rôle du Bâtonnier et l'Opération "Diego"

      Depuis 2021, l'article 719 du Code de procédure pénale confère au Bâtonnier un droit de visite dans les lieux de privation de liberté. Un guide pratique a été élaboré pour faciliter l'exercice de ce droit.

      L'opération "Diego", menée par la Conférence des Bâtonniers, vise à rendre effectif ce droit de visite et à vérifier le respect des droits fondamentaux des mineurs en détention (éducation, santé, dignité, respect).

      Des affiches informant les mineurs de la possibilité de consultations gratuites avec des avocats sont installées dans les lieux de parloir.

      De nombreux barreaux se sont mobilisés et ont intégré cette prérogative. Cependant, des difficultés persistent, notamment en ce qui concerne l'accès aux établissements pénitentiaires (circulaires limitant l'accompagnement du Bâtonnier) et aux Centres Éducatifs Fermés (CEF), où l'accès peut être plus complexe.

      Les visites permettent de constater des améliorations concrètes, comme des travaux de rénovation d'urgence. Le Bâtonnier peut également saisir les autorités compétentes en cas de manquements constatés.

      8. La Situation des Mineurs Incarcérés : Perspectives de la Protection Judiciaire de la Jeunesse (PJJ)

      La PJJ a une mission conjointe avec l'administration pénitentiaire pour accompagner les mineurs détenus et organiser l'individualisation de leur parcours.

      L'intervention éducative est assurée soit par des services éducatifs en détention (EPM, gros QM) soit par les services de milieu ouvert (autres établissements).

      Les services de la PJJ sont chargés de construire et de proposer au magistrat un projet de sortie individualisé.

      Un travail est mené avec les familles pour maintenir le lien et les impliquer dans le projet de sortie.

      Des initiatives concrètes sont développées en détention (cafés des parents, goûters préparés par les mineurs, brochures d'information, réunions de remise de bulletins).

      La PJJ favorise l'accès aux droits des mineurs détenus et l'information aux familles, notamment en participant à la grande consultation du Défenseur des Droits et en élaborant un guide d'accès aux droits adapté aux jeunes.

      Le travail sur le rapport à l'altérité et la citoyenneté est également important.

      L'accès à l'enseignement et à la formation est une obligation (Code de l'éducation).

      Des personnels de l'Éducation Nationale sont présents en détention, mais les spécificités de ce public nécessitent des adaptations.

      Des certifications peuvent être obtenues en détention. Des projets d'orientation sont mis en place.

      9. Peut-on Rendre Utile la Prison pour les Enfants ? Débats et Perspectives

      La question centrale est de savoir si la prison peut être utile pour les enfants.

      L'avocate Amélie Mourino exprime un doute profond, notamment en ce qui concerne la détention provisoire, qui ne semble utile qu'en termes de prévention de la réitération immédiate, sans prendre en compte le sens pour le mineur. Les délais courts de la détention provisoire ne permettent pas un travail significatif avec le jeune.

      Concernant la prison comme peine, elle rejoint la notion de désistance, un processus long et complexe, marqué par des rechutes et des ambivalences, qui dépend de nombreux facteurs environnementaux et de la création de nouveaux liens sociaux pro-sociaux.

      La prison a tendance à isoler et à couper des alliés potentiels. Le temps de la détention est rarement celui de la désistance.

      Les mécanismes d'aménagement de peine, souvent automatiques et rapides, peuvent perdre de leur sens. Rares sont les mineurs pour lesquels la prison a été un coup d'arrêt définitif à la délinquance ; ceux-là auraient probablement pu être aidés par la seule rencontre avec la justice des mineurs.

      Les conditions d'incarcération des mineurs sont également un sujet préoccupant.

      La juge pour enfants Aurélie Champion aborde la question de l'utilité de la détention à travers la notion de parcours du mineur.

      L'utilité ne se mesure pas à l'instant de la détention mais sur un temps plus long, si on parvient à lui donner du sens et à partager ce sens avec le mineur.

      La pensée symbolique de la prison comme menace peut avoir un impact pour certains mineurs, mais pas pour les plus fragiles.

      Il est essentiel de connaître le parcours du mineur et les facteurs de vulnérabilité et de protection pour prendre des décisions cohérentes, notamment celle du placement en détention, et éviter qu'elle n'arrive au "mauvais moment".

      L'audience unique, dans le cadre du CJPM, confie la décision du placement en détention provisoire à des magistrats qui ne connaissent pas le mineur, marquant une rupture potentielle dans son parcours. Des aménagements existent pour un second examen par le juge des enfants.

      La détention provisoire sert souvent à préparer un projet de sortie, parfois sans réelle participation du mineur.

      La question du post-sentenciel, avec l'application automatique des aménagements de peine, pose la question du sens de la détention si le suivi n'est pas effectif.

      La PJJ souligne que, malgré la contrainte de la détention, un travail de fond est mené pour donner du sens à cette période, en se fondant sur le droit commun et en tendant vers le respect de l'obligation de scolarité et de formation.

      L'individualisation du parcours, le maintien du lien familial et la préparation de la sortie sont des axes essentiels. Des initiatives concrètes sont développées en détention pour soutenir ces objectifs.

      Un débat émerge concernant l'efficacité des dispositifs de "sentinelles" (jeunes pairs aidant) en détention, soulignant la nécessité d'une approche collective et de la sensibilisation de l'ensemble de la communauté éducative.

      Face au discours ambiant favorisant la répression et l'incarcération des mineurs, il est rappelé l'importance de considérer le parcours de fragilité de ces jeunes et le rôle essentiel de la justice restaurative, encore trop peu développée pour les mineurs en France.

      Le soutien du Conseil National des Barreaux à la PJJ, face aux contraintes budgétaires, souligne l'importance des moyens dédiés à la prise en charge éducative.

      En conclusion, la question de l'utilité de la prison pour les enfants reste ouverte et complexe.

      Si la nécessité de la contrainte peut être admise dans certains cas, un consensus semble émerger sur l'importance de privilégier des approches éducatives individualisées, de maintenir le lien familial, de préparer activement la sortie et de favoriser l'accès aux droits et à la formation, afin de donner un sens à cette période et de favoriser la désistance à long terme.

      Les conditions d'enfermement et le respect des droits fondamentaux des mineurs doivent être une priorité constante.

    1. Briefing Document : Justice Pénale, Justice Restaurative et leurs Applications Diverses

      Ce document de briefing synthétise les principaux thèmes et idées issus des sources fournies, explorant la justice restaurative dans le contexte de la justice pénale, ses modalités de mise en œuvre, ses bénéfices potentiels et les défis rencontrés dans son application, notamment dans des cas d'infractions graves et en matière environnementale.

      Thèmes Principaux et Idées Clés

      1. Définition et Champ d'Application de la Justice Restaurative :

      La justice restaurative est présentée comme une réflexion alternative à la justice pénale traditionnelle, cherchant à impliquer les auteurs et les victimes d'infractions dans un processus de réparation et de dialogue.

      Bien que la loi française n'exclue aucune infraction du champ de la justice restaurative, des limitations peuvent exister dans la pratique au niveau local en raison de considérations de moyens, de disponibilité des intervenants, de politiques locales et de la nature de certaines infractions (par exemple, les violences conjugales font souvent débat).

      "même si la loi ne n'interdit pas enfin n'exclut aucune infraction du champ de la justice restaurative certains acteurs locaux qui vont considérer que elle n'est pas adapté pour tel ou tel tel ou telle nature d'infraction euh notamment les violences conjugal ça peut faire souvent débat"

      La participation des auteurs et des victimes doit être pleinement consentie et ils ont la possibilité de se retirer du dispositif à tout moment.

      La reconnaissance des faits par l'auteur est une condition nécessaire, non pas dans les moindres détails, mais dans leur principe et leurs éléments principaux, y compris l'élément intentionnel.

      "c'est un dispositif qui implique nécessairement la reconnaissance des faits par son auteur alors la loi et le le code de procédure pénale on trouve un peu partout cette cette référence à la reconnaissance des faits"

      2. Autonomie et Complémentarité avec la Justice Pénale :

      La justice restaurative est autonome par rapport à la procédure pénale et peut être mise en œuvre à n'importe quel stade de la procédure (avant, pendant, après) voire même en l'absence de procédure (infractions prescrites, classement sans suite).

      En théorie, la participation à une mesure de justice restaurative n'a pas d'influence directe sur la procédure pénale, l'exécution de la peine ou l'indemnisation de la victime. Une décision judiciaire ne peut être motivée sur cette seule base.

      Cependant, une incidence psychologique sur les décisions des magistrats n'est pas totalement exclue.

      "on évoque régulièrement l'absence d'influence sur la procédure judiciaire en ce que effectivement la justice restaurative n'aura aucune conséquence sur la procédure pénale sur l'exécution de la peine"

      3. Acteurs et Mise en Œuvre de la Justice Restaurative :

      La mise en œuvre de la justice restaurative repose sur un partenariat entre divers acteurs : les services de la Protection Judiciaire de la Jeunesse (PJJ), les Services Pénitentiaires d'Insertion et de Probation (SPIP), les associations d'aide aux victimes, la juridiction (principalement en tant que prescripteur), le barreau (de plus en plus impliqué).

      Au niveau local, ce partenariat se formalise souvent par des conventions et des réunions régulières (groupes projets) pour coordonner les mesures.

      Au niveau national, le Ministère de la Justice joue un rôle central à travers le Comité National de la Justice Restaurative, instance interdirectionnelle chargée du développement de la justice restaurative. Le ministère finance également la formation des acteurs et soutient les associations spécialisées (comme la FGR).

      4. Défis et Freins au Développement de la Justice Restaurative :

      Manque de visibilité et caractère encore confidentiel : Malgré un cadre légal favorable, le nombre de mesures de justice restaurative reste très faible par rapport au volume des condamnations pénales.

      En 2023, on comptait 302 mesures pour plus de 500 000 condamnations délictuelles et 2400 condamnations criminelles.

      Comment rendre la justice restaurative plus visible et accessible aux bénéficiaires potentiels reste une question cruciale.

      Manque de moyens (notamment budgétaires) : Le budget alloué à la justice restaurative est jugé largement insuffisant pour permettre un développement à grande échelle.

      Le coût est estimé à environ 1 200 000 € en France, une fraction infime du budget de la justice.

      Le nombre de professionnels formés et certifiés (environ 1000 animateurs) est également insuffisant pour répondre à une demande potentielle beaucoup plus importante.

      Le financement par l'auteur à travers la "contribution citoyenne" est principalement destiné à l'aide aux victimes et non directement au financement des mesures de justice restaurative pour cet auteur.

      Résistances et manque d'implication de certains acteurs : Les autorités judiciaires ne sont pas toujours motrices dans la prescription de mesures de justice restaurative, ce qui peut s'expliquer par la nature complémentaire du dispositif par rapport à leur office traditionnel.

      Difficultés opérationnelles : Des problèmes logistiques, comme le manque d'information sur les personnes formées dans différents établissements pénitentiaires, peuvent entraver la continuité des mesures.

      Craintes spécifiques liées à certaines infractions : En matière de terrorisme, par exemple, des freins importants peuvent exister au niveau de la magistrature, des avocats et de l'administration pénitentiaire.

      5. Bénéfices et Potentiel de la Justice Restaurative :

      La justice restaurative est perçue comme un outil puissant de réparation pour les victimes, favorisant l'apaisement, la compréhension et potentiellement le pardon. Le témoignage de Nathalie illustre profondément ce potentiel de guérison et de réappropriation de son histoire.

      Pour les auteurs, elle peut favoriser la prise de conscience des conséquences de leurs actes, la reconnaissance de leur responsabilité et leur réinsertion sociale.

      Sur le plan économique et social, des études suggèrent que la justice restaurative pourrait générer des économies significatives en termes de coûts de justice pénale et de réduction de la récidive. Le rapport coût-bénéfice est estimé à 1 € investi pour 8 € économisés.

      Elle permet de restaurer le lien social et de reconstruire un rapport à l'État et à la justice parfois fragilisé.

      6. Application à des Contextes Spécifiques : Terrorisme et Environnement :

      Terrorisme : Bien que représentant une infime partie des mesures de justice restaurative, l'expérience montre que l'approche est applicable aux infractions à caractère terroriste, sans limitation légale.

      L'accent est mis sur la reconnaissance de responsabilité de l'auteur, l'accompagnement individualisé et sécurisé, et la prise en compte de toutes les personnes touchées (y compris la famille des auteurs).

      Les craintes liées aux justifications idéologiques ou à l'absence de remords existent, mais sont similaires à celles rencontrées dans d'autres types d'infractions.

      Les expériences partagées soulignent le potentiel de dialogue et de prise de conscience, même dans des contextes d'extrême violence.

      Environnement : La justice restaurative est envisagée comme une réponse possible aux infractions environnementales, axée sur la réparation des préjudices (souvent envers la nature, mais avec des conséquences pour les humains).

      L'intégration de la justice restaurative dans le cadre institutionnel existant (pôles régionaux environnement, magistrats référents) est possible, mais nécessite une mobilisation des associations environnementales et une meilleure reconnaissance de leur rôle.

      Des défis persistent, notamment en termes de reconnaissance de culpabilité, d'efficacité de la justice environnementale et de la place des victimes (souvent les associations de protection de l'environnement).

      L'exemple de la justice transitionnelle en Colombie, reconnaissant la nature comme victime, offre des perspectives intéressantes, bien que difficilement transposables en l'état. Conclusion :

      La justice restaurative représente une approche prometteuse et potentiellement transformatrice de la justice, complémentaire à la justice pénale traditionnelle.

      Bien qu'elle soit applicable à un large éventail d'infractions, y compris les plus graves, son développement se heurte à des obstacles significatifs, principalement liés au manque de moyens et à des résistances culturelles ou institutionnelles.

      Les expériences menées, notamment dans le domaine du terrorisme, soulignent son potentiel de réparation et de dialogue.

      L'intégration de la justice restaurative dans des domaines spécifiques comme l'environnement offre également des perspectives intéressantes, à condition de surmonter les défis propres à ces contextes.

      Un engagement politique et des moyens accrus sont nécessaires pour que la justice restaurative puisse pleinement déployer son potentiel et devenir une composante essentielle du paysage judiciaire français.

    1. Briefing Doc : Soins Psychiatriques Sans Consentement - Module 1 : Hospitalisations Complètes et Programmes de Soins

      Date : 18 fev 2025

      Source : Excerpts du "Colloque - Les soins sans consentement 1/2" avec Anne Sophie Lpinard (CNB)

      Introduction

      Ce document présente une synthèse des principaux thèmes et idées abordés lors du premier module d'un colloque consacré aux soins psychiatriques sans consentement, animé par Anne Sophie Lpinard, Présidente de la commission accès au droit et à la justice du Conseil National des Barreaux (CNB) et membre de la commission libertés et droits de l'homme.

      Ce premier module s'est concentré sur les hospitalisations complètes et les programmes de soins, tandis que le second module abordera l'isolement et la contention.

      L'intervention d'Anne Sophie Lpinard a souligné l'importance de cette thématique au regard de la privation de liberté qu'impliquent ces mesures et de leur impact profond sur la vie et les droits fondamentaux des personnes concernées.

      Elle a également insisté sur la variabilité des pratiques territoriales et sur le rôle essentiel de l'avocat dans ce contexte de vulnérabilité, en rappelant que l'avocat ne doit pas être un observateur passif mais un acteur actif (CEDH, arrêt MS contre Croatie, 19 février 2015).

      Plan de l'Intervention

      L'intervention a suivi le plan suivant :

      Les différents types de mesures de soins psychiatriques sans consentement. Les rôles des différents professionnels (avec un focus particulier sur l'avocat et le magistrat). Les jurisprudences pertinentes (nationales et européennes).

      Points Clés et Idées Principales

      1. Évolution Législative

      Loi du 5 juillet 2011 : Cette loi a créé le contentieux de l'hospitalisation complète et du programme de soins tel qu'il existe aujourd'hui, remplaçant la loi de 1990.

      Elle a introduit le terme de "soins psychiatriques sans consentement" et créé les "soins en péril imminent".

      Le principe reste celui des soins libres, l'intervention de l'avocat étant initialement facultative, devenant obligatoire en cas d'impossibilité de comparution du patient.

      Citation : "les soins psychiatriques sans consentement constituent vous le savez une privation de liberté c'est euh un principe qui a été reconnu par le Conseil constitutionnel et à ce titre affecte du coup profondément la vie et les droits fondamentaux des personnes qui font l'objet de ces mesures."

      Citation : "On parle à partir de 2011 de soins psychiatrique sans consentement et non plus de l'hospitalisation d'office ou à la demande d'Er comme on le faisait avant et cette loi est également venu créer les soins en péril imminant."

      Loi du 27 septembre 2013 : Cette loi a rendu l'assistance de l'avocat obligatoire dans le cadre des contrôles des mesures d'hospitalisation complète et a ramené le délai de contrôle systématique des hospitalisations complètes de 15 à 12 jours.

      Citation : "la loi du 27 septembre 2013 qui a rendu l'assistance de l'avocat obligatoire dans le cadre des contrôles des mesures d'hospitalisation complète."

      Loi du 20 novembre 2023 (article 44) : Cette loi a transféré certaines compétences civiles du Juge des Libertés et de la Détention (JLD), notamment en matière de soins psychiatriques sans consentement, au profit du "magistrat du siège du tribunal judiciaire", avec une entrée en vigueur au 1er septembre 2024.

      2. Typologie des Mesures de Soins Psychiatriques Sans Consentement

      Principe : Consentement aux soins. L'exception réside dans les soins sans consentement.

      Soins à la demande du représentant de l'État (SDRE) : Prononcés par le préfet sur la base d'un certificat médical circonstancié (émanant d'un psychiatre extérieur à l'établissement) et d'un arrêté préfectoral. Les critères sont la nécessité de soins et la compromission de la sûreté des personnes ou une atteinte grave à l'ordre public.

      Soins à la demande d'un tiers (SDT) : Ordonnés par le directeur de l'établissement lorsque les troubles mentaux rendent impossible le consentement et que l'état mental impose des soins immédiats avec surveillance médicale constante justifiant une hospitalisation complète. Nécessitent une demande d'un tiers (famille ou personne justifiant de relations antérieures) et deux certificats médicaux circonstanciés de moins de 15 jours.

      Soins à la demande d'un tiers en urgence (SDTU) : Possible en cas d'urgence et de risque grave d'atteinte à l'intégrité du malade, sur la base d'un seul certificat médical (pouvant être établi par un médecin de l'établissement).

      Soins en péril imminent (SPI) : Nécessitent l'impossibilité de consentir et la nécessité d'une surveillance médicale constante. Le médecin établissant le certificat ne doit pas exercer dans l'établissement, et les certificats de 24h et 72h doivent être établis par des médecins distincts.

      Soins ordonnés dans le cadre d'une décision d'irresponsabilité pénale : Règles similaires à la SDRE avec des spécificités, notamment la transmission de la décision d'irresponsabilité.

      3. Modalités de Mise en Œuvre de la Contrainte Hospitalisation complète : Le patient est hospitalisé à temps complet dans un établissement habilité. Des aménagements sont possibles (sorties courtes, accompagnées ou non, avec des durées maximales), sans remettre en cause la nature de l'hospitalisation complète.

      Programme de soins : Toute forme de soins autre que l'hospitalisation complète (ambulatoire, à domicile, séjours courts à temps complet). Peut succéder à une hospitalisation complète, maintenant la contrainte et la compétence du magistrat du siège.

      Modification de la prise en charge : Le psychiatre peut à tout moment proposer la transformation d'une hospitalisation complète en programme de soins ou, inversement, la réintégration en hospitalisation complète si le programme de soins n'est plus adapté (non-respect du traitement, dégradation de l'état).

      4. Droits Généraux des Patients

      Applicables à toutes les formes de soins sans consentement (SDRE, SDT, SPI) et soulignant la nécessité d'adaptation, de nécessité et de proportionnalité des restrictions aux libertés individuelles :

      Adaptation, nécessité et proportionnalité des restrictions : Les restrictions doivent être adaptées à l'état mental et au traitement requis (article L3211-3 du Code de la Santé Publique).

      Citation : "les restrictions qui sont imposées à la personne faisant l'objet des soins donc les restrictions à l'exercice des libertés individuelles de la personne doivent être adapté nécessaires et proportionné à son état mental et à la mise en œuvre du traitement requis."

      Respect de la dignité et recherche de réinsertion.

      Droit à l'information : Information sur le projet de décision, la suite envisagée, et possibilité de faire valoir ses observations. L'avis de la personne doit être recherché.

      Autres droits : Communiquer avec les autorités consulaires, saisir la commission départementale, saisir le Contrôleur général des lieux de privation de liberté, prendre conseil auprès d'un médecin et d'un avocat de son choix, émettre et recevoir du courrier, consulter le règlement intérieur, exercer son droit de vote, pratiquer une activité religieuse ou philosophique de son choix.

      Préservation du logement (article L3211-7 du CSP).

      Droit à l'oubli : Conservation de tous les droits et devoirs de citoyen à l'issue des soins, sans que les antécédents psychiatriques puissent être opposés (article L3211-5 du CSP).

      5. Rôles des Professionnels du Droit

      Magistrat du siège du tribunal judiciaire (ex-JLD) :Contrôle systématique : Des hospitalisations complètes (saisine par le directeur ou le représentant de l'État) dans les délais de 8 jours (saisine) et 12 jours (statut) à compter de l'admission, puis tous les 6 mois (saisine 15 jours avant l'échéance, statut avant l'échéance). Le même délai de 12 jours s'applique en cas de réintégration.

      Contrôle facultatif : Sur saisine de la personne, de son entourage, de son tuteur ou du procureur, ou d'office. Pas de délai spécifique pour la saisine. Le magistrat contrôle la régularité formelle et le bien-fondé de la mesure (non pas l'aspect médical).

      Avocat :Rôle essentiel : Assurer la défense des droits de la personne vulnérable.

      Obligatoire : Pour les contrôles systématiques d'hospitalisation complète (si la personne ne peut comparaître, l'avocat la représente).

      Analyse du dossier : Vérification des pièces (certificats, décisions, arrêtés), des délais, de la régularité des décisions administratives (délégations de signature).

      Entretien avec le patient : Même si inaudible pour le magistrat, l'avocat doit s'entretenir avec la personne (si possible) pour l'informer, recueillir ses observations et porter sa parole. Confidentialité de l'entretien. Adaptation du discours à l'état de santé.

      Préparation de l'audience : Conclusions écrites en cas d'irrégularités.

      Débat devant le magistrat : Soulever les arguments de procédure et de fond (adaptation, nécessité, proportionnalité), sans se substituer au médecin. Information sur la décision et les voies de recours (appel dans les 10 jours de la notification).

      Mandat de l'avocat : Articulation entre le mandat du client et l'obligation légale d'assistance.

      6. Voies de Recours

      Appel : Dans les 10 jours de la notification de la décision du magistrat du siège. Interjeté auprès du Premier Président de la Cour d'Appel ou de son délégué.

      L'appel n'est pas suspensif, sauf demande du Ministère Public (procédure spécifique). Déclaration d'appel motivée obligatoire (sauf pour le patient). Délai pour statuer en appel : 12 jours (sauf expertise : 25 jours ; appel suspensif : 3 jours).

      Pourvoi en Cassation : Ouvert à toute partie ayant intérêt et au Ministère Public. Règles habituelles du Code de Procédure Civile et du Code de la Santé Publique applicables.

      7. Jurisprudence Pertinente (Points Saillants)

      Cour de Cassation :26 octobre 2022 (eurodattage) : Exigence d'eurodattage (heure et minute) des certificats de 24h et 72h pour vérifier le respect des délais.

      18 mai 2022 (avis - mineurs) : Clarification sur l'application des soins sans consentement aux mineurs (autorité parentale privilégiée, SDRE comme seule mesure de contrainte).

      26 octobre 2022 (SPI - information famille) : Obligation d'informer la famille dans les 24h (sauf difficulté particulière, notamment refus de la personne).

      Jurisprudence constante (absence de certificat mensuel) : Entraîne la main levée de la mesure.

      Jurisprudence sur la motivation de l'arrêté préfectoral (SDRE) : Possibilité de renvoi au certificat médical à condition de s'en approprier le contenu et de caractériser l'atteinte à l'ordre public.

      Tribunal des Conflits, 3 juillet 2023 : L'autorité judiciaire est compétente pour statuer sur les demandes de sortie d'UMD et de transfert.

      Jurisprudence sur le mandat de l'avocat : L'avocat n'a pas à justifier de son mandat pour interjeter appel.

      26 octobre 2022 (avis Cour d'Appel) : Le non-respect du délai de 48h pour la transmission de l'avis psychiatrique à la Cour d'Appel n'entraîne pas de main levée automatique si un débat contradictoire a été possible.

      Cour Européenne des Droits de l'Homme (CEDH) :Article 5 de la Convention Européenne des Droits de l'Homme : Base du contrôle de la privation de liberté des personnes aliénées.

      Arrêt MS contre Croatie (2015) et Coutura contre Croatie (2019) : Rôle actif de l'avocat exigé. L'avocat ne doit pas être un observateur passif mais interagir avec la personne et avoir un rôle juridique effectif.

      Jurisprudence sur la nécessité de la mesure : Critères de contrôle pour vérifier l'action de l'État.

      Importance d'une prise en charge complète et adaptée : Nécessité d'une prise en charge pluridisciplinaire axée sur la réinsertion.

      8. Modalités d'Intervention de l'Avocat

      Aide Juridictionnelle Garantie : La mission de soins psychiatriques sans consentement (hospitalisation complète, isolement, contention) est couverte par l'AJ garantie en cas de commission d'office.

      Honoraires : Libres en cas de choix par le patient (prudence sur les conventions d'honoraires). AJ possible en cas de choix si la personne est éligible.

      Conclusion Ce premier module a permis de dresser un panorama complet des hospitalisations complètes et des programmes de soins dans le cadre des soins psychiatriques sans consentement, en abordant les aspects législatifs, les différents types de mesures, les droits des patients, les rôles des professionnels du droit et les jurisprudences clés.

      L'accent a été mis sur le rôle actif et essentiel de l'avocat dans la défense des droits des personnes vulnérables soumises à ces mesures de privation de liberté.

      La jurisprudence de la CEDH offre des perspectives intéressantes pour faire évoluer les pratiques et renforcer le contrôle de ces mesures.

    1. #if defined(CONFIG_ENERGY_MODEL) && defined(CONFIG_CPU_FREQ_GOV_SCHEDUTIL) /* Build perf. domains: */ for (i = 0; i < ndoms_new; i++) { for (j = 0; j < n && !sched_energy_update; j++) { if (cpumask_equal(doms_new[i], doms_cur[j]) && cpu_rq(cpumask_first(doms_cur[j]))->rd->pd) { has_eas = true; goto match3; } } /* No match - add perf. domains for a new rd */ has_eas |= build_perf_domains(doms_new[i]);

      Policy that creates performance domains if EAS is enabled. This means that the groups of CPUs inside each domain is governed by schedutil + EAS. Schedutil is the only CPU governor that fits EAS.

    1. Briefing Document : Éducation à la Sexualité en France

      Source : Excerpts de "Texte collé" (Transcription d'un webinaire)

      Date du document : Inconnu (le webinaire semble être postérieur au 3 février 2024, date de publication du programme d'éducation à la sexualité)

      Objectif du document : Analyser les principaux thèmes, idées et faits saillants concernant l'éducation à la sexualité en France, tels qu'exprimés par les intervenants du webinaire.

      Intervenants principaux :

      • Marc Pelletier : (Fonction non explicitement mentionnée, mais semble être impliqué dans l'élaboration des programmes d'éducation à la sexualité au niveau national - probablement au Conseil Supérieur des Programmes).
      • Sarah Durocher : Représentante du Planning Familial.
      • Adeline Gouttenoire : Animatrice du webinaire.
      • Samira Meguellati : (Rôle non explicitement mentionné, pose des questions).

      Thèmes Principaux et Idées Clés :

      1. Contexte et Nécessité de l'Éducation à la Sexualité :

      • Obligation légale : L'éducation à la sexualité est une éducation obligatoire en France, inscrite par le législateur. ("le législateur a choisi de l'inscrire parmi les éducations obligatoires." - 00:04:17-00:04:19).

      • Évolution sociétale : La prise de conscience et l'importance accordée à cette question ont grandi avec l'évolution de la société, notamment l'autonomie croissante de la jeunesse, l'évolution des modes de socialisation, l'importance des médias et l'évolution des droits, en particulier les droits des femmes. ("à mesure que la jeunesse est devenue un peu plus autonome, que les modalités de socialisation de la jeunesse ont également évolué... les médias sont devenus progressivement plus importants à mesure également que la société se transformait, notamment à travers l'évolution des droits et notamment des droits des femmes.

      Et bien de nouvelles questions sont apparues et progressivement, on a eu une attention...” - 00:04:37-00:05:08).

      • Réponse aux enjeux actuels : L'éducation à la sexualité est essentielle pour répondre aux questions que se posent les jeunes aujourd'hui dans la société. ("très directement finalement aux questions qui se posent aujourd'hui dans la société." - 00:11:16-00:11:21).

      • Lutte contre la désinformation : Il existe de nombreuses idées reçues et une désinformation importante concernant la sexualité, rendant l'éducation cruciale. (Sarah Durocher souligne la désinformation autour du Planning Familial - 00:12:39-00:12:41).

      2. Évolution du Concept : De l'Information Sexuelle à l'Éducation à la Sexualité :

      • Élargissement du champ : L'approche a évolué d'une simple information biologique (corps humain, procréation) vers une éducation plus holistique intégrant des dimensions psychologiques (respect de soi, intimité), affectives, sociales (égalité fille-garçon, prévention des violences, droits).

      ("Ce n'est que progressivement dans les années 70 80 90, que cette information s'est élargie et progressivement devenue une éducation à la sexualité, c'est-à-dire comme lorsqu'on passe d'une information sexuelle à une éducation à la sexualité, on aborde de nouvelles dimensions au champ biologique... on ajoute des dimensions psychologiques... affectives... sociales...” - 00:06:11-00:06:55).

      • Valeurs fondamentales : L'éducation à la sexualité est centrée autour de la tolérance, la liberté, le respect de soi et d'autrui, et vise à aider les élèves à adopter des attitudes responsables.

      ("et qui est d'emblée, d'ailleurs, dès les années 90, qui est centrée autour de la tolérance, la liberté, le respect de soi et d'autrui et d'aider les individus, les élèves à s'intégrer finalement dans à adopter des attitudes responsables en termes individuels, familiales et sociales." - 00:07:09-00:07:27).

      3. Le Nouveau Programme d'Éducation à la Sexualité :

      • Impulsion législative : La loi de 2001 relative à l'interruption volontaire de grossesse et à la contraception a marqué une étape importante en étant déclinée dans le code de l'éducation, rendant l'éducation à la sexualité obligatoire à l'école.

      • Élaboration du programme : Un nouveau programme a été élaboré par le Conseil Supérieur des Programmes (CSP) suite à une lettre de saisine de 2023.

      Un groupe d'experts a été constitué de manière transparente.

      • Objectifs principaux de la lettre de saisine :Attention particulière au premier degré.

      • Articulation avec les trois piliers de l'éducation à la sexualité

      • dimensions biologiques,
      • socio-culturelles et
      • psycho-affectives).

      • Inscription dans le cadre du développement des compétences psycho-sociales des élèves.

      • Adoption et publication : Le programme a été adopté en janvier 2024 et publié le 3 février 2024, à la demande de la ministre.

      • Réponse aux besoins essentiels : Le programme vise à répondre aux besoins actuels des jeunes en matière de relations, de respect, de bien-être et de prévention des violences.

      • Structure autour de trois questions :

      Le programme s'articule autour de trois grandes questions :

      • Comment devenir soi ?
      • Comment rencontrer les autres ?
      • Comment trouver sa place dans la société ?

      4. Mise en Œuvre et Intervenants :

      • Distinction premier et second degré : Le terme "sexualité" n'apparaît explicitement dans le programme qu'à partir de la classe de quatrième.

      Au premier degré, on parle d'éducation à la vie affective et relationnelle.

      • Rôle des enseignants : Les enseignants sont les premiers acteurs de cette éducation, en lien avec les valeurs de l'école.

      • Rôle des intervenants extérieurs (associations agréées) : Des associations comme le Planning Familial, agréées par l'Éducation Nationale, interviennent depuis des décennies et sont un complément important.

      Elles sont formées et ont une expertise spécifique. ("le planning familial, c'est une association féministe d'éducation populaire qui va bientôt avoir 70 ans... on intervient aussi... dans le cadre scolaire ou ça fait des décennies maintenant. Et on a un agrément pour ça...” - 00:12:31-00:13:16).

      • Formation des intervenants : Les intervenants des associations agréées sont formés sur les questions de sexualité, de consentement, de lutte contre les discriminations, etc. Certains ont des formations spécifiques comme le conseil conjugal et familial (diplôme d'état).

      • Respect de la parole et neutralité : Les interventions doivent garantir un climat de confiance, une position neutre et bienveillante, respecter la parole de chacun et favoriser les échanges sans imposer de vues personnelles.

      • Information aux parents : Les parents doivent être informés des projets et des contenus abordés dans le cadre de l'éducation à la vie affective et relationnelle. Les établissements sont encouragés à présenter les projets aux parents lors de réunions.

      5. Réponses aux Questions et Préoccupations :

      • Droit des parents : L'éducation à la sexualité est obligatoire. Une absence ponctuelle pour des motifs philosophiques ou religieux peut être signalée, mais ne constitue pas un droit de retrait général.

      Il est important de dialoguer avec l'établissement. ("L'instruction est obligatoire, l'éducation à la sexualité est une de ses composantes.

      Donc, il n'y a pas de possibilité de se soustraire à cet enseignement." - 00:57:21-00:57:34).

      • Formation des enseignants : Un vaste programme de formation est mis en place au niveau national et académique pour accompagner la mise en œuvre du nouveau programme. Des ressources pédagogiques seront mises à disposition.

      • Notion de consentement et d'intimité : Le consentement est abordé dès le plus jeune âge comme le fait de demander l'autorisation avant de faire quelque chose à quelqu'un.

      L'intimité est la sphère privée de chacun, qu'il faut respecter dès l'enfance.

      • Impact sur la violence sexuelle : L'éducation à la sexualité vise à prévenir les violences en informant sur le consentement, le respect, l'égalité et les droits.

      Elle ne supprimera pas la violence à elle seule, mais contribue à une société plus informée et respectueuse.

      • Perception du programme : Il existe des perceptions diverses, avec des parents inquiets et d'autres qui y voient une nécessité pour la protection et l'information des enfants. Il est crucial de lire le programme et de dialoguer.

      • "Théorie du genre" : L'ancienne ministre de l'Éducation Nationale a affirmé que la théorie du genre n'existait pas et n'était pas présente dans les programmes.

      Conclusion :

      • Le webinaire met en lumière l'importance et l'évolution de l'éducation à la sexualité en France, désormais encadrée par un nouveau programme national.

      Les intervenants insistent sur la nécessité de cette éducation pour répondre aux enjeux sociétaux, lutter contre la désinformation et prévenir les violences.

      Ils soulignent le rôle complémentaire des enseignants et des associations agréées, ainsi que l'importance de la formation des professionnels et de l'information aux parents.

      Malgré certaines inquiétudes, le programme vise à instaurer un cadre clair et respectueux des valeurs de la République, en adaptant les contenus à l'âge des élèves et en favorisant le dialogue.

    2. Éducation à la sexualité : Guide d'étude

      Quiz : Questions courtes

      • Selon les intervenants, quand l'éducation à la sexualité est-elle devenue une action éducative en France ?
      • Quels éléments ont contribué à ce que la sensibilité à la question de l'éducation à la sexualité grandisse dans la société française ? Citez-en deux.
      • Quelle distinction est faite entre l'information sexuelle et l'éducation à la sexualité ?
      • Quels sont les trois objectifs principaux de la lettre de saisine concernant l'élaboration d'un programme d'éducation à la sexualité ?
      • Quand le nouveau programme d'éducation à la sexualité a-t-il été adopté par le Conseil supérieur des programmes et publié ?
      • Selon Sarah Durocher du Planning Familial, quel est l'objectif principal de leur association ?
      • Comment le Planning Familial intervient-il auprès des jeunes en matière d'éducation à la sexualité ?
      • Quelle distinction importante est soulignée concernant l'appellation du programme dans le premier et le second degré ?
      • Quels sont les trois axes qui structurent le nouveau programme d'éducation à la sexualité ?
      • Selon les intervenants, quel rôle essentiel jouent les associations agréées dans la mise en œuvre de l'éducation à la sexualité ?

      Clé de réponses du quiz

      • L'éducation à la sexualité est progressivement devenue une question éducative, notamment dans les années 70, 80 et 90, avec un élargissement des thèmes abordés au-delà des aspects biologiques. La loi de 2001 a ensuite inscrit l'éducation à la sexualité parmi les éducations obligatoires.
      • L'autonomie croissante de la jeunesse et l'évolution des modalités de socialisation (horizontalisation, importance du groupe de pairs et des médias) ont contribué à cette sensibilité accrue. L'évolution des droits, notamment des droits des femmes, et l'apparition de nouvelles questions sociétales ont également joué un rôle.
      • L'information sexuelle se concentre sur les aspects biologiques, la procréation et la physiologie. L'éducation à la sexualité élargit le champ en ajoutant des dimensions psychologiques (respect de soi, intimité), affectives et sociales (égalité, prévention des violences, droits des personnes).
      • Les trois objectifs principaux étaient une attention particulière au premier degré, une prise en compte des enjeux liés à l'égalité entre les filles et les garçons, et une inscription de l'éducation à la sexualité dans le cadre du développement des compétences psychosociales des élèves.
      • Le nouveau programme d'éducation à la sexualité a été adopté par le Conseil supérieur des programmes en janvier dernier (fin janvier) et publié le 3 février.
      • L'objectif principal du Planning Familial est de permettre aux personnes, y compris les jeunes, d'avoir accès à une information et à une éducation complète en matière de sexualité, de contraception et d'avortement, afin qu'elles puissent faire leurs propres choix et se protéger.
      • Le Planning Familial intervient dans le cadre scolaire depuis des décennies grâce à un agrément. Ils accueillent également des personnes dans leurs antennes et interviennent auprès des personnes éloignées du droit, en proposant des informations et un accompagnement sur les questions de sexualité et de droits.
      • Dans le premier degré, on parle d'éducation à la vie affective et relationnelle, tandis que dans le second degré, le programme s'intitule éducation à la vie affective et relationnelle et à la sexualité, cette dernière étant abordée dans ses dimensions juridique et sociale.

      • Les trois axes qui structurent le nouveau programme sont :

        • comment vivre ensemble dans le respect de l'égalité ?
        • Comment prendre soin de soi et des autres ?
        • Comment s'informer et développer son esprit critique ?
      • Les associations agréées apportent leur expertise de terrain, forment les professionnels de l'éducation et contribuent à la mise en œuvre des programmes en proposant des interventions adaptées et en luttant contre la désinformation.

      Pour approfondir

      • Analysez l'évolution de la perception et de l'intégration de l'éducation à la sexualité dans le système éducatif français, en vous appuyant sur les éléments historiques et sociétaux mentionnés dans l'extrait.
      • Discutez de l'importance de la distinction entre "information sexuelle" et "éducation à la sexualité" telle qu'elle est présentée, et expliquez comment cette distinction se traduit concrètement dans les objectifs et les contenus du programme.
      • Évaluez le rôle et l'apport des associations comme le Planning Familial dans la mise en œuvre de l'éducation à la sexualité en milieu scolaire, en tenant compte de leur expérience et de leur agrément.
      • Comment le nouveau programme d'éducation à la sexualité tente-t-il de répondre aux enjeux actuels de la société concernant les relations, le respect, l'égalité et la prévention des violences, notamment chez les jeunes ?
      • Réflexion sur les défis et les opportunités liés à la mise en œuvre d'un programme national d'éducation à la sexualité, en considérant les diverses réactions et les besoins de formation exprimés par les professionnels et les parents.

      Glossaire des termes clés

      • Éducation à la sexualité : Démarche éducative globale visant à informer, sensibiliser et développer les compétences des individus en matière de sexualité, incluant les aspects biologiques, psychologiques, affectifs, sociaux, juridiques et éthiques. Elle vise à promouvoir le bien-être, le respect, l'égalité et la responsabilité.
      • Information sexuelle : Transmission de connaissances factuelles concernant la biologie de la reproduction, le fonctionnement du corps et les risques sanitaires liés à la sexualité. Elle constitue une partie de l'éducation à la sexualité mais ne l'englobe pas entièrement.
      • Consentement : Accord libre, éclairé et sans équivoque d'une personne pour participer à une activité sexuelle. La notion de consentement est fondamentale dans la prévention des violences sexuelles et est abordée dès le plus jeune âge dans le cadre de l'éducation à la vie relationnelle.
      • Compétences psychosociales : Ensemble des aptitudes nécessaires pour adopter un comportement adapté et positif qui permette de faire face efficacement aux exigences et aux défis de la vie quotidienne. Elles incluent la conscience de soi, la gestion des émotions, l'empathie, la communication, la résolution de problèmes et la pensée critique, et sont liées à l'éducation à la sexualité.
      • Planning Familial : Association féministe d'éducation populaire qui œuvre pour le droit à la contraception, à l'avortement, à l'éducation à la sexualité et à l'égalité entre les femmes et les hommes. Elle propose des informations, des consultations et des actions de sensibilisation.
      • Développement affectif et relationnel : Processus par lequel les individus développent leur capacité à comprendre et à gérer leurs émotions, à établir et à maintenir des relations saines et respectueuses avec les autres. Cet aspect est central dans l'éducation à la sexualité, en particulier au premier degré.
      • Violence sexiste et sexuelle : Ensemble des actes de violence dirigés contre une personne en raison de son sexe ou de son orientation sexuelle. L'éducation à la sexualité joue un rôle dans la prévention de ces violences par la sensibilisation au consentement, au respect et à l'égalité.
      • Discrimination : Traitement inégalitaire et injuste basé sur des critères tels que le sexe, l'orientation sexuelle, l'identité de genre, etc. L'éducation à la sexualité vise à lutter contre les discriminations en promouvant la tolérance et l'acceptation de la diversité.
      • Cadre légal : Ensemble des lois et des réglementations qui encadrent une activité ou un domaine. En France, l'éducation à la sexualité est inscrite dans le code de l'éducation et fait l'objet de lois spécifiques (comme la loi de 2001).
      • Esprit critique : Capacité à analyser l'information de manière objective, à identifier les biais, à évaluer la fiabilité des sources et à se former un jugement éclairé. L'éducation à la sexualité vise à développer l'esprit critique des jeunes face aux informations et aux représentations de la sexualité.
    1. Arbitration

      Arbitrator was an Act in south Africa trying to help resolve different disputes outside of court. The advantages to be seen were, the ability to choose a suitable arbitrator or, if the arbitrator is appointed by an arbitration organization, the fact that generally suitable appointments are made; cost, when compared with litigation, although it is not always true that arbitration is less expensive that litigation; speed although it is not always true that they are always quicker; and the flexibility. Arbitration, unlike the history of the law, is not an account of growth and development of principles and doctrines that have come, through a long use, to have a general validity and force. Arbitration anticipates all the former legal systems, it has not developed any code of substantive principles, but is, with very few expectations, a matter of free decision, each case being viewed in t the light of practical advantages and decided in accord with the ethical or economic norm of some particular group.

      Source 1

      Source 2

    1. Reviewer #1 (Public review):

      Summary:

      Epiney et al. use single-nuclei RNA sequencing (snRNA-seq) to characterize the lineage of Type-2 (T2) neuroblasts (NBs) in the adult Drosophila brain. To isolate cells born from T2 NBs, the authors used a genetic tool that specifically allows the permanent labeling of T2-derived cell types, which are then FAC-sorted for snRNA-seq. This effective labeling approach also allows them to compare the isolated T2 lineage cells with T1-derived cell types by a simple exclusion method. The authors begin by describing a transcriptomic atlas for all T1 and T2-derived neuronal and glia clusters, reporting that the T2-derived lineage comprises 161 neuronal clusters, in contrast to the T1 lineage which comprises 114 of them. The authors then use the expression of VAChT, VGlut, Gad1, Tbh, Ple, SerT, and Tdc2 to show that T2 neuroblasts generate all major neuron classes of fast-acting neurotransmitters. Strikingly, they show that a subset of glia and neuronal clusters have disproportionate enrichment in males or females, suggesting that T2 neuroblasts generate sex-biased cell types. The authors then proceed to characterize neuropeptide expression across T2-derived neuronal clusters and argue that the same neuropeptide can be expressed across different cell types, while similar cell types can express distinct neuropeptides. The functional implication of both observations, however, remains to be tested. Furthermore, the authors describe combinatorial transcription factor (TF) codes that are correlated with neuropeptide expression for T2-derived neurons along with an overall TF code for all T2-derived cell types, both of which will serve as an important starting point for future investigations. Finally, the authors map well-studied neuronal types of the central complex to the clusters of their T2-derived snRNA-seq dataset. They use known marker combinations, bulk RNA-seq data and highly specific split-GAL4 driver lines to annotate their T2-derived atlas, establishing a comprehensive transcriptomic atlas that would guide future studies in this field.

      Strengths:

      This study provides an in-depth transcriptomic characterization of neurons and glia derived from Type-2 neuroblast lineages. The results of this manuscript offer several future directions to investigate the mechanisms of diversifying neuronal identity. The datasets of T1-derived and T2-derived cells will pave the way for studies focused on the functional analysis of combinatorial TF codes specifying cell identity, sex-based differences in neurogenesis and gliogenesis, the relationship between neuropeptide (co)expression and cell identity, and the differential contributions of distinct progenitor populations to the same cell type.

      Weaknesses:

      The study presents several important observations based on the characterization of Type II neuroblast-derived lineages. However, a mechanistic insight is missing for most observations. The idea that there is a sex-specific bias to certain T2-derived neurons and glial clusters is quite interesting, however, the functional significance of this observation is not tested or discussed extensively. Finally, the authors do not show whether the combinatorial TF code is indeed necessary for neuropeptide expression or if this is just a correlation due to cell identity being defined by TFs. Functional knockdown of some candidate TFs for a subset of neuropeptide-expressing cells would have been helpful in this case.

      Comments on revisions:

      The authors have addressed my recommendations.

    2. Author response:

      The following is the authors’ response to the original reviews

      Public Reviews:

      Reviewer #1 (Public review):

      Summary:

      Epiney et al. use single-nuclei RNA sequencing (snRNA-seq) to characterize the lineage of Type-2 (T2) neuroblasts (NBs) in the adult Drosophila brain. To isolate cells born from T2 NBs, the authors used a genetic tool that specifically allows the permanent labeling of T2-derived cell types, which are then FAC-sorted for snRNA-seq. This effective labeling approach also allows them to compare the isolated T2 lineage cells with T1-derived cell types by a simple exclusion method. The authors begin by describing a transcriptomic atlas for all T1 and T2-derived neuronal and glia clusters, reporting that the T2-derived lineage comprises 161 neuronal clusters, in contrast to the T1 lineage which comprises 114 of them. The authors then use the expression of VAChT, VGlut, Gad1, Tbh, Ple, SerT, and Tdc2 to show that T2 neuroblasts generate all major neuron classes of fast-acting neurotransmitters. Strikingly, they show that a subset of glia and neuronal clusters have disproportionate enrichment in males or females, suggesting that T2 neuroblasts generate sex-biased cell types. The authors then proceed to characterize neuropeptide expression across T2-derived neuronal clusters and argue that the same neuropeptide can be expressed across different cell types, while similar cell types can express distinct neuropeptides. The functional implication of both observations, however, remains to be tested. Furthermore, the authors describe combinatorial transcription factor (TF) codes that are correlated with neuropeptide expression for T2-derived neurons along with an overall TF code for all T2-derived cell types, both of which will serve as an important starting point for future investigations. Finally, the authors map well-studied neuronal types of the central complex to the clusters of their T2-derived snRNA-seq dataset. They use known marker combinations, bulk RNA-seq data and highly specific split-GAL4 driver lines to annotate their T2-derived atlas, establishing a comprehensive transcriptomic atlas that would guide future studies in this field.

      Thanks for the clear and accurate summary of our findings.

      Strengths:

      This study provides an in-depth transcriptomic characterization of neurons and glia derived from Type-2 neuroblast lineages. The results of this manuscript offer several future directions to investigate the mechanisms of diversifying neuronal identity. The datasets of T1-derived and T2-derived cells will pave the way for studies focused on the functional analysis of combinatorial TF codes specifying cell identity, sex-based differences in neurogenesis and gliogenesis, the relationship between neuropeptide (co)expression and cell identity, and the differential contributions of distinct progenitor populations to the same cell type.

      Thank you for the positive comments.

      Weaknesses:

      The study presents several important observations based on the characterization of Type II neuroblast-derived lineages. However, a mechanistic insight is missing for most observations. The idea that there is a sex-specific bias to certain T2-derived neurons and glial clusters is quite interesting, however, the functional significance of this observation is not tested or discussed extensively. Finally, the authors do not show whether the combinatorial TF code is indeed necessary for neuropeptide expression or if this is just a correlation due to cell identity being defined by TFs. Functional knockdown of some candidate TFs for a subset of neuropeptide-expressing cells would have been helpful in this case.

      We agree that we do not provide mechanistic or functional insights. Our goal was to produce hypothesis generating datasets for our lab and others to use to direct functional or mechanistic studies.

      Reviewer #2 (Public review):

      In this manuscript, Epiney et al., present a single-nucleus sequencing analysis of Drosophila adult central brain neurons and glia. By employing an ingenious permanent labeling technique, they trace the progeny of T2 neuroblasts, which play a key role in the formation of the central complex. This transcriptomic dataset is poised to become a valuable resource for future research on neurogenesis, neuron morphology, and behavior.

      Thank you for the positive comments.

      The authors further delve into this dataset with several analyses, including the characterization of neurotransmitter expression profiles in T2-derived neurons. While some of the bioinformatic analyses are preliminary, they would benefit from additional experimental validation in future studies.

      Thank you for the positive comments. We too hope that future research will benefit from this dataset.

      Reviewer #1 (Recommendations for the authors):

      Major points

      (1) In Figures 1E and 4A, the T1 and T2 glia subsets reveal sub-clusters for several cell types as seen by the distribution of points on the UMAP. This observation is never validated or discussed. Do these sub-clusters represent true differences in identities or are they artifacts of the single-nucleus preparation? For Figure 1E, it is not clear whether specific sub-clusters (see Ensheathing-4 vs Ensheathing-5 and Astrocyte-2 vs. Astrocyte-6) are differentially enriched between the T1 and T2 lineages. The existence of these sub-clusters must be discussed or dismissed.  

      We agree that this needs to be addressed more clearly in the manuscript and have made text changes in the Results and Discussion sections to clarify. We note that a recent glial cell atlas (Lago-Baldaia et al., 2023: PMID: 37862379) of the developing fly VNC and optic lobes found sub-clusters that mapped to the same subtype annotations. Interestingly, Lago-Baldaia and colleagues found that the transcriptional diversity of glia cell types did not match the morphological diversity of glia validated in vivo. See text changes below.

      Lines 131-133: “Similar to a previous glial cell atlas (Lago-Baldaia et al., 2023) we found some glial subtypes (astrocytes, ensheathing, and subperineurial) mapped to multiple clusters (Figure 1E, 1F).”

      Lines 206-208: “In line with our T1+T2 atlas and previous glia cell atlas (Lago-Baldaia et al., 2023), some subtypes mapped to several subclusters including ensheathing, astrocytes, and chiasm (Figure 4A-B).”

      Lines 397-401: “Similar to a recent glial cell atlas (Lago-Baldaia et al., 2023), we found glial subtypes like astrocytes, ensheathing, and subperineurial glia mapped to several sub-clusters (Figure 1E-F). It remains unclear if these sub-clusters with the same cell type annotation represent distinct glial identities or different transcriptional states within these populations.”

      (2) The authors present evidence for sex-specific neuronal and glia subtypes and find differential expression of specific yolk proteins and long non-coding RNAs. However, whether any of these differences are driven by other canonical sex-specific genes such as Fruitless (Fru) or Double-sex (Dbx) has not been reported or discussed. The authors must re-analyze their data for these genes and claim whether they have any contribution to sex-specific sub-clusters.

      Thank you for pointing this out. We have made text changes and clarifications to highlight the expression of other canonical sex-specific genes. Fru was enriched in male nuclei as expected. Interestingly, dbx was enriched in female nuclei. It remains to be determined if these genes are mechanisms that may be driving sex-specific changes.

      Lines 224-226: “Additionally, female nuclei were enriched for dbx (Supp Table 8). Male glial nuclei expressed higher levels of genes including the male-specific genes lncRNA:rox1/2 and fru (Figure 5C; Supp Table 8) (Ryner et al., 1996; Amrein and Axel, 1997; Meller et al., 1997).”

      Lines 237-239: “Male nuclei expressed higher levels of genes including the male-specific genes lncRNA:rox1/2 and fru (Figure 5G; Supp Table 9) (Ryner et al., 1996; Amrein and Axel, 1997; Meller et al., 1997).”

      Lines 428-431:” We found the expected differential expression of yolk proteins (yp1, yp2, yp3) enriched in female nuclei and the long non-coding RNAs rox1/2 and fru enriched in male neuronal nuclei (Ryner et al., 1996; Amrein and Axel, 1997; Meller et al., 1997; Warren et al., 1979). Interestingly, we found dbx to be enriched in both glial and neuronal female nuclei.”

      Lines 433-435: “It remains to be determined if these genes are driving these sex-specific differences in glia and neurons.”

      (3) In Figure 6C, it is unclear whether the Ms-2A-LexA-expressing neurons of clusters 157 and 160 project to two different neuropils or share projects to both neuropils. However, it is not explicitly shown in the immunostaining data whether indeed there are two populations to begin with. The authors must check for cluster 157 and 160 specific markers (such as Dh44 and ple) and test whether they appear mutually exclusively in the Ms-2A-LexA-expressing neurons. The same reasoning would apply to the data shown in Figures 6D and 6E, where the authors must test whether the NPF and AstA expressing cells are indeed neurons from clusters 100 and 128, using orthogonal cluster markers to conclude that they are similar (or the same) neurons.

      We changed the focus of the paragraph to confirm that these neurons indeed come from type II and that they target the central complex. Although due to the lack of reagents we cannot test the identity of each one of these neurons, we could make meaningful interpretations of the staining to validate our ideas about neuropeptidergic cells in the central complex. We made sure to mention the limitation of our experiment to avoid any wrong conclusions.

      Minor points

      (1) Line 115 - "cluster that represents optic lobe neurons". How was this cluster identified?

      We reexamined the most significant genes enriched in this cluster 124, and found they are Rh2, ninaC, trpl, and phototransduction related genes (Supplemental table 1). We reassigned the identity of this cluster as ocelli, which also express photoreceptor genes but can’t be easily removed during dissection. We modified the text as follows:

      "We used known markers (Croset et al., 2018; Davie et al., 2018; Supp Table 2) to identify distinct cell types in the central brain, including glia, mushroom body neurons, olfactory projection neurons, clock neurons, Poxn+ neurons, serotonergic neurons, dopaminergic neurons, octopaminergic neurons, corazonergic neurons, hemocytes, and ocelli (Figure 1B, Supp. Table 1)."

      (2) As the separation in Figure 1B is not obvious, annotated cell type clusters must be re-colored instead of being labelled as the exact dots are indistinguishable. This would especially be helpful for OCTY, SER, OPN, and CLK clusters.

      (3) Cluster labels in Figure 1C are barely visible and the font size must be increased for the reader. Recoloring the cluster identities and attaching a legend would again help in this case.

      We recolored the atlas in Figure 1B, 1C and 1C’ and increased the font size in Figure 1C’.

      (4) For Figure 4A, clusters should be labelled on the UMAP along with the legend as it is difficult for the reader to match identities using Seurat colors. The same is true for the UMAPs in Figure 5A.

      Yes, we agree that labeling would improve readability and have done so for UMAPs in Figure 4A and 5A-A’’.

      Reviewer #2 (Recommendations for the authors):

      In this manuscript, Epiney et al., present a single-nucleus sequencing analysis of adult central brain neurons and glia Through the use of a ingenious permanent labeling technique, they are able to trace the progeny of T2 neuroblasts, which contribute significantly to the formation of the central complex. This transcriptomic dataset is the first of its kind and will likely serve as a valuable resource for future studies.

      The authors further explore this dataset through several analyses, including the characterization of neurotransmitter expression profiles in T2-derived neurons. However, the approach used to identify the identity of each neuron cluster could be more clearly articulated, and some of the authors' conclusions are more generalized - either already well-established or lacking sufficient support.

      Detailed comments:

      Abstract - "Our data support the hypothesis that each transcriptional cluster represents one or a few closely related neuron subtypes. - Is this a novel finding? If so, it would be helpful if the authors could explain why this is the case more clearly.

      Our results are not generally novel, and many single cell/single nuclei RNA-seq papers have been published (more citations added to Introduction). Our work is novel in that we analyze Type 1 and Type 2 neuroblasts in the central brain.

      Line 53 - In the introduction the authors should also reference other single-cell studies done in the Drosophila brain.

      Done.

      Line 59 - There are some typos here. The authors could also mention type zero.

      Both done.

      Figure 1 and Sup Table 1 - Authors show in sup table 1 the top cell markers by cluster but there is no correspondence between cluster number and identity. The authors do not say which known markers were used to give the identity to each cluster.

      We have added the cell identity in the Supplemental Table 1. For the unknown cells, we left the column blank. We have also added a Supplemental Table 2 to show the markers we used to give identity to the clusters.

      Supplementary Tables - For each table, more detailed information should be provided regarding what is being compared and the methods used for these comparisons.

      We have added the methods we used in Seurat to generate each individual table.

      Line 138 - Differential gene expression analysis between T1 and T2 glial progeny did not show differences across any glial cell types (Supp Table 4). - Was this comparison done per cluster? Is differential gene expression of top markers, which are anyway the genes that define each glial cell type, enough for this type of analysis?

      Yes, we performed the differential expression analysis using all genes (i.e., not just marker defining) at a cluster-by-cluster resolution with results in Supplemental Table 4. We have edited the text to make this clarification.

      Lines 139-141: “Differential gene expression analysis for all genes between T1 and T2 glial progeny did not show differences across any glial cell types or clusters (Supp Table 4).”

      Line 146 - We identified T1-derived neurons by excluding cells co-expressing T2-specific. Markers FLP+/GFP+/RFP+ plus repo+ glial clusters. - Bioinformatically, correct?

      Yes. We clarified the sentence as follows:

      "We identified T1-derived neurons by bioinformatically excluding cells co-expressing T2-specific markers FLP+/GFP+/RFP+ plus repo+ glial clusters."

      Line 156 - We found that each cluster strongly expressed a unique combination of genes. - As they are grouped by seurat in different clusters, why is this surprising?

      Line 175 - "top 10 significantly enriched genes gathered from each T2 neuron cluster" - can these lists be included?

      Yes they are grouped by Seurat. We toned down the sentence and refer each combination of genes as cluster markers. We modified the sentences as follows:

      Each unique combination of enriched genes could be referred to as cluster markers.

      Line 211- How did the authors identify sex-biased clusters? How did the authors separate the samples/cells by sex? Was it done bioinformatically by the expression of certain genes? If so, which?

      We collected male and female nuclei separately. We have added text in the methods section as follows:

      "Equal amounts of male and female central brains (excluding optic lobes) were dissected at room temperature within 1 hour. The samples were flash-frozen in liquid nitrogen and stored separately at -80°.

      In the first round, we pooled male and female brains together to select GFP+ nuclei and used particle-templated instant partitions to capture single nuclei to generate cDNA library (Fluent BioSciences, Waterton, MA). In the second and third rounds, RFP+ nuclei from male and female brains were collected separately. The split-pool method was then used to generate barcoded cDNA libraries from each individual nucleus."

      Are there sex-specific differences in genes in glia other than genes that were previously known to be sex-specific?

      We report the comprehensive list of sex-specific differences in gene expression for both glia and neurons in Supp tables 8 and 9.

      Line 237 - When the authors mention "We conclude that male and female adult T2 neurons have sex-specific differences in gene expression within the same neuronal subtype" does this mean that these neurons are the same in male and in female brains, but they additionally specifically express sex-specific genes?

      Yes, we report that male and females contain the same neurons defined by their transcriptional profile. It remains to be seen if this sex-specific differences changes how these same neuronal subtypes function between male and females. We have added additional text in the discussion to expand on this thought.

      Lines 437-441: “It remains to be determined if these genes are driving sex-specific differences within glial and neuronal subtypes. These genes may reflect sex-specific differences in the adult central brain and may provide insight into how behavioral circuits are linked to sex-specific behaviors. Future work should aim to characterize and test these genes.”

      Line 250 - The idea behind these sections "What is the relationship between neuropeptide expression and cluster identity?" "relation between cluster and morphology" lacks clarity. As clusters are defined based on principal component analysis, and the genes used to define a cluster are dependent on this method, there is no assumption that each cluster represents only one type of neuron or that it should include only neurons expressing the same neurotransmitter genes. Even if some clusters consist of a single neuron type, this should not be generalized to all clusters (and vice-versa).

      Correct, we cannot determine from the transcriptome data whether distinct clusters will have different morphology. We have changed the focus of the question to address that we are confirming they come from type 2 and that they target the central complex while comparing to known cells that express the neuropeptide.

      Line 265 - We first assayed the neuronal morphology of Ms+ neurons - why did the authors choose these neurons?

      Resolved in main text: “we found that type II-derived Ms-2A-LexA-expressing neurons project to multiple layers of the dorsal fan-shaped body and the entire ellipsoid body, suggesting an unknown class of Ms+ neurons targeting to EB and/orFB".

      Line 268 - "Currently we can't determine whether Ms+ neurons in clusters 157 and 160 project to different CX neuropils, or whether neurons from both clusters share projections into both neuropils. " - The purpose of this point is unclear.

      Resolved in text: “we found that type II-derived Ms-2A-LexA-expressing neurons project to multiple layers of the dorsal fan-shaped body and the entire ellipsoid body, suggesting an unknown class of Ms+ neurons targeting to EB and/or FB”.

      Line 279 - This analysis could be more explored.

      Thank you for your feedback. As the comment was somewhat broad, we were unsure of the specific revisions needed and have therefore left the text unchanged.

      Line 301 - The text regarding this section, and the description and details of respective figures should be proofread to ensure clarity.

      Done.

      Line 386 - Alternatively, co-expression may be due to background from RNAs released during dissociation. - RNA in soup could be bioinformatically analysed.

      Correct. We opted to delete this sentence since our split-pool based method does not create background RNA expression. Additionally, the analysis is performed on scaled expression >2, and any background RNA is unlikely to yield such high expression.

      Discussion - Some of the conclusions are a bit too general, suggesting that the results might be meaningful, but also acknowledging the possibility of artifacts. If the authors could refine this, it would strengthen the manuscript.

      We are sorry but we are uncertain what you are asking; we don't know what you want us to refine. Our apologies for the misunderstanding.

    1. Author response:

      The following is the authors’ response to the original reviews

      eLife Assessment

      The authors present an algorithm and workflow for the inference of developmental trajectories from single-cell data, including a mathematical approach to increase computational efficiency. While such efforts are in principle useful, the absence of benchmarking against synthetic data and a wide range of different single-cell data sets make this study incomplete. Based on what is presented, one can neither ultimately judge if this will be an advance over previous work nor whether the approach will be of general applicability.

      We thank the eLife editor for the valuable feedback. Both benchmarking against other methods and validation on a synthetic dataset (“dyntoy”) are indeed presented in the Supplementary Note, although this was not sufficiently highlighted in the main text, which has now been improved.

      Our manuscript contains benchmarking against a challenging synthetic dataset in Figure 1; furthermore, both the synthetic dataset and the real-world thymus dataset have been analyzed in parallel using currently available TI tools (as detailed in the Supplementary Note). z other single-cell datasets (single-cell RNA-seq) were added in response to the reviewers' comments.

      One of the reviewers correctly points out that tviblindi goes against the philosophy of automated trajectory inference. This is correct; we believe that a new class of methods, complementary to fully automated approaches, is needed to explore datasets with unknown biology. tviblindi is meant to be a representative of this class of methods—a semi-automated framework that builds on features inferred from the data in an unbiased and mathematically well-founded fashion (pseudotime, homology classes, suitable low-dimensional representation), which can be used in concert with expert knowledge to generate hypotheses about the underlying dynamics at an appropriate level of detail for the particular trajectory or biological process.

      We would also like to mention that the algorithm and the workflow are not the sole results of the paper. We have thoroughly characterized human thymocyte development, where, in addition to expected biological endpoints, we found and characterized an unexpected activated thymic T-reg endpoint.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      The authors present tviblindi, a computational workflow for trajectory inference from molecular data at single-cell resolution. The method is based on (i) pseudo-time inference via expecting hitting time, (ii) sampling of random walks in a directed acyclic k-NN where edges are oriented away from a cell of origin w.r.t. the involved nodes' expected hitting times, and (iii) clustering of the random walks via persistent homology. An extended use case on mass cytometry data shows that tviblindi can be used elucidate the biology of T cell development.

      Strengths:

      - Overall, the paper is very well written and most (but not all, see below) steps of the tviblindi algorithm are explained well.

      - The T cell biology use case is convincing (at least to me: I'm not an immunologist, only a bioinformatician with a strong interest in immunology).

      We thank the reviewer for feedback and suggestions that we will accommodate, we respond point-by-point below

      Weaknesses:

      - The main weakness of the paper is that a systematic comparison of tviblindi against other tools for trajectory inference (there are many) is entirely missing. Even though I really like the algorithmic approach underlying tviblindi, I would therefore not recommend to our wet-lab collaborators that they should use tviblindi to analyze their data. The only validation in the manuscript is the T cell development use case. Although this use case is convincing, it does not suffice for showing that the algorithms's results are systematically trustworthy and more meaningful (at least in some dimension) than trajectories inferred with one of the many existing methods.

      We have compared tviblindi to several trajectory inference methods (Supplementary note section 8.2: Comparison to state-of-the-art methods, namely Monocle3 (v1.3.1) Cao et al. (2019), Stream (v1.1) Chen et al. (2019), Palantir (v1.0.0) Setty et al. (2019), VIA (v0.1.89) Stassen et al. (2021), StaVia (Via 2.0) Stassen et al. (2024), CellRank 2 (v2.06) Weiler et al. (2024)  and PAGA (scanpy==1.9.3) Wolf et al. (2019). We added thorough and systematic comparisons to the other algorithms mentioned by reviewers. We included extended evaluation on publicly available datasets (Supplementary Note section 10).

      Also, in the meantime we have successfully used tviblindi to investigate human B-cell development in primary immunodeficiency (Bakardjieva M, et al. Tviblindi algorithm identifies branching developmental trajectories of human B-cell development and describes abnormalities in RAG-1 and WAS patients. Eur J Immunol. 2024 Dec;54(12):e2451004. doi: 10.1002/eji.202451004.).

      - The authors' explanation of the random walk clustering via persistent homology in the Results (subsection "Real-time topological interactive clustering") is not detailed enough, essentially only concept dropping. What does "sparse regions" mean here and what does it mean that "persistent homology" is used? The authors should try to better describe this step such that the reader has a chance to get an intuition how the random walk clustering actually works. This is especially important because the selection of sparse regions is done interactively. Therefore, it's crucial that the users understand how this selection affects the results. For this, the authors must manage to provide a better intuition of the maths behind clustering of random walks via persistent homology.

      In order to satisfy both reader types: the biologist and the mathematician, we explain the mathematics in detail in the Supplementary Note, section 4. We improved the Results text to better point the reader to the mathematical foundations in the Supplementary Note.  

      - To motivate their work, the authors write in the introduction that "TI methods often use multiple steps of dimensionality reduction and/or clustering, inadvertently introducing bias. The choice of hyperparameters also fixes the a priori resolution in a way that is difficult to predict." They claim that tviblindi is better than the original methods because "analysis is performed in the original high-dimensional space, avoiding artifacts of dimensionality reduction." However, in the manuscript, tviblindi is tested only on mass cytometry data which has a much lower dimensionality than scRNA-seq data for which most existing trajectory inference methods are designed. Since tviblindi works on a k-NN graph representation of the input data, it is unclear if it could be run on scRNA-seq data without prior dimensionality reduction. For this, cell-cell distances would have to be computed in the original high-dimensional space, which is problematic due to the very high dimensionality of scRNA-seq data. Of course, the authors could explicitly reduce the scope of tviblindi to data of lower dimensionality, but this would have to be stated explicitly.

      In the manuscript we tested the framework on the scRNA-seq data from Park et al 2020 (DOI: 10.1126/science.aay3224). To illustrate that tviblindi can work directly in the high-dimensional space, we applied the framework successfully on imputed 2000 dimensional data. Furthermore we successfully used tviblindi to investigate bone marrow atlas scRNA-Seq dataset Zhang et al. (2024) and atlas of mouse gastrulation Pijuan-Sala et al. (2019). The idea behind tviblindi is to be able to work without the necessity to use non-linear dimensionality reduction techniques, which reduce the dimensionality to a very low number of dimensions and whose effects on the data distribution are difficult to predict. On the other hand the use of (linear) dimensionality reduction techniques which effectively suppress noise in the data such as PCA is a good practice (see also response to reviewer 2). We have emphasized this in the revised version and added the results of the corresponding analysis (see Supplementary note, section 9).

      - Also tviblindi has at least one hyper-parameter, the number k used to construct the k-NN graphs (there are probably more hidden in the algorithm's subroutines). I did not find a systematic evaluation of the effect of this hyper-parameter.

      Detailed discussion of the topic is presented in the Supplementary Note, section 8.1, where Spearman correlation coefficient between pseudotime estimated using k=10 and k=50 nearest neighbors was 0.997.   The number k however does affect the number of candidate endpoints. But even when larger k causes spurious connection between unrelated cell fates, the topological clustering of random walks allows for the separation of different trajectories. We have expanded the “sensitivity to hyperparameters” section 8.1 also in response to reviewer 2.

      Reviewer #2 (Public Review):

      Summary:

      In Deconstructing Complexity: A Computational Topology Approach to Trajectory Inference in the Human Thymus with tviblindi, Stuchly et al. propose a new trajectory inference algorithm called tviblindi and a visualization algorithm called vaevictis for single-cell data. The paper utilizes novel and exciting ideas from computational topology coupled with random walk simulations to align single cells onto a continuum. The authors validate the utility of their approach largely using simulated data and establish known protein expression dynamics along CD4/CD8 T cell development in thymus using mass cytometry data. The authors also apply their method to track Treg development in single-cell RNA-sequencing data of human thymus.

      The technical crux of the method is as follows: The authors provide an interactive tool to align single cells along a continuum axis. The method uses expected hitting time (given a user input start cell) to obtain a pseudotime alignment of cells. The pseudotime gives an orientation/direction for each cell, which is then used to simulate random walks. The random walks are then arranged/clustered based on the sparse region in the data they navigate using persistent homology.

      We thank the reviewer for feedback and suggestions that we have accommodated, we responded point-by-point below.

      Strengths:

      The notion of using persistent homology to group random walks to identify trajectories in the data is novel.

      The strength of the method lies in the implementation details that make computationally demanding ideas such as persistent homology more tractable for large scale single-cell data. This enables the authors to make the method more user friendly and interactive allowing real-time user query with the data.

      Weaknesses:

      The interactive nature of the tool is also a weakness, by allowing for user bias leading to possible overfitting for a specific data.

      tviblindi is not designed as a fully automated TI tool (although it implements a fully automated module), but as a data driven framework for exploratory analysis of unknown data. There is always a risk of possible bias in this type of analysis - starting with experimental design, choice of hyperparameters in the downstream analysis, and an expert interpretation of the results. The successful analysis of new biological data involves a great deal of expert knowledge which is difficult to a priori include in the computational models. 

      tvilblindi tries to solve this challenge by intentionally overfitting the data and keeping the level of resolution on a single random walk. In this way we aim to capture all putative local relationships in the data. The on-demand aggregation of the walks using the global topology of the data allows researchers to use their expert knowledge to choose the right level of detail (as demonstrated in the Figure 4 of the manuscript) while relying on the topological structure of the high dimensional point cloud. At all times tviblindi allows to inspect the composition of the trajectory to assess the variance in the development, possible hubs on the KNN-graph etc.

      The main weakness of the method is lack of benchmarking the method on real data and comparison to other methods. Trajectory inference is a very crowded field with many highly successful and widely used algorithms, the two most relevant ones (closest to this manuscript) are not only not benchmarked against, but also not sited. Including those that specifically use persistent homology to discover trajectories (Rizvi et.al. published Nat Biotech 2017). Including those that specifically implement the idea of simulating random walks to identify stable states in single-cell data (e.g. CellRank published in Lange et.al Nat Meth 2022), as well as many trajectory algorithms that take alternative approaches. The paper has much less benchmarking, demonstration on real data and comparison to the very many other previous trajectory algorithms published before it. Generally speaking, in a crowded field of previously published trajectory methods, I do not think this one approach will compete well against prior work (especially due to its inability to handle the noise typical in real world data (as was even demonstrated in the little bit of application to real world data provided).

      We provided comparisons of tviblindi and vaevictis in the Supplementary Note, section 8.2, where we compare it to Monocle3 (v1.3.1) Cao et al. (2019), Stream (v1.1) Chen et al. (2019), Palantir (v1.0.0) Setty et al. (2019), VIA (v0.1.89) Stassen et al. (2021),  StaVia (Via 2.0) Stassen et al. (2024), CellRank 2 (v2.06) Weiler et al. (2024)  and PAGA (scanpy==1.9.3) Wolf et al. (2019). We added thorough and systematic comparisons to the other algorithms mentioned by reviewers. We included extended evaluation on publicly available datasets (Supplementary Note section 10).

      Beyond general lack of benchmarking there are two issues that give me particular concern. As previously mentioned, the algorithm is highly susceptible to user bias and overfitting. The paper gives the example (Figure 4) of a trajectory which mistakenly shows that cells may pass from an apoptotic phase to a different developmental stage. To circumvent this mistake, the authors propose the interactive version of tviblindi that allows users to zoom in (increase resolution) and identify that there are in fact two trajectories in one. In this case, the authors show how the author can fix a mistake when the answer is known. However, the point of trajectory inference is to discover the unknown. With so much interactive options for the user to guide the result, the method is more user/bias driven than data-driven. So a rigorous and quantitative discussion of robustness of the method, as well as how to ensure data-driven inference and avoid over-fitting would be useful.

      Local directionality in expression data is a challenge which is not, to our knowledge, solved. And we are not sure it can be solved entirely, even theoretically. The random walks passing “through” the apoptotic phase are biologically infeasible, but it is an (unbiased) representation of what the data look like based on the diffusion model. It is a property of the data (or of the panel design), which has to be interpreted properly rather than a mistake. Of note, except for Monocle3 (which does not provide the directionality) other tested methods did not discover this trajectory at all.

      The “zoom in” has in fact nothing to do with “passing through the apoptosis”. We show how the researcher can investigate the suggested trajectory to see if there is an additional structure of interest and/or relevance. This investigation is still data driven (although not fully automated). Anecdotally in this particular case this branching was discovered by a bioinformatician, who knew nothing about the presence of beta-selection in the data.  

      We show that the trajectory of apoptosis of cortical thymocytes consists of 2 trajectories corresponding to 2 different checkpoints (beta-selection and positive/negative selection). This type of a structure, where 2 (or more) trajectories share the same path for most of the time, then diverge only to be connected at a later moment (immediately from the point of view of the beta-selection failure trajectory) is a challenge for TI algorithms and none of tested methods gave a correct result. More importantly there seems to be no clear way to focus on these kinds of structures (common origin and common fate) in TI methods.

      Of note, the “zoom in” is a recommended and convenient method to look for an inner structure, but it does not necessarily mean addition of further homological classes. Indeed, in this case the reason that the structure is not visible directly is the limitation of the dendrogram complexity (only branches containing at least 10% of simulated random walks are shown by default). In summary, tviblindi effectively handled all noise in the data that obscured biologically valid trajectories for other methods. We have improved the discussion of the robustness in the current version.  

      Second, the paper discusses the benefit of tviblindi operating in the original high dimensions of the data. This is perhaps adequate for mass cytometry data where there is less of an issue of dropouts and the proteins may be chosen to be large independent. But in the context of single-cell RNA-sequencing data, the massive undersampling of mRNA, as well as high degree of noise (e.g. ambient RNA), introduces very large degree of noise so that modeling data in the original high dimensions leads to methods being fit to the noise. Therefore ALL other methods for trajectory inference work in a lower dimension, for very good reason, otherwise one is learning noise rather than signal. It would be great to have a discussion on the feasibility of the method as is for such noisy data and provide users with guidance. We note that the example scRNA-seq data included in the paper is denoised using imputation, which will likely result in the trajectory inference being oversmoothed as well.

      We agree with the reviewer. In our manuscript we wanted to showcase that tviblindi can directly operate in high-dimensional space (thousands of dimensions) and we used MAGIC imputation for this purpose. This was not ideal. More standard approach, which uses 30-50 PCs as input to the algorithm resulted in equivalent trajectories. We have added this analysis to the study (Supplementary note, section 9).

      In summary, the fact that tviblindi scales well with dimensionality of the data and is able to work in the original space does not mean that it is always the best option. We have added a corresponding comment into the Supplementary note.  

      Reviewer #3 (Public Review):

      Summary:

      Stuchly et al. proposed a single-cell trajectory inference tool, tviblindi, which was built on a sequential implementation of the k-nearest neighbor graph, random walk, persistent homology and clustering, and interactive visualization. The paper was organized around the detailed illustration of the usage and interpretation of results through the human thymus system.

      Strengths:

      Overall, I found the paper and method to be practical and needed in the field. Especially the in-depth, step-by-step demonstration of the application of tviblindi in numerous T cell development trajectories and how to interpret and validate the findings can be a template for many basic science and disease-related studies. The videos are also very helpful in showcasing how the tool works.

      Weaknesses:

      I only have a few minor suggestions that hopefully can make the paper easier to follow and the advantage of the method to be more convincing.

      (1) The "Computational method for the TI and interrogation - tviblindi" subsection under the Results is a little hard to follow without having a thorough understanding of the tviblindi algorithm procedures. I would suggest that the authors discuss the uniqueness and advantages of the tool after the detailed introduction of the method (moving it after the "Connectome - a fully automated pipeline".

      We thank the reviewer for the suggestion and we have accommodated it to improve readability of the text.

      Also, considering it is a computational tool paper, inevitably, readers are curious about how it functions compared to other popular trajectory inference approaches. I did not find any formal discussion until almost the end of the supplementary note (even that is not cited anywhere in the main text). Authors may consider improving the summary of the advantages of tviblindi by incorporating concrete quantitative comparisons with other trajectory tools.

      We provided comparisons of tviblindi and vaevictis in the Supplementary Note, section 8.2, where we compare it to Monocle3 (v1.3.1) Cao et al. (2019), Stream (v1.1) Chen et al. (2019), Palantir (v1.0.0) Setty et al. (2019), VIA (v0.1.89) Stassen et al. (2021),  StaVia (Via 2.0) Stassen et al. (2024), CellRank 2 (v2.06) Weiler et al. (2024)  and PAGA (scanpy==1.9.3) Wolf et al. (2019). We added thorough and systematic comparisons to the other algorithms mentioned by reviewers. We included extended evaluation on publicly available datasets (Supplementary Note section 10).

      (2) Regarding the discussion in Figure 4 the trajectory goes through the apoptotic stage and reconnects back to the canonical trajectory with counterintuitive directionality, it can be a checkpoint as authors interpret using their expert knowledge, or maybe a false discovery of the tool. Maybe authors can consider running other algorithms on those cells and see which tracks they identify and if the directionality matches with the tviblindi.

      We have indeed used the thymus dataset for comparison of all TI algorithms listed above. Except for Monocle 3 they failed to discover the negative selection branch (Monocle 3 does not offer directionality information). Therefore, a valid topological trajectory with incorrect (expert-corrected) directionality was partly or entirely missed by other algorithms. 

      (3) The paper mainly focused on mass cytometry data and had a brief discussion on scRNA-seq. Can the tool be applied to multimodality data such as CITE-seq data that have both protein markers and gene expression? Any suggestions if users want to adapt to scATAC-seq or other epigenomic data?

      The analysis of multimodal data is the logical next step and is the topic of our current research. At this moment tviblindi cannot be applied directly to multimodal data. It is possible to use the KNN-graph based on multimodal data (such as weighted nearest neighbor graph implemented in Seurat) for pseudotime calculation and random walk simulation. However, we do not have a fully developed triangulation for the multimodal case yet. 

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      Suggestions for improved or additional experiments, data or analyses:

      -  Benchmark against existing trajectory inference methods.

      -  Benchmark on scRNA-seq data or an explicit statement that, unlike existing methods, tviblindi is not designed for such data.

      We provided comparisons of tviblindi and vaevictis in the Supplementary Note, section 8.2, where we compare it to Monocle3 (v1.3.1) Cao et al. (2019), Stream (v1.1) Chen et al. (2019), Palantir (v1.0.0) Setty et al. (2019), VIA (v0.1.89) Stassen et al. (2021),  StaVia (Via 2.0) Stassen et al. (2024), CellRank 2 (v2.06) Weiler et al. (2024)  and PAGA (scanpy==1.9.3) Wolf et al. (2019). We added thorough and systematic comparisons to the other algorithms mentioned by reviewers. We included extended evaluation on publicly available datasets (Supplementary Note section 10).

      -  Systematic evaluation of the effetcs of hyper-parameters on the performance of tviblindi (as mentioned above, there is at least one hyper-parameter, the number k to construct the k-NN graphs).

      This is described in Supplementary Note section 8.1

      Recommendations for improving the writing and presentation:

      -  The GitHub link to the algorithm which is currently hidden in the Methods should be moved to the abstract and/or a dedicated section on code availability.

      -  The presentation of the persistent homology approach used for random walk clustering should be improved (see public comment above).

      This is described extensively in Supplementary Note  

      -  A very minor point (can be ignored by the authors): consider renaming the algorithm. At least for me, it's extremely difficult to remember.

      We choose to keep the original name

      Minor corrections to the text and figures:

      -  Labels and legend texts are too small in almost all figures.

      Reviewer #2 (Recommendations For The Authors):  

      (1) On page 3: "(2) Analysis is performed in the original high-dimensional space avoiding artifacts of dimensionality reduction." In mass cytometry data where there is no issue of dropouts, one may choose proteins such that they are not correlated with each other making dimensionality reduction techniques less relevant. But in the context of an unbiased assays such as single-cell RNA-sequencing (scRNA-seq), one measures all the genes in a cell so dimensionality reduction can help resolve the redundancy in the feature space due to correlated/co-regulated gene expression patterns. This assumption forms the basis of most methods in scRNA-seq. More importantly, in scRNA-seq data the dropouts and ambient molecules in mRNA counts result in so much noise that modeling cells in the full gene expression is highly problematic. So the authors are requested to discuss in detail how they would propose to deal with noise in scRNA-seq data.

      On this note, the authors mention in Supplementary Note 9 (Analysis of human thymus single-cell RNA-seq data): "Imputed data are used as the input for the trajectory inference, scaled counts (no imputation) are shown in line plots". The line plots indicate the gene expression trends along the obtained pseudotime. The authors use MAGIC to impute the data, and we request the authors to mention this in the Methods section (currently one must look through the code on Supplementary Note 1.3 to find this). Data imputation in single-cell RNA-seq data are intended to enable quantification of individual gene expression distribution or pairwise gene associations. But when all the genes in an imputed data are used for visualization, clustering or trajectory inference, the averaging effect will compound and result in severely smoothed data that misses important differences between cell states. Especially, in the case of MAGIC, which uses a transition matrix raised to a power, it is over-smoothing of the data to use a transition matrix smoothed data to obtain another transition matrix to calculate the hitting time (or simulate random walks). Second, the authors' proposal to use scaled counts to study gene trends cannot be generalized to other settings due to drop out issue. Given the few genes (and only one branch) that are highlighted in Figure 7D-G and Figure 31 in Supplementary Note, it is hard to say if scaling raw values would pick up meaningful biology robustly here for other branches.

      We recommend that this data be reanalyzed with non-imputed data used for trajectory inference and imputed gene expression used for line plots.

      As stated above in the public review, we reanalyzed the scRNA Seq data using a more standard approach (first 50 principal components). We have also analyzed two additional scRNA Seq datasets (Section 1 and section 10 of Supplementary Note)

      On the same note, the authors use Seurat's CellCycleScoring to obtain the cell cycle phase of each cell and later use ScaleData to regress them out. While we agree that it is valuable to remove cell cycle effect from the data for trajectory inference (and has been used previously in other methods), the regression approach employed in Seurat's ScaleData is not appropriate. It is an aggressive approach that severely changes expression pattern of many genes and can result in new artifacts (false positives) in the data. We recommend the authors to explore this more and consider using a more principled alternatives such as fscLVM (https://genomebiology.biomedcentral.com/articles/10.1186/s13059-017-1334-8). 

      Cell cycle correction is an open problem (Heumos, Nat Rev Genetics, 2023)

      Here we use an (arguably aggressive) approach to make the presentation more straightforward. The cells we are interested here (end #6) are not dividing and the regression does not change the conclusion drawn in the paper

      (2) The figures provided are extremely low in resolution that it is practically impossible to correctly interpret a lot of the conclusion and references made in the figure (especially Figure 3 in the main text).

      Resolution of the Figures was improved

      (3) There are many aspects of the method that enable easy user biases and can lead to substantial overfitting of the data.

      a. On page 7: "The topology of the point cloud representing human T-cell development is more complex ... and does not offer a clear cutoff for the choice of significant sparse regions. Interactive selection allows the user to vary the resolution and to investigate specific sparse regions in the data iteratively." This implies that the method enables user biases to be introduced into the data analysis. While perhaps useful for exploration, quantitative trajectory assessment using such approach can be faulty when the user (A) may not know the underlying dynamics (B) forces preconceived notion of trajectory.

      The authors should consider making the trajectory inference approach less dependent on interactive user input and show that the trajectory results are robust to any choices the user may make. It may also help if the authors provide an effective guide and mention clearly what issues could result due to the use of such thresholds.

      As explained in the response in public reviews, tviblindi is not designed as a fully automated TI tool, but as a data driven framework for exploratory analysis of unknown data. 

      There is always a risk of possible bias in this type of analysis - starting with experimental design, choice of hyperparameters in the downstream analysis, and an expert interpretation of the results. The successful analysis of new biological data involves a great deal of expert knowledge which is difficult to a priori include in the computational models.  To specifically address the points raised by the reviewer:

      “(A) may not know the underlying dynamics” - tviblindi is designed to perform exploratory analysis of the unknown underlying dynamics. We showcase in the study how this can be performed and we highlight possible cases which can be resolved expertly (spurious connections (doublets), different scales of resolution (beta selection)). Crucially, compared to other TI methods, tviblindi offers a clear mechanism on how to discover, focus and resolve these issues which would (and do) contaminate the trajectories discovered fully automatically by tested methods (cf. the beta selection, or the development of plasmacytoid dendritic cells (PDCs) (Supplementary note, section 10.1).

      “(B) forces preconceived notion of trajectory” - user interaction in tviblindi does not force a preconceived notion of the trajectory. The random walks are simulated before the interactive step in an unbiased manner. During the interactive step the user adjusts trajectory specific resolution - incorrect choice of the resolution may result in either merging distinct trajectories into one or over separating the trajectories (which is arguably much less serious). However the interactive step is designed to deal with exactly this kind of challenge. We showcase (e.g. beta selection, or PDCs development) how to address the issue - tviblindi allows us to investigate deeper structure in any considered trajectory.

      Thus, tviblindi represents a new class of methods that is complementary to fully automated trajectory inference tools. It offers a semi-automated tool that leverages features derived from data in an unbiased and mathematically rigorous manner, including pseudotime, homology classes, and appropriate low-dimensional representations. These can be integrated with expert knowledge to formulate hypotheses regarding the underlying dynamics, tailored to the specific trajectory or biological process under investigation.

      b. In Figure 4, the authors discuss the trajectory of cells emanating from CD3 negative double positive stage and entering apoptotic phase and mention tviblindi may give "the false impression that cells may pass through an apoptotic phase into a later developmental stage" and propose that the interactive version of tviblindi can help user zoom into (increase resolution) this phenomenon and identify that there are in fact two trajectories in one. Given this, how do the other trajectories in the data change if a user manually adjusts the resolution? A quantification of the robustness is important. Also, it appears that a more careful data clean up could avoid such pitfalls where the algorithm infers trajectory based on mixed phenotype and the user would not have to manually adjust the resolution to obtain clear biological conclusion. We not that the original publication of this data did such "data clean up" using simple diffusion map based dimensionality reduction which the authors boast they avoid. There is a reason for this dimensionality reduction (distinguishing signal from noise), even in CyTOF data, let alone its importance in single cell data.

      The reviewer is concerned about two different, but intertwined issues we wish to untangle here. First, data clean-up is typically done on the premise that dead cells are irrelevant and they are a source of false signals. In the case of the thymocytes in the human thymus this premise is not true. Apoptotic cells are a legitimate (actually dominant) fate of the development and thus need to be represented in the TI dataset. Their biological behavior is however complex as they stop expressing proteins and thus lose their surface markers gradually, as dictated by the particular protein degradation kinetics. So can we clean up dead and dying cells better? Yes, but we don't want to do it since we would lose cells we want to analyze. Second, do trajectories change when we zoom into the data? No, only the level of detail presented visually changes. Since we calculate 5000 trajectories in the dataset, we need to aggregate them already for the hierarchical clustering visualization. Note that Figure 4, panel A highlights 159 trajectories selected in V. group. Zooming in means that the hierarchy of trajectories within V. group is revealed (panel D, groups V.a and Vb.) and can be interpreted on the vaevictis and lineplot graphs (panel E, F). 

      c. In the discussion, the authors write "[tviblindi] allows the selection and grouping of similar random walks into trajectories based on visual interaction with the data". This counters the idea of automated trajectory inference and can lead to severe overfitting.

      As explained in reply to Q3, our aim was NOT to create a fully automated trajectory inference tool. Even more, in our experience we realized that all current tools are taking this fully  automated approach with a search for an “ideal” set of hyperparameters. This, in our experience,  leads to a “blackbox” tool that is difficult to interpret for the expert in the biological field. To respond to this need we designed a modular approach where the results of the TI are presented and the expert can interact with them to focus the visualization and to derive interpretation. Our interactive concept is based on 15 years of experience with the data analysis in flow cytometry, where neither manual gating nor full automation is the ultimate solution but smart integration of both approaches eventually wins the game.

      Thus, tviblindi represents a new class of methods that is complementary to fully automated trajectory inference tools.  It offers a semi-automated tool that leverages features derived from data in an unbiased and mathematically rigorous manner. These features include pseudotime, homology classes, and appropriate low-dimensional representations. These features can be integrated with expert knowledge to formulate hypotheses regarding the underlying dynamics, tailored to the specific trajectory or biological process under investigation.

      d. The authors provide some comment on the robustness to the relaxation parameter for witness complex construction in Supplementary Note Section 8.1.2 but it is limited given the importance of this parameter and a more thorough investigation is recommended. We request the authors to provide concrete examples with figures of how changing alpha2 parameter leads to simplicial complexes of different sizes and an assessment of contexts in which the parameter is robust and when not (in both simulated and publicly available real data). Of note, giving the users a proper guide for parameter choice based on these examples and offering them ways to quantify robustness of their results may also be valuable.

      Section 8 in Supplementary Note was extended as requested.

      e. The authors are requested for an assessment of possible short-circuits (e.g. cells of two distantly related phenotypes that get connected erroneously in the trajectory) in the data, and how their approach based on persistent homology deals with it.

      If a short circuit results in a (spurious) alternative trajectory, the persistent homology approach allows us to distinguish it from genuine trajectories that do not follow the short circuit. This prevents contamination of the inferred evolution by erroneous connections. The ability to distinguish and separate distinct trajectories with the same fate is a major strength of this approach (e.g., the trajectory through doublets or the trajectories around checkpoints in thymocytes’ evolution).

      (4) The authors propose vaevictis as a new visualization tool and show its performance compared to the standard UMAP algorithm on a simulated data set (Figure 1 in Supplementary Notes). We recommend a more comprehensive comparison between the two algorithms on a wide array of publicly available single-cell datasets. As well as comparison to other popular dimensionality reduction approaches like force directed layouts, which are the most widely used tool specifically to visualize trajectories.

      We added Section 10 to Supplementary Note that presents multiple comparisons of this kind. It is important to note that tviblindi works independently of visualization and any preferred visualization can be used in the interactive phase (multiple visualisation methods are implemented).

      (5) In Supplementary Note 8.2, the authors compare tviblindi against the other methods. We recommend the authors to quantify the comparison or expand on their assesments in real biological data. For example, in comparison against Palantir and VIA the authors mention "... discovers candidate endpoints in the biological dataset but lacks toolbox to interrogate subtle features such as complex branching" and "fails to discover subtle features (such as Beta selection)" respectively. We recommend the authors to make these comparisons more precise or provide quantification. While the added benefit of interactive sessions of tviblindi may make it more user friendly, the way tviblindi appears to enable analysis of subtle features (e.g. Figure 1H) should be possible in Palantir or VIA as well.

      We extended the comparisons and presented them in Section 8 and 10 in Supplementary Note.  

      (6) The notion of using random walk simulations to identify terminal (and initial states) has been previously used in single-cell data (CellRank algorithm: https://www.nature.com/articles/s41592-021-01346-6). We request the authors to compare their approach to CellRank.

      We compared our algorithm to the CellRank successor CellRank 2 (see section 8.2, Supplementary Note)

      (7) The notion of using persistent homology to discover trajectories has been previously used in single cell data https://pubmed.ncbi.nlm.nih.gov/28459448/. we request a comparison to this approach

      The proposed algorithm was not able to accommodate the large datasets we used.

      scTDA (Rizvi, Camara et al. Nat. Biotechnol. 2017) has not been updated for 6 years. It is not suited for complex atlas-sized datasets both in terms of performance and utility, with its limited visualization tools. It also lacks capabilities to analyze individual trajectories.

      (8) In Figure 3B, the authors visualize the endpoints and simulated random walks using the connectome. There is no edge from start to the apoptotic cells here. It is not clear why? If they are not relevant based on random walks, can the user remove them from analysis? Same for the small group of pink cells below initial point.

      The connectome is a fully automated approach (similar to PAGA) which gives a basic overview of the data. It is not expected to be able to compete with the interactive pipeline of tviblindi for the same reasons as the fully automated methods (difficult to predict the effect of hyperparameters).

      (9) In Supplementary Figure 3, in relation to "Variants of trajectories including selection processes" the author mention that there is a spurious connection between CD4 single positive, and the doublet set of cells. The authors mention that the presence of dividing cells makes it difficult to remove the doublets. We request the authors to discuss why. For example, the authors seem to have cell cycle markers (e.g. Ki67, pH3, Cyclin) and one would think that coupled with DNA intercalator 191/193lr one could further clean-up the data. Can the authors employ alternative toolkits such as doublet detection methods?

      To address this issue, we do remove doublets with illegitimate cell barcodes (e.g. we remove any two cells from two samples with different barcode which present with double barcode). Although there are computational doublet removal approaches for mass cytometry (Bagwell, Cytometry A 2020), mostly applied to peripheral blood samples (where cell division is not present under steady state immune system conditions), these are however not well suited for situations where dividing samples occur (Rybakowska P, Comput Struct Biotechnol J. 2021), which is the case of our thymocyte samples. Furthermore, there are other situations where doublet formation is not an accident, but rather a biological response (Burel JG, Cytometry A (2020). Thus, the doublet cell problem is similar to the apoptotic cell problem discussed earlier.

      We could remove cells with the double DNA signal, but this would remove not only accidental doublets but also the legitimate (dividing) cells. So the question is how to remove the illegitimate doublets but not the legitimate?

      Of note, the trajectory going through doublets does not affect the interpretation of other trajectories as it is readily discriminated by persistent homology and thus random walks passing through this (spurious) trajectory do not contaminate the markers’ evolution inferred for legitimate trajectories.

      We therefore prefer to remove only the barcode illegitimate and keep all others in analysis, using the expert analysis step also to identify (using the cell cycle markers plus other features) the artificially formed doublets and thus spurious connections.

      (10) The authors should discuss how the gene expression trend plots are made (e.g. how are the expression averaged? Rolling mean?).

      The development of those markers is shown as a line plot connecting the average values of a specific marker within a pseudotime segment. By default, the pseudotime values are divided into uniform segments (each containing the same number of points) whose number can be changed in the GUI. To focus on either early or late stages of the development, the segment division can be adjusted in GUI. See section 6 of the Supplementary Note.

      Reviewer #3 (Recommendations For The Authors):

      The overall figures quality needs to be improved. For example, I can barely see the text in Figure 3c.

      Resolution of the Figures was improved

    1. 非阿贝尔编织门 (braiding‑based gate):自然实现 Clifford + π/8 等容错操作

      非阿贝尔编织门:把 MZM 当毛线球来“打结”算数

      0. 一句话预热

      思路:把 4 只马约拉纳零模(γ₁…γ₄)排成一串,当作一根量子“橡皮筋”。只要用 绕圈圈 的手法把它们交换(braid),系统波函数就会像魔术方块一样自己旋转——这就是最“佛系”的量子逻辑门:不靠脉冲计时,而靠“打结”形状


      1. 如何把 4 只 MZM 变成 1 个量子比特?

      | 步骤 | 操作 | 结果 | 备注 | |---|---|---|---| | 编码 | 固定总费米子奇偶为 偶数(γ₁γ₂γ₃γ₄ = +1) | 得到两维简并空间:|0⟩, |1⟩ | 其实是把电子—空穴的 全局拓扑量子数 当比特 | | 逻辑 Z | 交换 γ₁ ↔ γ₂ 一次 | $U_{12}=e^{\frac{\pi}{4}\sigma_z}$ | 相位 +π/2 | | 逻辑 X | 交换 γ₂ ↔ γ₃ | $U_{23}=e^{\frac{\pi}{4}\sigma_x}$ | | | Hadamard | 顺序 γ₁↔γ₂,γ₂↔γ₃,γ₁↔γ₂ | $\text{H}=e^{\frac{\pi}{4}\sigma_z}e^{\frac{\pi}{4}\sigma_x}e^{\frac{\pi}{4}\sigma_z}$ | 把 Z、X 对调 |

      这些都是 Clifford 门,全程“纯拓扑”,对局域噪声近乎免疫。citeturn0search4


      2. 两量子比特:八只 Majorana 的“交叉辫子”

      • 布阵:每个比特各 4 只 MZM;中间共享一根 T‑junction 或“π 型”交叉线。
      • 受控相 (CZ):把一个比特的 γ₄ 绕另一比特的 γ₁ 一圈 → $CZ = \text{diag}(1,1,1,-1)$。
      • 用 Hadamard 把 CZ 夹心,即成 CNOT,至此 Clifford 族全员到齐
      • 最新测量‑只方案可用 无辅模式 (ancilla‑free) 测量序列 实现,兼具确定性与拓扑保护。citeturn0search1turn0search3

      3. π⁄8(T)门:Clifford “驾驶证”进阶到万能钥匙

      核心事实:Ising 任何子仅靠编织做不到 π⁄8;必须加一点“调味料”,才让计算机从 Clifford‑only → 通用

      三条主流“加料”路线

      | 路线 | 思想图 | 优点 | 隐忧 | |---|---|---|---| | (A) Magic‑State Injection | 先“炼”一个 noisy $|T⟩=(|0⟩+e^{iπ/4}|1⟩)/√2$,再用 魔法态蒸馏 | 理论成熟,适配任何 Ising 平台 | 蒸馏开销大,需要额外物理比特 | | (B) 受控库仑/点接耦合 | 利用量子点或库仑岛调节 γᵢγⱼ 有效耦合,直接给出 $e^{i\frac{π}{8}σ_z}$ | 全硬件,门时长短 | 必须精准控制耦合能;非纯拓扑,需主动校准 citeturn1search0 | | (C) 测量‑Only Trick | 通过 随机投影 + 反馈 把 Clifford 轨迹“拗弯”出 π⁄8 相位 | 全程拓扑;可与面码 (surface code) 无缝结合 | 测量结果随机 ⇒ 逻辑电路要容错重排 citeturn0search1 |

      段子版:Clifford 门好比会 平地转体 360° 的体操运动员,π⁄8 再加上“抬脚尖”45°,瞬间就能参加奥运全能决赛,但也更容易崴脚,需要防护垫(蒸馏/校准)。


      4. 编织 = “打结就算数”,为什么天然容错?

      1. 拓扑保护:演化只依赖于绕圈的顺序,与路径细节无关 ⇒ 本征退相 (dephasing) 指数抑制。
      2. 对偶误纠 (Dual Error‑correction)
      3. 空间维度:非局域编码避免单点热涨落。
      4. 时间维度:整个“打结”过程中,只要 MZM 不被合并或消失,逻辑信息就安全。
      5. 错误率底限:理论估计 $p \sim \exp(-L/ξ)$,与传统超导比特 ($10^{-3}$) 相比 潜力深不见底

      5. 生态位 & 未来挑战

      • 当前战场:纳米线‑三岔网、平面 Josephson‑junction 网格、魔角石墨烯可重构通道。
      • 编织演示的里程碑:真正“视频流”式的 3‑MZM ↔ 4‑MZM ↔ …动态交换仍在路上,预计下一阶段将与 低温扫描门阵列 联手。
      • π⁄8 门对标:谁先把 π⁄8 错误率压到 < 10⁻⁴,谁就能抢占 通用拓扑量子计算“样板间”。

      🌟 结语:

      Clifford + π⁄8 就像武侠里的“独孤九剑 + 破剑式”。前者招招自成体系、天然带护身罡气;后者虽只多半招,却让你真正杀遍天下无敌手。马约拉纳编织门正试图把这套功法写进硬件底层——一旦成功,“打个结就跑程序”将不再是江湖传说,而是芯片工程师的日常。


      深挖阅读<br /> 1. Aguado & Lutchyn, Majorana qubits for TQC, Phys. Today (2020) citeturn0search7<br /> 2. Zhang et al., Ancilla‑free deterministic Clifford gates (2023) citeturn0search1<br /> 3. Liu et al., Coulomb‑assisted π⁄8 gate proposal (2024) citeturn1search0<br /> 4. PostQuantum white paper on Majorana braiding (2023) citeturn0search4

    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Public review): 

      Summary: 

      This fascinating manuscript studies the effect of education on brain structure through a natural experiment. Leveraging the UK BioBank, these authors study the causal effect of education using causal inference methodology that focuses on legislation for an additional mandatory year of education in a regression discontinuity design. 

      Strengths: 

      The methodological novelty and study design were viewed as strong, as was the import of the question under study. The evidence presented is solid. The work will be of broad interest to neuroscientists 

      Weaknesses: 

      There were several areas which might be strengthed from additional consideration from a methodological perspective. 

      We sincerely thank the reviewer for the useful input, in particular, their recommendation to clarify RD and for catching some minor errors in the methods (such as taking the log of the Bayes factors). 

      Reviewer #1 (Recommendations for the authors): 

      (1) The fuzzy local-linear regression discontinuity analysis would benefit from further description. 

      (2) In the description of the model, the terms "smoothness" and "continuity" appear to be used interchangeably. This should be adjusted to conform to mathematical definitions. 

      We have now added to our explanations of continuity regression discontinuity. In particular, we now explain “fuzzy”, and add emphasis on the two separate empirical approaches (continuity and local-randomization), along with fixing our use of “smoothness” and “continuity”.

      results:

      “Compliance with ROSLA was very high (near 100%; Sup. Figure 2). However, given the cultural and historical trends leading to an increase in school attendance before ROSLA, most adolescents were continuing with education past 15 years of age before the policy change (Sup Plot. 7b). Prior work has estimated 25 percent of children would have left school a year earlier if not for ROSLA 41. Using the UK Biobank, we estimate this proportion to be around 10%, as the sample is healthier and of higher SES than the general population (Sup. Figure 2; Sup. Table 2) 46–48.”

      methods:

      “RD designs, like ours, can be ‘fuzzy’ indicating when assignment only increases the probability of receiving it, in turn, treatment assigned and treatment received do not correspond for some units 33,53. For instance, due to cultural and historical trends, there was an increase in school attendance before ROSLA; most adolescents were continuing with education past 15 years of age (Sup Plot. 7b). Prior work has estimated that 25 percent of children would have left school a year earlier if not for ROSLA 41. Using the UK Biobank, we estimate this proportion to be around 10%, as the sample is healthier and of higher SES than the general population (Sup. Figure 2; Sup. Table 2) 46–48.”

      (3) The optimization of the smoother based on MSE would benefit from more explanation and consideration. How was the flexibility of the model taken into account in testing? Were there any concerns about post-selection inference? A sensitivity analysis across bandwidths is also necessary. Based on the model fit in Figure 1, results from a linear model should also be compared. 

      It is common in the RD literature to illustrate plots with higher-order polynomial fits while inference is based on linear (or at most quadratic) models (Cattaneo, Idrobo & Titiunik, 2019). We agree that this field-specific practice can be confusing to readers. Therefore, we have redone Figure 1 using local-linear fits better aligning with our analysis pipeline. Yet, it is still not a one-to-one alignment as point estimation and confidence are handled robustly while our plotting tools are simple linear fits. In addition, we updated Sup. Fig 3 and moved 3rd-order polynomial RD plots to Sup. Fig 4.

      Empirical RD has many branching analytical decisions (bandwidth, polynomial order, kernel) which can have large effects on the outcome. Fortunately, RD methodology is starting to become more standardized (Catteneo & Titiunik, 2022, Ann. Econ Rev) as there have been indications of publication bias using these methods (Stommes, Aronow & Sävje, 2023, Research and Politics (This paper suggest it is not researcher degrees of freedom, rather inappropriate inferential methods)). While not necessarily ill-intended, researcher degrees of freedom and analytic flexibility are major contributors to publication bias. We (self) limited our analytic flexibility by using pre-registration (https://osf.io/rv38z).

      One of the most consequential analytic decisions in RD is the bandwidth size as there is no established practice, they are context-specific and can be highly influential on the results. The choice of bandwidths can be framed as a ‘bias vs. variance trade-off’. As bandwidths increase, variance decreases since more subjects are added yet bias (misspecification error/smoothing bias) also increases (as these subjects are further away and less similar). In our case, our assignment (running/forcing) variable is ‘date of birth in months’; therefore our smallest comparison would be individuals born in August 1957 (unaffected/no treatment) vs September 1957 (affected/treated). This comparison has the least bias (subjects are the most similar to each other), yet it comes at the expense of very few subjects (high variance in our estimate). 

      MSE-derived bandwidths attempt to solve this issue by offering an automatic method to choose an analysis bandwidth in RD. Specifically, this aims to minimize the MSE of the local polynomial RD point estimator – effectively choosing a bandwidth by balancing the ‘bias vs. variance trade-off’ (explained in detail 4.4.2 Cattaneo et al., 2019 p 45 - 51 “A practical introduction to regression discontinuity designs: foundations”). Yet, you are very correct in highlighting potential overfitting issues as they are “by construction invalid for inference” (Calonico, Cattaneo & Farrell, 2020, p. 192). Quoting from Cattaneo and Titiunik’s Annual Review of Economics from 2022: 

      “Ignoring the misspecification bias can lead to substantial overrejection of the null hypothesis of no treatment effect. For example, back-of-the-envelop calculations show that a nominal 95% confidence interval would have an empirical coverage of about 80%.”

      Fortunately, modern RD analysis packages (such as rdrohust or RDHonest) calculate robust confidence intervals - for more details see Armstrong and Kolesar (2020). For a summary on MSE-bandwidths see the section “Why is it hard to estimate RD effects?” in Stommes and colleagues 2023 (https://arxiv.org/abs/2109.14526). For more in-depth handling see the Catteneo, Idrobo, and Titiunik primer (https://arxiv.org/abs/1911.09511).

      Lastly, with MSE-derived bandwidths sensitivity tests only make sense within a narrow window of the MSE-optimized bandwidth (5.5 Cattaneo et al., 2019 p 106 - 107). When a significant effect occurs, placebo cutoffs (artificially moving the cutoff) and donut-hole analysis are great sensitivity tests. Instead of testing our bandwidths, we decided to use an alternate RD framework (local randomization) in which we compare 1-month and 5-month windows. Across all analysis strategies, MRI modalities, and brain regions, we do not find any effects of the education policy change ROSLA on long-term neural outcomes.

      (4) In the Bayesian analysis, the authors deviated from their preregistered analytic plan. This whole section is a bit confusing in its current form - for example, point masses are not wide but rather narrow. Bayes factors are usually estimated; it is unclear how or why a prior was specified. What exactly is being modeled using a prior? Also, throughout - If the log was taken, as the methods seem to indicate for the Bayes factor, this should be mentioned in figures and reported estimates. 

      First, we would like to thank you for spotting that we incorrectly kept the log in the methods. We have fixed this and added the following sentence to the methods: 

      “Bayes factors are reported as BF<sub>10</sub> in support of the alternative hypothesis, we report Bayes factors under 1 as the multiplicative inverse (BF<sub>01</sub> = 1/BF)”

      All Bayesian analyses need to have a prior. In practice, this becomes an issue when you’re uncertain about 1) the location of the effect (directionality & center mass, defined by a location parameter), yet more importantly, the 2) confidence/certainty of the range-spread of possible effects (determined by a scale parameter). In normally distributed priors these two ‘beliefs’ are represented with a mean and a standard deviation (the latter impacts your confidence/certainty on the range of plausible parameter space). 

      Supplementary figure 6 illustrates several distributions (location = 0 for all) with varying scale parameters; when used as Bayesian priors this indicates differing levels of confidence in our certainty of the plausible parameter space. We illustrate our three reported, normally distributed priors centered at zero in blue with their differing scale parameters (sd = .5, 1 & 1.5).

      All of these five prior distributions have the same location parameter (i.e., 0) yet varying differences in the scale parameter – our confidence in the certainty of the plausible parameter space. At first glance it might seem like a flat/uniform prior (not represented) is a good idea – yet, this would put equal weight on the possibility of every estimate thereby giving the same probability mass to implausible values as plausible ones. A uniform prior would, for instance, encode the hypothesis that education causing a 1% increase in brain volume is just as plausible as it causing either a doubling or halving in brain volume. In human research, we roughly know a range of reasonable effect sizes and it is rare to see massive effects.

      A benefit of ‘weakly-informative’ priors is that they limit the range of plausible parameter values. The default prior in STAN (a popular Bayesian estimation program; https://mc-stan.org) is a normally distributed prior with a mean of zero and an SD of 2.5 (seen in orange in the figure; our initial preregistered prior). This large standard deviation easily permits positive and negative estimates putting minimal emphasis on zero. Contrast this to BayesFactor package’s (Morey R, Rouder J, 2023) default “wide” prior which is the Cauchy distribution (0, .7) illustrated in magenta (for more on the Cauchy see: https://distribution-explorer.github.io/continuous/cauchy.html). 

      These different defaults reflect differing Bayesian philosophical schools (‘estimate parameters’ vs ‘quantify evidence’ camps); if your goal is to accurately estimate a parameter it would be odd to have a strong null prior, yet (in our opinion) when estimating point-null BF’s a wide default prior gives far too much evidence in support of the null. In point-null BF testing the Savage-Dickey density ratio is the ratio between the height of the prior at 0 and the height of the posterior at zero (see Figure under section “testing against point null 0”). This means BFs can be very prior sensitive (seen in SI tables 5 & 6). For this reason, we thought it made sense to do prior sensitivity testing, to ensure our conclusions in favor of the null were not caused solely by an overly wide prior (preregistered orange distribution) we decided to report the 3 narrower priors (blue ones).

      Alternative Bayesian null hypotheses testing methods such as using Bayes Factors to test against a null region and ‘region of practical equivalence testing’ are less prior sensitive, yet both methods demand the researcher (e.g. ‘us’) to decide on a minimal effect size of practical interest. Once a minimal effect size of interest is determined any effect within this boundary is taken as evidence in support of the null hypothesis.

      (5) It is unclear why a different method was employed for the August / September data analysis compared to the full-time series. 

      We used a local-randomization RD framework, an entirely different empirical framework than continuity methods (resulting in a different estimate). For an overview see the primer by Cattaneo, Idrobo & Titiunik 2023 (“A Practical Introduction to Regression Discontinuity Designs: Extensions”; https://arxiv.org/abs/2301.08958).

      A local randomization framework is optimal when the running variable is discrete (as in our case with DOB in months) (Cattaneo, Idrobo & Titiunik 2023). It makes stronger assumptions on exchangeability therefore a very narrow window around the cutoff needs to be used. See Figure 2.1 and 2.2 (in the Cattaneo, Idrobo & Titiunik 2023) for graphical illustrations of 1) a randomized experiment, 2) a continuity RD design, and 3) local-randomization RD. Using the full-time series in a local randomization analysis is not recommended as there is no control for differences between individuals as we move further away from the cutoff – making the estimated parameter highly endogenous.

      We understand how it is confusing to have both a new framework and Bayesian methods (we could have chosen a fully frequentist approach) but using a different framework allows us to weigh up the aforementioned ‘bias vs variance tradeoff’ while Bayesian methods allow us to say something about the weight of evidence (for or against) our hypothesis.

      (6) Figure 1 - why not use model fits from those employed for hypothesis testing? 

      This is a great suggestion (ties into #3), we have now redone Figure 1.

      (7) The section on "correlational effect" might also benefit from additional analyses and clarifications. Indeed, the data come from the same randomized experiment for which minimum education requirements were adjusted. Was the only difference that the number of years of education was studied as opposed to the cohort? If so, would the results of this analysis be similar in another subsample of the UK Biobank for which there was no change in policy?

      We have clarified the methods section for the correlational/associational effect. This was the same subset of individuals for the local randomization analysis; all we did was change the independent variable from an exogenous dummy-coded ROSLA term (where half of the sample had the natural experiment) to a continuous (endogenous) educational attainment IV. 

      In principle, the results from the associational analysis should be exactly the same if we use other UK Biobank cohorts. To see if the association of education attainment with the global neuroimaging cohorts was similar across sub-cohorts of new individuals, we conducted post hoc Bayesian analysis on eight more subcohort of 10-month intervals, spaced 2 years apart from each other (Sup. Figure 7; each indicated by a different color). Four of these sub-cohorts predate ROSLA, while the other four are after ROSLA. Educational attainment is slowly increasing across the cohorts of individuals born from 1949 until 1965; intriguingly the effect of ROSLA is visually evident in the distributions of educational attainment (Sup. Figure 7). Also, as seen in the cohorts predating ROSLA more and more individuals were (already) choosing to stay in education past 15 years of age (see cohort 1949 vs 1955 in Sup. Figure 7).

      Sup. Figure 8 illustrates boxplots of the educational attainment posterior of the eight sub-cohorts in addition to our original analysis (s1957) using a normal distributed prior with a mean of 0 and a sd of 1. Total surface area shows a remarkably replicable association with education attainment. Yet, it is evident the “extremely strong” association we found for CSF was a statistical fluke – as the posterior of other cohorts (bar our initial test) crosses zero. The conclusions for the other global neuroimaging covariates where we concluded ‘no associational effect’ seems to hold across cohorts.

      We have now added methods, deviation from preregistration, and the following excerpt to the results:

      “A post hoc replication of this associational analysis in eight additional 10-month cohorts spaced two years apart (Sup. Figure 7) indicates our preregistered report on the associational effect of educational attainment on CSF to be most likely a false-positive (Sup. Figure 8). Yet, the positive association between surface area and educational attainment is robust across the additional eight replication cohorts.”

      Reviewer #2 (Public review): 

      Summary: 

      The authors conduct a causal analysis of years of secondary education on brain structure in late life. They use a regression discontinuity analysis to measure the impact of a UK law change in 1972 that increased the years of mandatory education by 1 year. Using brain imaging data from the UK Biobank, they find essentially no evidence for 1 additional year of education altering brain structure in adulthood. 

      Strengths: 

      The authors pre-registered the study and the regression discontinuity was very carefully described and conducted. They completed a large number of diagnostic and alternate analyses to allow for different possible features in the data. (Unlike a positive finding, a negative finding is only bolstered by additional alternative analyses). 

      Weaknesses: 

      While the work is of high quality for the precise question asked, ultimately the exposure (1 additional year of education) is a very modest manipulation and the outcome is measured long after the intervention. Thus a null finding here is completely consistent educational attainment (EA) in fact having an impact on brain structure, where EA may reflect elements of training after a second education (e.g. university, post-graduate qualifications, etc) and not just stopping education at 16 yrs yes/no. 

      The work also does not address the impact of the UK Biobank's well-known healthy volunteer bias (Fry et al., 2017) which is yet further magnified in the imaging extension study (Littlejohns et al., 2020). Under-representation of people with low EA will dilute the effects of EA and impact the interpretation of these results. 

      References: 

      Fry, A., Littlejohns, T. J., Sudlow, C., Doherty, N., Adamska, L., Sprosen, T., Collins, R., & Allen, N. E. (2017). Comparison of Sociodemographic and Health-Related Characteristics of UK Biobank Participants With Those of the General Population. American Journal of Epidemiology, 186(9), 1026-1034. https://doi.org/10.1093/aje/kwx246 

      Littlejohns, T. J., Holliday, J., Gibson, L. M., Garratt, S., Oesingmann, N., Alfaro-Almagro, F., Bell, J. D., Boultwood, C., Collins, R., Conroy, M. C., Crabtree, N., Doherty, N., Frangi, A. F., Harvey, N. C., Leeson, P., Miller, K. L., Neubauer, S., Petersen, S. E., Sellors, J., ... Allen, N. E. (2020). The UK Biobank imaging enhancement of 100,000 participants: rationale, data collection, management and future directions. Nature Communications, 11(1), 2624. https://doi.org/10.1038/s41467-020-15948-9 

      We thank the reviewer for the positive comments and constructive feedback, in particular, their emphasis on volunteer bias in UKB (similar points were mentioned by Reviewer 3). We have now addressed these limitations with the following passage in the discussion:

      “The UK Biobank is known to have ‘healthy volunteer bias’, as respondents tend to be healthier, more educated, and are more likely to own assets [71,72]. Various types of selection bias can occur in non-representative samples, impacting either internal (type 1) or external (type 2) validity. One benefit of a natural experimental design is that it protects against threats to internal validity from selection bias [43], design-based internal validity threats still exist, such as if volunteer bias differentially impacts individuals based on the cutoff for assignment. A more pressing limitation – in particular, for an education policy change – is our power to detect effects using a sample of higher-educated individuals. This is evident in our first stage analysis examining the percentage of 15-year-olds impacted by ROSLA, which we estimate to be 10% in neuro-UKB (Sup. Figure 2 & Sup. Table 2), yet has been reported to be 25% in the UK general population [41]. Our results should be interpreted for this subpopulation  (UK, 1973, from 15 to 16 years of age, compliers) as we estimate a ‘local’ average treatment effect [73]. Natural experimental designs such as ours offer the potential for high internal validity at the expense of external validity.”

      We also highlighted it both in the results and methods.

      We appreciate that one year of education may seem modest compared to the entire educational trajectory, but as an intervention, we disagree that one year of education is ‘a very modest manipulation’. It is arguably one of the largest positive manipulations in childhood development we can administer. If we were to translate a year of education into the language of a (cognitive) intervention, it is clear that the manipulation, at least in terms of hours, days, and weeks, is substantial. Prior work on structural plasticity (e.g., motor, spatial & cognitive training) has involved substantially more limited manipulations in time, intensity, and extent. There is even (limited) evidence of localized persistent long-term structural changes (Wollett & Maguire, 2011, Cur. Bio.).

      We have now also highlighted the limited generalizability of our findings since we estimate a ‘local’ average treatment effect. It is possible higher education (college, university, vocational schools, etc.) could impact brain structure, yet we see no theoretical reason why it would while secondary wouldn’t. Moreover, higher education education is even trickier to research empirically due to heightened self and administrative selection pressures. While we cannot discount this possibility, the impacts of endogenous factors such as genetics and socioeconomic status are most likely heightened. That being said, higher education offers exciting possibilities to compare more domain-specific processes (e.g., by comparing a philosophy student to a mathematics student). Causality could be tested in European systems with point entry into field-specific programs – allowing comparison of students who just missed entry criteria into one topic and settled for another.

      Regarding the amount of time following the manipulation, as we highlight in our discussion this is both a weakness and a strength. Viewed from a developmental neuroplasticity lens it would have been nice to have imaging immediately following the manipulation. Yet, from an aging perspective, our design has increased power to detect an effect.  

      Reviewer #2 (Recommendations for the authors): 

      (1) The authors assert there is no strong causal evidence for EA on brain structure. This overlooks work from Mendielian Randomisation, e.g. this careful work: https://pubmed.ncbi.nlm.nih.gov/36310536/ ... evidence from (good quality) MR studies should be considered. 

      We thank the reviewer for highlighting this well-done mendelian randomization study. We have now added this citation and removed previous claims on the “lack of causal evidence existing”. We refrain from discussing Mendelian randomization, as it it would need to be accompanied by a nuanced discussion on the strong limitations regarding EduYears-PGS in Mendelian randomization designs.

      (2) Tukey/Boxplot is a good name for your identification of outliers but your treatment of outliers has a well-recognized name that is missing: Windsorisation. Please add this term to your description to help the reader more quickly understand what was done. 

      Thanks, we have now added the term winsorized.

      (3) Nowhere is it plainly stated that "fuzzy" means that you allow for imperfect compliance with the exposure, i.e. some children born before the cut-off stayed in school until 16, and some born after the cut-off left school before 16. For those unfamiliar with RD it would be very helpful to explain this at or near the first reference of the term "fuzzy". 

      We have now clarified the term ‘fuzzy’ to the results and methods:

      methods:

      “RD designs, like ours, can be ‘fuzzy’ indicating when assignment only increases the probability of receiving it, in turn, treatment assigned and treatment received do not correspond for some units 33,53. For instance, due to cultural and historical trends, there was an increase in school attendance before ROSLA; most adolescents were continuing with education past 15 years of age (Sup Plot. 7b). Prior work has estimated that 25 percent of children would have left school a year earlier if not for ROSLA 41. Using the UK Biobank, we estimate this proportion to be around 10%, as the sample is healthier and of higher SES than the general population (Sup. Figure 2; Sup. Table 2) 46–48.”

      (4) Supplementary Figure 2 never states what the percentage actually measures. What exactly does each dot represent? Is it based on UK Biobank subjects with a given birth month? If so clarify. 

      Fixed!

      Reviewer #3 (Public review): 

      Summary: 

      This study investigates evidence for a hypothesized, causal relationship between education, specifically the number of years spent in school, and brain structure as measured by common brain phenotypes such as surface area, cortical thickness, total volume, and diffusivity. 

      To test their hypothesis, the authors rely on a "natural" intervention, that is, the 1972 ROSLA act that mandated an extra year of education for all 15-year-olds. The study's aim is to determine potential discontinuities in the outcomes of interest at the time of the policy change, which would indicate a causal dependence. Naturalistic experiments of this kind are akin to randomised controlled trials, the gold standard for answering questions of causality. 

      Using two complementary, regression-based approaches, the authors find no discernible effect of spending an extra year in primary education on brain structure. The authors further demonstrate that observational studies showing an effect between education and brain structure may be confounded and thus unreliable when assessing causal relationships. 

      Strengths: 

      (1) A clear strength of this study is the large sample size totalling up to 30k participants from the UK Biobank. Although sample sizes for individual analyses are an order of magnitude smaller, most neuroimaging studies usually have to rely on much smaller samples. 

      (2) This study has been preregistered in advance, detailing the authors' scientific question, planned method of inquiry, and intended analyses, with only minor, justifiable changes in the final analysis. 

      (3) The analyses look at both global and local brain measures used as outcomes, thereby assessing a diverse range of brain phenotypes that could be implicated in a causal relationship with a person's level of education. 

      (4) The authors use multiple methodological approaches, including validation and sensitivity analyses, to investigate the robustness of their findings and, in the case of correlational analysis, highlight differences with related work by others. 

      (5) The extensive discussion of findings and how they relate to the existing, somewhat contradictory literature gives a comprehensive overview of the current state of research in this area. 

      Weaknesses: 

      (1) This study investigates a well-posed but necessarily narrow question in a specific setting: 15-year-old British students born around 1957 who also participated in the UKB imaging study roughly 60 years later. Thus conclusions about the existence or absence of any general effect of the number of years of education on the brain's structure are limited to this specific scenario. 

      (2) The authors address potential concerns about the validity of modelling assumptions and the sensitivity of the regression discontinuity design approach. However, the possibility of selection and cohort bias remains and is not discussed clearly in the paper. Other studies (e.g. Davies et al 2018, https://www.nature.com/articles/s41562-017-0279-y) have used the same policy intervention to study other health-related outcomes and have established ROSLA as a valid naturalistic experiment. Still, quoting Davies et al. (2018), "This assumes that the participants who reported leaving school at 15 years of age are a representative sample of the sub-population who left at 15 years of age. If this assumption does not hold, for example, if the sampled participants who left school at 15 years of age were healthier than those in the population, then the estimates could underestimate the differences between the groups.". Recent studies (Tyrrell 2021, Pirastu 2021) have shown that UK Biobank participants are on average healthier than the general population. Moreover, the imaging sub-group has an even stronger "healthy" bias (Lyall 2022). 

      (3) The modelling approach used in this study requires that all covariates of no interest are equal before and after the cut-off, something that is impossible to test. Mentioned only briefly, the inclusion and exclusion of covariates in the model are not discussed in detail. Standard imaging confounds such as head motion and scanning site have been included but other factors (e.g. physical exercise, smoking, socioeconomic status, genetics, alcohol consumption, etc.) may also play a role. 

      We thank the reviewer for their numerous positive comments and have now attempted to address the first two limitations (generalizability and UKB bias) with the following passage in the discussion:

      “The UK Biobank is known to have ‘healthy volunteer bias’, as respondents tend to be healthier, more educated, and are more likely to own assets [71,72]. Various types of selection bias can occur in non-representative samples, impacting either internal (type 1) or external (type 2) validity. One benefit of a natural experimental design is that it protects against threats to internal validity from selection bias [43], design-based internal validity threats still exist, such as if volunteer bias differentially impacts individuals based on the cutoff for assignment. A more pressing limitation – in particular, for an education policy change – is our power to detect effects using a sample of higher-educated individuals. This is evident in our first stage analysis examining the percentage of 15-year-olds impacted by ROSLA, which we estimate to be 10% in neuro-UKB (Sup. Figure 2 & Sup. Table 2), yet has been reported to be 25% in the UK general population [41]. Our results should be interpreted for this subpopulation  (UK, 1973, from 15 to 16 years of age, compliers) as we estimate a ‘local’ average treatment effect [73]. Natural experimental designs such as ours offer the potential for high internal validity at the expense of external validity.”

      We further highlight this in the results section:

      “Compliance with ROSLA was very high (near 100%; Sup. Figure 2). However, given the cultural and historical trends leading to an increase in school attendance before ROSLA, most adolescents were continuing with education past 15 years of age before the policy change (Sup Plot. 7b). Prior work has estimated 25 percent of children would have left school a year earlier if not for ROSLA 41. Using the UK Biobank, we estimate this proportion to be around 10%, as the sample is healthier and of higher SES than the general population (Sup. Figure 2; Sup. Table 2) 46–48.”

      Healthy volunteer bias can create two types of selection bias; crucially participation itself can serve as a collider threatening internal validity (outlined in van Alten et al., 2024; https://academic.oup.com/ije/article/53/3/dyae054/7666749). Natural experimental designs are partially sheltered from this major limitation, as ‘volunteer bias’ would have to differentially impact individuals on one side of the cutoff and not the other – thereby breaking a primary design assumption of regression discontinuity. Substantial prior work (including this article) has not found any threats to the validity of the 1973 ROSLA (Clark & Royer 2010, 2013; Barcellos et al., 2018, 2023; Davies et al., 2018, 2023). While the Davies 2028 article did IP-weight with the UK Biobank sample, Barcellos and colleagues 2023 (and 2018) do not, highlighting the following “Although the sample is not nationally representative,  our estimates have internal validity because there is no differential selection on the two sides of the September 1, 1957 cutoff – see  Appendix A.”.

      The second (more acknowledged & arguably less problematic) type of selection bias results in threats to external validity (aka generalizability). As highlighted in your first point; this is a large limitation with every natural experimental design, yet in our case, this is further amplified by the UK Biobank’s healthy volunteer bias. We have now attempted to highlight this limitation in the discussion passage above.

      Point 3 – the inability to fully confirm design validity – is again, another inherent limitation of a natural experimental approach. That being said, extensive prior work has tested different predetermined covariates in the 1973 ROSLA (cited within), and to our knowledge, no issues have been found. The 1973 ROSLA seems to be one of the better natural experiments around (there was also a concerted effort to have an ‘effective’ additional year; see Clark & Royer 2010). For these reasons, we stuck with only testing the variables we wanted to use to increase precision (also offering new neuroimaging covariates that didn’t exist in the literature base). One additional benefit of ROSLA was that the cutoff was decided years later on a variable that happened (date of birth) in the past – making it particularly hard for adolescents to alter their assignments.

      Reviewer #3 (Recommendations for the authors): 

      (1) FMRIB's preprocessing pipeline is mentioned. Does this include deconfounding of brain measures? Particularly, were measures deconfounded for age before the main analysis? 

      This is such a crucial point that we triple-checked, brain imaging phenotypes were not corrected for age (https://biobank.ctsu.ox.ac.uk/crystal/crystal/docs/brain_mri.pdf) – large effects of age can be seen in the global metrics; older individuals have less surface area, thinner cortices, less brain volume (corrected for head size), more CSF volume (corrected for head size), more white matter hyperintensities, and worse FA values. Figure 1 shows these large age effects, which are controlled for in our continuity-based RD analysis.

      One’s date of birth (DOB) of course does not match perfectly to their age, this is why we included the covariate ‘visit date’; this interplay can now be seen in our updated SI Figure 1 (recommended in #3) which shows the distributions of visit date, DOB, and age of scan. 

      In a valid RD design covariates should not be necessary (as they should be balanced on either side of the cutoff), yet the inclusion of covariates does increase precision to detect effects. We tested this assumption, finding the effect of ‘visit date’ and its quadratic term to be not related to ROSLA (Sup. Table 1). This adds further evidence (specific to the UK Biobank sample) to the existing body of work showing the 1973 ROSLA policy change to not violate any design assumptions. Threats to internal validity would more than likely increase endogeneity and result in ‘false causal positive causal effects’ (which is not what we find).  

      (2) Despite the large overall sample size, I am wondering whether the effective number of samples is sufficient to detect a potentially subtle effect that is further attenuated by the long time interval before scanning. As stated, for the optimised bandwidth window (DoB 20 to 35 months around cut-off), N is about 5000. Does this mean that effectively about 250 (10%) out of about 2500 participants born after the cut-off were leaving school at 16 rather than 15 because of ROSLA? For the local randomisation analysis, this becomes about N=10 (10% out of 100). Could a power analysis show that these cohort sizes are large enough to detect a reasonably large effect? 

      This is a very valid point, one which we were grappling with while the paper was out for review. We now draw attention to this in the results and highlight this as a limitation in the discussion. While UKB’s non-representativeness limits our power (10% affected rather than 25% in the general population), it is still a very large sample. Our sample size is more in line with standard neuroimaging studies than with large cohort studies. 

      The novelty of our study is its causal design, while we could very precisely measure an effect of some phenotype (variable X) in 40,000 individuals. This effect is probably not what we think we are measuring. Without IP-weighting it could even have a different sign. But more importantly, it is not variable X – it is the thousands of things (unmeasured confounders) that lead an individual to have more or less of variable X. The larger the sample the easier it is for small unmeasured confounders to reach significance (Big data paradox) – this in no way invalidates large samples, it is just our thinking and how we handle large samples will hopefully change to a more casual lens.

      (3) Supplementary Figure 1: A similar raincloud plot of date of birth would be instructive to visualise the distribution of subjects born before and after the 1957 cut-off. 

      Great idea! We have done this in Sup Fig. 1 for both visit date and DOB.

      (4) p.9: Not sure about "extreme evidence", very strong would probably be sufficient. 

      As preregistered, we interpreted Bayes Factors using Jeffrey’s criteria. ‘Extreme evidence’ is only used once and it is about finding an associational effect of educational attainment on CSF (BF10 > 100). Upon Reviewer 1’s recommendation 7, we conducted eight replication samples (Sup. Figure 7 & 8) and have now added the following passage to the results:

      “A post hoc replication of this associational analysis in eight additional 10-month cohorts spaced two years apart (Sup. Figure 7) indicates our preregistered report on the associational effect of educational attainment on CSF to be most likely a false-positive (Sup. Figure 8). Yet, the positive association between surface area and educational attainment is robust across the additional eight replication cohorts.”

      (5) The code would benefit from a bit of clean-up and additional documentation. In its current state, it is not easy to use, e.g. in a replication study. 

      We have now further added documentation to our code; including a readme describing what each script does. The analysis pipeline used is not ideal for replications as the package used for continuity-based RD (RDHonest) initially could not handle covariates – therefore we manually corrected our variables after a discussion with Prof Kolesár (https://github.com/kolesarm/RDHonest/issues/7). 

      Prof Kolesár added this functionality recently and future work should use the latest version of the package as it can correct for covariates. We have a new preprint examining the effect of 1972 ROLSA on telomere length in the UK Biobank using the latest package version of RDHonest (https://www.biorxiv.org/content/10.1101/2025.01.17.633604v1). To ensure maximum availability of such innovations, we will ensure the most up-to-date version of this script becomes available on this GitHub link (https://github.com/njudd/EduTelomere).

  8. social-media-ethics-automation.github.io social-media-ethics-automation.github.io
    1. Code-switching. November 2023. Page Version ID: 1185649746. URL: https://en.wikipedia.org/w/index.php?title=Code-switching&oldid=1185649746 (visited on 2023-11-24).

      Wikipedia expands the meaning of code switching that I am used to. I also thought it was changing the way you talk based on your environment which is included as part of its description but apparently switching languages when talking is also considered code switching. I found this interesting since I've never heard code switching being described in that context.

    2. Text analysis of Trump's tweets confirms he writes only theAndroid half was published on. Text analysis of Trump's tweets confirms he writes only the (angrier) Android half. August 2016. URL: http://varianceexplained.org/r/trump-tweets/ (visited on 2023-11-24).

      This article is breaking down after analyzing trumps tweets that he tweets from an android and an iPhone on the same device. The non-hyperbolic is from iPhone (staff) and hyperbolic is from Android (trump himself). The article further breaks down the code that comes with it

    3. Code-switching. November 2023. Page Version ID: 1185649746. URL:

      Code switcing is something I have know about for a long time. It's something I do so much that sometimes I don't even notice. Sometimes it makes me wonder what my "actual" way of speaking really is

    4. Jordan Peele. December 2023. Page Version ID: 1188580582. URL: https://en.wikipedia.org/w/index.php?title=Jordan_Peele&oldid=1188580582 (visited on 2023-12-07).

      This source is a wikipedia page for the comedian/director, Jordan Peele. He is one half of the comedy duo Key & Peele, a few of their skit videos are in the bibliography, and display comedic examples of code-switching. Peele is also an accomplished psychological horror director, and has released many successful films.

    1. We also may change how we behave and speak depending on the situation or who we are around, which is called code-switching [f21]. While modified behaviors to present a persona or code switch may at first look inauthentic, they can be a way of authentically expressing ourselves in each particular setting

      I like how this part of the reading brings awareness to the negative reputation that code switching has but also shows how it can be very useful. I think it's similar to when people say there is a place and a time to do something, usually in the context that you shouldn't be misbehaving in important setting. I have personally code switched in different scenarios such as my friends and my professor will see very different versions of me since I talk more formally to a professor than I would with my friends.

    2. We also may change how we behave and speak depending on the situation or who we are around, which is called code-switching [f21].

      I found this definition of code-switching to be very resonating since, as a black man, I find myself often doing it. With the different environments I found myself in, I see myself changing my vernacular whether I am in a professional or academic settings.

    3. Do you feel like those changes or expressions are authentic to who you are, do they compromise your authenticity in some way?

      Code switching and changing how you express yourself in different situations I feel is fully natural to a degree. As a white male I don't have to deal with code switching in the same way it's talked about for AAVE but I still speak a different way at work than I do when I'm hanging out with my close friends. Everyone interacts with different people differently, and though one could argue I am less authentic to myself at work, I still can express myself as who I am without the exact same vocabulary.

    4. How do you notice yourself changing how you express yourself in different situations, particularly on social media?

      I had never particularly thought about code-switching outside of a social-media context, but I definitely do it. I find myself being a lot more authentic with the people I work with at my restaurant job than I am during my university classes. I think I try to look a little more professional, and I'm more careful about how I sound in classes.

      I don't post very often on social media, but I don't really post about the bad stuff. On Instagram, for example, I will only post exciting things happening to me, not my day-to-day life.

    1. Discuss how these results are consistent with the existence of a last universal commonancestor (LUCA).

      Genetic code and ratio of purines to pyrimidines is universal across species because all commonly inherited from LUCA (last common universal ancestor). Genetic code is universal and shows divergent evolution from a common ancestor (LUCA)

      that was a duplicate, so just add that the 4 bases of DNA are universal.

    2. Discuss the evidence of the relationship between genome size and the complexityof organisms.
      1. Increased genome sizes are not necessarily linked to more complexity of organisms.
      2. The genome contains coding but also many non-coding genes that do not code for polypeptides. The proteome of an organism may be smaller even with increased genome size.
      3. Introns, telomeres and DNA coding for tRNA and rRNA are examples of noncoding DNA.
      4. Gene expression determines what genes are expressed and not all coding genes are activated to be expressed into a polypeptide.

      However eukaryotic genomes tend to be larger than prokaryotic genomes.

    1. Author response:

      The following is the authors’ response to the original reviews

      Reviewer #1 (Recommendations for the authors):

      (1) Storyline and Narrative Flow:

      Consider revising the manuscript to create a more coherent and consistent narrative. Clarify how each section of the study-particularly the transition from multi-omics data integration to single-cell RNA-seq validation-contributes to the overall research question. This will help readers better understand the logical flow of the study.

      We thank the reviewer’s suggestion, which have highlighted the deficiencies in this area, and we have made appropriate modifications:

      We have modified some text, including the connections between different sections in the results part and the objectives and roles of various analyses in each section, thus enhancing the coherence between the contexts and clarifying the objectives and functions of each analysis, We believe this will help readers better understand the main content of the entire text.

      (2) Immune Cell Activity Analysis:

      Reevaluate the methods used to assess immune cell activities within the context of the tumor microenvironment. Consider providing additional justification for the relevance of using the cancer cell model for this analysis. If necessary, explore alternative methods or models that might offer more meaningful insights into immune-tumor interactions.

      We thank the reviewer’s suggestion, which have highlighted the deficiencies in this area, and we have made appropriate modifications:

      Using RNA-Bulk data, we evaluated the tumor immune microenvironment through various methods to assess immune infiltration levels and responses to immunotherapy. We found that the results were largely consistent with those presented in the manuscript, providing strong support for our viewpoints. We also acknowledge the limitations of findings from bioinformatics analysis. In our upcoming research, we plan to develop organoid models with gene expression patterns of both CS1 and CS2 subtypes, using these models as a foundation for studying the tumor immune microenvironment.

      (3) Single-Cell RNA-Seq Validation:

      Expand the validation of your findings using single-cell RNA-seq data. This could include more in-depth analyses that explore the heterogeneity within the subtypes and confirm the robustness of your classification method at the single-cell level. This would strengthen the support for your claims about the relevance of the identified subtypes.

      We thank the reviewer’s suggestion, which have highlighted the deficiencies in this area, and we have made appropriate modifications:

      In this manuscript, we employed the NTP algorithm to classify malignant cells identified by the CopyKAT algorithm using characteristic genes of CS1 and CS2 subtypes. This approach is similar to previous method that analyzed patients in the ICGC cohort with the same subtype genes. We consider this classification method valid.

      After classifying the malignant cells, we performed metabolic and cell communication analyses on the CS1 and CS2 subtype cells, revealing significant differences in biological pathways enriched by differential genes, metabolic levels, and cell signaling patterns. These differences align with variations observed in prior classifications and analyses based on RNA-Bulk data.

      We also acknowledge that validating the classification method solely with the single-cell dataset from this study is insufficient. We analyzed GSE202642 using the same processes and methods as GSE229772, finding that the results were generally consistent, indicating that our classification method exhibits a degree of robustness at the single-cell level.

      (4) Methodological Justification:

      Provide a more detailed rationale for the selection of machine learning algorithms and integration strategies used in the study. Explain why the chosen methods are particularly well-suited for this research, and discuss any potential limitations they might have.

      We thank the reviewer’s suggestion, which have highlighted the deficiencies in this area, and we have made appropriate modifications:

      We have updated the methodology section to enhance readers' understanding of the fundamental principles involved. This analysis has two key features: first, it combines 10 machine learning algorithms to generate 101 models and ultimately selects the prognostic prediction model with the highest C-index from these 101 algorithms; second, it utilizes the LOOCV method to analyze the training and validation sets. Compared to the conventional method of randomly dividing the training and validation sets by a fixed ratio, this approach significantly minimizes the bias and randomness introduced by the splitting process. Therefore, we believe this analysis can leverage the characteristic genes of the CS1 and CS2 subtypes, combined with existing clinical data from public databases, to yield results that are more accurate and reliable than the commonly used prognostic models in previous literature, such as COX regression and Lasso regression, as well as other individual algorithms. While this analysis presents advantages over some previous modeling methods, it is essential to recognize that it remains based on analyses conducted using public databases, which may obscure certain factors that might be clinically relevant to patient prognosis due to the mathematical logic of the algorithms.

      (5) Figures and Visualizations:

      Improve the clarity of your figures by addressing the following:

      a) Figure 3A: Cluster the pathways to make the comparisons clearer and more meaningful.

      b) Figure 4A: Clearly explain the significance of the blue bar.

      c) Figure 4B: Ensure this figure is discussed in the main text to justify its inclusion.

      d) Figure 7C: Enhance the figure legend to provide more informative details.

      Additionally, ensure that figure descriptions go beyond the captions and provide detailed explanations that help the reader understand the significance of each figure.

      We thank the reviewer’s suggestion, which have highlighted the deficiencies in this area, and we have made appropriate modifications:

      Figure 3A: We clustered the samples based on CS1 and CS2 subtypes and displayed the immune-related cell scores of each sample as a heatmap.

      Figure 4A: The blue bars in the figure represent the average C-index of this algorithm combination in the training dataset TCGA and the validation dataset ICGC, which we have supplemented in the corresponding sections of the text.

      Figure 4B: We described this figure in the results section, which primarily aims to validate whether our prognostic prediction model can predict patient outcomes in the TCGA cohort. The results showed that after performing prognostic risk scoring on patients based on the prediction model and categorizing them into high-risk and low-risk groups, the two groups exhibited significant prognostic differences, with the high-risk group showing worse outcomes compared to the low-risk group. This indicates that our prognostic prediction model can effectively distinguish the prognostic risk differences among patients in the TCGA-LIHC cohort. We also discussed these findings in the discussion section.

      Figure 7C: We used both point color and size to visualize the levels of metabolic scores, resulting in two dimensions in the legend, which actually represent the same information. Therefore, we removed the results that used point size to indicate the levels of metabolic scores.

      (6) Supplementary Materials:

      Consider including more detailed supplementary materials that provide additional validation data, extended methodological descriptions, and any other information that would support the robustness of your findings.

      We thank the reviewer’s suggestion, which have highlighted the deficiencies in this area, and we have made appropriate modifications:

      In the subsequent version of the record, we will upload the important results obtained during the research to GitHub, and in this revision, we have updated some figures that may better explain the results or the robustness of the findings as supplementary materials.

      (7) Recent Literature:

      a) Incorporate more recent studies in your discussion, especially those related to HCC subtypes and the application of machine learning in oncology. This will provide a more current context for your work and help position your findings within the broader field.

      We thank the reviewer’s suggestion, which have highlighted the deficiencies in this area, and we have made appropriate modifications:

      We have reviewed several studies related to HCC subtype classification and the application of machine learning in this field. In the discussion section, we summarize the significance and limitations of these studies. Additionally, we discuss the characteristics of our study in comparison to previous research in this field.

      (8) Data and Code Availability:

      Ensure that all data, code, and materials used in your study are made available in line with eLife's policies. Provide clear links to repositories where readers can access the data and code used in your analyses.

      We thank the reviewer’s suggestion, which have highlighted the deficiencies in this area, and we have made appropriate modifications:

      We have examined the relevant data, code, and materials. We confirm that we have indicated the sources of the data and tools used in the analysis within the manuscript. Moreover, these data and tools are accessible via the websites or references we have provided.

      Reviewer #2 (Recommendations for the authors):

      (1) While the computational findings are robust, further experimental validation of the two subtypes, particularly the role of the MIF signaling pathway, would strengthen the biological relevance of the findings. In vitro or in vivo validation could confirm the proposed mechanisms and their influence on patient prognosis.

      We thank the reviewer’s suggestion, which have highlighted the deficiencies in this area, and we have made appropriate modifications:

      We intend to verify our findings in future studies using tumor cell line models and animal models. We aim to identify and intervene with key molecules in the MIF signaling pathway. We will investigate how the MIF signaling pathway affects tumor sensitivity to treatment in both cell line and animal models, along with the underlying mechanisms.

      (2) Consider testing the model on additional independent cohorts beyond the TCGA and ICGC datasets to further demonstrate its generalizability and applicability across different patient populations.

      We thank the reviewer’s suggestion, which have highlighted the deficiencies in this area, and we have made appropriate modifications:

      We analyzed the GSE14520 study recorded in the GEO database, which uploaded a cohort consisting of 209 HCC patients and their corresponding RNA sequencing data. We validated the prognostic model obtained in this study using this cohort, and found that the model effectively distinguishes patients into high-risk and low-risk prognostic categories. Furthermore, there is a significant prognostic difference between the high-risk and low-risk patient groups. This is consistent with the results we obtained previously.

      (3) Review the manuscript for long or complex sentences, which can be broken down into shorter, more readable parts.

      We have made revisions to the long and complex sentences in the manuscript without compromising its academic integrity and rationality, with the hope that this will help readers better understand the content of this study.

      During the revision process, in addition to addressing the reviewer comments, we conducted a thorough review of the analysis. In the course of this review, we identified a few errors in the data usage and have since corrected the relevant data and figures:

      Figure 4: Due to space constraints, we adjusted the composition of the figures after incorporating the validation results from the GSE14520 dataset.

      Figure 5A: We rechecked the regression coefficients included in the model, updated several more recent prognostic models, and calculated the C-index for 20 prognostic models in the TCGA and ICGC cohorts using a method consistent with previous studies.

      Figure 5C-D: We adjusted the clarity of the figures.

      Figure 8: We reclassified the selected malignant cells and updated the subtypes results. Subsequently, based on the repeatedly confirmed typing results, we comprehensively updated the analysis results of the subsequent cell communication network construction, ensuring that the entire analysis process remains consistent with previous findings. We also adjusted the composition of the figure and presented the images that could not be conveniently merged due to space constraints as Figure 9.

    1. Compared to the easily interpretable “picture of the past” that Playfair designed his time-series charts to convey, the sequence of increasingly novel visual formsincreasingly novel visual forms that characterizes the Georgia Negro series charts seems to ask viewers to imagine future Black progress through the lens of Black creativity: what might come to be if the future of Black America, like the expressive possibilities of data visualization, were also not curtailed by convention and code.

      [Reaction] Seeing this distinction and the way DuBois uses his visualization not just to demonstrate black excellence decades after the civil war, but also to express Black progress and potential is something extremely unique to what we have come to understand in how data visualization can be used. It also changes my understanding of the power individuals have when creating these visualizations because of how Du Bois shows that they can be used as tools to combat dominant and discriminatory narratives. Reading this challenged my perspective because charts are typically used to explain facts but in this use case, they have evolved to become a microphone in amplifying groups who are oppressed.

    1. Starting with just the minimal thing we need for our particular use case, adding things immediately as we come across new requirements. Ending up with a tool that’s molded to our needs like a leather shoe,

      though, hazard: who knows whether the code quality's such that you'd comfortably iterate on it in the future!

    1. 5.5.2. 4Chan# 4Chan [e18] was created in 2003 by copying the code from a Japanese image-sharing bulletin board called Futaba or 2chan [e19]. 4Chan has various image-sharing bulletin boards, where users post anonymously. Perhaps the most infamous board is the “/b/” board for “random” topics. This board emphasizes “free speech” and “no rules” (with exceptions for child sexual abuse material [CSAM] and some other illegal content). In these message boards, users attempt to troll each other and post the most shocking content they can come up with. They also have a history of collectively choosing a target website or community and doing a “raid” where they all try to join and troll and offend the people in that community. Many memes, groups, and forms of internet slang come from 4Chan, such as: lolcats [e20] Rickroll [e21] ragefaces [e22] “Anonymous [e23]” the hacker group Bronies [e24] (male My Little Pony fans) much of trolling culture (we will talk more about in Chapter 7: Trolling) But one 4Chan user found 4chan to be too authoritarian and restrictive and set out to create a new “free-speech-friendly” image-sharing bulletin board, which he called 8chan. 5.5.3. 8Chan (now 8Kun)# 8Chan [e25] (now called 8Kun) is an image-sharing bulletin board site that was started in 2013. It has been host to white-supremacist, neo-nazi and other hate content. 8Chan has had trouble finding companies to host its servers and internet registration due to the presence of CSAM, and for being the place where various mass shooters spread their hateful manifestos. 8Chan is also the source and home of the false conspiracy theory QAnon [e26]

      This section highlights the blogging website, 4Chan which is a site that is pretty loose on restrictions so a lot of users will post harmful content and started multiple toxic internet trends. This serves as a primary example of toxic cyber environment, which has raised a lot of issues especially in recent years because of increased online trolling and exploitation of underaged users. This really shows how social media may not always be glamorous and good.