10,000 Matching Annotations
  1. Dec 2024
    1. Author response:

      We would like to thank the editors and reviewers for taking the time to help improve our manuscript. We appreciate the feedback and will definitely increase the level of methodological detail in a revised submission.

      Here is a brief summary of our plan to address the points raised by the reviewers. We will respond to the comments in a point-by-point manner when we resubmit a revised manuscript.

      Reviewer 1

      This reviewer raised a question about the 60 Hz frame rate for recording. We agree that increasing the number of cameras and frame rate would improve the tracking quality, but this would come at the cost of scalability. In the current study (and other concurrent studies in the lab), we recorded from 10-20 families simultaneously to try to sample the distribution of behavioral responses to stimuli observed in animals in our colony. This was only possible logistically because of the lightweight equipment design allowing us to record data from animals without large disruptions to their home-cage environment.

      One strategy for acquiring higher-resolution data is to build a small number of enclosures that are fully surrounded by cameras, and to cycle animals through these enclosures (1). However, this strategy limits throughput by reducing the number of animals per day that can be studied. If the size and cost of cameras and computers decreases in the future, then this recording strategy will be scalable to the whole-colony level. For our current study and analysis, we are limited by the resolution of our dataset. We do believe that our data (although not a perfect 3d reconstruction or an extremely high frame rate) is sufficient to label behavioral states with high accuracy. We will add a figure to more clearly show that behavioral state data can be accurately inferred from this imperfect data, which has also been recently highlighted by other groups (2).

      Additionally, with recent progress in the application of deep learning to animal pose tracking, new models can infer 3d pose dynamics from 2d data (3) and leverage spatiotemporal structure to clean up noisy data (4). We believe that other groups will be able to use these types of approaches to extract much more value from this dataset. So, in summary, we do understand the concern related to reconstruction quality and will 1) more clearly define the usefulness of our current models, 2) release our data and code so that others can build upon it or repurpose it, and 3) plan future experiments with higher camera count and frame rate as permitted by logistical constraints. 

      Reviewer 2

      This reviewer asked for an increased level of methodological detail. We will try to address this in a few ways:

      (1) Code and data sharing. We believe that many of the questions related to the methodology will be best answered by sharing the data and code directly. Because there is a large amount of code associated with this manuscript, it is impractical to list every step and every parameter in the paper. Along with our revised manuscript, we will make our data and code publicly available. That said, we will improve our description of key parameters in the paper as the reviewer suggested.

      (2) More detailed Methods section. The reviewer asked us to provide more methodological detail. We understand that this is currently a weakness of our manuscript, and we will focus on addressing it. For instance, the reviewer rightly points out that we did not describe the motion watches used to generate the data in Figure S7. We will address this.

      (3) Simplify the manuscript. The paper currently has 22 figures, and further analysis could be done based on the results shown in any of them. For instance, this reviewer asked us to add a comparison across females and males (similar to our comparison of juveniles and adults). While we plan to add that analysis, we recognize that there are several figures/panels that are not closely related to our intended goal of describing the patterns we found in our large dataset. We will simplify the manuscript by removing some excess figures/panels and focus on describing the parts of the analysis that are crucial to our conclusions in greater detail.

      (4) More careful language. This reviewer pointed out that there were some inaccuracies with our descriptive language. For instance, we used the term "natural" behavior to describe the behavior of animals in captivity, which may more accurately be described as their home-cage behavior. We will be more careful to align our language to the standard for the field. For instance, several studies refer to unrestrained behavior in a laboratory setting as "spontaneous" behavior rather than "natural" behavior (5). In our case, the data consists of both spontaneously occurring behavior and responses to a set of stimuli. We will make sure that the descriptions are more precise in the revised manuscript.

      (1) Bala, P. C. et al. Automated markerless pose estimation in freely moving macaques with OpenMonkeyStudio. Nat Commun 11, (2020).

      (2) Weinreb, C. et al. Keypoint-MoSeq: parsing behavior by linking point tracking to pose dynamics. bioRxiv (2023) doi:10.1101/2023.03.16.532307.

      (3) Gosztolai, A. et al. LiftPose3D, a deep learning-based approach for transforming two-dimensional to three-dimensional poses in laboratory animals. Nat Methods 18, 975–981 (2021).

      (4) Wu, A. et al. Deep Graph Pose: a semi-supervised deep graphical model for improved animal pose tracking. Adv Neural Inf Process Syst 33, 6040–6052 (2020).

      (5) Levy, D. R. et al. Mouse spontaneous behavior reflects individual variation rather than estrous state. Curr Biol 33, 1358-1364.e4 (2023).

    1. Reviewer #1 (Public review):

      Summary:

      The authors test the hypotheses, using an effort-exertion and an effort-based decision-making task, while recording brain dynamics with EEG, that the brain processes reward outcomes for effort differentially when they earned for themselves versus others.

      Strengths:

      The strengths of this experiment include what appears to be a novel finding of opposite signed effects of effort on the processing of reward outcomes when the recipient is self versus others. Also, the experiment is well-designed, the study seems sufficiently powered, and the data and code are publicly available.

      Weaknesses:

      Inferences rely heavily on the results of mixed effects models which may or may not be properly specified and are not supported by complementary analyses. Also, not all results hang together in a sensible way. For example, participants report feeling less subjective effort, but also more disliking of tasks when they were earning rewards for others versus self. Given that participants took longer to complete tasks when earning effort for others, it is conceivable that participants might have been working less hard for others versus themselves, and this may complicate the interpretation of results.

    1. Reviewer #2 (Public review):

      Summary:

      This work investigates transcriptional responses to varying levels of transcription factors (TFs). The authors aim for gradual up- and down-regulation of three transcription factors GFI1B, NFE2, and MYB in K562 cells, by using a CRISPRa- and a CRISPRi line, together with sgRNAs of varying potency. Targeted single-cell RNA sequencing is then used to measure gene expression of a set of 90 genes, which were previously shown to be downstream of GFI1B and NFE2 regulation. This is followed by an extensive computational analysis of the scRNA-seq dataset. By grouping cells with the same perturbations, the authors can obtain groups of cells with varying average TF expression levels. The achieved perturbations are generally subtle, not reaching half or double doses for most samples, and up-regulation is generally weak below 1.5-fold in most cases. Even in this small range, many target genes exhibit a non-linear response. Since this is rather unexpected, it is crucial to rule out technical reasons for these observations.

      Strengths:

      The work showcases how a single dataset of CRISPRi/a perturbations with scRNA-seq readout and an extended computational analysis can be used to estimate transcriptome dose responses, a general approach that likely can be built upon in the future.

      Weaknesses:

      (1) The experiment was only performed in a single replicate. In the absence of an independent validation of the main findings, the robustness of the observations remains unclear.

      (2) The analysis is based on the calculation of log-fold changes between groups of single cells with non-targeting controls and those carrying a guide RNA driving a specific knockdown. How the fold changes were calculated exactly remains unclear, since it is only stated that the FindMarkers function from the Seurat package was used, which is likely not optimal for quantitative estimates. Furthermore, differential gene expression analysis of scRNA-seq data can suffer from data distortion and mis-estimations (Heumos et al. 2023 (https://doi.org/10.1038/s41576-023-00586-w), Nguyen et al. 2023 (https://doi.org/10.1038/s41467-023-37126-3)). In general, the pseudo-bulk approach used is suitable, but the correct treatment of drop-outs in the scRNA-seq analysis is essential.

      (3) Two different cell lines are used to construct dose-response curves, where a CRISPRi line allows gene down-regulation and the CRISPRa line allows gene upregulation. Although both lines are derived from the same parental line (K562) the expression analysis of Tet2, which is absent in the CRISPRi line, but expressed in the CRISPRa line (Figure S3A) suggests substantial clonal differences between the two lines. Similarly, the PCA in S4A suggests strong batch effects between the two lines. These might confound this analysis.

      (4) The study uses pseudo-bulk analysis to estimate the relationship between TF dose and target gene expression. This requires a system that allows quantitative changes in TF expression. The data provided does not convincingly show that this condition is met, which however is an essential prerequisite for the presented conclusions. Specifically, the data shown in Figure S3A shows that upon stronger knock-down, a subpopulation of cells appears, where the targeted TF is not detected anymore (drop-outs). Also Figure 3B (top) suggests that the knock-down is either subtle (similar to NTCs) or strong, but intermediate knock-down (log2-FC of 0.5-1) does not occur. Although the authors argue that this is a technical effect of the scRNA-seq protocol, it is also possible that this represents a binary behavior of the CRISPRi system. Previous work has shown that CRISPRi systems with the KRAB domain largely result in binary repression and not in gradual down-regulation as suggested in this study (Bintu et al. 2016 (https://doi.org/10.1126/science.aab2956), Noviello et al. 2023 (https://doi.org/10.1038/s41467-023-38909-4)).

      (5) One of the major conclusions of the study is that non-linear behavior is common. This is not surprising for gene up-regulation, since gene expression will reach a plateau at some point, but it is surprising to be observed for many genes upon TF down-regulation. Specifically, here the target gene responds to a small reduction of TF dose but shows the same response to a stronger knock-down. It would be essential to show that his observation does not arise from the technical concerns described in the previous point and it would require independent experimental validations.

      (6) One of the conclusions of the study is that guide tiling is superior to other methods such as sgRNA mismatches. However, the comparison is unfair, since different numbers of guides are used in the different approaches. Relatedly, the authors point out that tiling sometimes surpassed the effects of TSS-targeting sgRNAs, however, this was the least fair comparison (2 TSS vs 10 tiling guides) and additionally depends on the accurate annotation of TSS in the relevant cell line.

      (7) Did the authors achieve their aims? Do the results support the conclusions?: Some of the most important conclusions are not well supported because they rely on accurately determining the quantitative responses of trans genes, which suffers from the previously mentioned concerns.

      (8) Discussion of the likely impact of the work on the field, and the utility of the methods and data to the community:<br /> Together with other recent publications, this work emphasizes the need to study transcription factor function with quantitative perturbations. Missing documentation of the computational code repository reduces the utility of the methods and data significantly.

    1. Write your code here.

      numerics = ['int16', 'int32', 'int64'] numerical_columns= adult_census.select_dtypes(include=numerics).columns.to_list() len(numerical_columns)

    1. One of the most common ways to handle potential errors is via return codes.

      The primary virtue of this approach is that it is extremely simple. However, using return codes has a number of drawbacks which can quickly become apparent when used in non-trivial cases:

      First, return values can be cryptic -- if a function returns -1, is it trying to indicate an error, or is that actually a valid return value? It’s often hard to tell without digging into the guts of the function or consulting documentation.

      Second, functions can only return one value, so what happens when you need to return both a function result and a possible error code? Consider the following function:

      double divide(int x, int y) { return static_cast<double>(x)/y; } This function is in desperate need of some error handling, because it will crash if the user passes in 0 for parameter y. However, it also needs to return the result of x/y. How can it do both? The most common answer is that either the result or the error handling will have to be passed back as a reference parameter, which makes for ugly code that is less convenient to use. For example:

      include <iostream>

      double divide(int x, int y, bool& outSuccess) { if (y == 0) { outSuccess = false; return 0.0; }

      outSuccess = true;
      return static_cast<double>(x)/y;
      

      }

      int main() { bool success {}; // we must now pass in a bool value to see if the call was successful double result { divide(5, 3, success) };

      if (!success) // and check it before we use the result
          std::cerr << "An error occurred" << std::endl;
      else
          std::cout << "The answer is " << result << '\n';
      

      } Third, in sequences of code where many things can go wrong, error codes have to be checked constantly. Consider the following snippet of code that involves parsing a text file for values that are supposed to be there:

      std::ifstream setupIni { "setup.ini" }; // open setup.ini for reading // If the file couldn't be opened (e.g. because it was missing) return some error enum if (!setupIni) return ERROR_OPENING_FILE;

      // Now read a bunch of values from a file if (!readIntegerFromFile(setupIni, m_firstParameter)) // try to read an integer from the file return ERROR_READING_VALUE; // Return enum value indicating value couldn't be read

      if (!readDoubleFromFile(setupIni, m_secondParameter)) // try to read a double from the file return ERROR_READING_VALUE;

      if (!readFloatFromFile(setupIni, m_thirdParameter)) // try to read a float from the file return ERROR_READING_VALUE; We haven’t covered file access yet, so don’t worry if you don’t understand how the above works -- just note the fact that every call requires an error-check and return back to the caller. Now imagine if there were twenty parameters of differing types -- you’re essentially checking for an error and returning ERROR_READING_VALUE twenty times! All of this error checking and returning values makes determining what the function is trying to do much harder to discern.

      Fourth, return codes do not mix with constructors very well. What happens if you’re creating an object and something inside the constructor goes catastrophically wrong? Constructors have no return type to pass back a status indicator, and passing one back via a reference parameter is messy and must be explicitly checked. Furthermore, even if you do this, the object will still be created and then has to be dealt with or disposed of.

      Finally, when an error code is returned to the caller, the caller may not always be equipped to handle the error. If the caller doesn’t want to handle the error, it either has to ignore it (in which case it will be lost forever), or return the error up the stack to the function that called it. This can be messy and lead to many of the same issues noted above.

    2. Exception handling provides a mechanism to decouple handling of errors or other exceptional circumstances from the typical control flow of your code. This allows more freedom to handle errors when and how ever is most useful for a given situation, alleviating most (if not all) of the messiness that return codes cause.

      Using return codes is simpler but ends up linked to the control flow, constraining both how the code is laid out, and how errors can be reasonably handled.

    1. Because there are multiple steps involved, the term building is often used to refer to the full process of converting source code files into an executable that can be run. A specific executable produced as the result of building is sometimes called a build.

      For complex projects, build automation tools (such as make or build2) are often used to help automate the process of building programs and running automated tests. While such tools are powerful and flexible, because they are not part of the C++ core language, nor do you need to use them to proceed, we’ll not discuss them as part of this tutorial series.

    1. Because source code is written using ASCII characters, programming languages use a certain amount of ASCII art to represent mathematical concepts. For example, ≠ is not part of the ASCII character set, so programming languages typically use != to represent mathematical inequality instead.

      Some programming fonts, such as Fira Code, use ligatures to combine such “art” back into a single character. For example, instead of displaying !=, Fira Code will display ≠ (using the same width as the two-character version). Some people find this easier to read, others prefer sticking with a more literal interpretation of the underlying characters.

    1. Software developers use Claude for tasks ranging from debugging code to explaining Git operations and concepts.

      How can they say "sofware devs" explicitly, wasn't the data anonymized? or to what extent?

    2. This revealed a particular emphasis on coding-related tasks

      This makes a lot of sense as it would reflect those first adopters being technical people (who code for instance), and or students (who where among the first appealed with the technology). However it is interesinf that there is an explicit focus on web and mobile app dev., is this an indicator that non-technical people are trying to leverage it for entreprenurial purposes for instace? again this notion of intent seems relevant in this context of evaluating use.

    1. Tips🔔:I created a Miro board for you to share and exchange your own emotional code creations, or you can leave your feelings in the comments section below.                   https://miro.com/app/board/uXjVLBRcOTg=/?share_link_id=580693888148

      You need to remove the upload link to Miro.

      Firstly, Miro is not open source, so the Miro component is not open and can't be part of the OER. Secondly, uploading the collage serves no stated purpose (why upload, to whom?) b) It creates the following ethical issues. Firstly, to collect data in this way you first must ask for and receive Informed Consent https://github.com/mrtayto/antart/wiki/Informed-Consent

      You don't explain why you might be collecting the data, nor do you ask for consent. There's then the danger that the uploaded data might include Sensitive Data https://github.com/mrtayto/antart/wiki/Sensitive-Data

      If you create or use such digital data, you need to manage that data according to the University's Research Data Management Policy (link). This means you have to inform the learner why you are collecting the data, how it will be securely stored, how it will be anonymised, how you will use it, when and how you will destroy the data. Since you haven't done any of these things, the link has to be removed.

    1. Embedding a Blog To embed a blog, use this code: https://blogname.tumblr.com/js Replace “blogname” with your username. If you’re using a custom domain, replace “blogname.tumblr.com” with your domain. For example, say you wanted to embed the Changes blog somewhere. You’d use this code: <script type='text/javascript' src='https://blogname.tumblr.com/js'></script>

      Tumblr Blog (Account) Embeds

      Relatively angry with myself that I somehow never made it to this page in the Tumblr docs, somehow.

      To embed a blog, use this code:

      https://blogname.tumblr.com/js

      Replace “blogname” with your username. If you’re using a custom domain, replace “blogname.tumblr.com” with your domain.

      For example, say you wanted to embed the Changes blog somewhere. You’d use this code:

      <script type='text/javascript' src='https://blogname.tumblr.com/js'></script>
      <footer>Tumblr Developer<cite> https://help.tumblr.com/embed-basics/</cite></footer>
      <script note="" src="https://cdn.jsdelivr.net/gh/Blogger-Peer-Review/quotebacks@1/quoteback.js"></script> <script type='text/javascript' src='https://asphaltapostle.tumblr.com/js'></script>

      Documentation

      https://help.tumblr.com/embed-basics/

      To embed a blog, use this code:

      [https://blogname.tumblr.com/js](https://blogname.tumblr.com/js)

      Replace “blogname” with your username. If you’re using a custom domain, replace “blogname.tumblr.com” with your domain.

      For example, say you wanted to embed the Changes blog somewhere. You’d use this code:

      ```

      <script type='text/javascript' src='https://blogname.tumblr.com/js'></script>

      ```

      <script type='text/javascript' src='https://asphaltapostle.tumblr.com/js'></script> `

    1. It is math – code – computers, built by people, owned by people, used by people, controlled by people.

      I think the fact we control it brings about these concerns much more than AI becoming evil in a way similar to that described above in the article. Humans are imperfect creatures. We make mistakes and are prone to bias in our actions. AI as a result will be built on our actions, including the mistakes and biases we have. It can already be observed in some ways. The reading "The Unseen Black Faces of AI Algorithms" we did for class discusses this in a rather direct fashion. Due to the algorithms we create for facial recognition being built on mainly white faces, it struggles to pick up faces of darker skinned individuals due to the lack of data the algorithms were provided in regard to differing races.

    1. Reviewer #1 (Public review):

      Summary:

      "Neural noise", here operationalized as an imbalance between excitatory and inhibitory neural activity, has been posited as a core cause of developmental dyslexia, a prevalent learning disability that impacts reading accuracy and fluency. This is study is the first to systematically evaluate the neural noise hypothesis of dyslexia. Neural noise was measured using neurophysiological (electroencephalography [EEG]) and neurochemical (magnetic resonance spectroscopy [MRS]) in adolescents and young adults with and without dyslexia. The authors did not find evidence of elevated neural noise in the dyslexia group from EEG or MRS measures, and Bayes factors generally informed against including the grouping factor in the models. Although the comparisons between groups with and without dyslexia did not support the neural noise hypothesis, a mediation model that quantified phonological processing and reading abilities continuously revealed that EEG beta power in the left superior temporal sulcus was positively associated with reading ability via phonological awareness. This finding lends support for analysis of associations between neural excitatory/inhibitory factors and reading ability along a continuum, rather than as with a case/control approach, and indicates the relevance of phonological awareness as an intermediate trait that may provide a more proximal link between neurobiology and reading ability. Further research is needed across developmental stages and over a broader set of brain regions to more comprehensively assess the neural noise hypothesis of dyslexia, and alternative neurobiological mechanisms of this disorder should be explored.

      Strengths:

      The inclusion of multiple methods of assessing neural noise (neurophysiological and neurochemical) is a major advantage of this paper. MRS at 7T confers an advantage of more accurately distinguishing and quantifying glutamate, which is a primary target of this study. In addition, the subject-specific functional localization of the MRS acquisition is an innovative approach. MRS acquisition and processing details are noted in the supplementary materials using according to the experts' consensus recommended checklist (https://doi.org/10.1002/nbm.4484). Commenting on rigor the EEG methods is beyond my expertise as a reviewer.<br /> Participants recruited for this study included those with a clinical diagnosis of dyslexia, which strengthens confidence in the accuracy of the diagnosis. The assessment of reading and language abilities during the study further confirms the persistently poorer performance of the dyslexia group compared to the control group.<br /> The correlational analysis and mediation analysis provide complementary information to the main case-control analyses, and the examination of associations between EEG and MRS measures of neural noise is novel and interesting.<br /> The authors follow good practice for open science, including data and code sharing. They also apply statistical rigor, using Bayes Factors to support conclusions of null evidence rather than relying only on non-significant findings. In the discussion, they acknowledge the limitations and generalizability of the evidence and provide directions for future research on this topic.

      Weaknesses:

      Though the methods employed in the paper are generally strong, the MRS acquisition was not optimized to quantify GABA, so the findings (or lack thereof) should be interpreted with caution. Specifically, while 7T MRS affords the benefit of quantifying metabolites, such as GABA, without spectral editing, this quantification is best achieved with echo times (TE) of 68 or 80 ms in order to minimize the spectral overlap between glutamate and GABA and reduce contamination from the macromolecular signal (Finkelman et al., 2022, https://doi.org/10.1016/j.neuroimage.2021.118810). The data in the present study were acquired at TE=28 ms, and are therefore likely affected by overlapping Glu and GABA peaks at 2.3 ppm that are much more difficult to resolve at this short TE, which could directly affect the measures that are meant to characterize the Glu/GABA+ ratio/imbalance. In future research, MRS acquisition schemes should be optimized for the acquisition of Glutamate, GABA, and their relative balance.

      As the authors note in the discussion, additional factors such as MRS voxel location, participant age, and participant sex could influence associations between neural noise and reading abilities and should be considered in future studies.

      Appraisal:

      The authors present a thorough evaluation of the neural noise hypothesis of developmental dyslexia in a sample of adolescents and young adults using multiple methods of measuring excitatory/inhibitory imbalances as an indicator of neural noise. The authors concluded that there was not support for the neural noise hypothesis of dyslexia in their study based on null significance and Bayes factors. This conclusion is justified, and further research is called for to more broadly evaluate the neural noise hypothesis in developmental dyslexia.

      Impact:

      This study provides an exemplar foundation for the evaluation of the neural noise hypothesis of dyslexia. Other researcher may adopt the model applied in this paper to examine neural noise in various populations with/without dyslexia, or across a continuum of reading abilities, to more thoroughly examine evidence (or lack thereof) for this hypothesis. Notably, the lack of evidence here does not rule out the possibility for a role of neural noise in dyslexia, and the authors point out that presentation with co-occurring conditions, such as ADHD, may contribute to neural noise in dyslexia. Dyslexia remains a multi-faceted and heterogenous neurodevelopmental condition, and many genetic, neurobiological and environmental factors play a role. This study demonstrates one step toward evaluating neurobiological mechanisms that may contribute to reading difficulties.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public Review):

      Summary:

      "Neural noise", here operationalized as an imbalance between excitatory and inhibitory neural activity, has been posited as a core cause of developmental dyslexia, a prevalent learning disability that impacts reading accuracy and fluency. This study is the first to systematically evaluate the neural noise hypothesis of dyslexia. Neural noise was measured using neurophysiological (electroencephalography [EEG]) and neurochemical (magnetic resonance spectroscopy [MRS]) in adolescents and young adults with and without dyslexia. The authors did not find evidence of elevated neural noise in the dyslexia group from EEG or MRS measures, and Bayes factors generally informed against including the grouping factor in the models. Although the comparisons between groups with and without dyslexia did not support the neural noise hypothesis, a mediation model that quantified phonological processing and reading abilities continuously revealed that EEG beta power in the left superior temporal sulcus was positively associated with reading ability via phonological awareness. This finding lends support for analysis of associations between neural excitatory/inhibitory factors and reading ability along a continuum, rather than as with a case/control approach, and indicates the relevance of phonological awareness as an intermediate trait that may provide a more proximal link between neurobiology and reading ability. Further research is needed across developmental stages and over a broader set of brain regions to more comprehensively assess the neural noise hypothesis of dyslexia, and alternative neurobiological mechanisms of this disorder should be explored.

      Strengths:

      The inclusion of multiple methods of assessing neural noise (neurophysiological and neurochemical) is a major advantage of this paper. MRS at 7T confers an advantage of more accurately distinguishing and quantifying glutamate, which is a primary target of this study. In addition, the subject-specific functional localization of the MRS acquisition is an innovative approach. MRS acquisition and processing details are noted in the supplementary materials according to the experts' consensus-recommended checklist (https://doi.org/10.1002/nbm.4484). Commenting on the rigor, the EEG methods is beyond my expertise as a reviewer.

      Participants recruited for this study included those with a clinical diagnosis of dyslexia, which strengthens confidence in the accuracy of the diagnosis. The assessment of reading and language abilities during the study further confirms the persistently poorer performance of the dyslexia group compared to the control group.

      The correlational analysis and mediation analysis provide complementary information to the main case-control analyses, and the examination of associations between EEG and MRS measures of neural noise is novel and interesting.

      The authors follow good practice for open science, including data and code sharing. They also apply statistical rigor, using Bayes Factors to support conclusions of null evidence rather than relying only on non-significant findings. In the discussion, they acknowledge the limitations and generalizability of the evidence and provide directions for future research on this topic.

      Weaknesses:

      Though the methods employed in the paper are generally strong, there are certain aspects that are not clearly described in the Materials & Methods section, such as a description of the statistical analyses used for hypothesis testing.

      Thank you for pointing this out. A description of the statistical models used in the analyses of EEG biomarkers has been added to the Materials and Methods:

      “First, exponent and offset values were averaged across all electrodes and analyzed using a 2x2 repeated measures ANOVA with group (dyslexic, control) as a between-subjects factor and condition (resting state, language task) as a within-subjects factor. Age was included in the analyses as a covariate due to the correlation between variables. Next, exponent and offset values were averaged across electrodes corresponding to the left (F7, FT7, FC5) and right inferior frontal gyrus (F8, FT8, FC6), and to the left (T7, TP7, TP9) and right superior temporal sulcus (T8, TP8, TP10). The electrodes were selected based on the analyses outlined by Giacometti and colleagues (2014) and Scrivener and Reader (2022). For these analyses, a 2x2x2x2 repeated measures ANOVA with age as a covariate was conducted with group (dyslexic, control) as a between-subjects factor and condition (resting state, language task), hemisphere (left, right), and region (frontal, temporal) as within-subjects factors. Results for the alpha and beta bands were calculated for the same clusters of frontal and temporal electrodes and analyzed with a similar 2x2x2x2 repeated measures ANOVA; however, for these analyses, age was not included as a covariate due to a lack of significant correlations.”

      We also expanded the description of the statistical models used in the analyses of MRS biomarkers:

      “To analyze the metabolite results, separate univariate ANCOVAs were conducted for Glu, GABA+, Glu/GABA+ ratio and Glu/GABA+ imbalance measures with group (control, dyslexic) as a between-subjects factor and voxel gray matter volume (GMV) as a covariate. Additionally, for the Glu analysis, age was included as a covariate due to a correlation between variables. Both frequentist and Bayesian statistics were calculated. Glu/GABA+ imbalance measure was calculated as the square root of the absolute residual value of a linear relationship between Glu and GABA+ (McKeon et al., 2024).”

      With regard to metabolite quantification, it is unclear why the authors chose to analyze and report metabolite values in terms of creatine ratios rather than quantification based on a water reference given that the MRS acquisition appears to support using a water reference.

      We have decided to use the ratio of Glu and GABA to total creatine (tCr), as this is still a common practice in MRS studies at 7T (e.g., Nandi et al., 2022; Smith et al., 2021). This approach normalizes the signal, reducing the impact of intensity variations across different regions and tissue compositions. Additionally, total creatine concentration is considered relatively stable across different brain regions, which is particularly important in our study, where a functional localizer was used to establish the left STS region individually. Our decision was further influenced by previous studies on dyslexia (Del Tufo et al., 2018; Pugh et al., 2014) which have reported creatine ratios and included GM volume as a covariate in their models, thus providing comparability. It is now indicated in the Results:

      “For comparability with previous studies in dyslexia (Del Tufo et al., 2018; Pugh et al., 2014) we report Glu and GABA as a ratio to total creatine (tCr).”

      and in the Method sections:

      “Glu and GABA+ concentrations were expressed as a ratio to total-creatine (tCr; Creatine + Phosphocreatine) following previous MRS studies in dyslexia (Del Tufo et al., 2018; Pugh et al., 2014).

      We did not estimate absolute concentrations using water signals as a reference, as this would require accounting for water relaxation times, which may vary across our age range. Nevertheless, our dataset has been made publicly available for future researchers to calculate and compare absolute values.

      Del Tufo, S. N., Frost, S. J., Hoeft, F., Cutting, L. E., Molfese, P. J., Mason, G. F., Rothman, D. L., Fulbright, R. K., & Pugh, K. R. (2018). Neurochemistry Predicts Convergence of Written and Spoken Language: A Proton Magnetic Resonance Spectroscopy Study of Cross-Modal Language Integration. Frontiers in Psychology, 9, 1507. https://doi.org/10.3389/fpsyg.2018.01507

      Nandi, T., Puonti, O., Clarke, W. T., Nettekoven, C., Barron, H. C., Kolasinski, J., Hanayik, T., Hinson, E. L., Berrington, A., Bachtiar, V., Johnstone, A., Winkler, A. M., Thielscher, A., Johansen-Berg, H., & Stagg, C. J. (2022). tDCS induced GABA change is associated with the simulated electric field in M1, an effect mediated by grey matter volume in the MRS voxel. Brain Stimulation, 15(5), 1153–1162. https://doi.org/10.1016/j.brs.2022.07.049

      Pugh, K. R., Frost, S. J., Rothman, D. L., Hoeft, F., Del Tufo, S. N., Mason, G. F., Molfese, P. J., Mencl, W. E., Grigorenko, E. L., Landi, N., Preston, J. L., Jacobsen, L., Seidenberg, M. S., & Fulbright, R. K. (2014). Glutamate and choline levels predict individual differences in reading ability in emergent readers. Journal of Neuroscience, 34(11), 4082–4089. https://doi.org/10.1523/JNEUROSCI.3907-13.2014

      Smith, G. S., Oeltzschner, G., Gould, N. F., Leoutsakos, J. S., Nassery, N., Joo, J. H., Kraut, M. A., Edden, R. A. E., Barker, P. B., Wijtenburg, S. A., Rowland, L. M., & Workman, C. I. (2021). Neurotransmitters and Neurometabolites in Late-Life Depression: A Preliminary Magnetic Resonance Spectroscopy Study at 7T. Journal of Affective Disorders, 279, 417–425. https://doi.org/10.1016/j.jad.2020.10.011

      GABA is typically quantified using J-editing sequences as lower field strengths (~3T), and there is some evidence that the GABA signal can be reliably measured at 7T without editing, however, the authors should discuss potential limitations, such as reliability of Glu and GABA measurements with short-TE semi-laser at 7T.

      In addition, MRS measurements of GABA are known to be influenced by macromolecules, and GABA is often denoted as GABA+ to indicate that other compounds contribute to the measured signal, especially at a short TE and in the absence of symmetric spectral editing.

      A general discussion of the strengths and limitations of unedited Glu and GABA quantification at 7T is warranted given the interest of this work to researchers who may not be experts in MRS.

      While we agree with the Reviewer that at 3T, it is recommended to use J-edited MRS to measure GABA (Mullins et al., 2014), the better spectral resolution at 7T allows for more reliable results for both metabolites using moderate echo-time, non-edited MRS (Finkelman et al., 2022). In this study, we used a short echo time (TE), which is optimal for Glu but not ideal for GABA, as it interferes with other signals. We are grateful to the Reviewer for suggesting the addition of a short paragraph to the Discussion, describing the practicalities of 3T and 7T MRS and changing the abbreviation to GABA+ to inform readers of possible macromolecule contamination:

      “We chose ultra-high-field MRS to improve data quality (Özütemiz et al., 2023), as the increased sensitivity and spectral resolution at 7T allows for better separation of overlapping metabolites compared to lower field strengths. Additionally, 7T provides a higher signal-to-noise ratio (SNR), improving the reliability of metabolite measurements and enabling the detection of small changes in Glu and GABA concentrations. Despite these theoretical advantages, several practical obstacles should be considered, such as susceptibility artifacts and inhomogeneities at higher field strengths that can impact data quality. Interestingly, actual methodological comparisons (Pradhan et al., 2015; Terpstra et al., 2016) show only a slight practical advantage of 7T single-voxel MRS compared to optimized 3T acquisition. For example, fitting quality yielded reduced estimates of variance in concentration of Glu in 7T (CRLB) and slightly improved reproducibility levels for Glu and GABA (at both fields below 5%). Choosing the appropriate MRS sequence involves a trade-off between the accuracy of Glu and GABA measurements, as different sequences are recommended for each metabolite. J-edited MRS is recommended for measuring GABA, particularly with 3T scanners (Mullins et al., 2014). However, at 7T, more reliable results can be obtained using moderate echo-time, non-edited MRS (Finkelman et al., 2022). We have opted for a short-echo-time sequence, which is optimal for measuring Glu. However, this approach results in macromolecule contamination of the GABA signal (referred to as GABA+).”

      Finkelman, T., Furman-Haran, E., Paz, R., & Tal, A. (2022). Quantifying the excitatory-inhibitory balance: A comparison of SemiLASER and MEGA-SemiLASER for simultaneously measuring GABA and glutamate at 7T. NeuroImage, 247, 118810. https://doi.org/10.1016/j.neuroimage.2021.118810

      Mullins, P. G., McGonigle, D. J., O'Gorman, R. L., Puts, N. A., Vidyasagar, R., Evans, C. J., Cardiff Symposium on MRS of GABA, & Edden, R. A. (2014). Current practice in the use of MEGA-PRESS spectroscopy for the detection of GABA. NeuroImage, 86, 43–52. https://doi.org/10.1016/j.neuroimage.2012.12.004

      Özütemiz, C., White, M., Elvendahl, W., Eryaman, Y., Marjańska, M., Metzger, G. J., Patriat, R., Kulesa, J., Harel, N., Watanabe, Y., Grant, A., Genovese, G., & Cayci, Z. (2023). Use of a Commercial 7-T MRI Scanner for Clinical Brain Imaging: Indications, Protocols, Challenges, and Solutions-A Single-Center Experience. AJR. American Journal of Roentgenology, 221(6), 788–804. https://doi.org/10.2214/AJR.23.29342

      Pradhan, S., Bonekamp, S., Gillen, J. S., Rowland, L. M., Wijtenburg, S. A., Edden, R. A., & Barker, P. B. (2015). Comparison of single voxel brain MRS AT 3T and 7T using 32-channel head coils. Magnetic Resonance Imaging, 33(8), 1013–1018. https://doi.org/10.1016/j.mri.2015.06.003

      Terpstra, M., Cheong, I., Lyu, T., Deelchand, D. K., Emir, U. E., Bednařík, P., Eberly, L. E., & Öz, G. (2016). Test-retest reproducibility of neurochemical profiles with short-echo, single-voxel MR spectroscopy at 3T and 7T. Magnetic Resonance in Medicine, 76(4), 1083–1091. https://doi.org/10.1002/mrm.26022

      Further, the single MRS voxel location is a limitation of the study as neurochemistry can vary regionally within individuals, and the putative excitatory/inhibitory imbalance in dyslexia may appear in regions outside the left temporal cortex (e.g., network-wide or in frontal regions involved in top-down executive processes). While the functional localization of the MRS voxel is a novelty and a potential advantage, it is unclear whether voxel placement based on left-lateralized reading-related neural activity may bias the experiment to be more sensitive to small, activity-related fluctuations in neurotransmitters in the CON group vs. the DYS group who may have developed an altered, compensatory reading strategy.

      We agree that including only one region of interest for the MRS measurements is a potential limitation of our study, and we have now added this information to the Discussion:

      “Moreover, since the MRS data was collected only from the left STS, it is plausible that other areas might be associated with differences in Glu or GABA concentrations in dyslexia.”

      However, differences in Glu and GABA concentrations in this region were directly predicted by the neural noise hypothesis of dyslexia. We acknowledge that this information was missing in the previous version of the manuscript. It is now included in the Results:

      “Moreover, the neural noise hypothesis of dyslexia identifies perisylvian areas as being affected by increased glutamatergic signaling, and directly predicts associations between Glu and GABA levels in the superior temporal regions and phonological skills (Hancock et al., 2017).”

      as well as in the Discussion:

      “Nevertheless, the neural noise hypothesis predicted increased glutamatergic signaling in perisylvian regions, specifically in the left superior temporal cortex (Hancock et al., 2017).”

      Figure 1 contains a lot of information, and it may be helpful to split it into 2 figures (EEG vs. MRS) so that the plots could be made larger and the reader could more easily digest the information.

      (a) I would also recommend displaying separate metabolite fit plots for each group, since the current presentation in panel F makes it appear that the MRS data is examined by testing differences between groups across the full spectrum (where the lines diverge), which really isn't the case.

      (b) The GABA peak is not visible in the spectrum, and Glutamate and GABA both have multiple peaks that should be shown on the spectrum. This may be best achieved by displaying the individual metabolite sub-spectra below the full spectrum

      Thank you for these suggestions. We have split the information into two Figures following the Reviewer’s recommendations.

      It is not clear why the 3T structural images were used for segmentation and calculation of tissue fraction if 7T structural images were also acquired (which would presumably have higher resolution).

      Generally, T1-weighted images from the 7T scanner exhibit more artifacts than those from the 3T scanner due to higher magnetic field inhomogeneity. These artifacts are especially pronounced in regions near air-tissue interfaces, such as the temporal lobes. Therefore, we chose the 3T structural images for segmentation and tissue fraction calculations and clarified this in the Method section:

      “Voxel segmentation was performed on structural images from a 3T scanner, coregistered to 7T structural images in SPM12, as the latter exhibited excessive artifacts and intensity bias in the temporal regions”.

      The basis set includes a large number of metabolites (27), including many low-concentration metabolites/compounds (e.g., bHG, bHB, Citrate, Threonine, ethanol) that are typically only included in studies targeting specific metabolites in disease/pathology. Please justify the inclusion of this maximal set of metabolites in the basis set, given that the inclusion of overlapping low-concentration metabolites may influence metabolite measurements of interest (https://doi.org/10.1002/mrm.10246).

      There is still no consensus in the MR community on which metabolites should be included in the model of human cerebral 1H-MR spectra. Typically, only major contributors such as NAA, Cr, Cho, Lac, mI, and possibly Glx are evaluated. Some studies also include additional metabolites like Ace, Ala, Asp, GABA, Glc, Gly, sI, NAAG, and Tau. In this study, as in a few others, further metabolites such as PCh, GPC, PCr, GSH, PE, and Thr were introduced and this approach seems suitable for high-field spectra (Hofmann et al., 2002).

      Hofmann, L., Slotboom, J., Jung, B., Maloca, P., Boesch, C., & Kreis, R. (2002). Quantitative 1H-magnetic resonance spectroscopy of human brain: Influence of composition and parameterization of the basis set in linear combination model-fitting. Magnetic Resonance in Medicine, 48(3), 440–453. https://doi.org/10.1002/mrm.10246

      Please provide a figure indicating the localization of the MRS voxel for a sample subject.

      A figure indicating the localization of the MRS voxel for a sample subject was added to the MRS checklist.

      It would be helpful to include Table S1 in the main article.

      Table S1 from the Supplementary Material has now been added to the main manuscript as Table 1 in the Results section.

      Please report descriptive statistics for EEG and MRS measures in Table S1.

      We have added a new Table S1 in the Supplementary Material, providing descriptive statistics for EEG and MRS E/I balance measures, presented separately for the dyslexic and control groups.

      I recommend avoiding using the terms "direct" and "indirect" to contrast MRS and EEG measures of E/I balance. Both of these measures are imperfect and it is misleading to say that MRS is a "direct" measure of neurotransmitters. There is also ambiguity in what is meant by "direct": in contrast to EEG, MRS does not measure neural activity and does not provide high-resolution temporal information, so in a sense, it is less direct.

      Thank you for this suggestion. We have replaced the terms 'direct' and 'indirect' biomarkers with 'MRS' and 'EEG' biomarkers throughout the text.

      There are many cases throughout the results in which Bayes and frequentist stats seem to contradict each other in terms of significance and what should be included in the models, especially with regard to the interaction effects (the Bayes factors appear to favor non-significant interactions). I think this is worth considering and describing to offer more clarity for the readers.

      We agree that a discussion of the divergent results between Bayesian and frequentist models was missing in the previous version of the manuscript. To provide greater clarity for the readers, we have conducted follow-up Bayesian t-tests in every case where the results indicated the inclusion of non-significant interactions with the effect of group in the model. These additional analyses have been performed for the exponent, offset, as well as for beta bandwidth in the Supplementary Material. We have also added a paragraph addressing these discrepancies in the Discussion:

      “Remarkably, in some models, results from Bayesian and frequentist statistics yielded divergent conclusions regarding the inclusion of non-significant effects. This was observed in more complex ANOVA models, whereas no such discrepancies appeared in t-tests or correlations. Given reports of high variability in Bayesian ANOVA estimates across repeated runs of the same analysis (Pfister, 2021), these results should be interpreted with caution. Therefore, following the recommendation to simplify complex models into Bayesian t-tests for more reliable estimates (Pfister, 2021), we conducted follow-up Bayesian t-tests in every case that favored the inclusion of non-significant interactions with the group factor. These analyses provided further evidence for the lack of differences between the dyslexic and control groups. Another source of discrepancy between the two methods may stem from the inclusion of interactions between covariates and within-subject effects in frequentist ANOVA, which were not included in Bayesian ANOVA to adhere to the recommendation for simpler Bayesian models (Pfister, 2021).”

      Pfister, R. (2021). Variability of Bayes factor estimates in Bayesian analysis of variance. The Quantitative Methods for Psychology, 17(1), 40-45. doi:10.20982/tqmp.17.1.p040

      It would be helpful to indicate whether participants in the DYS group had a history of reading intervention/remediation. In addition to showing that the DYS group performed lower than the CON group on reading assessments as a whole and given their age, was the performance on the reading assessments at an individual level considered for inclusion in the study? (i.e., were participants' persistent poor reading abilities confirmed with the research assessments?)

      We were unable to assess individual reading skills due to the lack of standardized diagnostic norms for adult dyslexia in Poland. Therefore, participants in the dyslexic group were recruited based on a previous clinical diagnosis of dyslexia, and reading and reading-related tasks were used for group-level comparisons only. This information has been added to the Methods section:

      “Since there are no standardized diagnostic norms for dyslexia in adults in Poland, individuals were assigned to the dyslexic group based on a past diagnosis of dyslexia.”

      Unfortunately, we did not collect information about participants' history of reading intervention or remediation. In this context, we acknowledge that including a sample of adult participants is a potential limitation of our study, however, this was already mentioned in the Discussion.

      Regarding the fMRI task, please indicate whether the participants whose threshold and/or contrast was changed for localization were from the DYS or CON group.

      This information is now added to the Method section:

      “For 6 participants (DYS n = 2, CON n = 4), the threshold was lowered to p < .05 uncorrected, while for another 6 participants (DYS n = 3, CON n = 3) the contrast from the auditory run was changed to auditory words versus fixation cross due to a lack of activation for other contrasts.”

      Reviewer #2 (Public Review):

      Summary:

      This study utilized two complementary techniques (EEG and 7T MRI/MRS) to directly test a theory of dyslexia: the neural noise hypothesis. The authors report finding no evidence to support an excitatory/inhibitory balance, as quantified by beta in EEG and Glutamate/GABA ratio in MRS. This is important work and speaks to one potential mechanism by which increased neural noise may occur in dyslexia.

      Strengths:

      This is a well-conceived study with in-depth analyses and publicly available data for independent review. The authors provide transparency with their statistics and display the raw data points along with the averages in figures for review and interpretation. The data suggest that an E/I balance issue may not underlie deficits in dyslexia and is a meaningful and needed test of a possible mechanism for increased neural noise.

      Weaknesses:

      The researchers did not include a visual print task in the EEG task, which limits analysis of reading-specific regions such as the visual word form area, which is a commonly hypoactivated region in dyslexia. This region is a common one of interest in dyslexia, yet the researchers measured the I/E balance in only one region of interest, specific to the language network.

      We agree with the Reviewer that including different tasks for the EEG biomarkers assessment would be valuable. However, this limitation was already addressed in the Discussion:

      “Importantly, our study focused on adolescents and young adults, and the EEG recordings were conducted during rest and a spoken language task. These factors may limit the generalizability of our results. Future research should include younger populations and incorporate a broader array of tasks, such as reading and phonological processing, to provide a more comprehensive evaluation of the E/I balance hypothesis.”

      Further, this work does not consider prior studies reporting neural inconsistency; a potential consequence of increased neural noise, which has been reported in several studies and linked with candidate-dyslexia gene variants (e.g., Centanni et al., 2018, 2022; Hornickel & Kraus, 2013; Neef et al., 2017). While E/I imbalance may not be a cause of increased neural noise, other potential mechanisms remain and should be discussed.

      Thank you for referring us to other works reporting neural variability in dyslexia. We agree that a broader context regarding sources of reduced neural synchronization, beyond E/I imbalance, was missing in the previous version of the manuscript. We have now included these references in the Discussion:

      “Furthermore, although our results do not support the idea of E/I balance alterations as a source of neural noise in dyslexia, they do not preclude other mechanisms leading to less synchronous neural firing posited by the hypothesis. In this context, there is evidence showing increased trial-to-trial inconsistency of neural responses in individuals with dyslexia (Centanni et al., 2022) or poor readers (Hornickel and Kraus, 2013) and its associations with specific dyslexia risk genes (Centanni et al., 2018; Neef et al., 2017). At the same time, the observed trial-to-trial inconsistency was either present only in a subset of participants (Centanni et al., 2018), limited to some experimental conditions (Centanni et al., 2022), or specific brain regions – e.g., brainstem in Hornickel and Kraus (2013), left auditory cortex in Centanni et al. (2018), or left supramarginal gyrus in Centanni et al. (2022).”

      A better description of the exponent and offset components is needed at the beginning of the results, given that the methods are presented in detail at the end. I also do not see a clear description of these components in the methods.

      A description of the aperiodic components is now included in the Results:

      “In the initial step of the analysis, we analyzed the aperiodic (exponent and offset) components of the EEG spectrum. The exponent reflects the steepness of the EEG power spectrum, with a higher exponent indicating a steeper signal; while the offset represents a uniform shift in power across frequencies, with a higher offset indicating greater power across the entire EEG spectrum (Donoghue et al., 2020).”

      as well as in the Materials and Methods:

      “Two broadband aperiodic parameters were extracted: the exponent, which quantifies the steepness of the EEG power spectrum, and the offset, which indicates signal’s power across the entire frequency spectrum.”

      Reviewer #3 (Public Review):

      Summary:

      This study by Glica and colleagues utilized EEG (i.e., Beta power, Gamma power, and aperiodic activity) and 7T MRS (i.e., MRS IE ratio, IE balance) to reevaluate the neural noise hypothesis in Dyslexia. Supported by Bayesian statistics, their results show solid 'no evidence' of EI balance differences between groups, challenging the neural noise hypothesis. The work will be of broad interest to neuroscientists, and educational and clinical psychologists.

      Strengths:

      Combining EEG and 7T MRS, this study utilized both the indirect (i.e., Beta power, Gamma power, and aperiodic activity) and direct (i.e., MRS IE ratio, IE balance) measures to reevaluate the neural noise hypothesis in Dyslexia.

      Weaknesses:

      The authors may need to provide more data to assess the quality of the MRS data.

      We have addressed the following specific recommendations of the Reviewer providing more data about the quality of the MRS data.

      The authors may need to explain how the number of subjects is determined in the MRS section.

      We have clarified the MRS sample description in the Results section:

      “Due to financial and logistical constraints, 59 out of the 120 recruited subjects, selected progressively as the study unfolded, were examined with MRS. Subjects were matched by age and sex between the dyslexic and control groups. Due to technical issues and to prevent delays and discomfort for the participants, we collected 54 complete sessions. Additionally, four datasets were excluded based on our quality control criteria, and three GABA+ estimates exceeded the selected CRLB threshold. Ultimately, we report 50 estimates for Glu (21 participants with dyslexia) and 47 for GABA+ and Glu/GABA+ ratios (20 participants with dyslexia).”

      Is there a reason why theta and gamma peaks were not observed in the majority of participants? What are the possible reasons that likely caused the discrepancy between this study and previously reported relevant studies?

      We have now added a discussion about the absence of oscillatory peaks in the theta and gamma bands to the Discussion section:

      “We could not perform analyses for the gamma oscillations since in the majority of participants the gamma peak was not detected above the aperiodic component. Due to the 1/f properties of the EEG spectrum, both aperiodic and periodic components should be disentangled to analyze ‘true’ gamma oscillations; however, this approach is not typically recognized in electrophysiology research (Hudson and Jones, 2022). Indeed, previous studies that analyzed gamma activity in dyslexia (Babiloni et al., 2012; Lasnick et al., 2023; Rufener and Zaehle, 2021) did not separate the background aperiodic activity. For the same reason, we could not analyze results for the theta band, which often does not meet the criteria for an oscillatory component manifested as a peak in the power spectrum (Klimesch, 1999). Moreover, results from a study investigating developmental changes in both periodic and aperiodic components suggest that theta oscillations in older participants are mostly observed in frontal midline electrodes (Cellier et al., 2021), which were not analyzed in the current study.”

      Hudson, M. R., & Jones, N. C. (2022). Deciphering the code: Identifying true gamma neural oscillations. Experimental Neurology357, 114205. https://doi.org/10.1016/j.expneurol.2022.114205

      Klimesch, W. (1999). EEG alpha and theta oscillations reflect cognitive and memory performance: A review and analysis. Brain Research Reviews29(2-3), 169-195. https://doi.org/10.1016/S0165-0173(98)00056-3

      Based on Figure 1F, the quality of the MRS data may be contaminated by the lipid signal, especially for the DYS group. To better evaluate the MRS data, especially the GABA measurements, the authors need to show:

      (a) the placement of the MRS voxel on the anatomical images;

      Averaged MRS voxel placement was already presented in Figure 1 (now Figure 2) in the manuscript. Now, we have also added exemplary single-subject images to the MRS checklist in the Supplement.

      (b) Glu and GABA model functions

      We have now provided more meaningful Glu and GABA indications in Figure 2.

      (c) CRLB for GABA

      We have added respective estimates to the Supplement:

      %CRLB of Glu: mean 2.96, SD = 0.79

      %CRLB of GABA: mean 10.59, SD = 2.76

      %CRLB of NAA: 1.76 SD = 0.46

      Further, the authors added voxel's gray matter volume as a covariate when performing separate ANCOVAs. The authors may need to use alpha correction or 1-fCSF correction to corroborate these results.

      We chose to use the ratio of Glu and GABA to total creatine (tCr), as this remains a common practice in MRS studies at 7T (e.g., Nandi et al., 2022; Smith et al., 2021). This decision was also influenced by previous dyslexia studies (Del Tufo et al., 2018; Pugh et al., 2014) and is now clarified in the Results and Methods sections.

      Regarding alpha correction, a recent paper (García-Pérez et al., 2023) recommends: 'In general, avoid corrections for multiple testing if statistical claims are to be made for each individual test, in the absence of an omnibus null hypothesis.' Since we report null findings, further alpha correction would not significantly impact the results.

      García-Pérez, M. A. (2023). Use and misuse of corrections for multiple testing. Methods in Psychology8, 100120. https://doi.org/10.1016/j.metip.2023.100120

    1. Reviewer #2 (Public review):

      Summary:

      It is generally believed that higher-order areas in the prefrontal cortex guide selection during working memory and attention through signals that selectively recruit neuronal populations in sensory areas that encode the relevant feature. In this work, Parto-Dezfouli and colleagues tested how these prefrontal signals influence activity in visual area V4 using a spatial working memory task. They recorded neuronal activity from visual area V4 and found that information about visual features at the behaviorally relevant part of space during the memory period is carried in a spatially selective manner in the timing of spikes relative to a beta oscillation (phase coding) rather than in the average firing rate (rate code). The authors further tested whether there is a causal link between prefrontal input and the phase encoding of visual information during the memory period. They found that indeed inactivation of the frontal eye fields, a prefrontal area known to send spatial signals to V4, decreased beta oscillatory activity in V4 and information about the visual features. The authors went one step further to develop a neural model that replicated the experimental findings and suggested that changes in the average firing rate of individual neurons might be a result of small changes in the exact beta oscillation frequency within V4. These data provide important new insights into the possible mechanisms through which top-down signals can influence activity in hierarchically lower sensory areas and can therefore have a significant impact on the Systems, Cognitive, and Computational Neuroscience fields.

      Strengths:

      This is a well-written paper with a well-thought-out experimental design. The authors used a smart variation of the memory-guided saccade task to assess how information about the visual features of stimuli is encoded during the memory period. By using a grating of various contrasts and orientations as the background the authors ensured that bottom-up visual input would drive responses in visual area V4 in the delay period, something that is not commonly done in experimental settings in the same task. Moreover, one of the major strengths of the study is the use of different approaches including analysis of electrophysiological data using advanced computational methods of analysis, manipulation of activity through inactivation of the prefrontal cortex to establish causality of top-down signals on local activity signatures (beta oscillations, spike locking and information carried) as well as computational neuronal modeling. This has helped extend an observation into a possible mechanism well supported by the results.

      Weaknesses:

      Although the authors provide support for their conclusions from different approaches, I found that the selection of some of the analyses and statistical assessments made it harder for the reader to follow the comparison between a rate code and a phase code. Specifically, the authors wish to assess whether stimulus information is carried selectively for the relevant position through a firing rate or a phase code. Results for the rate code are shown in Figures 1B-G and for the phase code are shown in Figure 2. Whereas an F-statistic is shown over time in Figure 1F (and Figure S1) no such analysis is shown for LFP power. Similarly, following FEF inactivation there is no data on how that influences V4 firing rates and information carried by firing rates in the two conditions (for positions inside and outside the V4 RF). In the same vein, no data are shown on how the inactivation affects beta phase coding in the OUT condition.

      Moreover, some of the statistical assessments could be carried out differently including all conditions to provide more insight into mechanisms. For example, a two-way ANOVA followed by post hoc tests could be employed to include comparisons across both spatial (IN, OUT) and visual feature conditions (see results in Figures 2D, S4, etc.). Figure 2D suggests that the absence of selectivity in the OUT condition (no significant difference between high and low contrast stimuli) is mainly due to an increase in slope in the OUT condition for the low contrast stimulus compared to that for the same stimulus in the IN condition. If this turns out to be true it would provide important information that the authors should address.

      There are also a few conceptual gaps that leave the reader wondering whether the results and conclusion are general enough. Specifically,

      (1) the authors used microstimulation in the FEF to determine RFs. It is thus possible that the FEF sites that were inactivated were largely more motor-related. Given that beta oscillations and motor preparatory activity have been found to be correlated and motor sites show increased beta oscillatory activity in the delay period, it is possible that the effect of FEF inactivation on V4 beta oscillations is due to inactivation of the main source of beta activity. Had the authors inactivated sites with a preponderance of visual neurons in the FEF would the results be different?

      (2) Somewhat related to this point and given the prominence of low-frequency activity in deeper layers of the visual cortex according to some previous studies, it is not clear where the authors' V4 recordings were located. The authors report that they do have data from linear arrays, so it should be possible to address this.

      (3) The authors suggest that a change in the exact frequency of oscillation underlies the increase in firing rate for different stimulus features. However, the shift in frequency is prominent for contrast but not for orientation, something that raises questions about the general applicability of this observation for different visual features.

      (4) One of the major points of the study is the primacy of the phase code over the rate code during the delay period. Specifically, here it is shown that information about the visual features of a stimulus carried by the rate code is similar for relevant and irrelevant locations during the delay period. This contrasts with what several studies have shown for attention in which case information carried in firing rates about stimuli in the attended location is enhanced relative to that for stimuli in the unattended location. If we are to understand how top-down signals work in cognitive functions it is inevitable to compare working memory with attention. The possible source of this difference is not clear and is not discussed. The reader is left wondering whether perhaps a different measure or analysis (e.g. a percent explained variance analysis) might reveal differences during the delay period for different visual features across the two spatial conditions.

      The use of the memory-guided saccade task has certain disadvantages in the context of this study. Although delay activity is interpreted as memory activity by the authors, it is in principle possible that it reflects preparation for the upcoming saccade, spatial attention (particularly since there is a stimulus in the RF), etc. This could potentially change the conclusion and perspective.

      For the position outside the V4 RF, there is a decrease in both beta oscillations and the clustering of spikes at a specific phase. It is therefore possible that the decrease in information about the stimuli features is a byproduct of the decrease in beta power and phase locking. Decreased oscillatory activity and phase locking can result in less reliable estimates of phase, which could decrease the mutual information estimates.

      The authors propose that coherent oscillations could be the mechanism through which the prefrontal cortex influences beta activity in V4. I assume they mean coherent oscillations between the prefrontal cortex and V4. Given that they do have simultaneous recordings from the two areas they could test this hypothesis on their own data, however, they do not provide any results on that.

      The authors make a strong point about the relevance of changes in the oscillation frequency and how this may result in an increase in firing rate although it could also be the reverse - an increase in firing rate leading to an increase in the frequency peak. It is not clear at all how these changes in frequency could come about. A more nuanced discussion based on both experimental and modeling data is necessary to appreciate the source and role (if any) of this observation.

    2. Author response:

      Reviewer #1 (Public review):

      Summary:

      This study investigates what happens to the stimulus-driven responses of V4 neurons when an item is held in working memory. Monkeys are trained to perform memory-guided saccades: they must remember the location of a visual cue and then, after a delay, make an eye movement to the remembered location. In addition, a background stimulus (a grating) is presented that varies in contrast and orientation across trials. This stimulus serves to probe the V4 responses, is present throughout the trial, and is task-irrelevant. Using this design, the authors report memory-driven changes in the LFP power spectrum, changes in synchronization between the V4 spikes and the ongoing LFP, and no significant changes in firing rate.

      Strengths:

      (1) The logic of the experiment is nicely laid out.

      (2) The presentation is clear and concise.

      (3) The analyses are thorough, careful, and yield unambiguous results.

      (4) Together, the recording and inactivation data demonstrate quite convincingly that the signal stored in FEF is communicated to V4 and that, under the current experimental conditions, the impact from FEF manifests as variations in the timing of the stimulus-evoked V4 spikes and not in the intensity of the evoked activity (i.e., firing rate).

      Weaknesses:

      I think there are two limitations of the study that are important for evaluating the potential functional implications of the data. If these were acknowledged and discussed, it would be easier to situate these results in the broader context of the topic, and their importance would be conveyed more fairly and transparently.

      (1) While it may be true that no firing rate modulations were observed in this case, this may have been because the probe stimuli in the task were behaviorally irrelevant; if anything, they might have served as distracters to the monkey's actual task (the MGS). From this perspective, the lack of rate modulation could simply mean that the monkeys were successful in attending the relevant cue and shielding their performance from the potentially distracting effect of the background gratings. Had the visual probes been in some way behaviorally relevant and/or spatially localized (instead of full field), the data might have looked very different.

      Any task design involves tradeoffs; if the visual stimulus was behaviorally relevant, then any observed neurophysiological changes would be more confounded by possible attentional effects. We cannot exclude the possibility that a different task or different stimuli would produce different results; we ourselves have reported firing rate enhancements for other types of visual probes during an MGS task (Merrikhi et al. 2017). We have added an acknowledgement of these limitations in the discussion section (lines 311-319). At minimum, our results show a dissociation between the top-down modulation of phase coding, which is enhanced during WM even for these task-irrelevant stimuli, and rate coding. Establishing whether and how this phase coding is related to perception and behavior will be an important direction for future work.

      With this in mind, it would be prudent to dial down the tone of the conclusions, which stretch well beyond the current experimental conditions (see recommendations).

      We have edited the title (removing the word ‘primarily’) and key sentences throughout to tone down the conclusions, generally to state that the importance of a phase code in WM modulations is *possible* given the observed results, rather than certain (see abstract line 27, introduction lines 58-60, results line 215, conclusion lines 294-295).

      (2) Another point worth discussing is that although the FEF delay-period activity corresponds to a remembered location, it can also be interpreted as an attended location, or as a motor plan for the upcoming eye movement. These are overlapping constructs that are difficult to disentangle, but it would be important to mention them given prior studies of attentional or saccade-related modulation in V4. The firing rate modulations reported in some of those cases provide a stark contrast with the findings here, and I again suspect that the differences may be due at least in part to the differing experimental conditions, rather than a drastically different encoding mode or functional linkage between FEF and V4.

      We have added a paragraph to the discussion section addressing links to attention and motor planning (lines 301-322), and specifically acknowledging the inherent difficulties of fully dissociating these effects when interpreting our results (lines 311-319).

      Reviewer #2 (Public review):

      Summary:

      It is generally believed that higher-order areas in the prefrontal cortex guide selection during working memory and attention through signals that selectively recruit neuronal populations in sensory areas that encode the relevant feature. In this work, Parto-Dezfouli and colleagues tested how these prefrontal signals influence activity in visual area V4 using a spatial working memory task. They recorded neuronal activity from visual area V4 and found that information about visual features at the behaviorally relevant part of space during the memory period is carried in a spatially selective manner in the timing of spikes relative to a beta oscillation (phase coding) rather than in the average firing rate (rate code). The authors further tested whether there is a causal link between prefrontal input and the phase encoding of visual information during the memory period. They found that indeed inactivation of the frontal eye fields, a prefrontal area known to send spatial signals to V4, decreased beta oscillatory activity in V4 and information about the visual features. The authors went one step further to develop a neural model that replicated the experimental findings and suggested that changes in the average firing rate of individual neurons might be a result of small changes in the exact beta oscillation frequency within V4. These data provide important new insights into the possible mechanisms through which top-down signals can influence activity in hierarchically lower sensory areas and can therefore have a significant impact on the Systems, Cognitive, and Computational Neuroscience fields.

      Strengths:

      This is a well-written paper with a well-thought-out experimental design. The authors used a smart variation of the memory-guided saccade task to assess how information about the visual features of stimuli is encoded during the memory period. By using a grating of various contrasts and orientations as the background the authors ensured that bottom-up visual input would drive responses in visual area V4 in the delay period, something that is not commonly done in experimental settings in the same task. Moreover, one of the major strengths of the study is the use of different approaches including analysis of electrophysiological data using advanced computational methods of analysis, manipulation of activity through inactivation of the prefrontal cortex to establish causality of top-down signals on local activity signatures (beta oscillations, spike locking and information carried) as well as computational neuronal modeling. This has helped extend an observation into a possible mechanism well supported by the results.

      Weaknesses:

      Although the authors provide support for their conclusions from different approaches, I found that the selection of some of the analyses and statistical assessments made it harder for the reader to follow the comparison between a rate code and a phase code. Specifically, the authors wish to assess whether stimulus information is carried selectively for the relevant position through a firing rate or a phase code. Results for the rate code are shown in Figures 1B-G and for the phase code are shown in Figure 2. Whereas an F-statistic is shown over time in Figure 1F (and Figure S1) no such analysis is shown for LFP power. Similarly, following FEF inactivation there is no data on how that influences V4 firing rates and information carried by firing rates in the two conditions (for positions inside and outside the V4 RF). In the same vein, no data are shown on how the inactivation affects beta phase coding in the OUT condition.

      We plan to incorporate statistical analysis of this point in the revised version.

      Moreover, some of the statistical assessments could be carried out differently including all conditions to provide more insight into mechanisms. For example, a two-way ANOVA followed by post hoc tests could be employed to include comparisons across both spatial (IN, OUT) and visual feature conditions (see results in Figures 2D, S4, etc.). Figure 2D suggests that the absence of selectivity in the OUT condition (no significant difference between high and low contrast stimuli) is mainly due to an increase in slope in the OUT condition for the low contrast stimulus compared to that for the same stimulus in the IN condition. If this turns out to be true it would provide important information that the authors should address.

      We plan to incorporate statistical analysis of this point in the revised version.

      There are also a few conceptual gaps that leave the reader wondering whether the results and conclusion are general enough. Specifically,

      (1) the authors used microstimulation in the FEF to determine RFs. It is thus possible that the FEF sites that were inactivated were largely more motor-related. Given that beta oscillations and motor preparatory activity have been found to be correlated and motor sites show increased beta oscillatory activity in the delay period, it is possible that the effect of FEF inactivation on V4 beta oscillations is due to inactivation of the main source of beta activity. Had the authors inactivated sites with a preponderance of visual neurons in the FEF would the results be different?

      We do not believe this to be likely based on what is known anatomically and functionally about this circuitry. Anatomically, the projections from FEF to V4 arise primarily from the supragranular layers, not layers which contain the highest proportion of motor activity (Barone et al. 2000, Pouget et al. 2009, Markov et al. 2013). Functionally, based on electrical identification of V4-projecting FEF neurons, we know that FEF to V4 projections are predominantly characterized by delay rather than motor activity (Merrikhi et al. 2017). We have now tried to emphasize these points when we introduce the inactivation experiments (lines 180-182).

      Experimentally, the spread of the pharmacological effect with our infusion system is quite large relative to any clustering of visual vs. motor neurons within the FEF, with behavioral consequences of inactivation spreading to cover a substantial portion of the visual hemifield (e.g., Noudoost et al. 2014, Clark et al. 2014), and so our manipulation lacks the spatial resolution to selectively target motor vs. other FEF neurons.

      (2) Somewhat related to this point and given the prominence of low-frequency activity in deeper layers of the visual cortex according to some previous studies, it is not clear where the authors' V4 recordings were located. The authors report that they do have data from linear arrays, so it should be possible to address this.

      Unfortunately our chamber placement for V4 has produced linear array penetration angles which do not reliably allow identification of cortical layers. We are aware of previous results showing layer-specific effects of attention in V4 (e.g., Pettine et al. 2019, Buffalo et al. 2011), and it would indeed be interesting to determine whether our observed WM-driven changes follow similar patterns. We may be able to analyze a subset of the data with current source density analysis to look for layer-specific effects in the future, but are not able to provide any information at this time.

      (3) The authors suggest that a change in the exact frequency of oscillation underlies the increase in firing rate for different stimulus features. However, the shift in frequency is prominent for contrast but not for orientation, something that raises questions about the general applicability of this observation for different visual features.

      We plan to incorporate statistical analysis of this point in the revised version.

      (4) One of the major points of the study is the primacy of the phase code over the rate code during the delay period. Specifically, here it is shown that information about the visual features of a stimulus carried by the rate code is similar for relevant and irrelevant locations during the delay period. This contrasts with what several studies have shown for attention in which case information carried in firing rates about stimuli in the attended location is enhanced relative to that for stimuli in the unattended location. If we are to understand how top-down signals work in cognitive functions it is inevitable to compare working memory with attention. The possible source of this difference is not clear and is not discussed. The reader is left wondering whether perhaps a different measure or analysis (e.g. a percent explained variance analysis) might reveal differences during the delay period for different visual features across the two spatial conditions.

      We have added discussion regarding the relationship of these results to previous findings during attention in the discussion section (lines 301-322).

      The use of the memory-guided saccade task has certain disadvantages in the context of this study. Although delay activity is interpreted as memory activity by the authors, it is in principle possible that it reflects preparation for the upcoming saccade, spatial attention (particularly since there is a stimulus in the RF), etc. This could potentially change the conclusion and perspective.

      We have added a new discussion paragraph addressing the relationship to attention and motor planning (lines 301-322). We have also moderated the language used to describe our conclusions throughout the manuscript in light of this ambiguity.

      For the position outside the V4 RF, there is a decrease in both beta oscillations and the clustering of spikes at a specific phase. It is therefore possible that the decrease in information about the stimuli features is a byproduct of the decrease in beta power and phase locking. Decreased oscillatory activity and phase locking can result in less reliable estimates of phase, which could decrease the mutual information estimates.

      We plan to incorporate statistical analysis of this point in the revised version.

      The authors propose that coherent oscillations could be the mechanism through which the prefrontal cortex influences beta activity in V4. I assume they mean coherent oscillations between the prefrontal cortex and V4. Given that they do have simultaneous recordings from the two areas they could test this hypothesis on their own data, however, they do not provide any results on that.

      This paper only includes inactivation data. We are working on analyzing the simultaneous recording data for a future publication.

      The authors make a strong point about the relevance of changes in the oscillation frequency and how this may result in an increase in firing rate although it could also be the reverse - an increase in firing rate leading to an increase in the frequency peak. It is not clear at all how these changes in frequency could come about. A more nuanced discussion based on both experimental and modeling data is necessary to appreciate the source and role (if any) of this observation.

      As the reviewer notes, it is difficult to determine whether the frequency changes drive the rate changes, vice versa, or whether both are generated in parallel by a common source. We have adjusted our language to reflect this (lines 277-278). Future modeling work may be able to shed more light on the causal relationships between various neural signatures.

      Reviewer #3 (Public review):

      Summary:

      In this report, the authors test the necessity of prefrontal cortex (specifically, FEF) activity in driving changes in oscillatory power, spike rate, and spike timing of extrastriate visual cortex neurons during a visual-spatial working memory (WM) task. The authors recorded LFP and spikes in V4 while macaques remembered a single spatial location over a delay period during which task-irrelevant background gratings were displayed on the screen with varying orientation and contrast. V4 oscillations (in the beta range) scaled with WM maintenance, and the information encoded by spike timing relative to beta band LFP about the task-irrelevant background orientation depended on remembered location. They also compared recorded signals in V4 with and without muscimol inactivation of FEF, demonstrating the importance of FEF input for WM-induced changes in oscillatory amplitude, phase coding, and information encoded about background orientations. Finally, they built a network model that can account for some of these results. Together, these results show that FEF provides meaningful input to the visual cortex that is used to alter neural activity and that these signals can impact information coding of task-irrelevant information during a WM delay.

      Strengths:

      (1) Elegant and robust experiment that allows for clear tests for the necessity of FEF activity in WM-induced changes in V4 activity.

      (2) Comprehensive and broad analyses of interactions between LFP and spike timing provide compelling evidence for FEF-modulated phase coding of task-irrelevant stimuli at remembered location.

      (3) Convincing modeling efforts.

      Weaknesses:

      (1) 0% contrast background data (standard memory-guided saccade task) are not reported in the manuscript. While these data cannot be used to consider information content of spike rate/time about task-irrelevant background stimuli, this condition is still informative as a 'baseline' (and a more typical example of a WM task).

      We plan to incorporate statistical analysis of this point in the revised version.

      (2) Throughout the manuscript, the primary measurements of neural coding pertain to task-irrelevant stimuli (the orientation/contrast of the background, which is unrelated to the animal's task to remember a spatial location). The remembered location impacts the coding of these stimulus variables, but it's unclear how this relates to WM representations themselves.

      Indeed, here we have focused on how maintaining spatial WM impacts visual processing of incoming sensory information, rather than on how the spatial WM signal itself is represented and maintained. Behaviorally, this impact on visual signals could be related to the effects of the content of WM on perception and reaction times (e.g., Soto et al. 2008, Awh et al. 1998, Teng et al. 2019), but no such link to behavior is shown in our data.

    1. eLife Assessment

      This important study uses reinforcement learning to study how turbulent odor stimuli should be processed to yield successful navigation. They find that there is an optimal memory length over which an agent should ignore blanks in the odor to discriminate whether the agent is still inside the plume or outside of it, complementing recent studies using RNNs and finite state controllers to identify optimal strategies for navigating a turbulent plume. While the overall strength of evidence is convincing, better justification for using Brownian motion as a recovery strategy and the addition of accompanying code for reproducibility would add to this strength.

    2. Reviewer #2 (Public review):

      Summary:

      The authors investigate the problem of olfactory search in turbulent environments using artificial agents trained using tabular Q-learning, a simple and interpretable reinforcement learning (RL) algorithm. The agents are trained solely on odor stimuli, without access to spatial information or prior knowledge about the odor plume's shape. This approach makes the emergent control strategy more biologically plausible for animals navigating exclusively using olfactory signals. The learned strategies show parallels to observed animal behaviors, such as upwind surging and crosswind casting. The approach generalizes well to different environments and effectively handles the intermittency of turbulent odors.

      Strengths:

      (1) The use of numerical simulations to generate realistic turbulent fluid dynamics sets this paper apart from studies that rely on idealized or static plumes.

      (2) A key innovation is the introduction of a small set of interpretable olfactory states based on moving averages of odor intensity and sparsity, coupled with an adaptive temporal memory.

      (3) The paper provides a thorough analysis of different recovery strategies when an agent loses the odor trail, offering insights into the trade-offs between various approaches.

      (4) The authors provide a comprehensive performance analysis of their algorithm across a range of environments and recovery strategies, demonstrating the versatility of the approach.

      (5) Finally, the authors list an interesting set of real-world experiments based on their findings, that might invite interest from experimentalists across multiple species.

      Weaknesses:

      (1) The inclusion of Brownian motion as a recovery strategy, seems odd since it doesn't closely match natural animal behavior, where circling (e.g. flies) or zigzagging (ants' "sector search") could have been more realistic.

      (2) Using tabular Q-learning is both a strength and a limitation. It's simple and interpretable, making it easier to analyze the learned strategies, but the discrete action space seems somewhat unnatural. In real-world biological systems, actions (like movement) are continuous rather than discrete. Additionally, the ground-frame actions may not map naturally to how animals navigate odor plumes (e.g. insects often navigate based on their own egocentric frame).

      (3) The lack of accompanying code is a major drawback since nowadays open access to data and code is becoming a standard in computational research. Given that the turbulent fluid simulation is a key element that differentiates this paper, the absence of simulation and analysis code limits the study's reproducibility.

    3. Author response:

      We thank the Editor and Reviewers for their work on our manuscript, and are happy to receive their positive comments, as well as their questions and suggestions. We are currently revising the manuscript and are planning to de-emphasize Brownian recovery as a simple yet biologically irrelevant benchmark and include comparisons with other biologically inspired strategies suggested by the reviewers. As for sharing the code and data: we completely agree: dataset 1 is already public and we will share the other dataset as well as the code. In a nutshell, we will be addressing the referee’s suggestions as follows:

      (1)   As Referee 1 points out, even if the algorithm does not require a map of space, the agent is still required to tell apart North, East, South and West relative to the wind direction which is implicitly assumed known. We will better clarify the spatial encoding required to implement these strategies.

      (2)   Referee 1 remarks that the learned recovery strategy works best and suggests to give it a more prominent role and better characterize it. We agree that what is done in the void state is definitely key and more work is needed to understand it. In the revised manuscript, we are planning to further substantiate the statistics of the learned recovery by repeating training several times and comparing several trajectories. Note that this strategy is much more flexible than the others and could potentially mix aspects of recovery to aspects of exploitation: we defer a more in-depth analysis that disentangles these two aspects elsewhere.

      (3)   Referee 1 asks whether an optimal, minimal representation of the olfactory states exists. Q learning defines the olfactory states prior to training and does not allow to systematically optimize odor representation for the task. Given the odor features, we can however discretize them in more or less olfactory states. We expect that decreasing the number of olfactory states provides less positional information and potentially degrades performance, although loss in performance may be overshadowed by noise or by efficient recovery. We are planning to re-train our model with a smaller numer of non-void states and will provide the comparison. The number of void states does not need further testing: we chose 50 void states because it matches the time agents typically remain in the void and indeed achieves very high performance (less than 50 void states results in no convergence and more than 50 introduces states that are rarely visited)

      (4)   Both reviewers correctly remark that Brownian motion is not biologically relevant. We will make sure to further clarify that this is a rather simple --but biologically irrelevant-- benchmark. We are planning to include results with both circling and zigzaging as biologically inspired recovery strategies.

      (5)   We agree with reviewer 2 that animal locomotion does not look like a series of discrete displacements on a checkerboard. However, to overcome this limitation, one has to first focus on a specific system to define actions in a way that best adheres to a species’ motor controls. Second, these actions are likely continuous, which makes reinforcement learning notoriously more complex. While we agree that more realistic models are definitely needed for a comparison with real systems, this remains outside the scope of the current work.

      (6)   We agree with the referees and editor that it is important to publish the code and data alongside with the manuscript. It was already planned and we will make sure to share the links within the revised version of the manuscript.

    1. Reviewer #3 (Public review):

      The author presents a novel theory and computational model suggesting that grid cells do not encode space, but rather encode non-spatial attributes. Place cells in turn encode memories of where those specific attributes occurred. The theory accounts for many experimental results and generates useful predictions for future studies. The model's simplicity and potential explanatory power will interest others in the field. There are, however, a few weaknesses outlined below which undermine the theory.

      Main criticisms:

      (1) A crucial assumption of the model is that grid cells express grid-like firing patterns if and only if the content of experience is constant in space. It is difficult to imagine a real world example that satisfies this assumption. Odors and sounds are used as examples. While they are often more spatially diffuse than an object on the ground, odors and sounds have sources that are readily detectable and thus are not constant in space. Animals can easily navigate to a food source or to a vocalizing conspecific. This assumption is especially problematic because it predicts that all grid cells should become silent when their preferred non-spatial attribute (e.g. a specific odor) is missing. I'm not aware of any experimental data showing that grid cells become silent. On the contrary, grid cells are known to remain active across all contexts that have been tested, including across sleep/wake states. Unlike place cells, grid cells have never been shown to turn off. Since grid cells are active in all contexts, their preferred attribute must also be present in all contexts, and therefore they would not convey any information about the specific content of an experience. The author lists many attributes that could in theory be constant in a laboratory setting, but there is no data I'm aware of that shows this is true in practice. As it stands, this crucial assumption of the model remains mere speculation.

      (2) The proposed novelty of this theory is that other models all assume that grid cells encode space. This is not quite true of models based on continuous attractor networks, the discussion of which is essentially absent. More specifically, attractor models focus on the importance of intrinsic dynamics within entorhinal cortex in generating the grid pattern. While this firing pattern is aligned to space during navigation and therefore can be used a representation of that space, the neural dynamics are preserved even during sleep. Similarly, it is because the grid pattern does not strictly encode physical space that grid-like signals are also observed in relation to other two-dimensional continuous variables.

      (3) The use of border cells or boundary vector cells as the main (or only) source of spatial information in the hippocampus is not well supported by experimental data. Border cells in entorhinal cortex are not active in the center of an environment. Boundary-vector cells can fire farther away from the walls, but are not found in entorhinal cortex. They are located in the subiculum, a major output of the hippocampus. While the entorhinal-hippocampal circuit is a loop, the route from boundary-vector cells to place cells is much less clear than from grid cells. Moreover, both border cells and boundary-vector cells (which are conflated in this paper) comprise a small population of neurons compared to grid cells.

      Minor comments:

      (1) There is substantial theoretical and experimental work supporting the idea that grid cell modules instantiate continuous attractor networks, yet this class of models is largely ignored:

      p. 7 "In contrast, most grid cell models (Bellmund et al., 2016; Bush et al., 2015; Castro & Aguiar, 2014; Hasselmo, 2009; Mhatre et al., 2012; Solstad et al., 2006; Sorscher et al., 2023; Stepanyuk, 2015; Widloski & Fiete, 2014) are domain specific models of spatial navigation"

      The following references should be added:

      McNaughton, B. L., Battaglia, F. P., Jensen, O., Moser, E. I. & Moser, M.-B. Path integration and the neural basis of the 'cognitive map'. Nat. Rev. Neurosci. 7, 663-678 (2006).

      Fuhs, M. C. & Touretzky, D. S. A spin glass model of path integration in rat medial entorhinal cortex. J. Neurosci. 26, 4266-4276 (2006).

      Burak, Y. & Fiete, I. R. Accurate path integration in continuous attractor network models of grid cells. PLoS Comput. Biol. 5, e1000291 (2009).

      Guanella, A., Kiper, D. & Verschure, P. A model of grid cells based on a twisted torus topology. Int. J. Neural Syst. 17, 231-240 (2007).

      Couey, J. J. et al. Recurrent inhibitory circuitry as a mechanism for grid formation. Nat. Neurosci. 16, 318-324 (2013).

      (Note: the Bellmund et al. (2016) citation is likely a mistake and was intended to be Bellmund et al. (2018).)

      (2) The author claims in two places that this model is the first to explain that grid cell population activity lies on a torus. While it may be the first explicit computational account of why grid cell activity is mapped onto a torus, these claims should be moderated for clarity, for example by adding "but see McNaughton et al. (2006) and others".

      Box 1. Results Uniquely Explained by this Memory Model - the population code of grid cells lies on a torus

      p.11 "In addition, this simplifying assumption allows the model to capture the finding that the population of grid cells lies on a torus (Gardner et al., 2022), although I note that the model was developed before this result was known."

      (3) Lateral entorhinal cortex is largely ignored in this memory model. It should be considered that the predominance of spatial representations reported in the literature is due to historical reasons. Namely, the discovery of hippocampal place cells spurred interest in looking upstream for the source of spatial information, which was later abundantly clear in medial entorhinal cortex. Lateral entorhinal cortex is relatively understudied, but is known to encode odors, objects, and time in a way that medial entorhinal cortex does not. It is therefore confusing to assume that these attributes are instead encoded by grid cells.

    2. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews

      Reviewer #1 (Public Review): 

      (1) Although the theory is based on memory, it also is based on spatially-selective cells.

      Not all cells in the hippocampus fulfill the criteria of place/HD/border/grid cells, and place a role in memory. E.g., Tonegawa, Buszaki labs' work does not focus on only those cells, and there are certainly a lot of non-pure spatial cells in monkeys (Martinez-Trujillo) and humans (iEEG). Does the author mainly focus on saying that "spatial cells" are memory, but do not account for non-spatial memory cells? This seems to be an incomplete account of memory - which is fine, but the way the model is set up suggests that *all* memory is, place (what/where), and non-spatial attributes ("grid") - but cells that don't fulfil these criteria in MTL (Diehl et al., 2017, Neuron; non-grid cells; Schaeffer et al., 2022, ICML; Luo et al., 2024, bioRxiv) certainly contribute to memory, and even navigation. This is also related to the question of whether these cell definitions matter at all (Luo et al., 2024). The authors note "However, this memory conjunction view of the MTL must be reconciled with the rodent electrophysiology finding that most cells in MTL appear to have receptive fields related to some aspect of spatial navigation (Boccara et al., 2010; Grieves & Jeffery, 2017). The paucity of non-spatial cells in MTL could be explained if grid cells have been mischaracterized as spatial." Is the author mainly talking about rodent work?

      There is a new section in the introduction that deals with these issues, titled ‘Why Model the Rodent Navigation Literature with a Memory Model?’ That section reads:

      “Spatial navigation is inherently a memory problem – learning the spatial arrangement of a new enclosure requires memory for the conjunction of what and where. This has long been realized and in the introduction to ‘Hippocampus as a Cognitive Map’, O’Keefe and Nadel (1978) wrote “We shall argue that the hippocampus is the core of a neural memory system providing an objective spatial framework within which the items and events of an organism's experience are located and interrelated” (emphasis added). Furthermore, in the last chapter of their book, they extended cognitive map theory to human memory for non-spatial characteristics. However, in the decades since the development of cognitive map theory, the rodent spatial navigation and human memory literatures have progressed somewhat independently.

      The ideas proposed in this model are an attempt to reunify these literatures by returning to the original claim that spatial navigation is inherently a memory problem. The goal of the current study is to explain the rodent spatial navigation literature using a memory model that has the potential to also explain the human memory literature. In contrast, most grid cell models (Bellmund et al., 2016; Bush et al., 2015; Castro & Aguiar, 2014; Hasselmo, 2009; Mhatre et al., 2012; Solstad et al., 2006; Sorscher et al., 2023; Stepanyuk, 2015; Widloski & Fiete, 2014) are domain specific models of spatial navigation and as such, they do not lend themselves to explanations of human memory. Thus, the reason to prefer this model is parsimony. Rather than needing to develop a theory of memory that is separate from a theory of spatial navigation, it might be possible to address both literatures with a unified account.

      This study does not attempt to falsify other theories of grid cells. Instead, this model reaches a radically different interpretation regarding the function of grid cells; an interpretation that emerges from viewing spatial navigation as a memory problem. All other grid cell models assume that an entorhinal grid cell displaying a spatially arranged grid of firing fields serves the function of spatial coding (i.e., spatial grid cells exist to support a spatial metric). In contrast, the proposed memory model of grid cells assumes that the hexagonal tiling reflects the need to keep memories separate from each other to minimize confusion and confabulation – the grid pattern is the byproduct of pattern separation between memories rather than the basis of a spatial code. 

      It is now understood that grid-like firing fields can occur for non-spatial twodimensional spaces. For instance, human entorhinal cortex exhibits grid-like responses to video morph trajectories in a two-dimensional bird neck-length versus bird leg-length space (Constantinescu et al., 2016). As a general theory of learning and memory, the proposed memory model of grid cells is easily extended to explain these results (e.g., relabeling the border cell inputs in the model as neck-length and leg-length inputs). However, there are other grid cell models that can explain both spatial grid cells as well as non-spatial grid-like responses (Mok & Love, 2019; Rodríguez-Domínguez & Caplan, 2019; Stachenfeld et al., 2017; Wei et al., 2015). Similar to this memory model of grid cells, these models are also positioned to explain both the rodent spatial navigation and human memory literatures. Nevertheless, there is a key difference between this model and other grid cell models that generalize to non-spatial representations. Specifically, these other models assume that grid cells exhibiting spatial receptive fields serve the function of identifying positions in the environment (i.e., their function is spatial). As such, these models do not explain why most of the input to rodent hippocampus appears to be spatial (Boccara et al., 2010; Diehl et al., 2017; Grieves & Jeffery, 2017). This memory model of grid cells provides an answer to the apparent paucity of nonspatial cell types in rodent MTL by proposing that grid cells with spatial receptive fields have been misclassified as spatial (they are what cells rather than where cells) and that place cells are fundamentally memory cells that conjoin what and where.”

      (2) Related to the last point, how about non-grid multi-field mEC cells? In theory, these also should be the same; but the author only presents perfect-look grid cells. In empirical work, clearly, this is not the case, and many mEC cells are multi-field non-grid cells (Diehl et al., 2017). Does the model find these cells? Do they play a different role? As noted by the author "Because the non-spatial attributes are constant throughout the two-dimensional surface, this results in an array of discrete memory locations that are approximately hexagonal (as explained in the Model Methods, an "online" memory consolidation process employing pattern separation rapidly turns an approximately hexagonal array into one that is precisely hexagonal). " If they are indeed all precisely hexagonal, does that mean the model doesn't have non-grid spatial cells? 

      Grid cells with irregular firing fields are now considered in the discussion with the following paragraphs

      “According to this model, hexagonally arranged grid cells should be the exception rather than the rule when considering more naturalistic environments. In a more ecologically valid situation, such as with landmarks, varied sounds, food sources, threats, and interactions with conspecifics, there may still be remembered locations were events occurred or remembered properties can be found, but because the non-spatial properties are non-uniform in the environment, the arrangement of memory feedback will be irregular, reflecting the varied nature of the environment. This may explain the finding that even in a situation where there are regular hexagonal grid cells, there are often irregular non-grid cells that have a reliable multi-location firing field, but the arrangement of the firing fields is irregular (Diehl et al., 2017). For instance, even when navigating in an enclosure that has uniform properties as dictated by experimental procedures, they may be other properties that were not well-controlled (e.g., a view of exterior lighting in some locations but not others), and these uncontrolled properties may produce an irregular grid (i.e., because the uncontrolled properties are reliably associated with some locations but not others, hippocampal memory feedback triggers retrieval of those properties in the associations locations).

      In this memory model, there are other situations in which an irregular but reliable multilocation grid may occur, even when everything is well controlled. In the reported simulations, when the hippocampal place cells were based on variation in X/Y (as defined by Border cells), nothing else changed as a function of location, and the model rapidly produced a precise hexagonal arrangement of hippocampal place cell memories. When head direction was included (i.e., real-world variation in X, Y, and head direction), the model still produced a hexagonal arrangement as per face-centered cubic packing of memories, but this precise arrangement was slower to emerge, with place cells continuing to shift their positions until the borders of the enclosure were sufficiently well learned from multiple viewpoints. If there is real-world variation in four or more dimensions, as is likely the case in a more ecologically valid situation, it will be even harder for place cell memories to settle on a precise regular lattice. Furthermore, in the case of four dimensions, mathematicians studying the “sphere packing problem” recently concluded that densest packing is irregular (Campos et al., 2023). This may explain why the multifield grid cells for freely flying bats have a systematic minimum distance between firing fields, but their arrangement is globally irregular (Ginosar et al., 2021). Assuming that the memories encoded by a bat include not just the three real-world dimensions of variation, but also head direction, the grid will likely be irregular even under optimal conditions of laboratory control.”

      (3) Theoretical reasons for why the model is put together this way, and why grid cells must be coding a non-spatial attribute: Is this account more data-driven (fits the data so formulated this way), or is it theoretical - there is a reason why place, border, grid cells are formulated to be like this. For example, is it an efficient way to code these variables? It can be both, like how the BVC model makes theoretical sense that you can use boundaries to determine a specific location (and so place cell), but also works (creates realistic place cells). 

      The motivation for this model is now articulated in the new section, quoted above, titled ‘Why Model the Rodent Navigation Literature with a Memory Model?’ Regarding the assumption that border cells provide a spatial metric, this assumption is made for the same reasons as in the BVC model. Regarding this, the text said: “These assumptions regarding border cells are based on the boundary vector cell (BVC) model of Barry et al. (2006). As in the BVC model, combinations of border cells encode where each memory occurred in the realworld X/Y plane.”. A new sentence is added to model methods, stating: “This assumption is made because border cells provide an efficient representation of Euclidean space (e.g., if the animal knows how far it is from different walls of the enclosure, this already available information can be used to calculate location).”

      But in this case, the purpose of grid cell coding a non-spatial attribute, and having some kind of system where it doesn't fire at all locations seems a little arbitrary. If it's not encoding a spatial attribute, it doesn't have to have a spatial field. For example, it could fire in the whole arena - which some cells do (and don't pass the criteria of spatial cells as they are not spatially "selective" to another location, related to above).  

      Some cells have a constant high firing rate, but they are the exception rather than the rule. More typically, cells habituate in the presence of ongoing excitatory drive and by doing so become sensitive to fluctuations in excitatory drive. Habituation is advantageous both in terms of metabolic cost and in terms of function (i.e., sensitivity to change). This is now explained in the following paragraph:

      “In theory, a cell representing a non-spatial attribute found at all locations of an enclosure (aka, a grid cell in the context of this model), could fire constantly within the enclosure. However, in practice, cells habituate and rapidly reduce their firing rate by an order of magnitude when their preferred stimulus is presented without cessation (Abbott et al., 1997; Tsodyks & Markram, 1997). After habituation, the firing rate of the cell fluctuates with minor variation in the strength of the excitatory drive. In other words, habituation allows the cell to become sensitive to changes in the excitatory drive (Huber & O’Reilly, 2003). Thus, if there is stronger top-down memory feedback in some locations as compared to others, the cell will fire at a higher rate in those remembered locations rather than in all locations even though the attribute is found at all locations. In brief when faced with constant excitatory drive, the cell accommodates, and becomes sensitive to change in the magnitude of the excitatory drive. In the model simulation, this dynamic adaptation is captured by supposing that cells fire 5% of the time on-average across the simulation, regardless of their excitatory inputs.”

      (4) Why are grid cells given such a large role for encoding non-spatial attributes? If anything, shouldn't it be lateral EC or perirhinal cortex? Of course, they both could, but there is less reason to think this, at least for rodent mEC.  

      This is a good point and the following paragraph has been added to the introduction to explain that lateral EC is likely part of the explanation. But even when including lateral EC, it still appears that most of the input to hippocampus is spatial.

      “One possible answer to the apparent lack of non-spatial cells in MTL is to highlight the role of the lateral entorhinal cortex (LEC) as the source of non-spatial what information for memory encoding (Deshmukh & Knierim, 2011). LEC can be contrasted with mEC, which appears to only provide where information (Boccara et al., 2010a; Diehl et al., 2017). Although it is generally true that LEC is involved in non-spatial processing, there is evidence that LEC provides some forms of spatial information (Knierim et al., 2014). The kind of non-spatial information provided by LEC appears to be in relation to objects (Connor & Knierim, 2017; Wilson et al., 2013). However, in a typical rodent spatial navigation study there are no objects within the enclosure. Thus, although the distinction between mEC and LEC is likely part of the explanation, it is still the case that rodent entorhinal input to hippocampus appears to heavily favor spatial information.”

      (5) Clarification: why do place cells and grid cells differ in terms of stability in the model? Place cells are not stable initially but grid cells come out immediately. They seem directly connected so a bit unclear why; especially if place cell feedback leads to grid cell fields. There is an explanation in the text - based on grid cells coding the on-average memories, but these should be based on place cell inputs as well. So how is it that place fields are unstable then grid fields do not move at all? I wonder if a set of images or videos (gifs) showing the differences in spatial learning would be nice and clarify this point.  

      In this revision, I provide a new video focused on learning of place cell memories that include head direction. This second video is in relation to the results reported in Figure 9. The short answer is that the grid fields for the non-spatial cell are based on the average across several view-dependent memories (i.e., across several place cells that have head direction sensitivity) and the average is reliable even if the place cells are unstable. The text of this explanation now reads:

      “Why was the grid immediately apparent for the non-spatial attribute cell whereas the grid took considerable prior experience for the head direction cells? The answer relates to memory consolidation and the shifting nature of the hippocampal place cells. Head direction cells only produced a reliable grid once the hippocampal place cells (aka, memory cells) assumed stable locations. During the first few sessions, the hippocampal place cells were shifting their positions owing to pattern separation and consolidation. But once the place cells stabilized, they provided reliable top-down memory feedback to the head direction cells in some places but not others, thus producing a reliable grid arrangement to the firing maps of the head direction cells. In other words, for the head direction cells, the grid only appeared once the place cells stabilized. This slow stabilization of place fields is a known property (Bostock et al., 1991; Frank et al., 2004).

      In the simulation, the place cells did not stabilize until a sufficient number of place cells were created (Figure 9C). Specifically, these additional memories were located immediately outside the enclosure, around all borders (Figure 9D). These “outside the box” memories served to constrain the interior place cells, locking them in position despite ongoing consolidation. This dynamic can be seen in a movie showing a representative simulation. The movie shows the positions of the head direction sensitive place cells during initial learning, and then during additional sessions of prior experience as the movie speeds up (see link in Figure 9 capture).

      Why did the non-spatial grid cell (k) produce a grid immediately, before the place cells stabilized? As discussed in relation to Figure 8, the non-spatial grid cell is the projection through the 3D volume of real-world coordinates that includes X, Y, and head direction. Each grid field of a non-spatial grid cell reflects feedback from several place cells that each have a different head direction sensitivity (see for instance the allocentric pairs of memories illustrated in Figure 8C and 8D). Thus, each grid field is the average across several memories that entail different viewpoints and this averaging across memories provides stability even if the individual memories are not yet stable. This average of unstable memories produces a blurry sort of grid pattern without any prior experience.

      A final piece of the puzzle relies on the same mechanism that caused the grid pattern to align with the borders as reported in the results of Figures 6 and 7. Specifically, there are some “sticky” locations with ongoing consolidation because the connection weights are bounded. Because weights cannot go below their minimum or above their maximum, it is slightly more difficult for consolidation to push or pull connection weights over the peak value or under the minimum value of the tuning curve. Thus, the place cells tend to linger in locations that correspond to the peak or trough of a border cell. There are multiple peak and trough locations but for the parameter values in this simulation, the grid pattern seen in Figure 9C shows the set of peak/trough locations that satisfy the desired spacing between memories. Thus, the average across memories shows a reliable grid field at these locations even though the memories are unstable.”

      (6) Other predictions. Clearly, the model makes many interesting (and quite specific!) predictions. But does it make some known simple predictions? 

      • More place cells at rewarded (or more visited) locations. Some empirical researchers seem to think this is not as obvious as it seems (e.g., Duvellle et al., 2019; JoN; Nyberg et al., 2021, Neuron Review).  

      • Grid cell field moves toward reward (Butler et al., 2019; Boccera et al., 2019).  

      • Grid cells deform in trapezoid (Krupic et al., 2015) and change in environments like mazes (Derikman et al., 2014).  

      Thank you for these suggestions and I have added the following paragraph to the discussion:

      “In terms of the animal’s internal state, all locations in the enclosure may be viewed as equally aversive and unrewarding, which is a memorable characteristic of the enclosure. Reward, or lack thereof, is arguably one of the most important nonspatial characteristics and application of this model to reward might explain the existence of goal-related activity in place cells (Hok et al., 2007; although see Duvelle et al., 2019), reflecting the need to remember rewarding locations for goal directed behavior. Furthermore, if place cell memories for a rewarding location activate entorhinal grid cells, this may explain the finding that grid cells remap in an enclosure with a rewarded location such that firing fields are attracted to that location (Boccara et al., 2019; Butler et al., 2019). Studies that introduce reward into the enclosure are an important first step in terms of examining what happens to grid cells when the animal is placed in a more varied environment.”

      Regarding the changes in shape of the environment, this was discussed in the section of the paper that reads “As seen in Figure 12, because all but one of the place cells was exterior when the simulated animal was constrained to a narrow passage, the hippocampal place cell memories were no longer arranged in a hexagonal grid. This disruption of the grid array for narrow passages might explain the finding that the grid pattern (of grid cells) is disrupted in the thin corner of a trapezoid (Krupic et al., 2015) and disrupted when a previously open enclosure is converted to a hairpin maze by insertion of additional walls within the enclosure (Derdikman et al., 2009).” This particular section of the paper now appears in the Appendix and Figure 12 is now Appendix Figure 2.

      Reviewer #2 (Public Review): 

      The manuscript describes a new framework for thinking about the place and grid cell system in the hippocampus and entorhinal cortex in which these cells are fundamentally involved in supporting non-spatial information coding. If this framework were shown to be correct, it could have high impact because it would suggest a completely new way of thinking about the mammalian memory system in which this system is non-spatial. Although this idea is intriguing and thought-provoking, a very significant caveat is that the paper does not provide evidence that specifically supports its framework and rules out the alternate interpretations. Thus, although the work provides interesting new ideas, it leaves the reader with more questions than answers because it does not rule out any earlier ideas. 

      Basically, the strongest claim in the paper, that grid cells are inherently non-spatial, cannot be specifically evaluated versus existing frameworks on the basis of the evidence that is shown here. If, for example, the author had provided behavioral experiments showing that human memory encoding/retrieval performance shifts in relation to the predictions of the model following changes in the environment, it would have been potentially exciting because it could potentially support the author's reconceptualization of this system. But in its current form, the paper merely shows that a new type of model is capable of explaining the existing findings. There is not adequate data or results to show that the new model is a significantly better fit to the data compared to earlier models, which limits the impact of the work. In fact, there are some key data points in which the earlier models seem to better fit the data.  

      Overall, I would be more convinced that the findings from the paper are impactful if the author showed specific animal memory behavioral results that were only supported by their memory model but not by a purely spatial model. Perhaps the author could run new experiments to show that there are specific patterns of human or animal behavior that are only explained by their memory model and not by earlier models. But in its current form, I cannot rule out the existing frameworks and I believe some of the claims in this regard are overstated. 

      As previously detailed in Box 1 and as explained in the text in several places, the model provides an explanation of several findings that remain unexplained by other theories (see “Results Uniquely Explained by the Memory Model”). But more generally this is a good point, and the initial draft failed to fully articulate why a researcher might choose this model to guide future empirical investigations. A new section in the introduction that deals with these issues, titled ‘Why Model the Rodent Navigation Literature with a Memory Model?’ That section reads:

      “Spatial navigation is inherently a memory problem – learning the spatial arrangement of a new enclosure requires memory for the conjunction of what and where. This has long been realized and in the introduction to ‘Hippocampus as a Cognitive Map’, O’Keefe and Nadel (1978) wrote “We shall argue that the hippocampus is the core of a neural memory system providing an objective spatial framework within which the items and events of an organism's experience are located and interrelated” (emphasis added). Furthermore, in the last chapter of their book, they extended cognitive map theory to human memory for non-spatial characteristics. However, in the decades since the development of cognitive map theory, the rodent spatial navigation and human memory literatures have progressed somewhat independently.

      The ideas proposed in this model are an attempt to reunify these literatures by returning to the original claim that spatial navigation is inherently a memory problem. The goal of the current study is to explain the rodent spatial navigation literature using a memory model that has the potential to also explain the human memory literature. In contrast, most grid cell models (Bellmund et al., 2016; Bush et al., 2015; Castro & Aguiar, 2014; Hasselmo, 2009; Mhatre et al., 2012; Solstad et al., 2006; Sorscher et al., 2023; Stepanyuk, 2015; Widloski & Fiete, 2014) are domain specific models of spatial navigation and as such, they do not lend themselves to explanations of human memory. Thus, the reason to prefer this model is parsimony. Rather than needing to develop a theory of memory that is separate from a theory of spatial navigation, it might be possible to address both literatures with a unified account.

      This study does not attempt to falsify other theories of grid cells. Instead, this model reaches a radically different interpretation regarding the function of grid cells; an interpretation that emerges from viewing spatial navigation as a memory problem. All other grid cell models assume that an entorhinal grid cell displaying a spatially arranged grid of firing fields serves the function of spatial coding (i.e., spatial grid cells exist to support a spatial metric). In contrast, the proposed memory model of grid cells assumes that the hexagonal tiling reflects the need to keep memories separate from each other to minimize confusion and confabulation – the grid pattern is the byproduct of pattern separation between memories rather than the basis of a spatial code. 

      It is now understood that grid-like firing fields can occur for non-spatial twodimensional spaces. For instance, human entorhinal cortex exhibits grid-like responses to video morph trajectories in a two-dimensional bird neck-length versus bird leg-length space (Constantinescu et al., 2016). As a general theory of learning and memory, the proposed memory model of grid cells is easily extended to explain these results (e.g., relabeling the border cell inputs in the model as neck-length and leg-length inputs). However, there are other grid cell models that can explain both spatial grid cells as well as non-spatial grid-like responses (Mok & Love, 2019; Rodríguez-Domínguez & Caplan, 2019; Stachenfeld et al., 2017; Wei et al., 2015). Similar to this memory model of grid cells, these models are also positioned to explain both the rodent spatial navigation and human memory literatures. Nevertheless, there is a key difference between this model and other grid cell models that generalize to non-spatial representations. Specifically, these other models assume that grid cells exhibiting spatial receptive fields serve the function of identifying positions in the environment (i.e., their function is spatial). As such, these models do not explain why most of the input to rodent hippocampus appears to be spatial (Boccara et al., 2010; Diehl et al., 2017; Grieves & Jeffery, 2017). This memory model of grid cells provides an answer to the apparent paucity of nonspatial cell types in rodent MTL by proposing that grid cells with spatial receptive fields have been misclassified as spatial (they are what cells rather than where cells) and that place cells are fundamentally memory cells that conjoin what and where.”

      - The paper does not fully take into account all the findings regarding grid cells, some of which very clearly show spatial processing in this system. For example, findings on grid-bydirection cells (e.g., Sargolini et al. 2006) would seem to suggest that the entorhinal grid system is very specifically spatial and related to path integration. Why would grid-bydirection cells be present and intertwined with grid cells in the author's memory-related reconceptualization? It seems to me that the existence of grid-by-direction cells is strong evidence that at least part of this network is specifically spatial.

      Head by direction grid cells were a key part of the reported results. These grid cells naturally arise in the model as the animal forms memories (aka, hippocampal place cells) that conjoin location (as defined by border cells), head direction at the time of memory formation, and one or more non-spatial properties found at that location. In this revision, I have attempted to better explain how including head direction in hippocampal memories naturally gives rise to these cell types. The introduction to the head direction module simulations now reads:

      “According to this memory model of spatial navigation, place cells are the conjunction of location, as defined by border cells, and one or more properties that are remembered to exist at that location. Such memories could, for instance, allow an animal to remember the location of a food cache (Payne et al., 2021). The next set of simulations investigates behavior of the model when one of the to-be-remembered properties is head direction at the time when the memory was formed (e.g., the direction of a pathway leading to a food cache). Indicating that head direction is an important part of place cell representations, early work on place cells in mazes found strong sensitivity to head direction, such that the place field is found in one direction of travel but not the other (McNaughton et al., 1983; Muller et al., 1994). Place cells can exhibit a less extreme version of head direction sensitivity in open field recordings (Rubin et al., 2014), but the nature of the sensitivity is more complicated, depending on location of the animal relative to the place field center (Jercog et al., 2019).

      It is possible that some place cell memories do not receive head direction input, as was the case for the simulations reported in Figures 6/7 – in those simulations, place cells were entirely insensitive to head direction, owing to a lack of input from head direction cells. However, removal of head direction input to hippocampus affects place cell responses (Calton et al., 2003) and grid cell responses (Winter et al., 2015), suggesting that head direction is a key component of the circuit. Furthermore, if place cells represent episodic memories, it seems natural that they should include head direction (i.e., viewpoint at the time of memory formation).

      In the simulations reported next, head direction is simply another property that is conjoined in a hippocampal place cell memory. In this case, a head direction cell should become a head direction conjunctive grid cell (i.e., a grid cell, but only when the animal is heading in a particular direction), owing to memory feedback from the hexagonal array of hippocampal place cell memories. When including head direction, the real-world dimensions of variation are across three dimensions (X, Y, and head direction) rather than two, and consolidation will cause the place cells to arrange in a three-dimensional volume. The simulation reported below demonstrates that this situation provides a “grid module”.”

      - I am also concerned that the paper does not do enough to address findings regarding how the elliptical shape of grid fields shifts when boundaries of an environment compress in one direction or change shape/angles (Lever et al., & Krupic et al). Those studies show compression in grid fields based on boundary position, and I don't see how the authors' model would explain these findings.  

      This finding was covered in the original submission: “For instance, perhaps one egocentric/allocentric pair of mEC grid modules is based on head direction (viewpoint) in remembered positions relative to the enclosure borders whereas a different egocentric/allocentric pair is based on head direction in remembered positions relative to landmarks exterior to the enclosure. This might explain why a deformation of the enclosure (moving in one of the walls to form a rectangle rather than a square) caused some of the grid modules but not others to undergo a deformation of the grid pattern in response to the deformation of the enclosure wall (see also Barry et al., 2007). More specifically, if there is one set of non-orthogonal dimensions for enclosure borders and the movement of one wall is too modest as to cause avoid global remapping, this would deform the grid modules based the enclosure border cells. At the same time, if other grid modules are based on exterior properties (e.g., perhaps border cells in relation to the experimental room rather than the enclosure), then those grid modules would be unperturbed by moving the enclosure wall.”

      I apologize for being unclear in describing how the model might explain this result. The paragraph has been rewritten and now reads:

      “Consider the possibility that one mEC grid modules is based on head direction (viewpoint) in remembered positions relative to the enclosure borders (e.g., learning the properties of the enclosure, such as the metal surface) while a different grid module is based on head direction in remembered positions relative to landmarks exterior to the enclosure (e.g., learning the properties of the experimental room, such as the sound of electronics that the animal is subject to at all locations). This might explain why a deformation of the enclosure (moving one of the walls to form a rectangle rather than a square) caused some of the grid modules but not others to undergo a deformation of the grid pattern in response to the deformation of the enclosure wall (see also Barry et al., 2007). More specifically, suppose that the movement of one wall is modest and after moving the wall, the animal views the enclosure as being the same enclosure, albeit slightly modified (e.g., when a home is partially renovated, it is still considered the same home). In this case, the set of non-orthogonal dimensions associated with enclosure borders would still be associated with the now-changed borders and any memories in reference to this border-determined space would adjust their positions accordingly in real-world coordinates (i.e., the place cells would subtly shift their positions owing to this deformation of the borders, producing a corresponding deformation of the grid). At the same time, there may be other sets of memories that are in relation to dimensions exterior to the enclosure. Because these exterior properties are unchanged, any place cells and grid cells associated with the exterior-oriented memories would be unchanged by moving the enclosure wall.”

      - Are findings regarding speed modulation of grid cells problematic for the paper's memory results? 

      - A further issue is that the paper does not seem to adequately address developmental findings related to the timecourses of the emergence of different cell types. In their simulation, researchers demonstrate the immediate emergence of grid fields in a novel environment, while noting that the stabilization of place cell positions takes time. However, these simulation findings contradict previous empirical developmental studies (Langston et al., 2010). Those studies showed that head direction cells show the earliest development of spatial response, followed by the appearance of place cells at a similar developmental stage. In contrast, grid cells emerge later in this developmental sequence. The gradual improvement in spatial stability in firing patterns likely plays a crucial role in the developmental trajectory of grid cells. Contrary to the model simulation, grid cells emerge later than place cells and head direction cells, yet they also hold significance in spatial mapping. 

      - The model simulations suggest that certain grid patterns are acquired more gradually than others. For instance, egocentric grid cells require the stabilization of place cell memories amidst ongoing consolidation, while allocentric grid cells tend to reflect average place field positions. However, these findings seemingly conflict with empirical studies, particularly those on the conjunctive representation of distance and direction in the earliest grid cells. Previous studies show no significant differences were found in grid cells and grid cells with directional correlates across these age groups, relative to adults (Wills et al., 2012). This indicates that the combined representation of distance and direction in single mEC cells is present from the earliest ages at which grid cells emerge. 

      These are good points and they have been addressed in a new section of the introduction titled ‘The Scope of the Proposed Model’. That section reads:

      “The reported simulations explain why most mEC cell types in the rodent literature appear to be spatial (Boccara et al., 2010; Diehl et al., 2017; Grieves & Jeffery, 2017). Assuming that rodents can form non-spatial memories, rodent hippocampus must receive non-spatial input from entorhinal cortex. These simulations suggest that characterization of the rodent mEC cortex as primarily spatial might be incorrect if most grid cells (except perhaps head direction conjunctive grid cells) have been mischaracterized as spatial. Other literatures with other species find non-spatial representations in MTL (Gulli et al., 2020; Quiroga et al., 2005; Wixted et al., 2014) and non-spatial hippocampal memory encoding has been found in rodents (Liu et al., 2012; McEchron & Disterhoft, 1999). The proposed memory model is compatible with these results – the ideas contained in this model could be applied to nonspatial memory representations. However, surveys of cell types in rodent entorhinal cortex seem to indicate that most cells are spatial (Boccara et al., 2010; Diehl et al., 2017; Grieves & Jeffery, 2017). How can the rodent hippocampus encode nonspatial memories if most of its input is spatial? The goal of the reported simulations is to explain the apparent paucity of non-spatial cells in rodent entorhinal cortex by proposing that grid cells have been misclassified as spatial (see also Luo et al., 2024).

      Given the simplicity of the proposed model, there are important findings that the model cannot address -- it is not that the model makes the wrong predictions but rather that it makes no predictions. The role of running speed (Kraus et al., 2015) is one such variable for which the model makes no predictions. Similarly, because the model is a rate-coded model rather than a model of oscillating spiking neurons, it makes no predictions regarding theta oscillations (Buzsáki & Moser, 2013). The model is an account of learning and memory for an adult animal, and it makes no predictions regarding the developmental (Langston et al., 2010; Muessig et al., 2015; Wills et al., 2012) or evolutionary (Rodrıguez et al., 2002) time course of different cell types. This model contains several purely spatial representations such as border cells, head direction cells, and head direction conjunctive grid cells and it may be that these purely spatial cell types emerged first, followed by the evolution and/or development of non-spatial cell types. However, this does not invalidate the model. Instead, this is a model for an adult animal that has both episodic memory capabilities and spatial navigation capabilities, irrespective of the order in which these capabilities emerged.

      This model has the potential to explain context effects in memory (Godden & Baddeley, 1975; Gulli et al., 2020; Howard et al., 2005). According to this model, different grid cells represent different non-spatial characteristics and place cells represent the combination of these “context” factors and location. In the simulation, just one grid cell is simulated but the same results would emerge when simulating hundreds of different non-spatial inputs provided that all of the simulated non-spatial inputs exist throughout the recording session. However, there is evidence that hippocampus can explicitly represent the passage of time (Eichenbaum, 2014), and time is assuredly an important factor in defining episodic memory (Bright et al., 2020). Thus, although the current model addresses unique combinations of what and where, it is left to future work to incorporate representations of when in the memory model.”

      Reviewer #3 (Public Review): 

      A crucial assumption of the model is that the content of experience must be constant in space. It's difficult to imagine a real-world example that satisfies this assumption. Odors and sounds are used as examples. While they are often more spatially diffuse than an objects on the ground, odors and sounds have sources that are readily detectable. Animals can easily navigate to a food source or to a vocalizing conspecific. This assumption is especially problematic because it predicts that all grid cells should become silent when their preferred non-spatial attribute (e.g. a specific odor) is missing. I'm not aware of any experimental data showing that grid cells become silent. On the contrary, grid cells are known to remain active across all contexts that have been tested, including across sleep/wake states. Unlike place cells, grid cells do not seem to turn off. Since grid cells are active in all contexts, their preferred attribute must also be present in all contexts, and therefore they would not convey any information about the specific content of an experience.  

      These are good points and in this revision I have attempted to explain that there is a great deal of contextual similarity across all recording sessions. One paragraph in the discussion now reads

      “In a typical rodent spatial navigation study, the non-spatial attributes are wellcontrolled, existing at all locations regardless of the enclosure used during testing (hence, a grid cell in one enclosure will be a grid cell in a different enclosure). Because labs adopt standard procedures, the surfaces, odors (e.g., from cleaning), external lighting, time of day, human handler, electronic apparatus, hunger/thirst state, etc. might be the same for all recording sessions. Additionally, the animal is not allowed to interact with other animals during recording and this isolation may be an unusual and highly salient property of all recording sessions. Notably, the animal is always attached to wires during recording. The internal state of the animal (fear, aloneness, the noise of electronics, etc.) is likely similar across all recording situations and attributes of this internal state are likely represented in the hippocampus and entorhinal input to hippocampus. According to this model, hippocampal place cells are “marking” all locations in the enclosure as places where these things tend to happen.”

      The proposed novelty of this theory is that other models all assume that grid cells encode space. This isn't quite true of models based on continuous attractor networks, the discussion of which is notably absent. More specifically, these models focus on the importance of intrinsic dynamics within the entorhinal cortex in generating the grid pattern. While this firing pattern is aligned to space during navigation and therefore can be used as a representation of that space, the neural dynamics are preserved even during sleep. Similarly, it is because the grid pattern does not strictly encode physical space that gridlike signals are also observed in relation to other two-dimensional continuous variables. 

      These models were briefly discussed in the general discussion section and in this revision they are further discussed in the introduction in a new section, titled ‘Why Model the Rodent Navigation Literature with a Memory Model?’ That section reads:

      “Spatial navigation is inherently a memory problem – learning the spatial arrangement of a new enclosure requires memory for the conjunction of what and where. This has long been realized and in the introduction to ‘Hippocampus as a Cognitive Map’, O’Keefe and Nadel (1978) wrote “We shall argue that the hippocampus is the core of a neural memory system providing an objective spatial framework within which the items and events of an organism's experience are located and interrelated” (emphasis added). Furthermore, in the last chapter of their book, they extended cognitive map theory to human memory for non-spatial characteristics. However, in the decades since the development of cognitive map theory, the rodent spatial navigation and human memory literatures have progressed somewhat independently.

      The ideas proposed in this model are an attempt to reunify these literatures by returning to the original claim that spatial navigation is inherently a memory problem. The goal of the current study is to explain the rodent spatial navigation literature using a memory model that has the potential to also explain the human memory literature. In contrast, most grid cell models (Bellmund et al., 2016; Bush et al., 2015; Castro & Aguiar, 2014; Hasselmo, 2009; Mhatre et al., 2012; Solstad et al., 2006; Sorscher et al., 2023; Stepanyuk, 2015; Widloski & Fiete, 2014) are domain specific models of spatial navigation and as such, they do not lend themselves to explanations of human memory. Thus, the reason to prefer this model is parsimony. Rather than needing to develop a theory of memory that is separate from a theory of spatial navigation, it might be possible to address both literatures with a unified account.

      This study does not attempt to falsify other theories of grid cells. Instead, this model reaches a radically different interpretation regarding the function of grid cells; an interpretation that emerges from viewing spatial navigation as a memory problem. All other grid cell models assume that an entorhinal grid cell displaying a spatially arranged grid of firing fields serves the function of spatial coding (i.e., spatial grid cells exist to support a spatial metric). In contrast, the proposed memory model of grid cells assumes that the hexagonal tiling reflects the need to keep memories separate from each other to minimize confusion and confabulation – the grid pattern is the byproduct of pattern separation between memories rather than the basis of a spatial code. 

      It is now understood that grid-like firing fields can occur for non-spatial two dimensional spaces. For instance, human entorhinal cortex exhibits grid-like responses to video morph trajectories in a two-dimensional bird neck-length versus bird leg-length space (Constantinescu et al., 2016). As a general theory of learning and memory, the proposed memory model of grid cells is easily extended to explain these results (e.g., relabeling the border cell inputs in the model as neck-length and leg-length inputs). However, there are other grid cell models that can explain both spatial grid cells as well as non-spatial grid-like responses (Mok & Love, 2019; Rodríguez-Domínguez & Caplan, 2019; Stachenfeld et al., 2017; Wei et al., 2015). Similar to this memory model of grid cells, these models are also positioned to explain both the rodent spatial navigation and human memory literatures. Nevertheless, there is a key difference between this model and other grid cell models that generalize to non-spatial representations. Specifically, these other models assume that grid cells exhibiting spatial receptive fields serve the function of identifying positions in the environment (i.e., their function is spatial). As such, these models do not explain why most of the input to rodent hippocampus appears to be spatial (Boccara et al., 2010; Diehl et al., 2017; Grieves & Jeffery, 2017). This memory model of grid cells provides an answer to the apparent paucity of nonspatial cell types in rodent MTL by proposing that grid cells with spatial receptive fields have been misclassified as spatial (they are what cells rather than where cells) and that place cells are fundamentally memory cells that conjoin what and where.”

      The use of border cells or boundary vector cells as the main (or only) source of spatial information in the hippocampus is not well supported by experimental data. Border cells in the entorhinal cortex are not active in the center of an environment. Boundary-vector cells can fire farther away from the walls but are not found in the entorhinal cortex. They are located in the subiculum, a major output of the hippocampus. While the entorhinalhippocampal circuit is a loop, the route from boundary-vector cells to place cells is much less clear than from grid cells. Moreover, both border cells and boundary-vector cells (which are conflated in this paper) comprise a small population of neurons compared to grid cells.

      AUTHOR RESPONSE: The model can be built without assuming between-border cells (early simulations with the model did not make this assumption). Regarding this issue, the text reads “Unlike the BVC model, the boundary cell representation is sparsely populated using a basis set of three cells for each of the three dimensions (i.e., 9 cells in total), such that for each of the three non-orthogonal orientations, one cell captures one border, another the opposite border, and the third cell captures positions between the opposing borders (Solstad et al., 2008). However, this is not a core assumption, and it is possible to configure the model with border cell configurations that contain two opponent border cells per dimension, without needing to assume that any cells prefer positions between the borders (with the current parameters, the model predicts there will be two border cells for each between-border cell). Similarly, it is possible to configure the model with more than 3 cells for each dimension (i.e., multiple cells representing positions between the borders).” The Solstad paper found a few cells that responded in positions between borders, but perhaps not as many as 1 out of 3 cells, such as this particular model simulation predicts. If the paucity of between-border cells is a crucial data point, the model can be reconfigured with opponent-border cells without any between border cells. The reason that 3 border cells were used rather than 2 opponent border cells was for simplicity. Because 3 head direction cells were used to capture the face-centered cubic packing of memories, the simulation also used 3 border cells per dimensions to allow a common linear sum metric when conjoining dimensions to form memories. If the border dimensions used 2 cells while head direction used 3 cells, a dimensional weighting scheme would be needed to allow this mixing of “apples and oranges” in terms of distances in the 3D space that includes head direction.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors): 

      Specific questions/clarifications:  

      (1) Assumption of population-based vs single unit link to biological cells: At the start, the author assumes that each unit here can be associated with a population: "the simulated activation values can be thought of as proportional to the average firing rate of an ensemble of neurons with similar inputs and outputs (O'Reilly & Munakata, 2000)." But is a 'grid cell' found here a single cell or an average of many cells? Does this mean the model assumes many cells that have different fields that are averaged, which become a grid-like unit in the model? But in biology, these are single cells? Or does it mean a grid response is an average of the place cell inputs? 

      I apologize for being unclear about this. The grid cells in the model are equivalent to real single cells except that the simulation uses a ratecoded cell rather than a spiking cell. The averaging that was mentioned in the paper is across identically behaving spiking cells rather than across cells with different grid field arrangements. To better explain this, I have added the following text:

      “For instance, consider a set of several thousand spiking grid cells that are identical in terms of their firing fields. At any moment, some of these identically-behaving cells will produce an action potential while others do not (i.e., the cells are not perfectly synchronized), but a snapshot of their behavior can be extracted by calculating average firing rate across the ensemble. The simulated cells in the model represent this average firing rate of identically-behaving ensembles of spiking neurons.” 

      This is a mathematical short-cut to avoid simulating many spiking neurons. Because this model was compared to real spike rate maps, this real-valued average firing rate is down-sampled to produce spikes by finding the locations that produced the top 5% of real-valued activation values across the simulation.

      (2) It is not clear to me why they are circular border cells/basis sets.  

      In the initial submission, there was a brief paragraph describing this assumption. In this revision, that paragraph has been expanded and modified for greater clarity. It now reads:

      “Because head direction is necessarily a circular dimension, it was assumed that all dimensions are circular (a circular dimension is approximately linear for nearby locations). This assumption of circular dimensions was made to keep the model relatively simple, making it easier to combine dimensions and allowing application of the same processes for all dimensions. For instance, the model requires a weight normalization process to ensure that the pattern of weights for each dimension corresponds to a possible input value along that dimension. However, the normalization for a linear dimension is necessarily different than for a circular dimension. Because the neural tuning functions were assumed to be sine waves, normalization requires that the sum of squared weights add up to a constant value. For a linear dimension, this sum of squares rule only applies to the subset of cells that are relevant to a particular value along the dimension whereas for a circular dimension, this sum of squares rule is over the entire set of cells that represent the dimension (i.e., weight normalization is easier to implement with circular dimensions). Although all dimensions were assumed to be circular for reasons of mathematical convenience and parsimony, circular dimensions may relate to the finding that human observers have difficultly re-orienting themselves in a room depending on the degree of rotational symmetry of the room (Kelly et al., 2008). In addition, this simplifying assumption allows the model to capture the finding that the population of grid cells lies on a torus (Gardner et al., 2022), although I note that the model was developed before this result was known.”

      (3) Why is it 3 components? I realise that the number doesn't matter too much, but I believe more is better, so is it just for simplicity? 

      In this revision, additional text has been added to explain this assumption: “To keep the model simple, the same number of cells was assumed for all dimensions and all dimensions were assumed to be circular (head direction is necessarily circular and because one dimension needed to be circular, all dimensions were assumed to be circular). Three cells per dimensions was chosen because this provides a sparse population code of each dimension, with few border cells responding between borders, with few border cells responding between borders, while allowing three separate phases of grid cells within a grid cell module (in the model, a grid cell module arises from combination of a third dimension, such as head direction, with the real-world X/Y dimensions defined by border cells).”

      As a reminder, the text explaining the sparse coding of border cells reads: “However, this is not a core assumption, and it is possible to configure the model with border cell configurations that contain two opponent border cells per dimension, without needing to assume that any cells prefer positions between the borders (with the current parameters, the model predicts there will be two border cells for each between-border cell). Similarly, it is possible to configure the model with more than 3 cells for each dimension (i.e., multiple cells representing positions between the borders).”

      The model can work with just two opponent cells or with more than three cells per basis set. In different simulations, I have explored these possibilities. Three was chosen because it is a convenient way to highlight the face-centered cubic packing of memories that tends to occur (FCP produces 3 alternating layers of hexagonally arranged firing fields). Thus, each of the three head direction cells captures a different layer of the FCP arrangement. A more realistic simulation might combine 6 different head direction cells tiling the head direction dimension with opponent border cells (just 2 cells for each border dimensions). Such a combination would produce responses at borders, but no responses between borders and, at the same time, the head direction cells would still reveal the FCP arrangement. However, it is not easy to find the right parameters for such a mix-and-match simulation in which different dimensions have different numbers of tuning functions (e.g., some dimensions having 2 cells while others have 3 or 6 and some dimensions being linear while others are circular). When all of the dimensions are of the same type, the simple sum that arises from multiplying the input by the weight values gives rise to Euclidean distance (see Figure 3B). With a mix-and-match model of different dimension-types, it should be possible to adjust the sum to nevertheless produce a monotonic function with Euclidean distance although I leave this to future work. To keep things simple, I assumed that all dimensions are of the same type (circular, with 3 cells per dimension).  

      (4) Confusion due to the border cells/box was unclear to me. "If the period of the circular border cells was the same as the width of the box, then a memory pushed outside the box on one side would appear on the opposite side of the box, in which case the partial grid field on one side should match up with its remainder on the other side. This would entail complete confusion between opposite sides of the box, and the representation of the box would be a torus (donut-shaped) rather than a flat two-dimensional surface. To reduce confusion ..." Is this confusion of the model? Of the animal?  

      This would be confusion of the animal (e.g., a memory field overlapping with one border would also appear at the opposite border in the corresponding location). At one point in model development, I made the assumption that one side of the box wraps to the other side, and I asked Trygve Solstad to run some analyses of real data to see if cells actually wrap around in this manner. He did not find any evidence of this, and so I decided to include outsidethe-box representational area which, as it turned out, allowed the model to capture other behaviors as detailed in the paper.

      This section of the paper now reads:

      “The cosine tuning curves of the simulated border cells represent distance from the border on both sides of the border (i.e., firing rate increases as the animal approaches the border from either the inside or the outside of the enclosure). Experimental procedures do not allow the animal to experience locations immediately outside the enclosure, but these locations remain an important part of the hypothetic representation, particularly when considering the modification of memories through consolidation (i.e., a memory created inside the enclosure might be moved to a location outside the enclosure). This symmetry about the border cell’s preferred location is needed to maintain an unbiased representation, with a constant sum of squares for the border cell inputs (see methods section). Rather than using linear dimensions, all dimensions were assumed to be circular to keep the model relatively simple. This assumption was made because head direction is necessarily a circular dimension and by having all dimensions be circular, it is easy to combine dimensions in a consistent manner to produce multidimensional hippocampal place cell memories. Thus, the border cells define a torus (or more accurately a three-torus) of possible locations. This provides a hypothetical space of locations that could be represented.

      In light of the assumption to represent border cells with a circular dimension, when a memory is pushed outside the East wall of the enclosure, it would necessarily be moved to the West wall of the enclosure if the period of the circular dimension was equal to the width of the enclosure. If this were true, then the partial grid field on one side of the enclosure would match up with its remainder on the other side. Such a situation would cause the animal to become completely confused regarding opposite sides of the enclosure (a location on the West wall would be indistinguishable from the corresponding location on the East wall). To reduce confusion between opposite sides of the enclosure, the width of the enclosure in which the animal navigated (Figure 5) was assumed to be half as wide as the full period of the border cells. In other words, although the space of possible representations was a three-torus, it was assumed that the real-world twodimensional enclosure encompassed a section of the torus (e.g., a square piece of tape stuck onto the surface of a donut). The torus is better thought of as “playing field” in which different sizes and shapes of enclosure can be represented (i.e., different sizes and shapes of tape placed on the donut). Furthermore, this assumption provides representational space that is outside the box without such locations wrapping around to the opposite side of the box.”

      (5) Figure 3 - This result seems to be related to whether you use Euclidean or city-block distance. If you use Euclidean distances in two dimensions wouldn't this work out fine?  

      Euclidean distance was the metric used in the analysis of the two-dimensional simulation, but this did not work out. To make this clear, I have changed the label on the x-axes to read “Euclidean distance” for both the two- and three-dimensional simulations. The two-dimensional simulation produced city block behavior rather than Euclidean behavior because memory retrieval is the sum of the two dimensions, as is standard in neural networks, rather than the Euclidian distance formula, which would require that memory retrieval be the square root of the sum of squares of the two dimensions. One way to address this problem with the two-dimensional simulation would be to use a specific Euclidean-mimicking activation function rather than a simple sum of dimensions. The very first model I developed used such an activation function as applied to opponent border cells with just two dimensions (so 4 cells in total – left/right and top/down). This produced Euclidean behavior, but the activation function was implausible and did not generalize to simulations that also included head direction. In contrast, with three non-orthogonal dimensions, the simple sum of dimensions is approximately Euclidean.

      (6) Final sentence of the Discussion: "However, unlike the present model, these models still assume that entorhinal grid cells represent space rather than a non-spatial attribute." I am not sure if the authors of the cited papers will agree with this. They consider the spatial cases, but most argue they can treat non-spatial features as well. What the author might mean is that they assume non-spatial features are in some metric space that, in a way, is spatial. However, I am not sure if the author would argue that non-spatial features cannot be encoded metrically (e.g., Euclidean distance based on the similarity of odours). 

      In this section, when referring to “entorhinal grid cells” I was specifically referring to traditional grid cells in a rodent spatial navigation experiment. I did not mean to imply that these other theories cannot explain nonspatial grid fields, such as in the two-dimensional bird space grid cells found with humans. The way in which the proposed memory model and these other models differ is in terms of what they assume regarding the function of grid cells that exhibit spatial grid fields. In this revision, I have changed this text to read:

      “These models can capture some of the grid cell results presented in the current simulations, including extension to non-spatial grid-like responses (e.g., grid field that cover a two-dimensional neck/leg length bird space). Furthermore, these models may be able to explain memory phenomena similar to the model proposed in this study. However, unlike the proposed model, these models assume that the function of entorhinal grid cells that exhibit spatial X/Y grid fields during navigation is to represent space. In contrast, the memory model proposed in this study assume that the function of spatial X/Y grid cells is to represent a non-spatial attribute; the only reason they exhibit a spatial X/Y grid is because memories of that non-spatial attribute are arranged in a hexagonal grid owing to the uncluttered/unvarying nature of the enclosure. Thus, these model do not explain why most of the input to rodent hippocampus appears to be spatial (Boccara et al., 2010b; Diehl et al., 2017; Grieves & Jeffery, 2017) whereas the proposed model can explain this situation as reflecting the miss-classification of grid cells with a spatial arrangement as providing spatial input to hippocampus.”

      (7) It would be interesting to see videos/gifs of the model learning, and an idea of how many steps of trials it takes (is it capturing real-time rodent cell firing whilst foraging, or is it more abstracted, taking more trials). 

      The short answer is “yes”, the model is capturing real-time rodent cell firing while foraging. This is particularly true when simulating place cell memories in the absence of head direction information, as was shown in a video provided in the initial submission in relation to Figure 4. In this revision, I have provided a second video of learning when simulating place cell memories that include head direction. This second video is in relation to the results reported in Figure 9. This shows that even when learning a three-dimensional real-world space (X, Y, and head direction), the model rapidly produces an on-average hexagonal arrangement of place cells memories owing to the slight tendency of the place cell memories to linger in some locations as compared to others during consolidation. More specifically, they are more likely to linger in the locations that are the intersections of the peaks and/or troughs of the border cells and it is this tendency that supports the immediate appearance of grid cells. However, because the place cell memories are still shifting, head direction conjunctive grid cells are slower to emerge (the head direction conjunctive grid cells require stabilization of the place cells). The video then speeds up the learning process to so how place cells eventually stabilize after sufficient learning of the borders of the enclosure from different head/view directions.

      (8) One question is whether all the results have to be presented in the main text. It was difficult to see which key predictions fit the data and do so better than a spatial/navigation account. 

      Thank you for this suggestion. To make the paper more readable and easier for different readers with different interests to choose different aspects of the results to read, the second half of the results have been put in an appendix. More specifically, the second half of the results concerned place cells rather than grid cells. Thus, in this revision, the main text concerns grid cell results and the appendix concerns place cell results.

      Reviewer #3 (Recommendations For The Authors):  

      The title could usefully be shortened to focus on the main argument that observed firing patterns could be consistent with mapping memories instead of space. It's a stretch to argue that memory is the primary role when no such data is presented (i.e., there is no comparison of competing models). 

      This is a good point (I do not present evidence that conclusively indicates the function of MTL). This original title was chosen to make clear how this account is a radical departure from other accounts of grid cells. The revised title highlights that: 1) a memory model can also explain rodent single cell recording data during navigation; and 2) grid cell may not be non-spatial. The revised title is: “A Memory Model of Rodent Spatial Navigation: Place Cells are Memories Arranged in a Grid and Grid Cells are Non-spatial”

      When arguing that the main role of the hippocampus is memory, I strongly suggest engaging with the work of people like Howard Eichenbaum who spent the better part of their career arguing the same (e.g. DOI:10.1152/jn.00005.2017.)  

      Thank you for pointing out this important oversight. Early in introduction, I now write: “The proposal that hippocampus represents the multimodal conjunctions that define an episode is not new (Marr et al., 1991; Sutherland & Rudy, 1989) and neither is the proposal that hippocampal memory supports spatial/navigation ability (Eichenbaum, 2017). This view of the hippocampus is consistent with “feature in place” results (O’Keefe & Krupic, 2021) in which hippocampal cells respond to the conjunction of a non-spatial attribute affixed to a specific location, rather than responding more generically to any instance of a non-spatial attribute. In other words, the what/where conjunction is unique. Furthermore, the uniqueness of the what/where conjunction may be the fundamental building block of spatial memory and navigation. In reviewing the hippocampal literature, Howard Eichenbaum (2017) concludes that ‘the hippocampal system is not dedicated to spatial cognition and navigation, but organizes experiences in memory, for which spatial mapping and navigation are both a metaphor for and a prominent application of relational memory organization.’”

      With a focus on episodic memory, there should be a mention of the temporal component of memory. While it may rightfully be beyond the scope of this model, it's confusing to omit time completely from the discussion. 

      This issue and several others are now addressed in a new section in the introduction titled ‘The Scope of the Proposed Model’. That section reads:

      “The reported simulations explain why most mEC cell types in the rodent literature appear to be spatial (Boccara et al., 2010; Diehl et al., 2017; Grieves & Jeffery, 2017). Assuming that rodents can form non-spatial memories, rodent hippocampus must receive non-spatial input from entorhinal cortex. These simulations suggest that characterization of the rodent mEC cortex as primarily spatial might be incorrect if most grid cells (except perhaps head direction conjunctive grid cells) have been mischaracterized as spatial. Other literatures with other species find non-spatial representations in MTL (Gulli et al., 2020; Quiroga et al., 2005; Wixted et al., 2014) and non-spatial hippocampal memory encoding has been found in rodents (Liu et al., 2012; McEchron & Disterhoft, 1999). The proposed memory model is compatible with these results – the ideas contained in this model could be applied to nonspatial memory representations. However, surveys of cell types in rodent entorhinal cortex seem to indicate that most cells are spatial (Boccara et al., 2010; Diehl et al., 2017; Grieves & Jeffery, 2017). How can the rodent hippocampus encode nonspatial memories if most of its input is spatial? The goal of the reported simulations is to explain the apparent paucity of non-spatial cells in rodent entorhinal cortex by proposing that grid cells have been misclassified as spatial (see also Luo et al., 2024).

      Given the simplicity of the proposed model, there are important findings that the model cannot address -- it is not that the model makes the wrong predictions but rather that it makes no predictions. The role of running speed (Kraus et al., 2015) is one such variable for which the model makes no predictions. Similarly, because the model is a rate-coded model rather than a model of oscillating spiking neurons, it makes no predictions regarding theta oscillations (Buzsáki & Moser, 2013). The model is an account of learning and memory for an adult animal, and it makes no predictions regarding the developmental (Langston et al., 2010; Muessig et al., 2015; Wills et al., 2012) or evolutionary (Rodrıguez et al., 2002) time course of different cell types. This model contains several purely spatial representations such as border cells, head direction cells, and head direction conjunctive grid cells and it may be that these purely spatial cell types emerged first, followed by the evolution and/or development of non-spatial cell types. However, this does not invalidate the model. Instead, this is a model for an adult animal that has both episodic memory capabilities and spatial navigation capabilities, irrespective of the order in which these capabilities emerged.

      This model has the potential to explain context effects in memory (Godden & Baddeley, 1975; Gulli et al., 2020; Howard et al., 2005). According to this model, different grid cells represent different non-spatial characteristics and place cells represent the combination of these “context” factors and location. In the simulation, just one grid cell is simulated but the same results would emerge when simulating hundreds of different non-spatial inputs provided that all of the simulated non-spatial inputs exist throughout the recording session. However, there is evidence that hippocampus can explicitly represent the passage of time (Eichenbaum, 2014), and time is assuredly an important factor in defining episodic memory (Bright et al., 2020). Thus, although the current model addresses unique combinations of what and where, it is left to future work to incorporate representations of when in the memory model.”

      I recommend explaining the motivation of the theory in more detail in the introduction. It reads as "what if it's like this?" It would be helpful to instead highlight the limitations of current theories and argue why this theory is either a better fit for the data or is logically simpler. 

      This issue and several others are now addressed in the new section in the introduction titled ‘Why Model the Rodent Navigation Literature with a Memory Model?’, which I quoted above in response to the public reviews.

      It's worth considering shortening the results section to include only those that most convincingly support the main claim. The manuscript is quite long and appears to lack focus at times. 

      Thank you for this suggestion. To make the paper more readable and easier for different readers with different interests to choose different aspects of the results to read, the second half of the results have been put in an appendix. More specifically, the second half of the results concerned place cells rather than grid cells. Thus, in this revision, the main text concerns grid cell results and the appendix concerns place cell results.

      The discussion of path dependence on the formation of the grid pattern is important but only briefly discussed. It may be useful to add simulations testing whether different paths (not random walks) produce distorted grid patterns. 

      The short answer is that the path doesn’t affect things in general. The consolidation rule ensures equally spaced memories even if, for instance, one side of the enclosure is explored much more than the other side. As just one example, I have run simulations with a radial arm maze and even though the animal is constrained to only run on the maze arms. The memories still arrange hexagonally as memories become pushed outside the arms. Rather than adding additional simulations to study, I now briefly describe this in the model methods:

      “Of note, the ability of the model to produce grid cell responses does not depend on this decision to simulate an animal taking a random walk – the same results emerge if the animal is more systematic in its path. All that matters for producing grid cell responses is that the animal visits all locations and that the animal takes on different head directions for the same location in the case of simulations that also include head direction as an input to hippocampal place cells.”

      I struggle to understand in Figure 3 why retrieval strength ought to scale monotonically with Euclidean distance, and why that justifies a more complex model (three non-orthogonal dimensions). 

      The introduction to this section now reads: “Animals can plan novel straight line paths to reach a known position and evidence suggests they do so by learning Euclidean representations of space (Cheng & Gallistel, 2014; Normand & Boesch, 2009; Wilkie, 1989). Thus, it was assumed that hippocampal place cells represent positions in Euclidean space (as opposed to non-Euclidean space, such a occurs with a city-block metric).”

      p.17 "although the representational space is a torus (or more specifically a three-torus), it is assumed that the real-world two-dimensional surface is only a section of the torus (e.g., a square piece of tape stuck onto the surface of a donut)." I fail to understand how the realworld surface is only a part of the torus. In the existing theoretical and experimental work on toroidal topology of grid cell activity, the torus represents a very small fraction of the real world, and repeating activity on the toroidal manifold is a crucial feature of how it maps 2D space in a regular manner. Why then here do you want the torus to be larger than the realworld? 

      This section has been rewritten to better explain these assumptions. The relevant paragraphs now read:

      “The cosine tuning curves of the simulated border cells represent distance from the border on both sides of the border (i.e., firing rate increases as the animal approaches the border from either the inside or the outside of the enclosure). Experimental procedures do not allow the animal to experience locations immediately outside the enclosure, but these locations remain an important part of the hypothetic representation, particularly when considering the modification of memories through consolidation (i.e., a memory created inside the enclosure might be moved to a location outside the enclosure). This symmetry about the border cell’s preferred location is needed to maintain an unbiased representation, with a constant sum of squares for the border cell inputs (see methods section). Rather than using linear dimensions, all dimensions were assumed to be circular to keep the model relatively simple. This assumption was made because head direction is necessarily a circular dimension and by having all dimensions be circular, it is easy to combine dimensions in a consistent manner to produce multidimensional hippocampal place cell memories. Thus, the border cells define a torus (or more accurately a three-torus) of possible locations. This provides a hypothetical space of locations that could be represented.

      In light of the assumption to represent border cells with a circular dimension, when a memory is pushed outside the East wall of the enclosure, it would necessarily be moved to the West wall of the enclosure if the period of the circular dimension was equal to the width of the enclosure. If this were true, then the partial grid field on one side of the enclosure would match up with its remainder on the other side. Such a situation would cause the animal to become completely confused regarding opposite sides of the enclosure (a location on the West wall would be indistinguishable from the corresponding location on the East wall). To reduce confusion between opposite sides of the enclosure, the width of the enclosure in which the animal navigated (Figure 5) was assumed to be half as wide as the full period of the border cells. In other words, although the space of possible representations was a three-torus, it was assumed that the real-world twodimensional enclosure encompassed a section of the torus (e.g., a square piece of tape stuck onto the surface of a donut). The torus is better thought of as “playing field” in which different sizes and shapes of enclosure can be represented (i.e., different sizes and shapes of tape placed on the donut). Furthermore, this assumption provides representational space that is outside the box without such locations wrapping around to the opposite side of the box.”

      p.28 "More specifically, egocentric grid cells (e.g., head direction conjunctive grid cells) require stabilization of the place cell memories in the face of ongoing consolidation whereas allocentric grid cells reflect on-average place field positions." and p.32 "if place cells represent episodic memories, it seems natural that they should include head direction (an egocentric viewpoint)." But the head direction signal is not egocentric, it is allocentric. I'm unsure whether this is a typo or a potentially more serious conceptual misunderstanding. 

      Any reference to egocentric has been removed in this revision. In the initial submission, when I used egocentric, I was referring to memories that depended on the head direction of the animal at the time of memory formation. I was using “egocentric” in relation to whether the memory was related to the animal’s personal bodily experience at the time of memory formation. But I concede that this is confusing since the ego/allo distinction is typically used to differentiate angular directions that are relative to the person (left/right) versus earth (East/West). Instead, throughout the manuscript I now refer to these as view-dependent memories since head direction would entail having a different view of the environment at the time of memory formation. I still refer to the stacking of multiple view-dependent memories on the same X/Y location as being the development of an allocentric representation however, since this can be thought of as one way to learn a cognitive map of the enclosure that is view independent.

      p.37 "But if the border cells had changed their alignment with the new enclosure (e.g., if the E border dimension aligned with the North-South borders), then the place cells would have appeared to undergo global remapping as their positions rotated by 90 degrees and the grid pattern would have also rotated." But this would not be interpreted as global remapping by standard analyses of place and grid cell responses. A coherent rotation of firing patterns is not interpreted as remapping. 

      This sentence now reads: “But if the border cells had changed their alignment with the new enclosure (e.g., if the E border dimension aligned with the North-South borders), then the place cells would remain in their same positions relative to the now-rotated borders (i.e., no remapping relative to the enclosure) and the corresponding grid cells would also retain their same alignment relative to the enclosure.”

      p.37 "this is more accurately described as partial remapping (nearly all place fields were unaffected)." If nearly all place fields were unaffected, this should be interpreted as a stable map. Partial remapping is a mix of stability, rate remapping, and global remapping within a population of place cells. 

      This sentence has been removed.

      p.40 "The dependence of grid cell responses on memory may help explain why grid cells have been found for bats crawling on a two-dimensional surface (Yartsev et al., 2011), but three-dimensional grid cells have never been observed for flying bats." This is not true. Ginosar et al. (2021) observed 3D grid cells in flying bats.  

      Thank you for highlighting this issue. In the initial submission I was using “grid cell” to mean a cell that produced a precise hexagonal grid, which is not the case for the 3D grid cells in bats. In this revision, I now discuss grid cell that produce irregular grid fields, writing:

      “According to this model, hexagonally arranged grid cells should be the exception rather than the rule when considering more naturalistic environments. In a more ecologically valid situation, such as with landmarks, varied sounds, food sources, threats, and interactions with conspecifics, there may still be remembered locations were events occurred or remembered properties can be found, but because the non-spatial properties are non-uniform in the environment, the arrangement of memory feedback will be irregular, reflecting the varied nature of the environment. This may explain the finding that even in a situation where there are regular hexagonal grid cells, there are often irregular non-grid cells that have a reliable multi-location firing field, but the arrangement of the firing fields is irregular (Diehl et al., 2017). For instance, even when navigating in an enclosure that has uniform properties as dictated by experimental procedures, they may be other properties that were not well-controlled (e.g., a view of exterior lighting in some locations but not others), and these uncontrolled properties may produce an irregular grid (i.e., because the uncontrolled properties are reliably associated with some locations but not others, hippocampal memory feedback triggers retrieval of those properties in the associations locations).

      In this memory model, there are other situations in which an irregular but reliable multi-location grid may occur, even when everything is well controlled. In the reported simulations, when the hippocampal place cells were based on variation in X/Y (as defined by Border cells), nothing else changed as a function of location, and the model rapidly produced a precise hexagonal arrangement of hippocampal place cell memories. When head direction was included (i.e., real-world variation in X, Y, and head direction), the model still produced a hexagonal arrangement as per face centered cubic packing of memories, but this precise arrangement was slower to emerge, with place cells continuing to shift their positions until the borders of the enclosure were sufficiently well learned from multiple viewpoints. If there is realworld variation in four or more dimensions, as is likely the case in a more ecologically valid situation, it will be even harder for place cell memories to settle on a precise regular lattice. Furthermore, in the case of four dimensions, mathematicians studying the “sphere packing problem” recently concluded that densest packing is irregular (Campos et al., 2023). This may explain why the multifield grid cells for freely flying bats have a systematic minimum distance between firing fields, but their arrangement is globally irregular (Ginosar et al., 2021). Assuming that the memories encoded by a bat include not just the three realworld dimensions of variation, but also head direction, the grid will likely be irregular even under optimal conditions of laboratory control.”

      Multiple typos are found on page 25, end of paragraph 3: "More specifically, if there is one set of non-orthogonal dimensions for enclosure borders and the movement of one wall is too modest as to cause avoid global remapping, this would deform the grid modules based the enclosure border cells."

      As detailed above in the response the public reviews, this paragraph has been rewritten.

    1. eLife Assessment

      The authors studied the development of mesentery borders in the rice coral Montipora, a new experimental system, to complement existing data from the sea anemone Nematostella. They make a solid case that in Montipora, there is a sequence of Hox-Gbx genes whose staggered expression in the unsegmented larva is suggestive of their role in subdividing the gastric cavity into repeated units bordered by mesenteries, as in the sea anemone Nematostella. Pharmacological experiments also point to the involvement of the BMP pathway in this process, but additional experiments validating this are necessary. This is a valuable contribution to the field of cnidarian evolution, suggesting that BMP- and "Hox-Gbx code"-dependent patterning of the directive axis was ancestral for Anthozoa.

    1. Reviewer #2 (Public review):

      Summary:

      This manuscript investigates how olfactory representations are transformed along the cortico-hippocampal pathway in mice during a non-associative learning paradigm involving novel and familiar odors. By recording single-unit activity in several key brain regions (AON, aPCx, LEC, CA1, and SUB), the authors aim to elucidate how stimulus identity and experience are encoded and how these representations change across the pathway.

      The study addresses an important question in sensory neuroscience regarding the interplay between sensory processing and signaling novelty/familiarity. It provides insights into how the brain processes and retains sensory experiences, suggesting that the earlier stations in the olfactory pathway, the AON aPCx, play a central role in detecting novelty and encoding odor, while areas deeper into the pathway (LEC, CA1 & Sub) are more sparse and encodes odor identity but not novelty/familiarity. However, there are several concerns related to methodology, data interpretation, and the strength of the conclusions drawn.

      Strengths:

      The authors combine the use of modern tools to obtain high-density recordings from large populations of neurons at different stages of the olfactory system (although mostly one region at a time) with elegant data analyses to study an important and interesting question.

      Weaknesses:

      (1) The first and biggest problem I have with this paper is that it is very confusing, and the results seem to be all over the place. In some parts, it seems like the AON and aPCx are more sensitive to novelty; in others, it seems the other way around. I find their metrics confusing and unconvincing. For example, the example cells in Figure 1C show an AON neuron with a very low spontaneous firing rate and a CA1 with a much higher firing rate, but the opposite is true in Figure 2A. So, what are we to make of Figure 2C that shows the difference in firing rates between novel vs. familiar odors measured as a difference in spikes/sec. This seems nearly meaningless. The authors could have used a difference in Z-scored responses to normalize different baseline activity levels. (This is just one example of a problem with the methodology.)

      (2) There are a lot of high-level data analyses (e.g., decoding, analyzing decoding errors, calculating mutual information, calculating distances in state space, etc.) but very little neural data (except for Figure 2C, and see my comment above about how this is flawed). So, if responses to novel vs. familiar odors are different in the AON and aPCx, how are they different? Why is decoding accuracy better for novel odors in CA1 but better for familiar odors in SUB (Figure 3A)? The authors identify a small subset of neurons that have unusually high weights in the SVM analyses that contribute to decoding novelty, but they don't tell us which neurons these are and how they are responding differently to novel vs. familiar odors.

      (3) The authors call AON and aPCx "primary sensory cortices" and LEC, CA1, and Sub "multisensory areas". This is a straw man argument. For example, we now know that PCx encodes multimodal signals (Poo et al. 2021, Federman et al., 2024; Kehl et al., 2024), and LEC receives direct OB inputs, which has traditionally been the criterion for being considered a "primary olfactory cortical area". So, this terminology is outdated and wrong, and although it suits the authors' needs here in drawing distinctions, it is simplistic and not helpful moving forward.

      (4) Why not simply report z-scored firing rates for all neurons as a function of trial number? (e.g., Jacobson & Friedrich, 2018). Figure 2C is not sufficient. For example, in the Discussion, they say, "novel stimuli caused larger increases in firing rates than familiar stimuli" (L. 270), but what does this mean? Odors typically increase the firing in some neurons and suppress firing in others. Where does the delta come from? Is this because novel odors more strongly activate neurons that increase their firing or because familiar odors more strongly suppress neurons?

      (5) Lines 122-124 - If cells in AON and aPCx responded the same way to novel and familiar odors, then we would say that they only encode for odor and not at all for experience. So, I don't understand why the authors say these areas code for a "mixed representation of chemical identity and experience." "On the other hand," if LEC, CA1, and SUB are odor selective and only encode novel odors, then these areas, not AON and aPCx, are the jointly encoding chemical identity and experience. Also, I do not understand why, here, they say that AON and PCx respond to both while LEC, CA1, and SUB were selective for novel stimuli, but the authors then go on to argue that novelty is encoded in the AON and PCx, but not in the LEC, CA1, and SUB.

      (6) Lines 132-140 - As presented in the text and the figure, this section is poorly written and confusing. Their use of the word "shuffled" is a major source of this confusion, because this typically is the control that produces outcomes at the chance level. More importantly, they did the wrong analysis here. The better and, I think, the only way to do this analysis correctly is to train on some of the odors and test on an untrained odor (i.e., what Bernardi et al., 2021 called "cross-condition generalization performance"; CCGP).

    1. And I had an idea for it: I wanted to construct a monospaced typeface—where the width of all glyphs are the same—that is ideal for writing code, but that would also have certain features of handwritten manuscripts that make it feel a bit like working with an old and mysterious text. I wanted programming to mingle with dusty tomes or spellwork. If programmers have been talking about the similarities between coding and magic for years, maybe we need a font that tries to make this more manifest.

      Virtuous project! I love the effect.

    1. n effet, même les États dotés de législations strictes, comme la France, ne parviennent pas toujours à protéger efficacement leurs citoyens contre le cyber-espionnage extérieur. La France, par exemple, dispose d’un cadre juridique précis inscrit dans le Code pénal, qui protège la vie privée des citoyens et interdit le cyber-espionnage, sauf lorsqu’il est réalisé à des fins de sécurité nationale. Toutefois, elle n’a pas pu se protéger de l’ingérence marocaine. Selon des médias ayant participé au Projet Pegasus, une trentaine de journalistes français auraient été visés par les autorités marocaines, dont Edwy Plenel, fondateur de Mediapart. Son téléphone aurait, d’après Amnesty International, été infecté pendant au moins trois mois, suite à des prises de position critiques sur le mouvement de protestation du Hirak marocain et la répression des manifestations. Plus grave encore, le Maroc aurait espionné en 2019 Emmanuel Macron, alors président, ainsi que le Premier ministre Édouard Philippe et quatorze autres membres du gouvernement français. Bien que ces révélations aient affecté les relations franco-marocaines, aucun procès n’a été intenté contre le Maroc. Ces révélations ont toutefois conduit le Maroc à entamer des démarches judiciaires, mais pas pour se défendre face aux accusations de cyber-espionnage : le pays a cherché à poursuivre en diffamation les médias et associations françaises ayant révélé l’affaire.

      ern fait je ne vois pas comment un cadre légal peut contraindre des pratiques qui par définition sont illicte. On peut mettre tous les cadres légaux en place, il n'auront aucun effet si un Etat décide de passer outre

    1. Reviewer #3 (Public review):

      In this study, O'Brien et al. address the need for scalable and cost-effective approaches to finding lead compounds for the treatment of the growing number of Mendelian diseases. They used state-of-the-art phenotypic screening based on an established high-dimensional phenotypic analysis pipeline in the nematode C. elegans.

      First, a panel of 25 C. elegans models was created by generating CRISPR/Cas9 knock-out lines for conserved human disease genes. These mutant strains underwent behavioral analysis using the group's published methodology. Clustering analysis revealed common features for genes likely operating in similar genetic pathways or biological functions. The study also presents results from a more focused examination of ciliopathy disease models.

      Subsequently, the study focuses on the NALCN channel gene family, comparing the phenotypes of mutants of nca-1, unc-77, and unc-80. This initial characterization identifies three behavioral parameters that exhibit significant differences from the wild type and could serve as indicators for pharmacological modulation.

      As a proof-of-concept, O'Brien et al. present a drug repurposing screen using an FDA-approved compound library, identifying two compounds capable of rescuing the behavioral phenotype in a model with UNC80 deficiency. The relatively short time and low cost associated with creating and phenotyping these strains suggest that high-throughput worm tracking could serve as a scalable approach for drug repurposing, addressing the multitude of Mendelian diseases. Interestingly, by measuring a wide range of behavioural parameters, this strategy also simultaneously reveals deleterious side effects of tested drugs that may confound the analysis.

      Considering the wealth of data generated in this study regarding important human disease genes, it is regrettable that the data is not made accessible to researchers less versed in data analysis methods. This diminishes the study's utility. It would have a far greater impact if an accessible and user-friendly online interface were established to facilitate data querying and feature extraction for specific mutants. This would empower researchers to compare their findings with the extensive dataset created here.

      Another technical limitation of the study is the use of single alleles. Large deletion alleles were generated by CRISPR/Cas9 gene editing. At first glance, this seems like a good idea because it limits the risk that background mutations, present in chemically-generated alleles, will affect behavioral parameters. However, these large deletions can also remove non-coding RNAs or other regulatory genetic elements, as found, for example, in introns. Therefore, it would be prudent to validate the behavioral effects by testing additional loss-of-function alleles produced through early stop codons or targeted deletion of key functional domains.

      Comments on revisions:

      In this final round of revisions, the authors have improved their manuscript and provide useful information about analysis procedures and code and updated figures.

    2. Author response:

      The following is the authors’ response to the previous reviews.

      This important study provides proof of principle that C. elegans models can be used to accelerate the discovery of candidate treatments for human Mendelian diseases by detailed high-throughput phenotyping of strains harboring mutations in orthologs of human disease genes. The data are compelling and support an approach that enables the potential rapid repurposing of FDA-approved drugs to treat rare diseases for which there are currently no effective treatments. The authors should provide a clearer explanation of how the statistical analyses were performed, as well as a link to a GitHub repository to clarify how figures and tables in the manuscript were generated from the phenotypic data.

      We have amended our description of the statistical analysis in the materials and methods section of the manuscript. We have also updated the GitHub repository link to a dedicated repository for this study, this contains all of the code needed to generated all the figures made from the phenotypic data provided. Additionally, we have updated the Zenodo repository to contain both the code and datasets within the same file.

      We have also updated the GitHub repository link to a dedicated repository for this manuscript, that contains all of the code needed to generate all figures from the phenotypic data provided. Additionally, we have updated the Zenodo repository link to contain both the code and datasets within the same folder structure. 

      Recommendations for the authors:

      Reviewer #1 (Recommendations for the authors):

      The authors have responded to previous review to improve the presentation of the work. The paper more than meets publication standards.

      No response required.

      Reviewer #2 (Recommendations for the authors):

      The authors have addressed all of my questions and concerns. I'm happy to see this updated paper of record.

      No response required.

      Reviewer #3 (Recommendations for the authors):

      Regarding the interactive heatmap

      The html version and the panel in Figure 2C appear not to coincide visually. Maybe the features are ordered in a different way?

      The html version of Figure 2C is for the entire feature set extract per strain and not the condensed Tierpsy256 set shown in the panel figure. We have now remade this figure to show this reduced feature set (aligning with what is shown in Figure 2C) and included both versions of the interactive heatmaps as static html files within the same repository.

      Regarding data accessibility overall

      More generally, the html file does not address my initial concern about the accessibility of the data to non-experts. Making the full dataset available was a necessary first step, but the hermetic nature of its format and the lack of a simple way to query the data remains an issue for me that limits the usefulness of this data to the broadest audience.

      We agree, but unfortunately do not currently have the resources to build a public-facing database to facilitate this.

    1. If a company is run primarily for profit, you’ll get entirely different outcomes than if it’s run for the public good—despite what the true believers in the “invisible hand” of the market preach. Social media provides the best example, and the experience of what happened with social media is a bad omen for what might happen (and is happening!) with AI. Two words—“maximize engagement,” code for “maximize profits”—were all that was needed to send social media into the abyss of spleen-venting hostility where it now wallows.

      Enshittification (Cory Doctorow)

    1. This is primarily becausethe growing problem of spam,

      In essence, the bad actors pushed toward centralization and reliance on large providers to cut the noise from their "signaling/code" surface.

    1. Arthurian romances, one of the most popular forms of literature in the High Middle Ages, frequently employed colour symbolism, particularly in the depiction of knights. Pastoureau notes that these narratives used colours to convey deeper meanings and character traits. He writes: The color code was recurrent and meaningful. A black knight was almost a character of primary importance (Tristan, Lancelot, Gawain) who wanted to hide his identity; he was generally motivated by good intentions and prepared to demonstrate his valor, especially by jousting or tournament. A red knight, on the other hand, was often hostile to the hero; this was a perfidious or evil knight, sometimes the devil’s envoy or a mysterious being from the Other World. Less prominent, a white knight was generally viewed as good; this was an older figure, a friend of protector or the hero, to who he gave wise council. Conversely, a green knight was a young knight, recently dubbed, whose audacious or insolent behavior was going to cause great disorder; he could be good or bad. Finally, yellow or gold knights were rare and blue knights nonexistent.

      We gotta bring back color coding for these prestige TV shows with thirty billion characters.

    1. Remplir, à la date d’admission à ce cycle, les conditions requises pour se présenter au concours externe d’entrée à l’INSP et les conditions de ressources fixées pour bénéficier d’une bourse d’enseignement supérieur sur critères sociaux prévue en application de l’article L.821-1 du code de l’éducation.

      manque la puce

    1. Justifier, à la date de clôture des inscriptions, du diplôme de doctorat défini à l'article L. 612-7 du code de l'éducation ou d'une qualification reconnue comme équivalente à ce diplôme dans les conditions fixées par le décret n° 2007-196 du 13 février 2007.

      Puce

    1. Pour les candidats titulaires d’un doctorat, sont prises en compte, pour la détermination de cette durée, les périodes pendant lesquelles ils ont bénéficié d’un contrat doctoral dans les conditions fixées au cinquième alinéa de l’article L. 412-1 du code de la recherche.

      mettre un espace (retour à la ligne) avant la 2e puce

  2. ontheroadtotheroad7.wordpress.com ontheroadtotheroad7.wordpress.com
    1. bad guys

      "In the world of The Road , there is a simple rule for distinguishing the good guys from the bad guys. Bad guys eat people; good guys don't. This is what remains of the Categorical Imperative: don't treat people as mere food. While this is the most obvious principle to which good guys are committed, it is not the only one. It is possible to discern in The Road a Code of the Good Guys, a set of principles to which good guys are committed. That Code includes the following rules:

      1. Don't eat people.
      2. Don't steal.
      3. Don't lie.
      4. Keep your promises.
      5. Help others.
      6. Never give up.

      The man tries to teach these principles to the child and he tries to follow them himself. Throughout the novel we witness the man's struggle to be a good guy, to do what is right in a world in which most people seem to have abandoned morality altogether." (Wielenberg 5-6).

      https://www.jstor.org/stable/42909407

      "Every social institution and convention that could serve as a hallmark of civilization has passed so far into oblivion that, as Ashley Kunsa argues, the names of places, road, and people have passed into meaninglessness, leaving only the deeds of individuals to providing meaning and morality to the world (61–63). The most important dividing line for the boy is the assurance from his father that they will not eat people." (Dominy 146).

      https://www.jstor.org/stable/10.5325/cormmccaj.13.1.0143

      More on Worldview here: https://ontheroadtotheroad7.wordpress.com/2024/11/26/worldview/

    1. Author response:

      The following is the authors’ response to the previous reviews.

      Reviewer #1 (Public Review):

      In this revision, the authors significantly improved the manuscript. They now address some of my concerns. Specifically, they show the contribution of end-effects on spreading the inputs between dendrites. This analysis reveals greater applicability of their findings to cortical cells, with long, unbranching dendrites than other neuronal types, such as Purkinje cells in the cerebellum.

      They now explain better the interactions between calcium and voltage signals, which I believe improve the take-away message of their manuscript. They modified and added new figures that helped to provide more information about their simulations.

      However, some of my points remain valid. Figure 6 shows depolarization of ~5mV from -75. This weak depolarization would not effectively recruit nonlinear activation of NMDARs. In their paper, Branco and Hausser (2010) showed depolarizations of ~10-15mV.

      More importantly, the signature of NMDAR activation is the prolonged plateau potential and activation at more depolarized resting membrane potentials (their Figure 4). Thus, despite including NMDARs in the simulation, the authors do not model functional recruitment of these channels. Their simulation is thus equivalent to AMPA only drive, which can indeed summate somewhat nonlinearly.

      In the current study, we used short sequences of 5 inputs, since the convergence of longer sequences is extremely unlikely in the network configurations we have examined. This resulted in smaller EPSP amplitudes of ~5mV (Figure 6 - Supplement 2A, B). Longer sequences containing 9 inputs resulted in larger somatic depolarizations of ~10mV (Figure 6 - Supplement 2E, F). Although we had modified the (Branco, Clark, and Häusser 2010) model to remove the jitter in the timing of arrival of inputs and made slight modifications to the location of stimulus delivery on the dendrite, we saw similar amplitudes when we tested a 9-length sequence using (Branco, Clark, and Häusser 2010)’s published code (Figure 6 - Supplement 2I, J). In all the cases we tested (5 input sequence, 9 input sequence, 9 input sequence with (Branco, Clark, and Häusser 2010) code repository), removal of NMDA synapses lowered both the somatic EPSPs (Figure 6 - Supplement 2C,D,G,H,K,L) as well as the selectivity (measured as the difference between the EPSPs generated for inward and outward stimulus delivery) (Figure 6 Supplement 2M,N,O). Further, monitoring the voltage along the dendrite for a sequence of 5 inputs showed dendritic EPSPs in the range of 20-45 mV (Figure 6 - Supplement 2P, Q), which came down notably (10-25mV) when NMDA synapses were abolished (Figure 6 - Supplement 2R, S). Thus, even sequences containing as few as 5 inputs were capable of engaging the NMDA-mediated nonlinearity to show sequence selectivity, although the selectivity was not as strong as in the case of 9 inputs.

      Reviewer #1 (Recommendations for the authors):

      Minor points:

      Figure 8, what does the scale in A represent? I assume it is voltage, but there are no units. Figure 8, C, E, G, these are unconventional units for synaptic weights, usually, these are given in nS / per input.

      We have corrected these. The scalebar in 8A represents membrane potential in mV. The units of 8C,E,G are now in nS.

      Reviewer #2 (Public Review):

      Summary:

      If synaptic input is functionally clustered on dendrites, nonlinear integration could increase the computational power of neural networks. But this requires the right synapses to be located in the right places. This paper aims to address the question of whether such synaptic arrangements could arise by chance (i.e. without special rules for axon guidance or structural plasticity), and could therefore be exploited even in randomly connected networks. This is important, particularly for the dendrites and biological computation communities, where there is a pressing need to integrate decades of work at the single-neuron level with contemporary ideas about network function.

      Using an abstract model where ensembles of neurons project randomly to a postsynaptic population, back-of-envelope calculations are presented that predict the probability of finding clustered synapses and spatiotemporal sequences. Using data-constrained parameters, the authors conclude that clustering and sequences are indeed likely to occur by chance (for large enough ensembles), but require strong dendritic nonlinearities and low background noise to be useful.

      Strengths:

      (1) The back-of-envelope reasoning presented can provide fast and valuable intuition. The authors have also made the effort to connect the model parameters with measured values. Even an approximate understanding of cluster probability can direct theory and experiments towards promising directions, or away from lost causes.

      (2) I found the general approach to be refreshingly transparent and objective. Assumptions are stated clearly about the model and statistics of different circuits. Along with some positive results, many of the computed cluster probabilities are vanishingly small, and noise is found to be quite detrimental in several cases. This is important to know, and I was happy to see the authors take a balanced look at conditions that help/hinder clustering, rather than to just focus on a particular regime that works.

      (3) This paper is also a timely reminder that synaptic clusters and sequences can exist on multiple spatial and temporal scales. The authors present results pertaining to the standard `electrical' regime (~50-100 µm, <50 ms), as well as two modes of chemical signaling (~10 µm, 100-1000 ms). The senior author is indeed an authority on the latter, and the simulations in Figure 5, extending those from Bhalla (2017), are unique in this area. In my view, the role of chemical signaling in neural computation is understudied theoretically, but research will be increasingly important as experimental technologies continue to develop.

      Weaknesses:

      (1) The paper is mostly let down by the presentation. In the current form, some patience is needed to grasp the main questions and results, and it is hard to keep track of the many abbreviations and definitions. A paper like this can be impactful, but the writing needs to be crisp, and the logic of the derivation accessible to non-experts. See, for instance, Stepanyants, Hof & Chklovskii (2002) for a relevant example.

      It would be good to see a restructure that communicates the main points clearly and concisely, perhaps leaving other observations to an optional appendix. For the interested but time-pressed reader, I recommend starting with the last paragraph of the introduction, working through the main derivation on page 7, and writing out the full expression with key parameters exposed. Next, look at Table 1 and Figure 2J to see where different circuits and mechanisms fit in this scheme. Beyond this, the sequence derivation on page 15 and biophysical simulations in Figures 5 and 6 are also highlights.

      We appreciate the reviewers' suggestions. We have tightened the flow of the introduction. We understand that the abbreviations and definitions are challenging and have therefore provided intuitions and summaries of the equations discussed in the main text.

      Clusters calculations

      Our approach is to ask how likely it is that a given set of inputs lands on a short segment of dendrite, and then scale it up to all segments on the entire dendritic length of the cell.

      Thus, the probability of occurrence of groups that receive connections from each of the M ensembles (PcFMG) is a function of the connection probability (p) between the two layers, the number of neurons in an ensemble (N), the relative zone-length with respect to the total dendritic arbor (Z/L) and the number of ensembles (M).

      Sequence calculations

      Here we estimate the likelihood of the first ensemble input arriving anywhere on the dendrite, and ask how likely it is that succeeding inputs of the sequence would arrive within a set spacing.

      Thus, the probability of occurrence of sequences that receive sequential connections (PcPOSS) from each of the M ensembles is a function of the connection probability (p) between the two layers, the number of neurons in an ensemble (N), the relative window size with respect to the total dendritic arbor (Δ/L) and the number of ensembles (M).

      (2) I wonder if the authors are being overly conservative at times. The result highlighted in the abstract is that 10/100000 postsynaptic neurons are expected to exhibit synaptic clustering. This seems like a very small number, especially if circuits are to rely on such a mechanism. However, this figure assumes the convergence of 3-5 distinct ensembles. Convergence of inputs from just 2 ense mbles would be much more prevalent, but still advantageous computationally. There has been excitement in the field about experiments showing the clustering of synapses encoding even a single feature.

      We agree that short clusters of two inputs would be far more likely. We focused our analysis on clusters with three of more ensembles because of the following reasons:

      (1) The signal to noise in these clusters was very poor as the likelihood of noise clusters is high.

      (2) It is difficult to trigger nonlinearities with very few synaptic inputs.

      (3) At the ensemble sizes we considered (100 for clusters, 1000 for sequences), clusters arising from just two ensembles would result in high probability of occurrence on all neurons in a network (~50% in cortex, see p_CMFG in figures below.). These dense neural representations make it difficult for downstream networks to decode (Foldiak 2003).

      However, in the presence of ensembles containing fewer neurons or when the connection probability between the layers is low, short clusters can result in sparse representations (Figure 2 - Supplement 2). Arguments 1 and 2 hold for short sequences as well.

      (3) The analysis supporting the claim that strong nonlinearities are needed for cluster/sequence detection is unconvincing. In the analysis, different synapse distributions on a single long dendrite are convolved with a sigmoid function and then the sum is taken to reflect the somatic response. In reality, dendritic nonlinearities influence the soma in a complex and dynamic manner. It may be that the abstract approach the authors use captures some of this, but it needs to be validated with simulations to be trusted (in line with previous work, e.g. Poirazi, Brannon & Mel, (2003)).

      We agree that multiple factors might affect the influence of nonlinearities on the soma. The key goal of our study was to understand the role played by random connectivity in giving rise to clustered computation. Since simulating a wide range of connectivity and activity patterns in a detailed biophysical model was computationally expensive, we analyzed the exemplar detailed models for nonlinearity separately (Figures 5, 6, and new figure 8), and then used our abstract models as a proxy for understanding population dynamics. A complete analysis of the role played by morphology, channel kinetics and the effect of branching requires an in-depth study of its own, and some of these questions have already been tackled by (Poirazi, Brannon, and Mel 2003; Branco, Clark, and Häusser 2010; Bhalla 2017). However, in the revision, we have implemented a single model which incorporates the range of ion-channel, synaptic and biochemical signaling nonlinearities which we discuss in the paper (Figure 8, and Figure 8 Supplement 1, 2,3). We use this to demonstrate all three forms of sequence and grouped computation we use in the study, where the only difference is in the stimulus pattern and the separation of time-scales inherent in the stimuli.

      (4) It is unclear whether some of the conclusions would hold in the presence of learning. In the signal-to-noise analysis, all synaptic strengths are assumed equal. But if synapses involved in salient clusters or sequences were potentiated, presumably detection would become easier? Similarly, if presynaptic tuning and/or timing were reorganized through learning, the conditions for synaptic arrangements to be useful could be relaxed. Answering these questions is beyond the scope of the study, but there is a caveat there nonetheless.

      We agree with the reviewer. If synapses receiving connectivity from ensembles had stronger weights, this would make detection easier. Dendritic spikes arising from clustered inputs have been implicated in local cooperative plasticity (Golding, Staff, and Spruston 2002; Losonczy, Makara, and Magee 2008). Further, plasticity related proteins synthesized at a synapse undergoing L-LTP can diffuse to neighboring weakly co-active synapses, and thereby mediate cooperative plasticity (Harvey et al. 2008; Govindarajan, Kelleher, and Tonegawa 2006; Govindarajan et al. 2011). Thus if clusters of synapses were likely to be co-active, they could further engage these local plasticity mechanisms which could potentiate them while not potentiating synapses that are activated by background activity. This would depend on the activity correlation between synapses receiving ensemble inputs within a cluster vs those activated by background activity. We have mentioned some of these ideas in a published opinion paper (Pulikkottil, Somashekar, and Bhalla 2021). In the current study, we wanted to understand whether even in the absence of specialized connection rules, interesting computations could still emerge. Thus, we focused on asking whether clustered or sequential convergence could arise even in a purely randomly connected network, with the most basic set of assumptions. We agree that an analysis of how selectivity evolves with learning would be an interesting topic for further work.

      References

      • Bhalla, Upinder S. 2017. “Synaptic Input Sequence Discrimination on Behavioral Timescales Mediated by Reaction-Diffusion Chemistry in Dendrites.” Edited by Frances K Skinner. eLife 6 (April):e25827. https://doi.org/10.7554/eLife.25827.

      • Branco, Tiago, Beverley A. Clark, and Michael Häusser. 2010. “Dendritic Discrimination of Temporal Input Sequences in Cortical Neurons.” Science (New York, N.Y.) 329 (5999): 1671–75. https://doi.org/10.1126/science.1189664.

      • Foldiak, Peter. 2003. “Sparse Coding in the Primate Cortex.” The Handbook of Brain Theory and Neural Networks. https://research-repository.st-andrews.ac.uk/bitstream/handle/10023/2994/FoldiakSparse HBTNN2e02.pdf?sequence=1.

      • Golding, Nace L., Nathan P. Staff, and Nelson Spruston. 2002. “Dendritic Spikes as a Mechanism for Cooperative Long-Term Potentiation.” Nature 418 (6895): 326–31. https://doi.org/10.1038/nature00854.

      • Govindarajan, Arvind, Inbal Israely, Shu-Ying Huang, and Susumu Tonegawa. 2011. “The Dendritic Branch Is the Preferred Integrative Unit for Protein Synthesis-Dependent LTP.” Neuron 69 (1): 132–46. https://doi.org/10.1016/j.neuron.2010.12.008.

      • Govindarajan, Arvind, Raymond J. Kelleher, and Susumu Tonegawa. 2006. “A Clustered Plasticity Model of Long-Term Memory Engrams.” Nature Reviews Neuroscience 7 (7): 575–83. https://doi.org/10.1038/nrn1937.

      • Harvey, Christopher D., Ryohei Yasuda, Haining Zhong, and Karel Svoboda. 2008. “The Spread of Ras Activity Triggered by Activation of a Single Dendritic Spine.” Science (New York, N.Y.) 321 (5885): 136–40. https://doi.org/10.1126/science.1159675.

      • Losonczy, Attila, Judit K. Makara, and Jeffrey C. Magee. 2008. “Compartmentalized Dendritic Plasticity and Input Feature Storage in Neurons.” Nature 452 (7186): 436–41. https://doi.org/10.1038/nature06725.

      • Poirazi, Panayiota, Terrence Brannon, and Bartlett W. Mel. 2003. “Pyramidal Neuron as Two-Layer Neural Network.” Neuron 37 (6): 989–99. https://doi.org/10.1016/S0896-6273(03)00149-1.

      • Pulikkottil, Vinu Varghese, Bhanu Priya Somashekar, and Upinder S. Bhalla. 2021. “Computation, Wiring, and Plasticity in Synaptic Clusters.” Current Opinion in Neurobiology, Computational Neuroscience, 70 (October):101–12. https://doi.org/10.1016/j.conb.2021.08.001.

    2. Author response:

      The following is the authors’ response to the previous reviews.

      Reviewer #1 (Public Review):

      In this revision, the authors significantly improved the manuscript. They now address some of my concerns. Specifically, they show the contribution of end-effects on spreading the inputs between dendrites. This analysis reveals greater applicability of their findings to cortical cells, with long, unbranching dendrites than other neuronal types, such as Purkinje cells in the cerebellum.

      They now explain better the interactions between calcium and voltage signals, which I believe improve the take-away message of their manuscript. They modified and added new figures that helped to provide more information about their simulations.

      However, some of my points remain valid. Figure 6 shows depolarization of ~5mV from -75. This weak depolarization would not effectively recruit nonlinear activation of NMDARs. In their paper, Branco and Hausser (2010) showed depolarizations of ~10-15mV.

      More importantly, the signature of NMDAR activation is the prolonged plateau potential and activation at more depolarized resting membrane potentials (their Figure 4). Thus, despite including NMDARs in the simulation, the authors do not model functional recruitment of these channels. Their simulation is thus equivalent to AMPA only drive, which can indeed summate somewhat nonlinearly.

      In the current study, we used short sequences of 5 inputs, since the convergence of longer sequences is extremely unlikely in the network configurations we have examined. This resulted in smaller EPSP amplitudes of ~5mV (Figure 6 - Supplement 2A, B). Longer sequences containing 9 inputs resulted in larger somatic depolarizations of ~10mV (Figure 6 - Supplement 2E, F). Although we had modified the (Branco, Clark, and Häusser 2010) model to remove the jitter in the timing of arrival of inputs and made slight modifications to the location of stimulus delivery on the dendrite, we saw similar amplitudes when we tested a 9-length sequence using (Branco, Clark, and Häusser 2010)’s published code (Figure 6 - Supplement 2I, J). In all the cases we tested (5 input sequence, 9 input sequence, 9 input sequence with (Branco, Clark, and Häusser 2010) code repository), removal of NMDA synapses lowered both the somatic EPSPs (Figure 6 - Supplement 2C,D,G,H,K,L) as well as the selectivity (measured as the difference between the EPSPs generated for inward and outward stimulus delivery) (Figure 6 Supplement 2M,N,O). Further, monitoring the voltage along the dendrite for a sequence of 5 inputs showed dendritic EPSPs in the range of 20-45 mV (Figure 6 - Supplement 2P, Q), which came down notably (10-25mV) when NMDA synapses were abolished (Figure 6 - Supplement 2R, S). Thus, even sequences containing as few as 5 inputs were capable of engaging the NMDA-mediated nonlinearity to show sequence selectivity, although the selectivity was not as strong as in the case of 9 inputs.

      Reviewer #1 (Recommendations for the authors):

      Minor points:

      Figure 8, what does the scale in A represent? I assume it is voltage, but there are no units. Figure 8, C, E, G, these are unconventional units for synaptic weights, usually, these are given in nS / per input.

      We have corrected these. The scalebar in 8A represents membrane potential in mV. The units of 8C,E,G are now in nS.

      Reviewer #2 (Public Review):

      Summary:

      If synaptic input is functionally clustered on dendrites, nonlinear integration could increase the computational power of neural networks. But this requires the right synapses to be located in the right places. This paper aims to address the question of whether such synaptic arrangements could arise by chance (i.e. without special rules for axon guidance or structural plasticity), and could therefore be exploited even in randomly connected networks. This is important, particularly for the dendrites and biological computation communities, where there is a pressing need to integrate decades of work at the single-neuron level with contemporary ideas about network function.

      Using an abstract model where ensembles of neurons project randomly to a postsynaptic population, back-of-envelope calculations are presented that predict the probability of finding clustered synapses and spatiotemporal sequences. Using data-constrained parameters, the authors conclude that clustering and sequences are indeed likely to occur by chance (for large enough ensembles), but require strong dendritic nonlinearities and low background noise to be useful.

      Strengths:

      (1) The back-of-envelope reasoning presented can provide fast and valuable intuition. The authors have also made the effort to connect the model parameters with measured values. Even an approximate understanding of cluster probability can direct theory and experiments towards promising directions, or away from lost causes.

      (2) I found the general approach to be refreshingly transparent and objective. Assumptions are stated clearly about the model and statistics of different circuits. Along with some positive results, many of the computed cluster probabilities are vanishingly small, and noise is found to be quite detrimental in several cases. This is important to know, and I was happy to see the authors take a balanced look at conditions that help/hinder clustering, rather than to just focus on a particular regime that works.

      (3) This paper is also a timely reminder that synaptic clusters and sequences can exist on multiple spatial and temporal scales. The authors present results pertaining to the standard `electrical' regime (~50-100 µm, <50 ms), as well as two modes of chemical signaling (~10 µm, 100-1000 ms). The senior author is indeed an authority on the latter, and the simulations in Figure 5, extending those from Bhalla (2017), are unique in this area. In my view, the role of chemical signaling in neural computation is understudied theoretically, but research will be increasingly important as experimental technologies continue to develop.

      Weaknesses:

      (1) The paper is mostly let down by the presentation. In the current form, some patience is needed to grasp the main questions and results, and it is hard to keep track of the many abbreviations and definitions. A paper like this can be impactful, but the writing needs to be crisp, and the logic of the derivation accessible to non-experts. See, for instance, Stepanyants, Hof & Chklovskii (2002) for a relevant example.

      It would be good to see a restructure that communicates the main points clearly and concisely, perhaps leaving other observations to an optional appendix. For the interested but time-pressed reader, I recommend starting with the last paragraph of the introduction, working through the main derivation on page 7, and writing out the full expression with key parameters exposed. Next, look at Table 1 and Figure 2J to see where different circuits and mechanisms fit in this scheme. Beyond this, the sequence derivation on page 15 and biophysical simulations in Figures 5 and 6 are also highlights.

      We appreciate the reviewers' suggestions. We have tightened the flow of the introduction. We understand that the abbreviations and definitions are challenging and have therefore provided intuitions and summaries of the equations discussed in the main text.

      Clusters calculations

      Our approach is to ask how likely it is that a given set of inputs lands on a short segment of dendrite, and then scale it up to all segments on the entire dendritic length of the cell.

      Thus, the probability of occurrence of groups that receive connections from each of the M ensembles (PcFMG) is a function of the connection probability (p) between the two layers, the number of neurons in an ensemble (N), the relative zone-length with respect to the total dendritic arbor (Z/L) and the number of ensembles (M).

      Sequence calculations

      Here we estimate the likelihood of the first ensemble input arriving anywhere on the dendrite, and ask how likely it is that succeeding inputs of the sequence would arrive within a set spacing.

      Thus, the probability of occurrence of sequences that receive sequential connections (PcPOSS) from each of the M ensembles is a function of the connection probability (p) between the two layers, the number of neurons in an ensemble (N), the relative window size with respect to the total dendritic arbor (Δ/L) and the number of ensembles (M).

      (2) I wonder if the authors are being overly conservative at times. The result highlighted in the abstract is that 10/100000 postsynaptic neurons are expected to exhibit synaptic clustering. This seems like a very small number, especially if circuits are to rely on such a mechanism. However, this figure assumes the convergence of 3-5 distinct ensembles. Convergence of inputs from just 2 ense mbles would be much more prevalent, but still advantageous computationally. There has been excitement in the field about experiments showing the clustering of synapses encoding even a single feature.

      We agree that short clusters of two inputs would be far more likely. We focused our analysis on clusters with three of more ensembles because of the following reasons:

      (1) The signal to noise in these clusters was very poor as the likelihood of noise clusters is high.

      (2) It is difficult to trigger nonlinearities with very few synaptic inputs.

      (3) At the ensemble sizes we considered (100 for clusters, 1000 for sequences), clusters arising from just two ensembles would result in high probability of occurrence on all neurons in a network (~50% in cortex, see p_CMFG in figures below.). These dense neural representations make it difficult for downstream networks to decode (Foldiak 2003).

      However, in the presence of ensembles containing fewer neurons or when the connection probability between the layers is low, short clusters can result in sparse representations (Figure 2 - Supplement 2). Arguments 1 and 2 hold for short sequences as well.

      (3) The analysis supporting the claim that strong nonlinearities are needed for cluster/sequence detection is unconvincing. In the analysis, different synapse distributions on a single long dendrite are convolved with a sigmoid function and then the sum is taken to reflect the somatic response. In reality, dendritic nonlinearities influence the soma in a complex and dynamic manner. It may be that the abstract approach the authors use captures some of this, but it needs to be validated with simulations to be trusted (in line with previous work, e.g. Poirazi, Brannon & Mel, (2003)).

      We agree that multiple factors might affect the influence of nonlinearities on the soma. The key goal of our study was to understand the role played by random connectivity in giving rise to clustered computation. Since simulating a wide range of connectivity and activity patterns in a detailed biophysical model was computationally expensive, we analyzed the exemplar detailed models for nonlinearity separately (Figures 5, 6, and new figure 8), and then used our abstract models as a proxy for understanding population dynamics. A complete analysis of the role played by morphology, channel kinetics and the effect of branching requires an in-depth study of its own, and some of these questions have already been tackled by (Poirazi, Brannon, and Mel 2003; Branco, Clark, and Häusser 2010; Bhalla 2017). However, in the revision, we have implemented a single model which incorporates the range of ion-channel, synaptic and biochemical signaling nonlinearities which we discuss in the paper (Figure 8, and Figure 8 Supplement 1, 2,3). We use this to demonstrate all three forms of sequence and grouped computation we use in the study, where the only difference is in the stimulus pattern and the separation of time-scales inherent in the stimuli.

      (4) It is unclear whether some of the conclusions would hold in the presence of learning. In the signal-to-noise analysis, all synaptic strengths are assumed equal. But if synapses involved in salient clusters or sequences were potentiated, presumably detection would become easier? Similarly, if presynaptic tuning and/or timing were reorganized through learning, the conditions for synaptic arrangements to be useful could be relaxed. Answering these questions is beyond the scope of the study, but there is a caveat there nonetheless.

      We agree with the reviewer. If synapses receiving connectivity from ensembles had stronger weights, this would make detection easier. Dendritic spikes arising from clustered inputs have been implicated in local cooperative plasticity (Golding, Staff, and Spruston 2002; Losonczy, Makara, and Magee 2008). Further, plasticity related proteins synthesized at a synapse undergoing L-LTP can diffuse to neighboring weakly co-active synapses, and thereby mediate cooperative plasticity (Harvey et al. 2008; Govindarajan, Kelleher, and Tonegawa 2006; Govindarajan et al. 2011). Thus if clusters of synapses were likely to be co-active, they could further engage these local plasticity mechanisms which could potentiate them while not potentiating synapses that are activated by background activity. This would depend on the activity correlation between synapses receiving ensemble inputs within a cluster vs those activated by background activity. We have mentioned some of these ideas in a published opinion paper (Pulikkottil, Somashekar, and Bhalla 2021). In the current study, we wanted to understand whether even in the absence of specialized connection rules, interesting computations could still emerge. Thus, we focused on asking whether clustered or sequential convergence could arise even in a purely randomly connected network, with the most basic set of assumptions. We agree that an analysis of how selectivity evolves with learning would be an interesting topic for further work.

      References

      Bhalla, Upinder S. 2017. “Synaptic Input Sequence Discrimination on Behavioral Timescales Mediated by Reaction-Diffusion Chemistry in Dendrites.” Edited by Frances K Skinner. eLife 6 (April):e25827. https://doi.org/10.7554/eLife.25827.

      Branco, Tiago, Beverley A. Clark, and Michael Häusser. 2010. “Dendritic Discrimination of Temporal Input Sequences in Cortical Neurons.” Science (New York, N.Y.) 329 (5999): 1671–75. https://doi.org/10.1126/science.1189664.

      Foldiak, Peter. 2003. “Sparse Coding in the Primate Cortex.” The Handbook of Brain Theory and Neural Networks. https://research-repository.st-andrews.ac.uk/bitstream/handle/10023/2994/FoldiakSparse HBTNN2e02.pdf?sequence=1.

      Golding, Nace L., Nathan P. Staff, and Nelson Spruston. 2002. “Dendritic Spikes as a Mechanism for Cooperative Long-Term Potentiation.” Nature 418 (6895): 326–31. https://doi.org/10.1038/nature00854.

      Govindarajan, Arvind, Inbal Israely, Shu-Ying Huang, and Susumu Tonegawa. 2011. “The Dendritic Branch Is the Preferred Integrative Unit for Protein Synthesis-Dependent LTP.” Neuron 69 (1): 132–46. https://doi.org/10.1016/j.neuron.2010.12.008.

      Govindarajan, Arvind, Raymond J. Kelleher, and Susumu Tonegawa. 2006. “A Clustered Plasticity Model of Long-Term Memory Engrams.” Nature Reviews Neuroscience 7 (7): 575–83. https://doi.org/10.1038/nrn1937.

      Harvey, Christopher D., Ryohei Yasuda, Haining Zhong, and Karel Svoboda. 2008. “The Spread of Ras Activity Triggered by Activation of a Single Dendritic Spine.” Science (New York, N.Y.) 321 (5885): 136–40. https://doi.org/10.1126/science.1159675.

      Losonczy, Attila, Judit K. Makara, and Jeffrey C. Magee. 2008. “Compartmentalized Dendritic Plasticity and Input Feature Storage in Neurons.” Nature 452 (7186): 436–41. https://doi.org/10.1038/nature06725.

      Poirazi, Panayiota, Terrence Brannon, and Bartlett W. Mel. 2003. “Pyramidal Neuron as Two-Layer Neural Network.” Neuron 37 (6): 989–99. https://doi.org/10.1016/S0896-6273(03)00149-1.

      Pulikkottil, Vinu Varghese, Bhanu Priya Somashekar, and Upinder S. Bhalla. 2021. “Computation, Wiring, and Plasticity in Synaptic Clusters.” Current Opinion in Neurobiology, Computational Neuroscience, 70 (October):101–12. https://doi.org/10.1016/j.conb.2021.08.001.

    1. If you don’t observe those behaviors, something is wrong with your installation. Here’s how to proceed:

      Surely the first thing to do is to install the ocaml extension in vs code.

    1. We performed structural alignments between proteomes with the tool Foldseek

      Thanks for putting your code on github! I noticed on your github page that you masked low confidence ends of protein structures prior to alignment. This is an interesting consideration and I think is worth mentioning in the methods here.

    1. Note: it's okay to use some # pragma: no cover's to get make test coverage to pass if there are some parts of the code that are too awkward or not worthwhile to write tests for. Our approach to unittests is that we enforce 100% coverage but are pragmatic about using # pragma: no cover to get there if necessary. Note: the code in this project was hacked together without a lot of care. You might find that you have to refactor it in order to make it unit-testable. You might find it helpful to write functests first (there's a separate issue for those: #7) as these will enable large-scale refactorings with confidence.Create sub-issue

      Note: it's okay to use some # pragma: no cover's to get make test coverage to pass if there are some parts of the code that are too awkward or not worthwhile to write tests for. Our approach to unittests is that we enforce 100% coverage but are pragmatic about using # pragma: no cover to get there if necessary.

      Note: the code in this project was hacked together without a lot of care. You might find that you have to refactor it in order to make it unit-testable. You might find it helpful to write functests first (there's a separate issue for those: #7) as these will enable large-scale refactorings with confidence.

    1. Hutchins, la manœuvre d’un bateau dans un port nécessite le respect d’un code nautique afin que les bateaux puissent circuler avec fluidité tout en préservant un minimum de sécurité pour toutes les autres entités empruntant les chenaux. Ce code nautique définit les comportements à adopter vis-à-vis des autres bâtiments dans le port, ainsi que des autorités portuaires. En un sens, il s’agit là d’un protocole partagé par les marins afin que chacun puisse interpréter les comportements de l’autre et ainsi adapter les siens en conséquence (en suivant ce que le protocole prévoit).

      là on passe de la cognition distribuée au protocole... c'est pas la même chose... est-ce que tu critiques l'idée de H. et tu proposes une alternative? ou une manière de l'interpréter (ce qu'on dirait plus bas... tu dis: cogn distr -> nécessité du protocole. Mais est-ce vrai par ex pour des oiseaux en formation? Ont-ils un protocole??

    1. Statistical Modeling

      I see from your code page that you used other stats like Kolmogorov-Smirnov Test and Levene’s Test, for example. I think, these should be described somewhere here in this section as well as assumptions and potential violations to assumptions.

    2. DBI, RSQLite, dplyr, lubridate, sp, mapview, glmmTMB, DHARMa, performance, MuMIn, emmeans, car, ggplot2, and kableExtra.

      Add versions and relevant references to tese packages, if available. Add references to reference list.

      I assume that you will provide the full code as an appendix to your thesis? If so, please add link to repository here.

    Annotators

    1. Les recherches en psychologie sociale[modifier | modifier le code] Bien que de nombreuses études sur le comportement d’aide aient été menées depuis les années 1950, c’est l’affaire Kitty Genovese qui est considérée comme le point de départ des recherches sur l’effet du témoin. L’expérience de Darley et Latané en 1968 est à l’origine des travaux fondamentaux de cette discipline[6]. En effet, ces chercheurs ont entamé une série d'expériences qui ont permis de mettre en évidence un des effets les plus robustes et stables dans le domaine de la psychologie sociale.
    1. Statistical Modeling

      I see from your code page that you used other stats like Kolmogorov-Smirnov Test and Levene’s Test, for example. I think, these should be described somewhere here in this section as well as assumptions and potential violations to assumptions.

    2. DBI, RSQLite, dplyr, lubridate, sp, mapview, glmmTMB, DHARMa, performance, MuMIn, emmeans, car, ggplot2, and kableExtra.

      Add versions and relevant references to tese packages, if available. Add references to reference list.

      I assume that you will provide the full code as an appendix to your thesis? If so, please add link to repository here.

    Annotators

    1. Traditionally, complex code was required for running incremental refreshes, but you can now define a refresh policy within Power BI Desktop. The refresh policy is applied when you publish to Power BI service, which then does the work of managing partitions for optimized data loads, resulting in the following benefits: Quicker refreshes - Only data that needs to be changed gets refreshed. For example, if you have five years' worth of data, and you only need to refresh the last 10 days because that is the only data that has changed, the incremental refresh will refresh only those 10 days of data. Undoubtedly, the time it takes to refresh 10 days of data is much shorter than five years of data. More reliable refreshes - You no longer need to keep your long-running data connections open to schedule a refresh. Reduced resource consumption - Because you only need to refresh the smaller the amount of data, the overall consumption of memory and other resources is reduced.

      benefit

    1. Statistical Modeling

      I see from your code page that you used other stats like Kolmogorov-Smirnov Test and Levene’s Test, for example. I think, these should be described somewhere here in this section as well as assumptions and potential violations to assumptions.

    2. DBI, RSQLite, dplyr, lubridate, sp, mapview, glmmTMB, DHARMa, performance, MuMIn, emmeans, car, ggplot2, and kableExtra.

      Add versions and relevant references to tese packages, if available. Add references to reference list.

      I assume that you will provide the full code as an appendix to your thesis? If so, please add link to repository here.

    Annotators

    1. Reviewer #1 (Public review):

      Summary:

      Cesar, Santos & Cogni use a meta-analysis to report on the direction and magnitude of three fundamental fitness components in defensive symbioses. Specifically, the work focuses on interactions between three arthropod host families (Aphididae, Culicidae, Drosophilidae, and others) and common bacterial endosymbionts (Wolbachia, Serratia, Hamiltonella, Spiroplasma, Rickettsia, Regiella X-type and Arsenophonus). The results of the overall analysis confirm common assumptions and previous work on such fitness components, showing that defensive symbionts provide strong protection to hosts and cause detectable costs to both hosts and the enemy. The analysis provides insight into the extent of the cost/benefit tradeoff for hosts, reporting that the cost is six times lower than the protective effect. The confirmation that natural enemies attacking hosts infected with symbionts have a reduction in their fitness is also an interesting one, as this shows that the majority of defensive symbionts provide protection by resisting enemy infection, as opposed to tolerating it. This finding has important consequences for evolutionary counter-responses in the enemy species. Of course, this result has less relevance for certain types of enemies (such as parasitoids) where successful infection is dependent upon host killing.

      Interesting results also emerge from the subgroup analysis. For the full dataset, both natural and introduced symbionts were similarly effective in positively influencing the fitness of hosts. However, in the Wolbachia-specific analysis, the artificially introduced symbionts caused costs to the hosts where the natural strain did not. These findings have potentially important ramifications for schemes that use endosymbionts for biocontrol or vector competence, suggesting that (in some cases) natural strains may be the more stable choice for deploying (as they are associated with lower costs).

      The analysis draws from an impressively large dataset, but the interpretation of the full impact of the results would be helped by greater detail on the species/strain level systems included, the data extraction approach, and inclusion criteria. Accounting for phylogenetic nonindependence and alternative coding of one of the moderator variables could also strengthen the biological relevance of the models. Suggestions and thoughts are outlined below.

      Strengths & Potential Improvements:

      An impressively large number of effect sizes (3000) from only 226 studies is collected, robustly confirming common assumptions on the magnitude of fundamental fitness components. However the paper would benefit from a clear breakdown in the main text of the specificities of each system included (e.g. a table at the host species/symbiont strain level, where it is possible). Currently, there is not enough detail for those who want a deep dive to understand what data was extracted for the analysis from these 226 studies, or those who want to understand the underlying diversity in the dataset.

      Currently, when the 'natural enemy group' is tested as a moderator it is coded broadly by type of organism (e.g. virus, bacterium, fungi, parasitoid). But this doesn't adequately capture the mode of killing/fitness reduction by the enemy, which would be the much more biologically relevant categorisation for your questions. For example, parasitoid infection is dependent upon host death (thus host fecundity is not relevant, because the host either survived or did not). Among bacterial and viral pathogens antagonists there is scope for both fecundity and survival to be affected. This in turn may be a very influential factor for the outcome. You could consider recoding this enemy moderator.

      The analysis is restricted to arthropod hosts and defensive symbionts that are also classed as endosymbionts. This focus should be made clear early on in the paper, as there are many systems (that are classed by many as defensive symbioses) that are not part of the analysis.

      There is fairly minimalistic testing of moderators/sub-groups (which probably has its statistical strengths) but perhaps there are also some missed opportunities for testing other ecological contributors to variance, including coinfection (although perhaps limited by power) and other approaches to coding enemy group (as detail above).

      Looking at the overview of systems included, there's likely a high degree of phylogenetic non-independence in the dataset. Where it is possible, using phylogenetically controlled models could strengthen this analysis.

      Looking at your included systems (Table S5), you might be able to test the effect of coinfection on the 3 variables of interest. For example, it would be particularly important to see if the effects of two symbionts are additive or not.

      No code for the analysis is provided for review at this stage and full details of the dataset are also not available. This slightly limits the ability to assess the full scope and robustness of the study. It would be helpful to have an extensive table in the supplementary detailing (minimum) the reference, study, experiment, host species, symbiont strain, and a description of the exact data extraction source (e.g.table/figure/in text), and method of extraction.

    2. Author response:

      Reviewer #1 (Public review):

      Summary:

      Cesar, Santos & Cogni use a meta-analysis to report on the direction and magnitude of three fundamental fitness components in defensive symbioses. Specifically, the work focuses on interactions between three arthropod host families (Aphididae, Culicidae, Drosophilidae, and others) and common bacterial endosymbionts (Wolbachia, Serratia, Hamiltonella, Spiroplasma, Rickettsia, Regiella X-type and Arsenophonus). The results of the overall analysis confirm common assumptions and previous work on such fitness components, showing that defensive symbionts provide strong protection to hosts and cause detectable costs to both hosts and the enemy. The analysis provides insight into the extent of the cost/benefit tradeoff for hosts, reporting that the cost is six times lower than the protective effect. The confirmation that natural enemies attacking hosts infected with symbionts have a reduction in their fitness is also an interesting one, as this shows that the majority of defensive symbionts provide protection by resisting enemy infection, as opposed to tolerating it. This finding has important consequences for evolutionary counter-responses in the enemy species. Of course, this result has less relevance for certain types of enemies (such as parasitoids) where successful infection is dependent upon host killing.

      Interesting results also emerge from the subgroup analysis. For the full dataset, both natural and introduced symbionts were similarly effective in positively influencing the fitness of hosts. However, in the Wolbachia-specific analysis, the artificially introduced symbionts caused costs to the hosts where the natural strain did not. These findings have potentially important ramifications for schemes that use endosymbionts for biocontrol or vector competence, suggesting that (in some cases) natural strains may be the more stable choice for deploying (as they are associated with lower costs).

      The analysis draws from an impressively large dataset, but the interpretation of the full impact of the results would be helped by greater detail on the species/strain level systems included, the data extraction approach, and inclusion criteria. Accounting for phylogenetic nonindependence and alternative coding of one of the moderator variables could also strengthen the biological relevance of the models. Suggestions and thoughts are outlined below.

      We sincerely thank Reviewer #1 for the time and effort dedicated to reviewing our manuscript. The suggestions provided are highly constructive and will greatly assist us in improving both our analyses and the manuscript overall.

      Strengths & Potential Improvements:

      An impressively large number of effect sizes (3000) from only 226 studies is collected, robustly confirming common assumptions on the magnitude of fundamental fitness components. However the paper would benefit from a clear breakdown in the main text of the specificities of each system included (e.g. a table at the host species/symbiont strain level, where it is possible). Currently, there is not enough detail for those who want a deep dive to understand what data was extracted for the analysis from these 226 studies, or those who want to understand the underlying diversity in the dataset.

      We thank the reviewer for the suggestion, and we will add this information to our revised manuscript.

      Currently, when the 'natural enemy group' is tested as a moderator it is coded broadly by type of organism (e.g. virus, bacterium, fungi, parasitoid). But this doesn't adequately capture the mode of killing/fitness reduction by the enemy, which would be the much more biologically relevant categorisation for your questions. For example, parasitoid infection is dependent upon host death (thus host fecundity is not relevant, because the host either survived or did not). Among bacterial and viral pathogens antagonists there is scope for both fecundity and survival to be affected. This in turn may be a very influential factor for the outcome. You could consider recoding this enemy moderator.

      We agree, and we will implement this in the analysis to our revised manuscript.

      The analysis is restricted to arthropod hosts and defensive symbionts that are also classed as endosymbionts. This focus should be made clear early on in the paper, as there are many systems (that are classed by many as defensive symbioses) that are not part of the analysis.

      We agree, and we will implement this to our revised manuscript.

      There is fairly minimalistic testing of moderators/sub-groups (which probably has its statistical strengths) but perhaps there are also some missed opportunities for testing other ecological contributors to variance, including coinfection (although perhaps limited by power) and other approaches to coding enemy group (as detail above).

      We agree, and we will implement this in the analysis to our revised manuscript.

      Looking at the overview of systems included, there's likely a high degree of phylogenetic non-independence in the dataset. Where it is possible, using phylogenetically controlled models could strengthen this analysis.

      We thank the reviewer for the suggestion. We will explore the possibility of using phylogenetically controlled models in our analyses, although we recognize the challenges associated with their implementation, particularly in the case of the natural enemies, given the great diversity of distant related groups included in our study - viruses, bacteria, fungi, protozoans, nematodes and parasitoids wasps.

      Looking at your included systems (Table S5), you might be able to test the effect of coinfection on the 3 variables of interest. For example, it would be particularly important to see if the effects of two symbionts are additive or not.

      We agree, and we will implement this in the analysis to our revised manuscript.

      No code for the analysis is provided for review at this stage and full details of the dataset are also not available. This slightly limits the ability to assess the full scope and robustness of the study. It would be helpful to have an extensive table in the supplementary detailing (minimum) the reference, study, experiment, host species, symbiont strain, and a description of the exact data extraction source (e.g.table/figure/in text), and method of extraction.

      The code for the analysis and the full raw data with the suggested information are available at https://github.com/cassiasqr/MetaSymbiont (The link is available at the end of the manuscript).

      Reviewer #2 (Public review):

      Summary:

      In this exciting study, Cesar and co-authors perform a meta-analysis on the influence of arthropod symbionts on the fitness of their hosts when they are exposed or not to natural enemies. These so-called defensive symbionts are increasingly recognized as key elements in arthropod survival against natural enemies, with effects that ripple through entire terrestrial ecosystems. The topic is timely, the approach is sound, and the manuscript is well-written. I believe this manuscript will attract the attention of entomologists and of microbiologists interested in symbiosis. This study builds on a previous meta-analysis that I was involved in, which was based on phloem-feeding insects. This novel data set is much larger and includes flies (including the model system Drosophila) and mosquitoes (a group of high medical interest). While the previous metaanalysis considered only parasitoids as natural enemies, this study also includes fungi, bacteria, and viruses.

      Strengths:

      The authors compile a very large dataset and provide a broad quantitative overview of the effects of defensive symbionts in insects. By measuring symbiont effects in the presence and absence of natural enemies, the authors are able to infer whether a trade-off between defense and the costs of mutualism in the absence of enemy pressure exists. Defensive symbioses are an important research topic that had its initial "momentum" a decade ago, so the timing for such a systematic review is very appropriate.

      We sincerely thank Reviewer #2 for dedicating their time and effort to reviewing our manuscript. The suggestions are very insightful and will significantly contribute to improving our manuscript.

      Weaknesses:

      I think the manuscript could be improved by clarifying several sections, particularly the introduction and methods. The introduction section is too specific and heavily reliant on particular examples. In my view, the theoretical background of the study could be made clearer, and the knowledge gap identified more explicitly. A focus on how widespread defensive symbioses are, along with a brief, up-to-date review of the groups possessing such symbionts, would help. This lack of focus is also observed in the methods section, where more details are needed in many instances to better understand how data was collected and analyzed. Regarding the analyses, the multi-level analysis contains many moderators, but it's unclear why these moderators were included. While this may seem a minor issue, it highlights a disconnection between the analyses, the conceptual background, and the hypotheses tested. 

      We thank the reviewer for the suggestions, and we will try to make the introduction and the methods section clearer. 

      Another important weakness is that the analyses are too general, and much-hidden information is not immediately apparent. For instance, readers cannot easily identify which species of symbionts are studied (and the effects they have), or which natural enemies are involved. Although this information is found in the supplementary material, including it in the main body would significantly improve the manuscript.

      We agree, and we will implement this to our   revised manuscript.

    1. Ethical codes often emerge after a crisis event. The Common Rule developed out of a rule-making process initiated in response to a series of breaches to the public trust, especially those committed by physician-researchers. Following the Nazi-era medical atrocities, the Nuremberg Code (Nuremberg Code, 1949) and the Declaration of Helsinki (World Medical Association, 1964) established ethical norms for human-subjects research, while building on the 1931 Guidelines for Human Experimentation (Ghooi, 2011).

      assuming this happens in order for the individuals behind the process of collecting data do this in order to see where the common affects that the crisis caused?

    1. There is a tremendous power in thinking about everything as a single kind of thing, because then you don’t have to juggle lots of different ideas about different kinds of things; you can just think about your problem.

      In my experience this is also the main benefit of using node.js as your backend. Being able to write your front and backend code in the same language (javascript) removes a switching cost I didn't fully realize existed until I tried node the first time.

    1. Statistical Modeling

      I see from your code page that you used other stats like Kolmogorov-Smirnov Test and Levene’s Test, for example. I think, these should be described somewhere here in this section as well as assumptions and potential violations to assumptions.

    2. DBI, RSQLite, dplyr, lubridate, sp, mapview, glmmTMB, DHARMa, performance, MuMIn, emmeans, car, ggplot2, and kableExtra.

      Add versions and relevant references to tese packages, if available. Add references to reference list.

      I assume that you will provide the full code as an appendix to your thesis? If so, please add link to repository here.

    Annotators

    1. Reviewer #1 (Public review):

      Summary:

      This paper presents a data processing pipeline to discover causal interactions from time-lapse imaging data and convincingly illustrates it on a challenging application for the analysis of tumor-on-chip ecosystem data.

      The core of the discovery module is the original tMIIC method of the authors, which is shown in supplementary material to compare favourably to two state-of-the-art methods on synthetic temporal data on a 15 nodes network.

      Strengths:

      This paper tackles the problem of learning causal interactions from temporal data which is an open problem in presence of latent variables.

      The core of the method tMIIC of the authors is nicely presented in connection to Granger-Schreiber causality and to the novel graphical conditions used to infer latent variables and based on a theorem about transfer entropy.

      tMIIC compares favourably to PC and PCMCI+ methods using different kernels on synthetic datasets generated from a network of 15 nodes.

      A full application to tumor-on-chip cellular ecosystems data including cancer cells, immune cells, cancer-associated fibroblasts, endothelial cells and anti cancer drugs, with convincing inference results with respect to both known and novel effects between those components and their contact.

      The code and dataset are available online for the reproducibility of the results.

      Weaknesses:

      The references to "state-of-the-art methods" concerning the inference of causal networks should be more precise by giving citations in the main text, and better discussed in general terms, both in the first section and in the section of presentation of CausalXtract. It is only in the legend of the figures of the supplementary material that we get information.

      Of course, comparison on our own synthetic datasets can always be criticized but this is rather due to the absence of a common benchmark in this domain compared to other domains. I recommend the authors to explicitly propose their datasets made accessible in supplementary material as benchmark for the community.

      Comments on revisions:

      This is a very nice paper.

    2. Reviewer #2 (Public review):

      Summary:

      The authors propose a methodology to perform causal (temporal) discovery. The approach appears to be robust and is tested in the different scenarios: one related to live-cell imaging data, and another one using synthetic (mathematically defined) time series data. They compare the performance of their findings against another well-known method by using metrics like F-score, precision and recall,

      Strengths:

      --Performance, robustness, the text is clear and concise, The authors provide the code to review.

      Comments on revisions:

      The authors have addressed my concerns properly providing the needed explanations.

    3. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer #1 (Public review):

      Summary:

      This paper presents a data processing pipeline to discover causal interactions from time-lapse imaging data, and convicingly illustrates it on a challenging application for the analysis of tumor-on-chip ecosystem data. The core of the discovery module is the original tMIIC method of the authors, which is shown in supplementary material to compare favourably to two state-of-the-art methods on synthetic temporal data on a 15 nodes network.

      Strengths:

      This paper tackles the problem of learning causal interactions from temporal data which is an open problem in presence of latent variables. The core of the method tMIIC of the authors is nicely presented in connection to Granger- Schreiber causality and to the novel graphical conditions used to infer latent variables and based on a theorem about transfer entropy. tMIIC compares favourably to PC and PCMCI+ methods using different kernels on synthetic datasets generated from a network of 15 nodes. A full application to tumor-onchip cellular ecosystems data including cancer cells, immune cells, cancer-associated fibroblasts, endothelial cells and anti cancer drugs, with convincing inference results with respect to both known and novel effects between those components and their contact.

      The code and dataset are available online for the reproducibility of the results.

      We thank Reviewer #1 for highlighting the main results and strengths of our paper, as well as, for his/her recommendations below to further improve the manuscript.

      Weaknesses:

      The references to ”state-of-the-art methods” concerning the inference of causal networks should be more precise by giving citations in the main text, and better discussed in general terms, both in the first section and in the section of presentation of CausalXtract. It is only in the legend of the figures of the supplementary material that we get information. Of course, comparison on our own synthetic datasets can always be criticized but this is rather due to the absence of common benchmark and I would recommend the authors to explicitly propose their datasets as benchmark to the community.

      Following Reviewer #1’s suggestion, we now compare tMIIC’s performance to other state-of-the-art causal discovery methods for time series data in the main text and in a new Figure 2. This Figure 2 also highlights the relation between graph-based causal discovery methods for time series data and Granger-Schreiber temporal causality, as discussed in more details in Methods (Theorem 1).

      We also agree about the importance of sharing benchmark datasets with the community. This is the reason why we provide the dynamical equations of the 15-node benchmarks in Supplementary Tables 1 & 2, so that anyone can generate equivalent time series datasets of any desired length.

      Reviewer #2 (Public review):

      Summary:

      The authors propose a methodology to perform causal (temporal) discovery. The approach appears to be robust and is tested in the different scenarios: one related with live-cell imaging data, and another one using synthetic (mathematically defined) time series data. They compare the performance of their findings against another well-know method by using metrics like F-score, precision and recall,

      Strengths:

      Performance, robustness, the text is clear and concise, The authors provide the code to review.

      We thank Reviewer #2 for his/her positive assessment of our work and the suggestions below to improve the manuscript.

      Weaknesses:

      One concern could be the applicability of the method in other areas like climate, economy. For those areas, public data are available and might be interesting to test how the method performs with this kind of data.

      While our main expertise concerns the analysis of biological and biomedical data, we agree that tMIIC (which is included in MIIC R package) could in principle be applied to other areas, like climate, economy.

      We have not included benchmarks on such diverse types of datasets in the present manuscript, which focuses on CausalXtract’s pipeline for the analysis and causal interpretation of live-cell time-lapse imaging data from complex cellular systems.

    1. Reviewer #3 (Public review):

      This work brings a computational approach to the study of promoters and transcription. The paper is improved but there are still factual errors and implausible explanations. I am not convinced by the response from the authors, concerning the promoter -35 element, in their rebuttal.

      Comments on author rebuttal:

      - We respectfully but strongly disagree that our analysis has misrepresented the true nature of -35 boxes. First, accounting for more A's at position 5 in the PWM is not going to lead to a "critical error." This is because positions 4-6 of the motif barely have any information content (bits) compared to positions 1-3 (see Fig 1A).

      The analysis does misrepresent the consensus -35 element, which is, unequivocally, TTGACA. I agree that positions 4-6 of the element are less well-conserved.

      - This assertion is not just based on our own PWM, but based on ample precedent in the literature. In PMID 14529615, TTG is present in 38% of all -35 boxes, but ACA only in 8%.

      This does not mean that TTGACA is not the consensus, or that "ACA" is not important at promoters where it's present.

      - In PMID 29388765, with the -10 instance TATAAT, the -35 instance TTGCAA yields stronger promoters compared to the -35 instance TTGACA (See their Figure 3B).

      This is a known phenomenon and results from "perfect" promoters being limited at the point of RNA polymerase promoter escape (because the RNAP struggles to "let go" of perfect promoters). This does not mean the TTGACA is not the consensus. Indeed, and this is a key point, it is evident in the figure the authors refer to that TTGACA stimulates more transcription than alternative -35 sequences when -10 elements are not perfect.

      - In PMID 29745856 (Figure 2), the most information content lies in positions 1-3, with the A and C at position 5 both nearly equally represented, as in our PWM.

      The motif shown in this paper suffers from exactly the same issue as the paper under review; the variable spacing between the -35 hexamer and -10 element isn't taken into account by MEME.

      - In PMID 33958766 (Figure 1) an experimentally-derived -35 box is even reduced to a "partial" -35 box which only includes positions 1 and 2, with consensus: TTnnnn.

      This paper does not show an "experimentally-derived -35 box" in Figure 1 (or anywhere else, as far as I can see).

      - In addition, we did not derive the PWMs as the reviewer describes. The PWMs we use are based on computational predictions that are in excellent agreement with experimental results. Specifically, the PWMs we use are from PMID 29728462, which acquired 145 -10 and -35 box sequences from the top 3.3% of computationally predicted boxes from Regulon DB.

      The paper mentioned states "for the genomic RNAP logo, sequences were taken from computationally predicted RNAP binding sites on RegulonDB" so these are not experimentally defined promoters? It's not obvious from the paper, or regulon DB, which sequences these are or how they were predicted.

      - Thank you for pointing out that our original submission was incomplete in this regard. We address these concerns by new analyses, including some new experiments. First, Rho dependent termination is associated with the RUT motif, which is very rich in Cytosines (PMID: 30845912). Given that our sequences confer between 65%-78% of AT-content, canonical rho dependent termination is unlikely. However, we computationally searched for rho-dependent terminators using the available code from PMID: 30845912, but the algorithm did not identify any putative RUTs. Because this analysis was not informative, we did not include it in the paper.

      I don't believe it is the case that Rho absolutely requires a RUT sequence. My understanding is that, if an RNA is not translated, Rho will intervene (e.g. see PMID: 18487194).

      - We respectfully disagree that the reviewer's point is pertinent because what the reviewer is referring to is the likelihood that the sequence is a promoter, which indeed increases with AT content, but we are focused on the likelihood that a sequence becomes a promoter through DNA mutation

      I disagree that this distinction is relevant. An AT-rich sequence will much more closely resemble a promoter by chance than a GC rich sequence. As an extreme example, the sequence TTTTTT can be converted into a reasonable -10 element by one change (to TATTTT) but the sequence GGGGGG can't.

    1. Reusing code instead of repeating code: When we find ourselves repeating a set of actions in our program, we end up writing (or copying) the same code multiple times. If we put that repeated code in a function, then we only have to write it once and then use that function in all the places we were repeating the code.

      This explanation underscores the importance of using functions to promote efficiency and maintainability in programming. By reusing code, we reduce redundancy, make our programs easier to debug, and set a foundation for more scalable software development practices.

    1. You can tell how capitalism was inspired or infnluenced by colonialism which was older. This is shown in how one group or person has the msot pwoer and sets the rules and laws in a place. In colonialism, the country imposes laws, culture, and language on the group and in capitalism, the same can be said in how there might be a language policy or code of conduct in the company that everyone has to follow

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      The work from Petazzi et al. aimed at identifying novel factors supporting the differentiation of human hematopoietic progenitors from induced pluripotent stem cells (iPSCs). The authors developed an inducible CRISPR-mediated activation strategy (iCRISPRa) to test the impact of newly identified candidate factors on the generation of hematopoietic progenitors in vitro. They first compared previously published transcriptomic data of iPSCderived hemato-endothelial populations with cells isolated ex vivo from the aorta-gonadmesonephros (AGM) region of the human embryo and they identified 9 transcription factors expressed in the aortic hemogenic endothelium that were poorly expressed in the in vitro differentiated cells. They then tested the activation of these candidate factors in an iPSCbased culture system supporting the differentiation of hematopoietic progenitors in vitro. They found that the IGF binding protein 2 (IGFBP2) was the most upregulated gene in arterial endothelium after activation and they demonstrated that IGFBP2 promotes the generation of functional hematopoietic progenitors in vitro.

      Strengths:

      The authors developed an extremely useful doxycycline-inducible system to activate the expression of specific candidate genes in human iPSC. This approach allows us to simultaneously test the impact of 9 different transcription factors on in vitro differentiation of hematopoietic cells, and the system appears to be very versatile and applicable to a broad variety of studies.

      The system was extensively validated for the expression of 1 transcription factor (RUNX1) in both HeLa cells and human iPSC, and a detailed characterization of this test experiment was provided.

      The authors exhaustively demonstrated the role of IGFBP2 in promoting the generation of functional hematopoietic progenitors in vitro from iPSCs. Even though the use of IGFBP2interacting proteins IGF1 and IGF2 have been previously reported in human iPSC-derived hematopoietic differentiation in vitro (Ditadi and Sturgeon, Methods 2016; Ng et al., Nature Biotechnology 2016), and IGFBP-2 itself has been shown to promote adult HSC expansion ex vivo (Zhang et al., Blood 2008), its role on supporting in vitro hematopoiesis was demonstrated here for the first time.

      Weaknesses:

      Although the authors performed a very thorough characterization of the system in proof-ofprinciple experiments activating a single transcription factor, the data provided when 9 independent factors were used is not sufficient to fully validate the experimental strategy. Indeed, in the current version of the manuscript, it is not clear whether the results presented in both the scRNAseq analysis and the functional assays are the consequence of the simultaneous activation of all 9 TF or just a subset of them. This is essential to establish whether all the proposed factors play a role during embryonic hematopoiesis, and a more complete analysis of the scRNAseq dataset could help clarify this aspect.

      Similarly, the data presented in the manuscript are not sufficient to clarify at what stage of the endothelial-to-hematopoietic transition (EHT) the TF activation has an impact. Indeed, even though the overall increase of functional hematopoietic progenitors is fully demonstrated, the assays proposed in the manuscript do not clarify whether this is due to a specific effect at the endothelial level or to an increased proliferation rate of the generated hematopoietic progenitors. Similar conclusions can be applied to the functional validation of IGFBP2 in vitro.

      The overall conclusions are sometimes vague and not always supported by the data. For instance, the authors state that the CRISPR activation strategy resulted in transcriptional remodeling and a steer in cell identity, but they do not specify which cell types are involved and at what level of the EHT process this is happening. In the discussion, the authors also claim that they provided evidence to support that RUNX1T1 could regulate IGFBP2 expression. However, this is exclusively based on the enrichment of RUNX1T1 gRNA in cells expressing higher levels of IGFBP2 and it does not demonstrate any direct or indirect association of the two factors.

      We thank the reviewer for the positive comments about the importance of our work and have now addressed the points raised as weaknesses by performing additional analysis and experiments, adding a new schematic of the mechanism, and rewording our claims.

      We have clarified the different effects mediated by the activation and the IGFBP2 addition in a summary section at the end of the results and added Figure 6, showing this in visual form. We have also clearly stated the limitations related to the correlation between RUNX1T1 and IGFBP2 in the discussion and toned down our claims regarding this throughout the entire paper. We have also reworded the text to clarify the specific cell types identified in the sequencing data that we refer to.

      Reviewer #2 (Public Review):

      To enable robust production of hematopoietic progenitors in-vitro, Petazzi et al examined the role of transcription factors in the arterial hemogenic endothelium. They use IGFBP2 as a candidate gene to increase the directed differentiation of iPSCs into hematopoietic progenitors. They have established a novel induced-CRISPR mediated activation strategy to drive the expression of multiple endogenous transcription factors and show enhanced production of hematopoietic progenitors through expansion of the arterial endothelial cells. Further, upregulation of IGFBP2 in the arterial cells facilitates the metabolic switch from glycolysis to oxidative phosphorylation, inducing hematopoietic differentiation. While the overall study and resources generated are good, assertions in the manuscript are not entirely supported by the experimental data and some claims need further experimental validation.

      We thank the reviewer for the positive comments, and we have provided new data and analysis to make sure that all our assertations are clearly supported and also reworded those where limitations were identified by the reviewers.

      Recommendations for the authors:

      Reviewing Editor (Recommendations For The Authors):

      The assessment could change from "incomplete" to "solid" if the authors: i) improve data analysis (for both scRNAseq and functional assays) by providing additional information that could strengthen their conclusions, as suggested in the specific comments by both reviewers; ii) either provide new functional evidence supporting their mechanistic conclusion or alternatively tone down the claims that are not fully supported by data and acknowledge the limitations raised by reviewers in the discussion; (iii) the issue of paracrine signaling to expand only hematopoietic progenitors needs to be addressed.

      We have now improved the data analysis and provided additional functional tests to strengthen our conclusions and toned down those that were identified by the reviewers as not supported enough and included a discussion on these limitations. We have also reworded the section about the paracrine signaling throughout the paper.

      Reviewer #1 (Recommendations For The Authors):

      Figure 1 contains exclusively published data. It might be more appropriate to use it as a supplementary figure or as part of a more exhaustive figure (maybe combining Figures 1 and 2 together?).

      Figure 1 contained novel bioinformatic analyses that represent the base of our research and it has a different content and focus to figure 2, which is already a large figure. We therefore believe it is better to keep it as a separate figure, containing a new panel now too. 

      It seems there is an issue with Figure S3 labelling:

      • In line 112, Figure S2A-B does not display genomic PCR and sequencing results;

      • In line 123, Figure S3D-E does not show viability and proliferation data;

      • In line 127, Figure S3G does not show mCherry expression in response to DOX;

      We apologies for the confusion with the numbers, we have now correctly labelled the figures.

      It would be more informative to include gates and frequency on flow cytometry plots in Figure S3, to be able to evaluate the extent of the reduction in mCherry expression.

      We have now included the gating and frequency of mCherry-expressing cells in Supplementary Figure 3D.

      It is not clear from the text and figures whether the SB treatment was maintained throughout the hematopoietic differentiation protocol (line 122):

      • If so, it would be important to confirm that HDAC treatment does not affect EHT cultures

      • If not, can the authors provide some evidence that transgene silencing is not occurring during hematopoietic differentiation?

      We have clarified that we decided to treat the cells with SB exclusively in maintenance condihons because HDACs have been shown to be essenhal for the EHT (lines 138-142). We have now also included addihonal data showing the high expression of the mCherry tag reporhng the iSAM expression on day 8 (Supplementary Figure 4F).

      Can the authors provide a simple diagram summarizing the experimental strategy for each differentiation experiment in the respective supplementary figure? For instance, at what stage of the protocol was DOX added in Figure 3? Or at what stage IGFBP2 was added in Figure 5? It would be a very useful addition to the interpretation of the results.

      We have now included three schemahcs for all the experiments in the manuscript in supplementary figure 4 A-C.

      In Figure 3, the authors should provide more detailed information about the data filtering of the scRNAseq experiment, and more specifically:

      • How many cells were included in the analysis for each library after QC and filtering?

      • How "cells in which the gRNAs expression was detected" were selected? Do they include only cells showing expression of gRNAs for all 9 TF?

      This informahon is now included in the method sechon lines 773-781; the detailed code is available on the GitHub link provided in the same sechon. We have filtered the cells expressing one gRNA for the non-targehng gRNA (iSAM_NT) control and more than one for the iSAM_AGM sample. 

      In Figure 3A, it is not clear whether the expression of the 9 factors is consistently detected in all cells or just a subset of them, and the heatmap in Figure 3A does not provide this information. It would be more accurate to provide expression on a per-cell basis, for instance, as a violin plot displaying single dots representing each cell. 

      We have now included this violin plot in Supplementary Figure 4G as requested. However, this visualisation is difficult to interpret because some of the target genes’ expression seems variable in both experimental and control conditions. We had envisaged that this could have been the case and so this is why we had included the three different controls.  For this reason we chose to show the normalised expression which takes all the different variables into account (Figure 3A). 

      In Figure 3B-C, it seems that clusters EHT1 and EHT2 do not express endothelial markers anymore. Are these fully differentiated hematopoietic cells rather than cells undergoing EHT? In general, it would be quite important to provide evidence of expressed marker genes characterizing each cluster (eg. heatmap summarizing top DEG in the supplementary figure?). 

      We have now provided a spreadsheet containing the clusters’ markers that we used in

      Supplementary Table 1) a heatmap in Figure 3E. Furthermor,e we have now edited Figure 3C to include Pan Endothelial markers (PECAM1 and CDH5). These data show that the EHT1 and EHT2 cluster both express endothelial markers but are progressively downregulated as expected during endothelial to hematopoietic transition. We have also included and discussed this in the manuscript lines 192-195 and a schematic for the mechanism in Figure 6.

      In Figure 3E, displaying the proportion of clusters within each sample/library would be a more accurate way of comparing the cell types present in each library (removing potential bias introduced by loading different numbers of cells in each sample).

      We have now included the requested data in Supplementary Figure 4I and it confirms again the expansion of arterial cells in the activated cells.    

      In Figure 3G, by plating 20,000 total CD34+, the assay does not account for potential differences in sample composition. It is then hard to discriminate between the increased number of progenitors in the input or an enhanced ability of HE to undergo EHT. This is an important aspect to consider to precisely identify at what level the activation of the 9 factors is acting. A proper quantification of flow cytometry data summarizing the % of progenitors, arterial cells, etc. would be useful to interpret these results.

      Lines 204-205 reworded. We are very much aware of the fact that the CD34+ cell population consists of a range of cells across the EHT process and this is precisely why we carried out this single cell sequencing analyses.  We purposely tested the effect of the observed changes in composition by colony assays

      In Figure 3G, it seems that NT cells w/o DOX have very little CFU potential (if any). Can the authors provide an explanation for this?

      We think that the limited CFU potential is due to the extensive genetic manipulation and selection that the cells underwent for the derivation of all the iSAM lines but this did not impede us from observing an effect of gene activation on CFU numbers. This is one of the primary reasons that we then validated our overall findings using the parental iPSC line in control condition and with the addition of IGFBP2. We show that the parental iPSC line gives rise to hematopoietic progenitor, both immunophenotypically (Figure 4D) and functionally, at expected levels (Figure 4B left column).

      Figure 4A shows an upregulation of IGFBP2 in arterial cells as a result of TF activation. However, from the data presented here, it is not possible to evaluate whether this is specific to the arterial cluster, or it is a common effect shared by all cell types regardless of their identity. 

      Data has now been included in Supplementary Figure 4H, which shows that all the cells show an increase in IGFBP2, but arterial cells show the highest increase. We have now edited the text to reflect this, in lines 228-230.

      In Figure 5A-B only a minority of arterial cells express RUNX1 in response to IGFBP2 treatment. Is this sufficient to explain the very significant increase in the generation of functional hematopoietic progenitors described in Figure 4? Quantification and statistical analysis of RUNX1 upregulation would strengthen this conclusion.

      We have now provided the statistical analysis showing significant upregulation of RUNX1 upon IGFBP2 addition. The p values are now provided in the figure 5 legend.

      In Figure 5 the authors conclude that IGFBP2 remodels the metabolic profile of endothelial cells. However, it is not clear which cell types and clusters were included in the analysis of Figure 5C-G. Is the switch from Glycolysis to Oxidative Phosphorylation specific to endothelial cells? Or it is a more general effect on the entire culture, including hematopoietic cells? 

      We based this conclusion on the fact that the single-cell RNAseq allows to verify that the metabolic differences are obtained in the endothelial cells. Given that we sorted the adherent cells, the majority of these are endothelial cells as shown in Figure 5A. The Seahorse pipeline includes a number of washing steps resulting in the analyses being performed on the adherent compartment which we know consists primarily of endothelial cells. We cannot exclude some contamination from non-endothelial cells but we highlight to this reviewer that the initial observation of the metabolic changes was identified in endothelial cells in the single cell sequencing data. Taken together, we believe that this implies that metabolic changes are specific to this population. We have clarified this in the line 317.

      In the discussion, the authors conclude that they "provide evidence to support the hypothesis that RUNX1T1 could regulate IGFBP2 expression". To further support this conclusion, the authors could provide a correlation analysis of the expression of the two genes in the cell type of interest. 

      Following the observation of the IGFBP2 high expression across clusters, we have now reworded this sentence in lines 382-385  We have tried to perform the correlation analysis but we believe this not to be appropriate due to the detection level of the gRNA, we have now included this as a limitation point in the discussion lines 416-427, and also toned down the conclusion we did draw about RUNX1T1 throughout the whole manuscript.

      As mentioned by the authors, IGFBP2 binds IGF1 and IGF2 modulating their function. Both IGF1 (http://dx.doi.org/10.1016/j.ymeth.2015.10.001) and IGF2 (doi:10.1038/nbt.3702) have been used in iPSC differentiation into definitive hematopoietic cells. It would be relevant to discuss/reference this in the discussion.

      We have now included the suggested reference in the section where we discuss the role of IGFBP2 in binding IGF1 and IGF2.

      Reviewer #2 (Recommendations For The Authors):

      (1) Figure 1 compares the transcriptome of human AGM and in-vitro derived hemogenic endothelial cells (HECs). It is not clear why only the genes downregulated in the latter were chosen. Are there any significantly upregulated genes, knockdown/knockout which could also serve a similar purpose? Single-cell transcriptome database analysis is very preliminary. A detailed panel with differences in cluster properties of HECs between the two systems should be provided. A heatmap of all differentially expressed genes between the two samples must be generated, along with a logical explanation for choosing the given set of genes. 

      We have now included another panel in figure 1 to better clarify the logic behind the strategy used to identify our target genes (Figure 1A).

      (2) Figure 2 - a panel describing the workflow of gRNA design and targeting for the 9 candidate genes, along with lentiviral packaging and transduction would make it easier to follow. 

      We have now included three schematics for all the experiments in the manuscript in supplementary figure 4 A-C. 

      (3) Figure 3- to assess the effect of arterial cell expansion on the emergence of hematopoietic progenitors, CD34+ Dll4+ cells should be sorted for OP9 co-culture assay.

      Using only CD34+ cells does not answer the question raised. Also, the CFU assay performed does not fully support the claim of enhanced hematopoietic differentiation since only CFU-E and CFU-GM colonies are increased in Dox-treated samples, with no effect on other colony types. OP9 co-culture assay with these cells would be required to strengthen this claim. 

      We wanted to clarify that the effect on the methylcellulose coming from the activated cells was not limited to CFU-E, as the reviewer reported; instead, it also affected CFU-GM and CFU-M. 

      We have now performed additional experiments where we sorted the CD34+ compartment into DLL4- and DLL4+ in Supplementary Figure 5D-E, which we discussed in lines 250-258. 

      (4) In Figure 3F, there appears to be a lot of variation in the DLL4% fold change values for

      DOX treated iSAM_AGM sample, which weakens the claim of increased arterial expansion.

      Can the authors explain the probable reason? It is suggested that the two other controls (iSAM_+DOX and iSAM_-DOX) should be included in this analysis. It is imperative to also show % populations rather than just fold change to gain confidence.

      We agree that there is a lot of variability. That is because differentiation happens in 3D in embryoid bodies, which contain many different cell types that differentiate in different proportions across independent experiments. We have now included the raw data in Supplementary Figure 4 D, with additional statistical analysis to show the expansion of arterial cells including also the suggested additional controls.

      (5) How does activation of these target genes cause increased arterialization? Is the emergence of non-HE populations suppressed? Or is it specific to the HE? The data on this should be clarified and also discussed. ANTO/Lesley text

      We have provided additional data clarifying the connection between increased arterialisation and hemogenic potential. We showed that the activation induces increased arterialisation and that IGFBP2 acts by supporting the acquisition of hemogenic potential. We have discussed this in lines 326-348 and provided a new figure to explain this in detail (figure 6)

      (6) Considering that IGFBP2 was chosen from the activated target gene(s) cluster, can the authors explain why the reduced CFU-M phenomenon observed in Figure 3G does not appear in the MethoCult assay for IGFBP2 treated cells (Figure 4B)?

      The difference could be explained by the fact that in Figure 3G, the cells underwent activation of multiple genes, while in Figure 4B, they were only exposed to IGFBP2. Our results show that IGFBP2 could at least partially explain the phenotype that we see with the activation, but we believe that during the activation experiments, there might be other signals available that might not be induced by IGFBP2 alone. We have also added a summary section and a figure to clarify the different mechanisms of action of the gene activation and IGFBP2.

      (7) Figure 4- while the experiments conducted support the role of IGFBP2 in increasing hematopoietic output, there is no experimental evidence to prove its function through paracrine signalling in HECs. The authors need to provide some evidence of how IGFBP2 supplementation specifically expands only the hematopoietic progenitors. Experimental strategies involving specifically targeting IGFBP2 in hemogenic/arterial endothelial cells are required to prove its cell type specific function. Additionally, assessing the in vivo functional potential of the hematopoietic cells generated in the presence of IGFBP2, by bone-marrow transplantation of CD34+ CD43+ cells, is essential. 

      The role of IGFBP2 in the context of HSC production and expansion was not the topic of our research, and we have not claimed that IGFBP2  affects the long-term repopulating capacity of HSPCs. Therefore, we believe that the requested experiments are not required to support the specific claims that we do make. We have now provided more experiments and bioinformatic analysis that support the role of IGFBP2 in inducing the progression of EHT from arterial cells to hemogenic endothelium, and to avoid misunderstandings, we have toned down our claims by editing the text regarding its paracrine effect s. 

      (8) Figure 4C-D -It is recommended to plot % populations along with fold change value. As this is a key finding, it is important to perform flow cytometry for additional hematopoietic markers- CD144, CD235a and CD41a to demonstrate whether this strategy can also expand erythroid-megakaryocyte progenitors. Telma

      Figure 4C already shows the percentage values; we have now added the percentage for Figure 4D in SF5C. We have also performed additional analysis as requested and added the data obtained to Supplementary Figure 5D.

      (9) In Figure 5, analysis showing the frequency of cells constituting different clusters, between untreated and IGFBP2-treated samples in the single-cell transcriptome analysis is essential. Additional experiments are required to validate the function of IGFBP2 through modulation of metabolic activity. Inhibition of oxidative phosphorylation in the IGFBP2treated cells should reduce the hematopoietic output. Authors should consider doing these experiments to provide a stronger mechanistic insight into IGFBP2-mediated regulation of hematopoietic emergence.

      We have now included the requested cluster composition in Supplementary Figure 5F. We decided not to include further tests on the metabolic profile of IGFBP2 as we already discussed in other papers that showed, using selective inhibitors, that the EHT coincides with a glycol to OxPhos switch. 

      (10) It is very striking to see that IGFBP2 supplementation changes the transcriptional profile of developing hematopoietic cells by increasing transcription of OXPHOS-related genes with concomitant reduction of glycolytic signatures, particularly at Day 13. However, the mitochondrial ATP rate measurements do not seem convincing. The bioenergetic profiles show that when mitochondrial inhibitors are added, both groups exhibit decreased OCR values and, on the other hand, higher ECAR. This indicates that both groups have the capability to utilize OXPHOS or glycolysis and may only differ in their basal respiration rates.

      Differences in proliferation rate can cause basal respiration to change. There is no information on how the bioenergetic profile was normalized (cell no./protein amount). Given that IGFBP2 has been shown to increase proliferation, it is very likely that the cells treated with IGFBP2 proliferated faster and therefore have higher OCR. The data needs to be normalized appropriately to negate this possibility.

      We have previously tested whether IGFBP2 causes an increase in proliferation by analysing the cell cycle of cells treated with it, as we initially thought this could be a mechanism of action. We have now provided the quantification of the cell cycle in the cells treated with IGFBP2, showing no effect was observed in cell cycle Supplementary Figure 4E. Following this analysis, we decided to plate the same number of cells and test their density under the microscope before running the experiment; each experiment was done in triplicate for each condition. We have now added this info to the method sections lines 806-813.  We did not comment on the basal difference, which we agree might be due to several factors, but we only compared the difference in response to the inhibitors, which isn’t affected by the basal level but exclusively by their D values. We have also included the formulas used to calculate the ATP production rate.

      Overall, it appears that IGFBP2 does not seem to primarily cause metabolic changes, but simply accelerates the metabolic dependency on OXPHOS. Hence, the term 'metabolic remodelling' must be avoided unless IGFBP2 depletion/loss of function analysis is shown.

      We thank the reviewer for suggesting how to interpret the data about the dependency on OXPHOS. We have now changed the conclusions and claims about the effect of IGFBP2. We have also included a cell cycle analysis of the hematopoietic cells derived upon IGFBP2 addition to show that they don’t show differences in proliferation that could cause the increase in colony formation we observed. Regarding the assay, we have plated the same number of cells for each group to make sure we were comparing the same number of cells, which we also assessed in the microscope before the test, and we eliminated the suspension cells during the washes that preceded the measurement. The review is correct in indicating that there is a basal difference in the value of OCR and ECAR where the IGFBP2 is lower at the start and not higher, which would not conceal higher proliferation. Finally, the ATP production rate is calculated on the variation of OCR and ECAR upon the addition of inhibitors, which normalizes for the basal differences.

    1. Reference #18.aa2d3e17.1733239827.ea4e98df

      Explanation:

      The annotated text "Reference #18.aa2d3e17.1733239827.ea4e98df" appears to be a reference code or link to an external resource, likely pertaining to the Spanish bill mentioned in the user question. The significance of this annotation lies in its function as a placeholder or identifier for additional information that is not directly included in the provided text. This reference could potentially lead to a detailed document, database, or error page that contains the key provisions of the bill or outlines its relationship to other bills.

      The implications of this annotation are multifaceted:

      1. Identification and Retrieval: The reference code is crucial for locating the specific document or webpage that contains the relevant details about the Spanish bill. This ensures that users can access comprehensive information that may not be immediately visible in the main text.

      2. Contextual Linking: By referencing an external source, the annotation suggests that the full understanding of the bill's provisions and related legislation requires consulting the linked material. This highlights the interconnected nature of legal documents and the importance of cross-referencing for thorough legal analysis.

      3. Potential Error: The URL provided in the reference indicates an "errors.edgesuite.net" domain, which might imply that the link is broken or leads to an error page. This could signify issues with accessing the necessary information, suggesting the need for alternative methods to obtain the details about the bill.

      In summary, the annotated text serves as a critical reference point for accessing detailed information regarding the Spanish bill. Its significance is rooted in its role as a connector to external resources, which are essential for a full understanding of the bill's key provisions and its relationship to other legislation.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Reviewer 1:

      - The manuscript needs comprehensive proofreading for language and formatting. In many instances, spaces are missing or not required.

      Thank you for your comments. The manuscript has been thoroughly proofread for errors in language and formatting.

      - Could the authors explore correlation network analyses to get additional insights into the structure of different clusters? 

      We have added a co-occurrence analysis (at species taxonomic level) based on SparCC to the manuscript (Figure 2).

      This is described on Page 9 line 141-148

      - The GitHub link is not correct. 

      The github repository has now been made public.

      - It is not possible to access the dataset on ENA. 

      We have changed the ENA study PRJEB57401 status to open.

      - Add the graphs obtained with decontam analysis as a supplementary figure. 

      We have added the outputs of decontam (.csv files with feature lists of ASVs that were filtered based on the prevalence and frequency tests) to the github repository.

      - There is nothing about the RPL group in the results section, while the authors discuss this issue in the introduction. What about the controls with proven fertility? 

      Thank you. We have amended the manuscript to compare characteristics between the RPL, unexplained subfertility and controls groups.

      Line 1279-130 page 8:  

      “The study group represented 85% of samples with high sperm DNA fragmentation, 85% of samples with elevated ROS and 79% of samples with oligospermia. Rates of abnormal seminal parameters including low sperm concentration, reduced progressive motility and ROS concentrations were found to be highest in the MFI group (Supplementary Figure 1). Baseline characteristics between the RPL, unexplained subfertility and controls groups were similar.

      Line 150-154 Page 9: 

      “Bacterial richness, diversity and load were similar between all patient groups examined in the study (Supplementary Figure 4).

      - While correctly stated in the title, the term microbiota should be used throughout the manuscript instead of "microbiome" 

      Thank you. This misnomer has been amended throughout the manuscript.

      Minor corrections:

      Line 25: provoke is not a good term here. 

      Thank you. The term ‘provoke’ has been removed

      Line 26: why does semen culture have a limited scope? 

      Thank you. Line 40-41 Page 3 has been amended:

      “It is therefore plausible that asymptomatic seminal infections may be associated with impaired reproductive function in some men. Since semen culture has a limited scope for studying the seminal microbiota due to its inability to identify all present microbiota next generation sequencing (NGS) approaches have been reported recently by a growing number of investigators (13, 14, 15, 16, 17, 18, 19)”.

      Line 68: write μl correctly

      Thank you. This has been corrected

      Line 131: several organisms at the genus level. 

      Thank you. This has been corrected

      Line 136: what are the relative abundances of these genera? Is this relevant? 

      The mean relative abundances for the key taxa mention in each cluster are all above 20%. This information has been added to the manuscript text on page 9, line 153.

      Line 173: Molina et al. 

      Thank you. This has been corrected

      Line 173: the contaminations are referred to the low biomass nature of testicular samples. If present, bacteria of accessory gland secretions are an integral part of the seminal microbiota itself. Please review these sentences. 

      Thank you. This had been reworked to highlight the important of urethral contamination, which you later allude to as a limitation of our study is the failure to provide paired urine and semen samples.

      Page 11 line 194-196

      “Molina et al report that 50%-70% of detected bacterial reads may be environmental contaminants in a sample from extracted testicular spermatozoa (35); with the addition of passage along the urethra it is likely that contamination of ejaculated semen would be much higher.”

      Table 1: remove results interpretation from table caption. 

      Thank you this has been acted upon.

      Table 1: why in some cases, like in DNA fragmentation index, the total is not equal to n=223? 

      This is due to missing data/ analysis not possible for some men due to the requirement of a minimum number of sperm in the ejaculate to perform DNA fragmentation testing.

      Table 1: "frag" is not defined. 

      Thank you, this has been amended

      Tables 2, 3 & 4: bacterial genera in italics. 

      Thank you, this has been amended

      Figure 1A: add the fertility status information above the cluster colors. 

      Thank you, this has been amended in Figure 1.

      Figure 1C: the color code is confusing. Use different colors for each cluster. 

      Figure 1 legend: bacterial genera in italics. 

      Figures 1 & 2: the authors should use similar chart formatting in the two tables. 

      Thank you, this has been amended

      Reviewer 2:

      (1) The patient groups have different diagnoses and should be handled as different groups, and not fused into one 'patient' group in analyses. <br /> Why are the data in tables presented as controls and cases? I would consider men from couples with recurrent pregnancy loss, unexplained infertility, and male factor infertility to have different seminal parameters (not to fuse them into one group). This means, that the statistical analyses should be performed considering each group separately, and not to fuse 3 different infertility diagnoses into one patient group. 

      We have conducted detailed analyses, requested by the reviewer, comparing seminal DNA, ROS and microbiota characteristics between each individual patient groups (Supplimental figures 1 and 4). No specific taxa (at either genera or species-level) were found to differ in relative abundance between the diagnostic groups. However, we expect associations between parameters such as reactive oxygen species, or DNA fragmentation, and relative abundance of bacterial species, to be general and not restricted to or specific to each diagnostic group. Therefore, we also conducted further analyses aggregating data from all patient groups to investigate relationships common to these different forms of male reproductive dysfunction.

      (2) Were any covariables included in the statistical analyses, e.g. age, BMI, smoking, time of sexual abstinence, etc? 

      Covariates were not included in the statistical analyses. This has been added in the manuscript to the limitations.

      Page 14 line 267-268

      “Additionally, we did not have other covariables such as smoking status with which to include in further analyses”.

      (3) Furthermore, it is known that 16S rRNA gene analysis does not provide sensitive enough detection of bacteria on the species level. How much do the authors trust their results on the species level? 

      The limitations of taxonomic assignment using 16S rRNA gene metataxonomics are well documented. However, the capacity to assign sequence amplicons at species level depends on the sequence variability of the 16S rRNA gene for each of the taxa reported and the specific gene region chosen. In this study, amplification of the V1-V2 region was performed using a mixed 28f primer set (see methods for details) that enables resolution and assignment of several bacterial species highly relevant to the reproductive tract including Lactobacillus spp., such as L. crispatus and L. iners, (e.g. https://doi.org/10.3389/fcell.2021.641921, https://doi.org/10.1128/msystems.01039-23, https://doi.org/10.1186/s12915-023-01702-2). In this study, we report the presence of L. iners, but not L. crispatus in semen samples, and we have also identified a specific association/co-occurrence between Gardnerella vaginalis and Lactobacillus iners, similar to that observed in vaginal bacterial communities.

      (4) Were the analyses of bacterial genera and species abundances with seminal quality parameters controlled for diagnosis and other confounders? 

      As stated in point 2, no adjustment was made for co-variates. No differences in microbiome composition were observed among the three diagnostic groups, so no adjustments were made to our analysis.

      (5) The authors stress that their study is the biggest on the microbiome in semen. However, when considering that the study consists of 4 groups (with n=46-63), it does not stand out from previous studies. 

      Our study is overall the largest investigating interactions between the seminal microbiome and male reproductive dysfunction. Other studies have included greater numbers of men with infertility.

      (6) Weaknesses: There is a lack of paired seminal/urinal samples. 

      Thank you. This limitation has been added.

      Page 14 line 266-267

      “A further limitation of this study, and others, is the lack of reciprocal genital tract microbiota testing of the female partners, or paired seminal and urinary samples from male participants”.

      Recommendation for authors to consider:

      Including previous classical reviews in the introduction: DOI:10.1097/MOU.0000000000000742 <br /> DOI: 10.1038/s41585-019-0250-y 

      Thank you. This has been added.

      Mentioning in the M&M section that there is a supplementary text with a more detailed M&M part. 

      Thank you. This has been added. Further methodological detail can be found in supplementary text.

      Revising the use of 'microbiota' and 'microbiome', they are not synonyms. When talking of 16S rRNA gene analysis, we consider 'microbiome' analysis. 

      Thank you. This misnomer has been amended throughout the manuscript.

      Revising the text, there are several erratas (e.g. verb missing, etc). 

      Thank you for your comments. The manuscript has been thoroughly proofread for errors in language and formatting.

    1. Google Wave is still only a glimmer in Google engineers’ eyes; the company just made source code available to select developers today.

      Wave glimmer in enginners' eye

    1. Total: 75/100

      Original Content 30/30

      Technical Understanding 25/30: * Use of Required Tools (6/6) * The project effectively integrates tools such as the image classifier, video input, and audio playback. * Integration of Technology (4/5) * The use of classification, video display, and label-based progression is seamless and functional. * The classification results are used effectively to trigger specific actions and audio playback. However, there is no fallback logic if classification fails or if the environment lacks recognized objects. * Functionality of Classifier Models (3/5) * The classifier operates with sufficient functionality, identifying objects and triggering associated instructions. * The model is not retrained, as indicated in the code comments, which could reduce its accuracy in recognizing project-specific elements. * Interactivity and Feedback (4/5) * The user is guided through the experience with dynamic audio and text feedback based on classified labels. * Feedback is engaging and humorous but could be enhanced with more visual or interactive cues for the detected elements. * Complexity of Decision Tree or Flowchart (3/4) * The flow is linear but has some branching logic tied to specific labels. This creates a guided but somewhat limited exploration experience. * Increasing the number of paths or conditions could enrich the overall structure and align more closely with the dérive concept. * Creative Application of Rules (5/5) * The rules are really creative, with witty and engaging text prompts that enhance the psychogeographic aspect of the project.

      Timely Submission 20/40

    1. no-code way to create custom feeds? Try out SkyFeed or Bluesky Feed Creator

      This seems interesting. Do I still consume the feeds on the Bluesky application?

    1. On this page, I share my attempt at achieving a beautiful glass effect, along with sample code and assets for anyone who wants to explore this technique themselves.

      via mikael

      I wish that MySpace profiles hadn't been lost. I wasn't technical back then (n.b. this is nostalgia about when I was a literal child) but I used all kinds of tools and generators to try to get the visual look I wanted, and the look of translucency above a large photo background was a big one. Why did I like it? What was the influence? For the life of me I can't identify one.

  3. Nov 2024
    1. State management is a complex topic in Flutter with various approaches to choose from.

      "State management is a complex topic."

      Provider: A commonly used state management solution in Flutter.

      "Provider package"

      Riverpod offers compile safety and testing without depending on the Flutter SDK, similar to Provider.

      "Riverpod works in a similar fashion to Provider. It offers compile safety and testing without depending on the Flutter SDK."

      setState is the low-level approach for widget-specific, ephemeral state.

      "The low-level approach to use for widget-specific, ephemeral state."

      ValueNotifier & InheritedNotifier use Flutter's built-in tools to update state and notify the UI of changes.

      "An approach using only Flutter provided tooling to update state and notify the UI of changes."

      InheritedWidget & InheritedModel facilitate communication between ancestors and children in the widget tree and are the foundation for other state management solutions.

      "The low-level approach used to communicate between ancestors and children in the widget tree. This is what provider and many other approaches use under the hood."

      June is a lightweight and modern library focusing on a pattern similar to Flutter's built-in state management.

      "A lightweight and modern state management library that focuses on providing a pattern similar to Flutter's built-in state management."

      Redux is a state container approach familiar to web developers, suitable for managing application state.

      "A state container approach familiar to many web developers."

      Fish Redux is an assembled Flutter application framework based on Redux, suitable for medium and large applications.

      "Fish Redux is an assembled flutter application framework based on Redux state management. It is suitable for building medium and large applications."

      BLoC / Rx comprise a family of stream/observable-based patterns for state management.

      "A family of stream/observable based patterns."

      GetIt is a service locator approach that doesn't require a BuildContext for state management.

      "A service locator based state management approach that doesn't need a BuildContext."

      MobX is a popular library based on observables and reactions for state management.

      "A popular library based on observables and reactions."

      Dart Board is a modular feature management framework designed to encapsulate and isolate features in Flutter applications.

      "A modular feature management framework for Flutter. Dart Board is designed to help encapsulate and isolate features, including examples/frameworks, small kernel, and many ready-to-use decoupled features such as debugging, logging, auth, redux, locator, particle system and more."

      Flutter Commands uses the Command Pattern and ValueNotifiers for reactive state management.

      "Reactive state management that uses the Command Pattern and is based on ValueNotifiers."

      Binder is a state management package using InheritedWidget at its core, promoting separation of concerns.

      "A state management package that uses InheritedWidget at its core. Inspired in part by recoil. This package promotes the separation of concerns."

      GetX is a simplified and powerful reactive state management solution.

      "A simplified reactive state management solution."

      states_rebuilder combines state management with dependency injection and an integrated router.

      "An approach that combines state management with a dependency injection solution and an integrated router."

      Triple Pattern (Segmented State Pattern) uses Streams or ValueNotifier, focusing on three core values: Error, Loading, and State.

      "Triple is a pattern for state management that uses Streams or ValueNotifier. This mechanism (nicknamed triple because the stream always uses three values: Error, Loading, and State), is based on the Segmented State pattern."

      solidart is a simple yet powerful state management solution inspired by SolidJS.

      "A simple but powerful state management solution inspired by SolidJS."

      flutter_reactive_value offers a minimalistic solution, allowing newcomers to add reactivity without complexity.

      "The flutter_reactive_value library might offer the least complex solution for state management in Flutter. It might help Flutter newcomers add reactivity to their UI, without the complexity of the mechanisms described before."

      Elementary provides a straightforward way to build Flutter applications with MVVM, enhancing productivity and testability.

      "Elementary is a simple and reliable way to build applications with MVVM in Flutter. It offers a pure Flutter experience with clear code separation by responsibilities, efficient rebuilds, easy testability, and enhancing team productivity."

      Developers are encouraged to review these options to select an approach that best fits their use case.

      "If you feel that some of your questions haven't been answered, or that the approach described on these pages is not viable for your use cases, you are probably right."

    1. Bayesian neural network on the supervised regression task

      The annotated text, "Bayesian neural network on the supervised regression task," is highlighted to emphasize a key methodological innovation in the research presented in the paper. Here’s a comprehensive explanation of its significance:

      Explanation

      The highlighted phrase "Bayesian neural network on the supervised regression task" is significant because it underscores a novel approach in the study of cosmological density fields and warm dark matter (WDM) models. By leveraging a Bayesian neural network, the authors introduce a machine learning technique that can perform a supervised regression task to predict the optical depth-weighted density field (Δτ) from the Lyman-α forest flux field. This method offers several transformational benefits:

      1. Enhanced Predictive Accuracy: The Bayesian neural network is trained to predict not only the density field but also the uncertainty in its reconstruction. This dual prediction capability allows for more robust and reliable inferences about the underlying cosmological structures.

      2. Efficient Data Utilization: The machine learning approach enables the extraction of meaningful constraints on WDM particle masses using significantly less observational data compared to traditional methods like Markov Chain Monte Carlo (MCMC) techniques. This efficiency is crucial for advancing our understanding of dark matter with limited data resources.

      3. Innovative Statistical Inference: The implementation of a Bayesian framework allows the researchers to quantify and propagate uncertainties through the statistical analysis pipeline. This results in more precise and credible constraints on the properties of dark matter particles.

      4. Validation on Diverse Datasets: The neural network's performance is validated against both the Sherwood-Relics simulation suite and the Nyx simulation code, demonstrating its generalization capability across different hydrodynamical solvers and physical prescriptions.

      Overall, the use of a Bayesian neural network for supervised regression in this context represents a significant methodological advancement in astrophysics and cosmology. It enables more accurate reconstructions of the intergalactic medium density field and provides tighter constraints on warm dark matter models, thus contributing to our broader understanding of the universe's structure and composition.

    Annotators

    1. Des Signes were commissioned by the Élysée together with the Musée national de l’histoire de l’immigration to design a commemorative plaque and a visual identity to mark all Algerian-French memorial sites.

      So clearly this is meant to let us admire the monogram and wordmark, but actually what's grabbing me is the use of the repeated QR code. QR codes vibrate with potential in bringing hypertextuality to printed (or knitted or etc. etc.) materials, but something about how they look stamped onto posters and stickers and such has always turned me off. (If someone were out there getting good results with a vegan diffusion model, this kind of nonsense wouldn't be entirely unappealing to me, pursued with a bit more taste...) But the simple repeat as a border changes how this reads, so profoundly I feel stupid for not having ever tried it myself.

      Sometimes it's fun to live in the time you live in!

    1. Author response:

      Reviewer #1 (Recommendations for the authors):

      (1) Storyline and Narrative Flow:

      Consider revising the manuscript to create a more coherent and consistent narrative. Clarify how each section of the study-particularly the transition from multi-omics data integration to single-cell RNA-seq validation-contributes to the overall research question. This will help readers better understand the logical flow of the study.

      In the upcoming revisions, we will optimize the logical connections between sections of the manuscript to clarify the role each part plays in the overall research question, making it easier for readers to follow.

      (2) Immune Cell Activity Analysis:

      Reevaluate the methods used to assess immune cell activities within the context of the tumor microenvironment. Consider providing additional justification for the relevance of using the cancer cell model for this analysis. If necessary, explore alternative methods or models that might offer more meaningful insights into immune-tumor interactions.

      We fully recognize the importance of using tumor models to analyze and validate immune activity results, and we are considering experimental research in this area in future projects.

      (3) Single-Cell RNA-Seq Validation:

      Expand the validation of your findings using single-cell RNA-seq data. This could include more in-depth analyses that explore the heterogeneity within the subtypes and confirm the robustness of your classification method at the single-cell level. This would strengthen the support for your claims about the relevance of the identified subtypes.

      In the current study, we have applied the obtained multi-omics profiling features to single-cell sequencing data to classify malignant cells. We analyzed the metabolic and cell communication differences between different subtypes of malignant cells and explored potential reasons for these differences. Next, we plan to conduct further analysis of the differences between malignant cell subtypes to identify additional clues and mechanisms underlying these variations.

      (4) Methodological Justification:

      Provide a more detailed rationale for the selection of machine learning algorithms and integration strategies used in the study. Explain why the chosen methods are particularly well-suited for this research, and discuss any potential limitations they might have.

      In the revised manuscript, we will include descriptions of the principles of these analytical methods, as well as examples of their application in other studies, to discuss the rationale and limitations of applying these methods in this research.

      (5) Figures and Visualizations:

      Improve the clarity of your figures by addressing the following:

      a) Figure 3A: Cluster the pathways to make the comparisons clearer and more meaningful.

      b) Figure 4A: Clearly explain the significance of the blue bar.

      c) Figure 4B: Ensure this figure is discussed in the main text to justify its inclusion.

      d) Figure 7C: Enhance the figure legend to provide more informative details.

      Additionally, ensure that figure descriptions go beyond the captions and provide detailed explanations that help the reader understand the significance of each figure.

      We fully agree with the reviewer’s suggestions regarding these figures, and we will make the necessary revisions in the revised manuscript.

      (6) Supplementary Materials:

      Consider including more detailed supplementary materials that provide additional validation data, extended methodological descriptions, and any other information that would support the robustness of your findings.

      When we submission the revised manuscript, we will include supplementary materials such as figures or tables that may enhance the presentation of the manuscript's completeness.

      (7) Recent Literature:

      a) Incorporate more recent studies in your discussion, especially those related to HCC subtypes and the application of machine learning in oncology. This will provide a more current context for your work and help position your findings within the broader field.

      We appreciate the reviewer's suggestion. We will incorporate more recent studies into the discussion section and optimize its content.

      (8) Data and Code Availability:

      Ensure that all data, code, and materials used in your study are made available in line with eLife's policies. Provide clear links to repositories where readers can access the data and code used in your analyses.

      We have indicated the sources of the data and tools used in the analysis process within the text, and these data and tools can be accessed through the websites or literature we have cited.

      Reviewer #2 (Recommendations for the authors):

      (1) While the computational findings are robust, further experimental validation of the two subtypes, particularly the role of the MIF signaling pathway, would strengthen the biological relevance of the findings. In vitro or in vivo validation could confirm the proposed mechanisms and their influence on patient prognosis.

      We fully recognize the importance of using tumor models to analyze and validate immune activity results, and we are considering experimental research in this area in future projects.

      (2) Consider testing the model on additional independent cohorts beyond the TCGA and ICGC datasets to further demonstrate its generalizability and applicability across different patient populations.

      We are considering looking for independent external datasets in the GEO database or other databases to validate our model.

      (3) Review the manuscript for long or complex sentences, which can be broken down into shorter, more readable parts.

      In the revised manuscript, we will address any grammatical issues present in the manuscript and modify long and complex sentences that may hinder reader comprehension.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Thank you for your assessment and constructive critique, which helped us to improve the manuscript and its clarity. Upon carefully reading through the comments, we noticed that, based on the Reviewer's questions, some of our answers were already available but “hidden” as supplementary data. Thus, we changed the following two figures and text accordingly to showcase our results to the reader better:

      A) To highlight how mobile service data can indicate the spread of highly prevalent variants, we added a high-prevalence subcluster to Figure 2 (previously shown in Supplementary Figures S4 and S5) and, in exchange, moved one low-prevalence subcluster from Figure 2 back into the supplement. The figure is now showing a low and a high prevalent subcluster instead of two low prevalent subclusters.

      B) Based on Reviewer 1’s question about where samples were taken in regards to the mobility data from the community of the first identification (negative controls), we now highlight all the mobility data that was available to us in Figure 3 (as triangles) instead of just a few top mobility hits for both - mobility guided and random surveillance (serving as a negative control for the former). This way, we think, it is clearer how random sampling was also performed in some regions where mobility was coming from the community of origin (as asked by Reviewer 1) - the detailed trips and sampling are now part of the supplement for data transparency reasons. We also noticed a typo in the GPS coordinates, aligning one of the arrows falsely, which is corrected in the improved Figure 3.

      We have also included the R-Scripts used to generate all the figures in the manuscript in an OSF repository (we updated the “Data sharing statement”). We also updated Figure 1 slightly and extended the supplemental material. The remaining comments to reviewers are addressed point-by-point below.

      Reviewer 1 (Public Review):

      In "1 Exploring the Spatial Distribution of Persistent SARS-CoV-2 Mutations -Leveraging mobility data for targeted sampling" Spott et al. combine SARS-CoV-2 genomic data alongside granular mobility data to retrospectively evaluate the spread of SARS-CoV-2 alpha lineages throughout Germany and specifically Thuringia. They further prospectively identified districts with strong mobility links to the first district in which BQ.1.1 was observed to direct additional surveillance efforts to these districts. The additional surveillance effort resulted in the earlier identification of BQ.1.1 in districts with strong links to the district in which BQ.1.1 was first observed.

      Thank you for taking the time to review our work.

      (1) It seems the mobility-guided increased surveillance included only districts with significant mobility links to the origin district and did not include any "control" districts (those without strong mobility links). As such, you can only conclude that increasing sampling depth increased the rate of detection for BQ.1.1., not necessarily that doing so in a mobility-guided fashion provided an additional benefit. I absolutely understand the challenges of doing this in a real-world setting and think that the work remains valuable even with this limitation, but I would like the lack of control districts to be more explicitly discussed.

      Thank you for the critical assessment of our work. We agree that a control is essential for interpreting the results. In our case, randomized surveillance (“the gold standard”) served as a control with a total sampling depth seven times higher than the mobility-guided sampling. To better reflect the sampling in regards to the available mobility data, we revisited Figure 3 and added all the mobility information from the origin that was available to us. We also added this information to the random surveillance to provide a clearer picture to the reader. This now clearly shows how randomized surveillance covered communities with varying degrees of incoming mobility from the community of first occurrences, thereby underlining its role as a negative control. We updated the manuscript to reflect these changes and included the October 2020 and June 2021 mobility datasets in Supplementary Table S6. We agree that the sampling depth increases the detection, which is the point of guided sampling to increase sampling, specifically in areas where mobility points towards a possible spread. In regards to the negative control: Random surveillance (not Mobility-guided) in October covered 40 samples in the northwest region of Thuringia (Mobility-guided covered 19 samples). Thus, random surveillance also contained 31 out of 132 samples with a mobility link towards the first occurrence of BQ1.1 but with varying amounts of mobility (low to high).

      We added this information to the main text:

      Line 270 to 293:

      Following its first Thuringian identification, we utilized the latest available dataset of the past two years of mobile service data (October 2020 and June 2021) to investigate the residential movements for the community of first detection. Considering the highest incoming mobility from both datasets, we identified 18 communities with high (> 10,000), 34 with medium (2,001-10,000), and 82 with low (30-2,000) number of incoming one-way trips from the originating community (purple triangles in Figure 3a). As a result, we specifically requested all the available samples from the eight communities with the highest incoming mobility. Still, we were restricted to the submission of third parties over whom we had no influence. This led to the inclusion of the following eight communities with the most residential movement from the originating community: four in central and three in NW of Thuringia, one in NW-neighboring state Saxony-Anhalt. The samples requested from central Thuringia were also due to their geographic arrangement as a “belt” in central Thuringia, linking three major cities (see Supplementary Figure S1). Subsequently, we collected 19 additional samples (isolated between the 17th and 25th of October 2022; see “Guided Sampling” for October 2022, Figure 3a) besides the randomized sampling strategy. Thus, the sampling depth was increased in communities with high incoming mobility from the first origin.

      As part of the general Thuringian surveillance, we collected 132 samples for October (covering dates between the 5th and 31st) and 69 samples in November (covering dates between the 1st and 25th; see Figure 3b and c). Randomized sampling was not influenced or adjusted based on the mobility-guided sample collection. Thus, it also contains samples from communities with a mobility link towards the first occurrence of BQ.1.1, as they were part of the regular random collection (see gray triangles in Figure 3b). A complete overview of all samples is provided in Supplementary Table S5. The mobility datasets from October 2020 and June 2021 for all sampled communities are provided in Supplementary Table S6.

      Line 305 to 313:

      Among the 19 samples specifically collected based on mobile service data, we identified one additional sample of the specific Omicron sublineage BQ.1.1 in a community with high incoming mobility (n = 14, number of trips = 37,499) with a distance of approximately 16 km between both towns. Our randomly sampled routine surveillance strategy did not detect another sample during the same period. This was despite a seven times higher overall sample rate, which included 31 samples from communities with an identified incoming mobility from the community of the first occurrence (October 2022, Figure 3b). Only in the one-month follow-up were four other samples identified across Thuringia through routine surveillance (November 2022, Figure 3c).

      Line 325 to 333:

      In summary, increasing the sampling depth in the suspected regions successfully identified the specified lineage using only a fraction of the samples from the randomized sampling. Conversely, randomized surveillance, the “gold standard” acting as our negative control, did not identify additional samples with similar sampling depths in regions with no or low incoming mobility or even in high mobility regions with less sampling depth. Implementing such an approach effectively under pandemic conditions poses difficult challenges due to the fluctuating sampling sizes. Although the finding of the sample may have been coincidental, our proof of concept demonstrated how we can leverage the potential of mobile service data for targeted surveillance sampling.

      (2) Line 313: While this work has reliably shown that the spread of Alpha was slower in Thuringia, I don't think there have been sufficient analyses to conclude that this is due to the lack of transportation hubs. My understanding is that only mobility within Thuringia has been evaluated here and not between Thuringia and other parts of Germany.

      Thank you for pointing this out. We noticed that the original sentence lacked the necessary clarity. The statement in line 313 was based on the observation that Alpha first occurred in federal states with major transport hubs, such as international airports and ports, which Thuringia lacks, as demonstrated in the Microreact dataset. For clarification, we adjusted the sentence as follows:

      Line 340 and following:

      A plausible explanation for the delayed spread of the Alpha lineage in Thuringia is the lack of major transport hubs, as Alpha first occurred in federal states with such hubs. Previous studies have already highlighted the impact of major transportation hubs in the spread of Sars-CoV-2.

      (3) Line 333 (and elsewhere): I'm not convinced, based on the results presented in Figure 2, that the authors have reliably identified a sampling bias here. This is only true if you assume (as in line 235) that the variant was in these districts, but that hasn't actually been demonstrated here. While I recognize that for high-prevalence variants, there is a strong correlation between inflow and variant prevalence, low-prevalence variants by definition spread less and may genuinely be missing from some districts. To support this conclusion that they identified a bias, I'd like to see some type of statistical model that is based e.g. on the number of sequences, prevalence of a given variant in other districts, etc. Alternatively, the language can be softened ("putative sampling bias").

      Thank you for addressing this legitimate point of criticism in our interpretation. Due to the retrospective nature of the analysis and the fact that we found no additional samples of the clusters after the specified timeframes, we were limited to the samples in our dataset. Therefore, it is impossible to demonstrate if a variant was present in the relevant districts afterward. We agree that the variant’s low prevalence means they may genuinely not have spread to some districts. For clarification, we added the following statements and changed the wording accordingly:

      Additional statement in line 248:

      However, due to their low prevalence, it is also possible that these subclusters have not spread to the indicated districts.

      Adjusted wording in line 361:

      We exemplified this approach with the Alpha lineage, where mobile service data indicated a putative sampling bias and partially predicted the spread of our Thuringian subclusters.

      Recommendations:

      (1) I applaud the use of the microreact page to make the data public, however, I don't see any reference to a GitHub or Zenodo repository with the analysis code. The NextStrain code is certainly appreciated but there is presumably additional code used to identify the clusters, generate figures, etc. I generally prefer this code be made public and it is recommended by eLife.

      Thank you for your appreciation. We have now included the R-scripts in the manuscript’s OSF repository. These were used to create the figures in the manuscript and supplement utilizing the supplementary tables 1-6, which are also stored in the repository. To clearly communicate which data is provided, we changed lines 513 and 514 of the “Data sharing statement” as follows:

      Line 513 and following:

      Supplementary tables and the R-scripts used to generate all figures are also provided in the repository under https://osf.io/n5qj6/. These include the mobile service data used in this study, which is available in processed and anonymized form.

      The subcluster identification was performed manually. By adding each sample's mutation profile to the Microreact metadata file, we visually screened the phylogenetic time tree for all non-Alpha specific mutations present in at least 20 Thuringian genomes. We then applied the criteria described in the Methods section to identify the nine Alpha subclusters. For clarification, we changed line 436:

      Line 436:

      We then manually screened for mutations present in at least 20 genomes with a small phylogenetic distance and a time occurrence of at least two months.

      Reviewer 2 (Public Review):

      In the manuscript, the authors combine SARS-CoV-2 sequence data from a state in Germany and mobility data to help in understanding the movement of the virus and the potential to help decide where to focus sequencing. The global expansion in sequencing capability is a key outcome of the public health response. However, there remains uncertainty about how to maximise the insights the sequence data can give. Improved ability to predict the movement of emergent variants would be a useful public health outcome. Also knowing where to focus sequencing to maximising insights is also key. The presented case study from one State in Germany is therefore a useful addition to the literature. Nevertheless, I have a few comments.

      Thank you for taking the time to review our work.

      (1) One of the key goals of the paper is to explore whether mobile phone data can help predict the spread of lineages. However, it appears unclear whether this was actually addressed in the analyses. To do this, the authors could hold out data from a period of time, and see whether they can predict where the variants end up being found.

      Based on your feedback, we noticed that the results of the other seven clusters presented in the supplement were not appropriately highlighted, causing them to be overlooked. We indeed demonstrated that predicting viral spread based on mobility data is possible, as shown for the high-prevalence subcluster 7 (Cluster “ORF1b:A520V”, 811 samples). This was briefly mentioned in lines 240-242, but the cluster was only shown in Supplementary Figures S4 and S5. Instead, we focused more on the putative sampling bias that the mobility for low-prevalence subclusters could indicate as an interesting use case of mobility data. This addresses a concrete problem of every surveillance: successfully identifying low-prevalence targets. However, based on your feedback, we revisited Figure 2, adding the plots of the high-prevalence subcluster: “ORF1b:A520V” from Supplementary Figures S4 and S5 while moving the low-prevalence subcluster “S:N185D” from Figure 2 into the Supplementary Figures S4 and S5. Additionally, we changed line 229 to highlight this result properly.

      line 229 and following:

      The mobile service data-based prediction of a subcluster’s spread aligned well with the subsequent regional coverage of fast-spreading, highly prevalent subclusters, such as subcluster 7, which covered 811 samples (see Figure 2). In contrast, the predicted spread for the low-prevalence subclusters did not correspond well with the actual occurrence.

      (2) The abstract presents the mobility-guided sampling as a success, however, the results provide a much more mixed result. Ultimately, it's unclear what having this strategy really achieved. In a quickly moving pandemic, it is unclear what hunting for extra sequences of a specific, already identified, variant really does. I'm not sure what public health action would result, especially given the variant has already been identified.

      Thank you for your critical assessment of the presented results and their interpretation.

      Here, we aimed to provide an alternative to the standard randomized surveillance strategy. Through mobility-guided sampling, we sought to increase identification chances while necessitating fewer samples and decreasing costs, ultimately enhancing surveillance efficiency. The Omicron-lineage BQ.1.1 was the perfect example to prove this concept under actual pandemic conditions. Yet, the strategy is not limited to low-prevalence sublineages but can be applied to virtually any surveillance case. However, from your question, we recognize that this conclusion was unclear from the text. Therefore, we adapted the conclusion to better communicate the real implications of our proof of concept. Additionally, we altered line 42 in the abstract for clarification.

      However, we did not assess the benefits of surveillance itself, as the German Robert Koch Institute (RKI) already had outlined its importance for tracking different viral variants. This tracking served several reasons, like monitoring vaccine escapism, mutational progress, and assessing available antibodies for treatment.

      Line 42:

      The latter concept was successfully implemented as a proof-of-concept for a mobility-guided sampling strategy in response to the surveillance of Omicron sublineage BQ.1.1.

      Line 364 to 374:

      Another approach is actively guiding the sampling process through mobile service data, which we demonstrated with our proof of principle focusing on the Omicron-lineage BQ.1.1 as a real-life example. This approach could allow for a flexible allocation of surveillance resources, enabling adaptation to specific circumstances and increasing sampling depth in regions where a variant is anticipated. By incorporating guided sampling, much fewer resources may be needed for unguided or random sampling, thereby reducing overall surveillance costs.

      Additionally, while this approach is particularly useful for identifying low-prevalence variants, it is not limited to such variants. Still, it can provide a guided, more cost-efficient, low-sampling alternative to general randomized surveillance that can also be applied to other viruses or lineages.

      (3) Relatedly, it is unclear to me whether simply relying on spatial distance would not be an alternative simpler approach than mobile phone data. From Figure 2, it seems clear that a simple proximity matrix would work well at reconstructing viral flow. The authors could compare the correlation of spatial, spatial proximity, and CDR data.

      Thank you for pointing this out. While proximity data might appear to be an obvious choice, it has significant limitations compared to mobility data, especially in the context of our study. Proximity data assumes that spatial distance alone can accurately represent movement patterns, which would only be true in a normally distributed traffic network. Geographic features such as mountains, cities, and highways affect traffic flows, leading to variability over distance and time, which are beyond the scope of spatial proximity but efficiently captured by mobility data. In Figure 2, we presented a simplified view of the mobility data. Hence, proximity and mobility data appear to provide the same insights. However, as shown in the updated Figure 3, a detailed overview of the available mobility data reveals obvious and non-obvious spatial connections that proximity data can not capture. Incorporating such a level of detail in Figure 2 would have cluttered the figure and reduced its clarity (e.g., adding triangles for each Thuringian community).

      While a comparison between proximity data and mobility data would indeed be informative, it is beyond the scope of our current study, as our primary focus was to examine the useability of mobility data in explaining our subcluster’s spread in the first place. However, we agree it would be a valuable direction for future research. We summarized our thoughts from above in the following additional sentence:

      Line 374:

      Pre-generated mobility networks automatically tailored to each state's unique infrastructure and population dynamics could provide better-targeted sampling guidance rather than simple geographical proximity.

      Recommendations:

      (1) Line 128: What do these percentages mean - the proportion of States with at least one Alpha variant? Please clarify.

      We clarified the values at their first appearance in the text:

      Line 127:

      By March, Alpha had spread to nearly all states and districts (districts are similar to counties or provinces) in Germany (Median: 76·47 % Alpha samples among a federal states total sequenced samples compared to 36·03 % in February, excluding Thuringia) and Thuringia (Median: 85·29 %, up from 50·00 % in February).

      (2) Line 134: It's a little strange to compare the dynamics of a state with that of the whole country. For it lagged as compared to all other States?

      Line 134: “In summary, the spread of the Alpha lineage in Thuringia lagged roughly two weeks behind the general spread in the rest of Germany but showed similar proportions.”

      Thank you for the feedback. The statement refers to the comparison of Alpha-lineage proportions across federal states, excluding Thuringia, in lines 118 to 130. To simplify, we collectively referred to these federal states as “Germany” in the text. However, we recognize that this formulation is misleading, so we adjusted line 135 for clarification:

      Line 135:

      In summary, the spread of the Alpha lineage in Thuringia lagged roughly two weeks behind the general spread of other German federal states but showed similar proportions.

    1. project path by creating a new project folder:

      It is forbidden to re-use a project directory, which is especially annoying if you are testing the code. Would it be possible to allow this with a warning that you are replacing the last version?

    1. Sometime later, however, some of these patients are likely to have had a code included in their healthcare record that was associated with unintended weight loss and cancer

      Not sure I follow this sentence

    1. avec la commande \overrightarrow lorsqu’elle concerne plusieurs caractères.

      Le code présenté ci-dessous n'est pas optimal. La pointe de la flèche se superpose sur le haut de la lettre B (ceci est corrigé en chargeant amsmath, mais ça fait doublon avec le point 1.2 qui justement propose une solution en chargeant amsmath). De plus, il faudrait éviter l'emploi simultané de \vec et de \overrightarrow car le dessin de leurs pointes diffère sensiblement.

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews: 

      Reviewer #1 (Public Review): 

      Syngnathid fishes (seahorses, pipefishes, and seadragons) present very particular and elaborated features among teleosts and a major challenge is to understand the cellular and molecular mechanisms that permitted such innovations and adaptations. The study provides a valuable new resource to investigate the morphogenetic basis of four main traits characterizing syngnathids, including the elongated snout, toothlessness, dermal armor, and male pregnancy. More particularly, the authors have focused on a late stage of pipefish organogenesis to perform single-cell RNA-sequencing (scRNA-seq) completed by in situ hybridization analyses to identify molecular pathways implicated in the formation of the different specific traits. 

      The first set of data explores the scRNA-seq atlas composed of 35,785 cells from two samples of gulf pipefish embryos that authors have been able to classify into major cell types characterizing vertebrate organogenesis, including epithelial, connective, neural, and muscle progenitors. To affirm identities and discover potential properties of clusters, authors primarily use KEGG analysis that reveals enriched genetic pathways in each cell types. While the analysis is informative and could be useful for the community, some interpretations appear superficial and data must be completed to confirm identities and properties. Notably, supplementary information should be provided to show quality control data corresponding to the final cell atlas including the UMAP showing the sample source of the cells, violin plots of gene count, UMI count, and mitochondrial fraction for the overall

      dataset and by cluster, and expression profiles on UMAP of selected markers characterizing cluster identities. 

      We thank the reviewer for these suggestions, and have added several figures and supplemental files in response. We added a supplemental UMAP showing the sample that each cell originated (S1). We also added supplemental violin plots for each sample showing the gene count, unique molecular identifier (UMI) count, mitochondrial fraction, and the doublet scores (S2). We added feature plots of zebrafish marker genes for these major cell types and marker genes identified from our dataset to the supplement (S3:S57). We also provided two supplemental files with marker genes. These changes should clarify the work that went into labeling the clusters. Although some of the cluster labels are general, we decided it would be unwise to label clusters with speculated specific annotations. We only gave specific annotations to clusters with concrete markers and/or in situ hybridization (ISH) results that cemented an annotation.  As shown in the new supplemental figures and files, certain clusters had clear, specific markers while others did not. Therefore, we used caution when we annotated clusters without distinct markers. 

      The second set of data aims to correlate the scRNA-seq analysis with in situ hybridizations (ISH) in two different pipefish (gulf and bay) species to identify and characterize markers spatially, and validate cell types and signaling pathways active in them. While the approach is rational, the authors must complete the data and optimize labeling protocols to support their statements. One major concern is the quality of ISH stainings and images; embryos show a high degree of pigmentation that could hide part of the expression profile, and only subparts and hardly detectable tissues/stainings are presented. The authors should provide clear and good-quality images of ISH labeling on whole-mount specimens, highlighting the magnification regions and all other organs/structures (positive controls) expressing the marker of interest along the axis. Moreover, ISH probes have been designed and produced on gulf pipefish genome and cDNA respectively, while ISH labeling has been performed indifferently on bay or gulf pipefish embryos and larvae. The authors should specify stages and species on figure panels and should ensure sequence alignment of the probe-targeted sequences in the two species to validate ISH stainings in the bay pipefish. Moreover, spatiotemporal gene expression being a very dynamic process during embryogenesis, interpretations based on undefined embryonic and larval stages of pipefish development and compared to 3dpf zebrafish are insufficient to hypothesize on developmental specificities of pipefish features, such as on the absence of tooth primordia that could represent a very discrete and transient cell population. The ISH analyses would require a clean and precise spatiotemporal expression comparison of markers at the level of the entire pipefish and zebrafish specimens at well-defined stages, otherwise, the arguments proposed on teleost innovations and adaptations turn out to be very speculative. 

      We are appreciative of the reviewer’s feedback. We primarily used the in situ hybridization (ISH) data as supplementary to the scRNAseq library and we are aware that further evidence is necessary to identify origins of syngnathid’s evolutionary novelties. Our goal was to provide clues for the developmental genetic basis of syngnathid derived features.  We hope that our study will inspire future investigations and are excited for the prospect that future research could include this reviewer’s ideas. 

      All of the developmental stages and species information for the embryos used were in the figure captions as well as in supplemental file 6. Because we primarily used wild caught embryos, we did not have specific ages of most embryos. Syngnathid species are challenging to culture in the laboratory, and extracting embryos requires euthanizing the father which makes it difficult to obtain enough embryos for ISH. In addition, embryos do not survive long when removed from the brood pouch prematurely. We supplemented our ISH with bay pipefish caught off the Oregon coast because these fish have large broods. Wild caught pregnant male bay pipefish were immediately euthanized, and their broods were fixed. Because we did not have their age, we classified them based on developmental markers such as presence of somites and the extent of craniofacial elongation. Although these classification methods are not ideal, they are consistent with the syngnathid literature (Sommer et al. 2012). Since the embryos used for the ISH were primarily wild caught, we had a few different developmental stages represented in our ISH data. For our tooth primordia search, we used embryos from the same brood (therefore, same stage) for these experiments.

      We understand the concern for the degree of pigmentation in the samples. We completed numerous bleach trials before embarking on the in situ hybridization experiments. After completing a bleach trial with a probe created from the gene tnmd for ISH_,_ we noticed that the bleached embryos were missing expression domains found in the unbleached embryos. We were, therefore, concerned that using bleached embryos for our experiments would result incorrect conclusions about the expression domains of these genes. We sparingly used bleaching at older stages, hatched larvae, where it was fundamentally necessary to see staining. As stated above, the primary goal of this manuscript was to generate and annotate the first scRNA-seq atlas in a syngnathid, and the ISHs were utilized to support inferred cluster annotations only through a positive identification of marker gene expression in expected tissues/cells. Therefore, the obscuring of gene expression by pigmentation would have resulted in the absence of evidence for a possible cluster annotation, not an incorrect annotation.

      For the ease of viewing the ISHs, we improved annotations and clarity. We increased the brightness and contrast of images. In the original submission, we had to lower the image resolution to make the submission file smaller. We hope that these improvements plus the true image quality improves clarity of ISH results. We also included alignments in our supplementary files of bay pipefish sequences to the Gulf pipefish probes to showcase the high degree of sequence similarity. 

      Sommer, S., Whittington, C. M., & Wilson, A. B. (2012). Standardised classification of pre-release development in male-brooding pipefish, seahorses, and seadragons (Family Syngnathidae). BMC Developmental Biology, 12, 12–15. 

      To conclude, whereas the scRNA-seq dataset in this unconventional model organism will be useful for the community, the spatiotemporal and comparative expression analyses have to be thoroughly pushed forward to support the claims. Addressing these points is absolutely necessary to validate the data and to give new insights to understand the extraordinary evolution of the Syngnathidae family. 

      We really appreciate the reviewer’s enthusiasm for syngnathid research, and hope that the additional files and explanation of the supporting role of the ISHs have adequately addressed their concerns. We share the reviewer’s enthusiasm and are excited for future work that can extend this study. 

      Reviewer #2 (Public Review):

      Summary: 

      The authors present the first single-cell atlas for syngnathid fishes, providing a resource for future evolution & development studies in this group. 

      Strengths: 

      The concept here is simple and I find the manuscript to be well written. I like the in situ hybridization of marker genes - this is really nice. I also appreciate the gene co-expression analysis to identify modules of expression. There are no explicit hypotheses tested in the manuscript, but the discovery of these cell types should have value in this organism and in the determination of morphological novelties in seahorses and their relatives.  

      We are grateful for this reviewer’s appreciation of the huge amount of work that went into this study, and we agree that the in situ hybridizations (ISHs) support the scRNAseq study as we intended. We appreciate that the reviewer thinks that this work will add value to the syngnathid field.

      Weaknesses: 

      I think there are a few computational analyses that might improve the generality of the results. 

      (1) The cell types: The authors use marker gene analysis and KEGG pathways to identify cell types. I'd suggest a tool like SAMap (https://elifesciences.org/articles/66747) which compares single-cell data sets from distinct organisms to identify 'homologous' cell types - I imagine the zebrafish developmental atlases could serve as a reasonable comparative reference. 

      We appreciate the reviewer’s request, and in fact we would have loved to integrate our dataset with zebrafish. However, syngnathid’s unique craniofacial development makes it challenging to determine the appropriate stage for comparison. While 3 days post fertilization (dpf) zebrafish data were appropriate for comparisons of certain cell types (e.g. epidermal cells), it would have been problematic for other cell types (e.g. osteoblasts) that are not easily detectable until older zebrafish stages. Therefore, determining equivalent stages between these species is difficult and contains potential for error. Future research should focus on trying to better match stages across syngnathids and zebrafish (and other fish species such as stickleback). Studies of this nature promise to uncover the role of heterochrony in the evo-devo of syngnathid’s unique snouts.

      (2) Trajectory analyses: The authors suggest that their analyses might identify progenitor cell states and perhaps related differentiated states. They might explore cytoTRACE and/or pseudotime-based trajectory analyses to more fully delineate these ideas.

      We thank the reviewer for this suggestion! We added a trajectory analysis using cytoTRACE to the manuscript. It complemented our KEGG analysis well (L172-175; S73) and has improved the manuscript.

      (3) Cell-cell communication: I think it's very difficult to identify 'tooth primordium' cell types, because cell types won't be defined by an organ in this way. For instance, dental glia will cluster with other glia, and dental mesenchyme will likely cluster with other mesenchymal cell types. So the histology and ISH is most convincing in this regard. Having said this, given the known signaling interactions in the developing tooth (and in development generally) the authors might explore cell-cell communication analysis (e.g., CellChat) to identify cell types that may be interacting. 

      We agree! It would have been a wonderful addition to the paper to include a cell-cell communication analysis. One limitation of CellChat is that it only includes mouse and human orthologs. Given concerns of reviewer #3 for mouse-syngnathid comparisons, we decided to not pursue CellChat for this study. We are looking forward to future cell communication resources that include teleost fishes.

      Reviewer #3 (Public Review): 

      Summary: 

      This study established a single-cell RNA sequencing atlas of pipefish embryos. The results obtained identified unique gene expression patterns for pipefish-specific characteristics, such as fgf22 in the tip of the palatoquadrate and Meckel's cartilage, broadly informing the genetic mechanisms underlying morphological novelty in teleost fishes. The data obtained are unique and novel, potentially important in understanding fish diversity. Thus, I would enthusiastically support this manuscript if the authors improve it to generate stronger and more convincing conclusions than the current forms. 

      Thank you, we appreciate the reviewer’s enthusiasm!

      Weaknesses: 

      Regarding the expression of sfrp1a and bmp4 dorsal to the elongating ethmoid plate and surrounding the ceratohyal: are their expression patterns spatially extended or broader compared to the pipefish ancestor? Is there a much closer species available to compare gene expression patterns with pipefish? Did the authors consider using other species closely related to pipefish for ISH? Sfrp1a and bmp4 may be expressed in the same regions of much more closely related species without face elongation. I understand that embryos of such species are not always accessible, but it is also hard to argue responsible genes for a specific phenotype by only comparing gene expression patterns between distantly related species (e.g., pipefish vs. zebrafish). Due to the same reason, I would not directly compare/argue gene expression patterns between pipefish and mice, although I should admit that mice gene expression patterns are sometimes helpful to make a hypothesis of fish evolution. Alternatively, can the authors conduct ISH in other species of pipefish? If the expression patterns of sfrp1a and bmp4 are common among fishes with face elongation, the conclusion would become more solid. If these embryos are not available, is it possible to reduce the amount of Wnt and BMP signal using Crispr/Cas, MO, or chemical inhibitor? I do think that there are several ways to test the Wnt and/or BMP hypothesis in face elongation. 

      We appreciate the reviewer’s suggestion, and their recognition for challenges within this system. In response to this comment, we completed further in situ hybridization experiments in threespine stickleback, a short snouted fish that is much more closely related to syngnathids than is zebrafish, to make comparisons with pipefish craniofacial expression patterns (S76-S79). We added ISH data for the signaling genes (fgf22, bmp4, and sfrp1a) as well as prdm16. Through adding this additional ISH results, we speculated that craniofacial expression of bmp4, sfrp1a, and prdm16 is conserved across species. However, compared to the specific ceratohyal/ethmoid staining seen in pipefish, stickleback had broad staining throughout the jaws and gills. These data suggest that pipefish have co-opted existing developmental gene networks in the development of their derived snouts. We added this interpretation to the results and discussion of the manuscript (L244-L248; L262-277; L444-470).

      Recommendations for the authors:  

      Reviewing Editor (Recommendations for the Authors)

      We hope that the eLife assessment, as well as the revisions specified here, prove helpful to you for further revisions of your manuscript. 

      Revisions considered essential: 

      (1) Marker genes and single-cell dataset analyses. While these analyses have been performed to a good standard in broad terms, there is a majority view here that cell type annotations and trajectory analyses can be improved. In particular, there is question about the choice of marker genes for the current annotation. For one it can depend on the use of single marker genes (see tnnti1 example for clusters 17 and 31). Here, we recommend incorporating results from SAMap and trajectory analysis (e.g., cytoTRACE or standard pseudotime).

      Because of the reviewer comments, we became aware that we insufficiently communicated how cell clusters were annotated. We did mention in the manuscript that we did not use single marker genes to annotate clusters, but instead we used multiple marker genes for each cluster for the annotation process. We used both marker genes derived from our dataset and marker genes identified from zebrafish resources for cluster annotation. We chose single marker genes for each cluster for visualization purposes and for in situ hybridizations. However, it is clear from the reviewers’ comments that we needed to make more clear how the annotations were performed. To make this effort more clear in our revision, we included two new supplementary files – one with Seurat derived marker genes and one with marker genes derived from our DotPlot method. We also included extensive supplementary figures highlighting different markers. Using Daniocell, we identified 6 zebrafish markers per major cell type and showed their expression patterns in our atlas with FeaturePlots. We also included feature plots of the top 6 marker genes for each cluster. We hope that the addition of these 40+ plots (S3:S57) to the supplement fully addresses these concerns. 

      We appreciated the suggestion of cytotrace from reviewer #2! We ran cytotrace on three major cell lineages (neural, muscle, and connective; S73) which complemented our KEGG analysis in suggesting an undifferentiated fate for clusters 8, 10, and 16. We chose to not run SAMap because it is a scRNA-seq library integration tool. Although we compared our lectin epidermal findings to 3 dpf zebrafish scRNA-seq data, we did not integrate the datasets out of concern that we could draw erroneous conclusions for other cell types.  Future work that explores this technical challenge may uncover the role of heterochrony in syngnathid craniofacial development. We detail these changes more fully in our responses to reviewers.

      (2) The claims regarding evolutionary novelty and/or the genes involved are considered speculative. In part, this comes from relying too heavily on comparisons against zebrafish, as opposed to more closely related species. For example, the discussion regarding C-type lectin expression in the epidermis and KEGG enrichment (lines 358 - 364) seems confusing. Another good example here is the discussion on sfrp1a (lines 258 - 261). Here, the text seems to suggest craniofacial sfrp1a expression (or specifically ethmoid expression?) is connected to the development of the elongated snout in pipefish. However, craniofacial expression of sfrp1a is also reported in the arctic charr, which the authors grouped into fishes with derived craniofacial structures. Separately, sfrp2 expression was also reported in stickleback fish, for example. Do these different discussions truly support the notion that sfrp1a expression is all that unique in pipefish, rather than that pipefish and zebrafish are only distantly related and that sfrp1a was a marker gene first, and co-opted gene second? The authors should respond to the comments in the public review related to this aspect, and include more informative comparison and discussion. 

      A much more nuanced discussion with appropriate comparisons and caveats would be strongly recommended here.  

      We appreciate this insight and used it as a motivator to complete and add select comparative ISH data to this manuscript. We added in situ hybridization experiments from stickleback fish for craniofacial development genes (sfrp_1a, prdm16, bmp4_, and fgf22; S76-S79).  After adding stickleback ISH to the manuscript, we were able to make comparisons between pipefish and stickleback patterns and draw more informed conclusions (L244-L248; L262-277; L444-470). We added additional nuance to the discussion of the head, tooth (L485-489), and male pregnancy (L358-L391) sections to address concerns of study limitations. We describe in more detail these additional data in response to reviewers.

      (3) In situ hybridization results: as already included above, there is generally weak labeling of species, developmental stages, and other markings that can provide context. The collective feeling here is that as it is currently presented, the ISH results do not go too far beyond simply illustrative purposes. To take these results further, more detailed comparison may be needed. At a minimum, far better labeling can help avoid making the wrong impression. 

      Based on the reviewers’ comments, we made changes to improve ISH clarity and add select comparative ISH findings. ISH was used to further interpretation of the scRNAseq atlas. All the developmental stages and species information for the embryos used were in the figure captions as well as in supplemental file 4. Since we primarily used wild caught embryos, we did not have specific ages of most embryos. The technical challenges of acquiring and staging Syngnathus embryos are detailed above. Because we did not have their age, we classified them based on developmental markers (such as presence of somites and the extent of craniofacial elongation). Although these classification methods are not ideal, they are consistent with the syngnathid literature (Sommer et al. 2012).  

      We followed reviewer #1’s recommendations by adding an annotated graphic of a pipefish head, aligning bay and Gulf pipefish sequences for the probe regions, expanding out our supplemental figures for ISH into a figure for each probe, and improving labeling. These changes improved the description of the ISH experiments and have increased the quality of the manuscript.

      We would have loved to complete detailed comparative studies as suggested, but doing such a complete analysis was not feasible for this study. Therefore, we completed an additional focused analysis. We followed reviewer #3’s idea and added ISHs from threespine stickleback, a short snouted fish, for 4 genes (sfrp1a, prdm16, fgf22, and bmp4). While more extensive ISHs tracking all marker genes through a variety of developmental stages in pipefish and stickleback would have provided crucial insights, we feel that it is beyond the scope of this study and would require a significant amount of additional work. We, thus, primarily interpreted the ISH results as illustrative data points in our discussion. As we state in the response to reviewer 1, the generation and annotation of the first scRNA-seq atlas in a syngnathid is the primary goal of this manuscript.  The ISHs were utilized primarily to support inferred cluster annotations if a positive identification of marker gene expression in expected tissues/cells occurred. 

      Reviewer #1 (Recommendations For The Authors): 

      While the scRNA-seq dataset offers a valuable resource for evo-devo analyses in fish and the hypotheses are of interest, critical aspects should be strengthened to support the claims of the study. 

      Concerning the scRNA-seq dataset, the major points to be addressed are listed below: 

      - Supplementary file 3 reports the single markers used to validate cluster annotations. To confirm cluster identities, more markers specific to each cluster should be highlighted and presented on the UMAP. 

      We recognize the reviewer’s concern and had in reality used numerous markers to annotate the clusters. Based upon the reviewer’s comment we decided to make this clear by creating feature plots for every cluster with the top 6 marker genes. These plots showcase gene specificity in UMAP space. We also added feature plots for zebrafish marker genes for key cell types. Through these changes and the addition of 54 supplementary figures (S3:S57), we hope that it is clear that numerous markers validated cluster identity.

      For example, as clusters 17 and 37 share the same tnnti1 marker, which other markers permit to differentiate their respective identity. 

      This is a fair point. Cluster 17 and 37 both are marked by a tnni1 ortholog.

      Different paralogous co-orthologs mark each cluster (cluster 17: LOC125989146; cluster 37: LOC125970863). In our revision to the above comment, additional (6) markers per cluster were highlighted which should remedy this concern. 

      - L146: the low number of identified cartilaginous cells (only 2% of total connective tissue cells) appears aberrant compared to bone cell number, while Figure 1 presents a welldeveloped cartilaginous skeleton with poor or no signs of ossification. Please discuss this point. 

      We also found this to be interesting and added a brief discussion on this subject to the results section (L147-L149). Single cell dissociations can have variable success for certain cell types. It is possible that the cartilaginous cells were more difficult to dissociate than the osteoblast cells.

      - L162: pax3a/b are not specific to muscle progenitors as the genes are also expressed in the neural tube and neural crest derivatives during organogenesis. Please confirm cluster 10 identity.  

      Thank you for the reminder, we added numerous feature plots that explored zebrafish (from Daniocell) and pipefish markers (identified in our dataset). Examining zebrafish satellite muscle markers (myog, pabpc4, and jam2a) shows a strong correspondence with cluster #10.

      - L198: please specify in the text the pigment cell cluster number. 

      We completed this change.

      - L199: it is not clear why considering module 38 correlated to cluster 20 while modules 2/24 appear more correlated according to the p-value color code. 

      We thank the reviewer for pointing this confusing element out! Although the t-statistic value for module 38 (3.75) is lower than the t-statistics for modules 2 and 24 (5.6 and 5.2, respectively), we chose to highlight module 38 for its ‘connectivity dependence’ score. In our connectivity test, we examined whether removing cells from a specific cell cluster reduced the connectivity of a gene network. We found that removing cluster 20 led to a decrease in module 38’s connectivity (-.13, p=0) while it led to an increase in modules 2 and 24’s connectivity (.145, p=1; .145, p=9.14; our original supplemental files 9-10). Therefore, the connectivity analysis showed that module 38’s structure was more dependent on cluster 20 than in comparison with modules 2 and 24. Although you highlighted an interesting quandary, we decided that this is tangential to the paper and did not add this discussion to the manuscript. 

      - Please describe in the text Figure 4A. 

      Completed, we thank the reviewer for catching this! 

      Concerning embryo stainings, the major points to be addressed are listed below: 

      - Figure 1: please enhance the light/contrast of figures to highlight or show the absence of alcian/alizarin staining. Mineralized structures are hardly detectable in the head and slight differences can be seen between the two samples. The developmental stage should be added. Please homogenize the scale bar format (remove the unit on panels E and, G as the information is already in the text legend). It would be useful to illustrate the data with a schematic view of the structures presented in panels B, and E, and please annotate structures in the other panels.  

      We thank the reviewer for these suggestions to improve our figure. We increased the brightness and contrast for all our images. We also added an illustration of the head with labels of elements. As discussed, we used wild caught pregnant males and, therefore, do not know the exact age of the specimens. However, we described the developmental stage based on morphological observations. Slight differences in morphology between samples is expected. We and others have noticed that

      developmental rate varies, even within the same brood pouch, for syngnathid embryos. We observed several mineralization zones including in the embryos including the upper and lower jaws, the mes(ethmoid), and the pectoral fin. We recognize the cartilage staining is more apparent than the bone staining, though increasing image brightness and contrast did improve the visibility of the mineralization front.

      - All ISH stainings and images presented in Figures 4-6/ Figures S2-3 should be revised according to comments provided in the public review. 

      We thank the reviewer for providing thorough comments, we provided an in-depth response to the public review. We made several improvements to the manuscript to address their concerns. 

      - Figure 4: Figure 4B should be described before 4C in the text or inverse panels / L222 the Meckel's cartilage is not shown on Figure 4C. The schematic views in H should be annotated and the color code described / the ISH data must be completed to correlate spatially clusters to head structures. 

      We thank the reviewer for pointing this out, we fixed the issues with this figure and added annotations to the head schematics.

      - Figure 5: typo on panels 'alician' = alcian. 

      We completed this change. 

      - Figures S2-3: data must be better presented, polished / typo in captions 'relavant'= relevant. 

      Thank you for this critique, we created new supplementary figures to enhance interpretation of the data (S59-S71). In these new figures, we included a feature plot for each gene and respective ISHs.

      - Figure S3: soat2 = no evidence of muscle marker neither by ISH presented nor in the literature. 

      We realized this staining was not clear with the previous S2/S3 figures. Our new changes in these supplementary figures based on the reviewer’s ideas made these ISH results clearer. We observed soat2 staining in the sternohyoideus muscle (panel B in S71).

      Other points: 

      - The cartilage/bone developmental state (Alcian/alizarin staining) and/or ISH for classical markers of muscle development (such as pax3/myf5) could be used to clarify the This could permit the completion of a comparative analysis between the two species and the interpretation of novel and adaptative characters.  

      We appreciate this idea! We thought deeply about a well characterized comparative analysis between pipefish and zebrafish for this study. We discussed our concerns in our public response to reviewer 2. We found that it was challenging to stage match all cell types, and were concerned that we could make erroneous conclusions. For example, our pipefish samples were still inside the male brood pouch and possessed yolk sacs. However, we found osteoblast cells in our scRNAseq atlas, and in alizarin staining. Although zebrafish literature notes that the first zebrafish bone appears at 3 dpf (Kimmel et al. 1995), osteoblasts were not recognized until 5 dpf in two scRNAseq datasets (Fabian et al. 2022; Lange et al. 2023). A 5dpf zebrafish is considered larval and has begun hunting. Therefore, we chose to not integrate our data out of concern that osteoblast development may occur at different timelines between the fishes. 

      Fabian, P., Tseng, K.-C., Thiruppathy, M., Arata, C., Chen, H.-J., Smeeton, J., Nelson, N., & Crump, J. G. (2022). Lifelong single-cell profiling of cranial neural crest diversification in zebrafish. Nature Communications 2022 13:1, 13(1), 1–13. 

      Lange, M., Granados, A., VijayKumar, S., Bragantini, J., Ancheta, S., Santhosh, S., Borja, M., Kobayashi, H., McGeever, E., Solak, A. C., Yang, B., Zhao, X., Liu, Y., Detweiler, A. M., Paul,

      S., Mekonen, H., Lao, T., Banks, R., Kim, Y.-J., … Royer, L. A. (2023). Zebrahub – Multimodal Zebrafish Developmental Atlas Reveals the State-Transition Dynamics of Late-Vertebrate Pluripotent Axial Progenitors. BioRxiv, 2023.03.06.531398. 

      Kimmel, C., Ballard, S., Kimmel, S., Ullmann, B., Schilling, T. (1995). Stages of Embryonic Development of the Zebrafish. Developmental Dynamics 203:253:-310.

      'in situs' in the text should be replaced by 'in situ experiments'.  

      We made this change (L395, L663, L666, L762).

      - Lines 562-565: information on samples should be added at the start of the result section to better apprehend the following scRNA-seq data.

      We thank the reviewer for pointing out this issue. Although we had a few sentences on the samples in the first paragraph of the result section, we understand that it was missing some critical pieces of information. Therefore, we added these additional details to the beginning of the results section (L126-L132). 

      - Lines 629-665: PCR with primers designed on gulf pipefish genome could be performed in parallel on bay and gulf cDNA libraries, and amplification products could be sequenced to analyze alignment and validate the use of gulf pipefish ISH probes in bay pipefish embryos. Probe production could also be performed using gulf primers on bay pipefish cDNA pools. 

      After the submission of this manuscript, a bay pipefish genome was prepared by our laboratory. We used this genome to align our probes, these alignments demonstrate strong sequence conservation between the species. We included these alignments in our supplemental files.

      - L663: the bleaching step must be optimized on pipefish embryos. 

      We understand this concern and had completed several bleach optimization experiments prior to publication. Although we found that bleaching improved visibility of staining, we noticed with the probe tnmd that bleached embryos did not have complete staining of tendons and ligaments. The unbleached embryos had more extensive staining than the bleached embryos. We were concerned that bleaching would lead to failures to detect expression domains (false negatives) important for our analysis. Therefore, we did not use bleaching with our in situs experiments (except with hatched fish with a high degree of pigmentation). 

      - Indicate the number of specimens analyzed for each labeling condition.  

      We thank the reviewer for noticing this issue. We added this information to the methods (L766-767).

      - Describe the fixation and pre-treatment methods previous to ISH and skeleton stainings

      We thank the reviewer for pointing out this issue, we added these descriptions (L765-766; L772-774). 

      Reviewer #3 (Recommendations For The Authors): 

      (1) If sfrp1a expression is observed also in other fish species with derived craniofacial structures, it's important to discuss this more in the Discussion. This could be a common mechanism to modify craniofacial structures, although functional tests are ultimately required (but not in this paper, for sure). Can lines 421-428 involve the statement "a prolonged period of chondrocyte differentiation" underlies craniofacial diversity?

      This is a great idea, and we added a sentence that captures this ethos (L451-452).

      (2) Lines 334-346 need to be rephrased. It's hard to understand which genes are expressed or not in pipefish and zebrafish. Did "23 endocytosis genes" show significant enrichment in zebrafish epidermis, or are they expressed in zebrafish epidermis? 

      We thank the reviewer for this comment, we re-phrased this section for clarity (L365-368).

      (3) Figure 4 is missing the "D" panel and two "E" panels. 

      We thank the reviewer for noticing this, we fixed this figure.

      (4) Line 302: "whole-mount" or "whole mount"

      We thank the reviewer for the catch!

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer 1 (Public Review):

      The authors explain that an action potential that reaches an axon terminal emits a small electrical field as it ”annihilates”. This happens even though there is no gap junction, at chemical synapses. The generated electrical field is simulated to show that it can affect a nearby, disconnected target membrane by tens of microvolts for tenths of a microsecond. Longer effects are simulated for target locations a few microns away.

      To simulate action potentials (APs), the paper does not use the standard Hodgkin-Huxley formalism because it fails to explain AP collision. Instead, it uses the Tasaki and Matsumoto (TM) model which is simplified to only model APs with three parameters and as a membrane transition between two states of resting versus excited. The authors expand the strictly binary, discrete TM method to a Relaxing Tasaki Model (RTM) that models the relaxation of the membrane potential after an AP. They find that the membrane leak can be neglected in determining AP propagation and that the capacitive currents dominate the process.

      The strength of the work is that the authors identified an important interaction between neurons that is neglected by the standard models. A weakness of the proposed approach is the assumptions that it makes. For instance, the external medium is modeled as a homogeneous conductive medium, which may be further explored to properly account for biological processes.

      The authors provide convincing evidence by performing experiments to record action potential propagation and collision properties and then developing a theoretical framework to simulate the effect of their annihilation on nearby membranes. They provide both experimental evidence and rigorous mathematical and computer simulation findings to support their claims. The work has the potential of explaining significant electrical interaction between nerve centers that are connected via a large number of parallel fibers.

      We thank the reviewer for the distinct analysis of our work and the assessment that we ’identified an important interaction between neurons that is neglected by standard models’.

      Indeed, we modeled the external (extracellular) medium as homogeneous conductive medium and, compared to real biological systems, this is a simplification. Our intention is to keep our formal model as general as possible, however, it can be extended to account for specific properties. Accessory structures at axon terminals (such as the pinceau at Purkinje cells) most likely evolved to shape ephaptic coupling. In addition, the extracellular medium is neither homogeneous nor isotropic, and to fully mimic a particular neural connection this has to be implemented in a model as well. We agree and look forward to see how specific modification of the external medium in biological systems will affect ephaptic coupling. We hope to facilitate progress on this question by providing our source code for further exploration. Using the tools that have been developed by the BRIAN community one can generate or import arbitrary complex cell morphologies (e.g. NeuroML files). Our source code adds the TM- and RTM model, which allows exploring the direct impact of extracellular properties on target neurons.

      Reviewer 2 (Public Review):

      In this study, the authors measured extracellular electrical features of colliding APs travelling in different directions down an isolated earthworm axon. They then used these features to build a model of the potential ephaptic effects of AP annihilation, i.e. the electrical signals produced by colliding/annihilating APs that may influence neighbouring tissue. The model was then applied to some different hypothetical scenarios involving synaptic connections. The conclusion was that an annihilating AP at a presynaptic terminal can ephaptically influence the voltage of a postsynaptic cell (this is, presumably, the ’electrical coupling between neurons’ of the title), and that the nature of this influence depends on the physical configuration of the synapse.

      As an experimental neuroscientist who has never used computational approaches, I am unable to comment on the rigour of the analytical approaches that form the bulk of this paper. The experimental approaches appear very well carried out, and here I just have one query - an important assumption made is that the conduction velocity of anti- and orthodromically propagating APs is identical in every preparation, but this is never empirically/statistically demonstrated.

      My major concern is with the conclusions drawn from the synaptic modelling, which, disappointingly, is never benchmarked against any synaptic data. The authors state in their Introduction that a ’quantitative physical description’ of ephaptic coupling is ’missing’, however, they do not provide such a description in this manuscript. Instead, modelled predictions are presented of possible ephaptic interactions at different types of synapses, and these are then partially and qualitatively compared to previous published results in the Discussion. To support the authors’ assertion that AP annihilation induces electrical coupling between neurons, I think they need to show that their model of ephaptic effects can quantitatively explain key features of experimental data pertaining to synaptic function. Without this, the paper contains some useful high-precision quantitative measurements of axonal AP collisions, some (I assume) high-quality modelling of these collisions, and some interesting theoretical predictions pertaining to synaptic interactions, but it does not support the highly significant implications suggested for synaptic function.

      We thank the reviewer for highlighting the potential and the limitation of our model. We demonstrated with empirical data that measured conduction velocities of anti- and orthodromic propagating APs are indeed very similar and values are provided in Appendix 3 – table 1.

      In order to address how our model ’of ephaptic effects can quantitatively explain key features of experimental data’, we used the measured modulation of AP rates in Purkinje fibers by Blot and Babour (2014) and our results are now included in the manuscript. In our model, we implemented the ephaptic coupling of the Basket cell (with an annihilating AP) and predicted the modulation of AP rate in the Purkinje cell. Our model predictions are compared to the measured modulation of AP-rates in Purkinje cells and is added as Fig. 5 to the main manuscript (line 264 to 284 ). With this example, we show that ephaptic coupling as described with our RTM model can quantitatively describe key features of experimental data. Both, the rapid inhibition and the rebound activity is described by our model with implementation of non-excitable parts at the pinceau of the Basket cell. Future, experimental research can use the provided formalism to investigate in more detail the ephaptic coupling in systems like the Mauthner cell and the Purkinje cell by exploring how accessory structures and concomitant physical parameters, e.g. the extracellular properties impact ephaptic coupling.

      Reviewer 3 (Public Review):

      This manuscript aims to exploit experimental measurements of the extracellular voltages produced by colliding action potentials to adjust a simplified model of action potential propagation that is then used to predict the extracellular fields at axon terminals. The overall rationale is that when solving the cable equation (which forms the substrate for models of action potential propagation in axons), the solution for a cable with a closed end can be obtained by a technique of superposition: a spatially reflected solution is added to that for an infinite cable and this ensures by symmetry that no axial current flows at the closed boundary. By this method, the authors calculate the expected extracellular fields for axon terminals in different situations. These fields are of potential interest because, according to the authors, their magnitude can be larger than that of a propagating action potential and may be involved in ephaptic signalling. The authors perform direct measurements of colliding action potentials, in the earthworm giant axon, to parameterise and test their model.

      Although simplified models can be useful and the trick of exploiting the collision condition is interesting, I believe there are several significant problems with the rationale, presentation, and application, such that the validity and potential utility of the approach is not established.

      Simplified model vs. Hogdkin and Huxley

      The authors employ a simplified model that incorporates a two-state membrane (in essence resting and excited states) and adds a recovery mechanism. This generates a propagating wave of excitation and key observables such as propagation speed and action potential width (in space) can be adjusted using a small number of parameters. However, even if a Hodgkin-Huxley model does contain a much larger number of parameters that may be less easy to adjust directly, the basic formalism is known to be accurate and typical modifications of the kinetic parameters are very well understood, even if no direct characterisations already exist or cannot be obtained. I am therefore unconvinced by the utility of abandoning the HodgkinHuxley version.

      In several places in the manuscript, the simplified model fits the data well whereas the Hodgkin-Huxley model deviates strongly (e.g. Fig. 3CD). This is unsatisfying because it seems unlikely that the phenomenon could not be modelled accurately using the HH formulation. If the authors really wish to assert that it is ”not suitable to predict the effects caused by AP [collision]” (p9) they need to provide a good deal more analysis to establish the mechanism of failure.

      We are not as convinced as the reviewer that, at the current state of parameter estimation, the HH model is suited for predicting ephaptic coupling after ’adjusting’ parameters. There are strong arguments against such an approach. A major function of a model is to make testable predictions rather than to just mimic a biological phenomenon. The predictive power of a model heavily depends on how reasonable model parameters can be estimated or measured. As the reviewer correctly points out in the specific comments (”... the parameters adjusted to fit the model are the membrane capacitance and intracellular resistance. These have a physical reality and could easily be measured or estimated quite accurately...”), our model contains only parameters that can be assessed experimentally, thus it has a better predictive power compared to the HH model with a multitude of parameters for which ”no direct characterisations already exist or cannot be obtained” (citing reviewer from above).

      Already the founders of the HH model were well aware of the limitations, as stated by Hodgkin and Huxley in 1952 (J Physiol 117:500–544):

      An equally satisfactory description of the voltage clamp data could no doubt have been achieved with equations of very different form ... The success of the equations is no evidence in favour of the mechanism of permeability change that we tentatively had in mind when formulating them.

      A catchy but sloppy description for the problem of overfitting with too many parameters is given by the quote of John von Neumann: With four parameters I can fit an elephant, and with five I can make him wiggle his trunk.

      We do not rule out the possibility that the HH model eventually can be used to predict ephaptic coupling. However, at the moment, parameter estimation for the HH model prevents its usability for predicting ephaptic coupling.

      (In)applicability of the superposition principle

      The reflecting boundary at the terminal is implemented using the symmetry of the collision of action potentials. However, at a closed cable there is no reflecting boundary in the extracellular space and this implied assumption is particularly inappropriate where the extracellular field is one objective of the modelling, as here. I believe this assumption is not problematic for the calculation of the intracellular voltage, because extracellular voltage gradients can usually be neglected1, but the authors need to explain how the issue was dealt with for the calculation of the extracellular fields of terminals. I assume they were calculated from the membrane currents of one-half of the collision solution, but this does not seem to be explained. It might be worth showing a spatial profile of the calculated field.

      We disagree with the reviewer’s statement ’...at a closed cable there is no reflecting boundary in the extracellular space and this implied assumption is particularly inappropriate...’. We do not imply this assumption in our model! We do not assume any symmetry or boundary condition in the extracellular space. Instead, the extracellular field is calculated for an infinite homogeneous volume conductor (Eq.

      6).

      We conduct separate calculations for (1) source membrane current, (2) resulting extracellular field, and (3) impact upon a target neuron. The boundary condition used for our calculations only refers to the axial current being zero at the axon terminal. Consequently all the internal current that enters the last compartment must leave the last compartment as membrane current and contributes to the extracellular current and field.

      The extracellular field around the axon terminal is not symmetric, as can be seen by it’s impact upon a target in Figure 4—figure supplement 1 which is also not symmetric. The symmetry of the extracellular field when APs are colliding (Cf. symmetry in Fig 1C) is merly the result of the symmetric stimulation and counterpropagation of two APs. We now are describing more specifically the bounday condition for colliding and terminating APs already in the introduction: ’A suitable boundary condition (intracellular, axial current equals zero) can be generated experimentally by a collision of two counter-propagating APs ... Within any cable model, the very same boundary condition also exists within the axon at the synaptic terminal due to the broken translation symmetry for the current loops ...’ Later, at the result section (Discharge of colliding APs), we continue with ’AP propagation is blocked when the axial current is shut down at a boundary condition, e.g. by reaching the axon terminal or by AP collision....’ and implement this condition in our calculations for the axon terminals.

      Missing demonstrations

      Central analytical results are stated rather brusquely, notably equations (3) and (4) and the relation between them. These merit an expanded explanation at the least. A better explanation of the need for the collision measurements in parameterising the models should also be provided.

      We thank the reviewer for pointing out the insufficient explanation of the equations 3 and 4. We rephrased the paragraph ’Discharge of colliding APs’ in order to clarify the origin and the function of the two equations (eq. 3: how much charge is expelled and eq. 4: the resulting extracellular potential that is used for model validation).

      Later, in the Discussion, we rephrased the paragraph where we describe the annihilation process and explain further that one term of eq. 4 sometimes is refered to ’activating function’ when using microelectrodes for stimulation.

      With respect to the ’explanation of the need for the collision measurement’, we think that the explanations we give at several locations in the manuscript are sufficient as is. We explain and elaborate in the introduction: ’We explore the behaviour of APs at boundaries ... In this study, we first focus on collisions of APs. Our experimental observation of colliding APs provides unique access to the spatial profile of the extracellular potential around APs that are blocked by collisions and thus annihilate..... Recording propagating APs allows to determine both the propagation velocity and the amplitude of the extracellular electric potentials. The collision experiment provides additional information ... In the results we recall: ’The width of the collision is a measure of the characteristic length λ⋆ of the AP and is uniquely revealed by a collision sweep experiment.’

      Adjusted parameters

      I am uncomfortable that the parameters adjusted to fit the model are the membrane capacitance and intracellular resistance. These have a physical reality and could easily be measured or estimated quite accurately. With a variation of more than 20-fold reported between the different models in Appendix 2 we can be sure that some of the models are based upon quite unrealistic physical assumptions, which in turn undermines confidence in their generality.

      The fact that the parameters of our model have physical realities is clearly in favor of our models. We rephrased the legend of the table, now explaining the procedure for the model fitting and the rational behind. Although the values of g⋆ can differ by a factor of 15 and the resulting amplitude is very different, the relationship ri cm \= vpλ⋆ is very similar, independently of the model used and this confirms our analytical framework.

      p8 - the values of both the extracellular (100 Ohm m) and intracellular resistivity (1 Ohm m) appear to be in error, especially the former.

      We have the following justification for the resistivity values we used. For the intracellular resistivity, literature values range from 0.4 - 1.5 Ohm m, and therefore we selected 1 Ohm m. See: Carpenter et al (1975) doi: 10.1085/jgp.66.2.139; Cole et al (1975) doi: 10.1085/jgp.66.2.133; Bekkers (2014) doi: 10.1007/978-1-46147320-6 35-2.

      Estimating extracellular resistivity is less straight forward, since it depends crucially on the structure around the synapse which consists of conducting saline and insulating fatty tissue. Ranges from 3 to 600 Ohm m are reported (Linden et al (2011) doi: 10.1016/j.neuron.2011.11.006) and Bakiri et al (2011) doi: 10.1113/jphysiol.2010.201376). Weiss et al (2008; doi: 10.1073/pnas.0806145105) report extracellular resistivities in the Mauthner Cap between 50-600 Ohm m in SI. Since the pinceau is structurally similar to the Mauthner cells axon cap, we argue that a value of 100 Ohm m is a reasonable choice for our calculations. Additionally, we derived a value from Blot and Barbour (doi:c10.1038/nn.3624), rephrased the paragraph in the main text and added our calculation to the supplementary material (Appendix 1).

      (In)applicability to axon terminals

      The rationale of the application of the collision formalism to axon terminals is somewhat undermined by the fact that they tend not to be excitable. There is experimental evidence for this in the Calyx of Held and the cerebellar pinceau.

      The solution found via collision is therefore not directly applicable in these cases.

      We do not agree with the reviewer’s statement that ’the solution found via collision is (therefore) not directly applicable...’. Our model is well suited for application on axon terminals that are not excitable, e.g. the pinceau of the basket cell, as the reviewer points out. We have included a calculation for this case and present the results in the new Fig. 5 (main text line 264 to 284 ).

      Comparison with experimental data

      More effort should be made to compare the modelling with the extracellular terminal fields that have been reported in the literature.

      As outlined above (see: Reponse to reviewer 2), we now compare directly the predictions of our models with measured modulation of AP rates in Purkinje fibers (Blot and Babour 2014) and our results are included in the manuscript (Fig. 5 and main text line 264 to 284). See also our response to reviewer 2 in which we address how our model ’of ephaptic effects can quantitatively explain key features of experimental data’.

      Choice of term ”annihilation”

      The term annihilation does not seem wholly appropriate to me. The dictionary definitions are something along the lines of complete destruction by an external force or mutual destruction, for example of an electron and a positron. I don’t think either applies exactly here. I suggest retaining the notion of collision which is well understood in this context.

      Experimentally, we generated a collision of APs and showed that colliding APs dissapear and do not pass each other. For this process the term annihilation is used in our and in other studies (see e.g. Berg et al (2017) doi: 10.1103/PhysRevX.7.028001; Johnson et al (2018) doi: 10.3389/fphys.2018.00779; Follmann (2015) doi: 10.1103/PhysRevE.92.032707; Shrivastava et al (2018) doi: 10.1098/rsif.2017.0803). The physical processes involved in the termination of an AP at a closed end are essentially identical to those of two colliding APs. This we think justifies using the term annihilation for those processes.

      Recommendations for the authors:

      We believe the work is of high quality and should motivate future experimental work. We are including the review comments here for your information. The main piece of feedback we are offering is that the broad claims need to be adjusted to the strength of evidence provided: as is, the manuscript provides compelling predictions but the claim that these predictions are in full agreement with data remains to be substantiated. A technical concern raised by the reviewers is that the reflecting boundary condition may need further justification. The authors may wish to respond to this issue in a rebuttal and/or adjust the manuscript as necessary.

      We substantiated our claim that our predictions are in full agreement with experimental data. We added to the manuscript a section in which we compare our models’ predictions to published, experimental data. To this aim, we extracted date from the publication of Blot and Babour (2014), we elaborated on the parameters used and run our model accordingly. We added to the Results/Model of ephaptic coupling a paragraph on ’The modulation of activity in Purkinje cells...’ (line 264), where we describe our results and we also included another figure to the main text for illustration (Fig. 5).

      We clarified the term ’boundary condition’ by rephrasing parts of the introduction and we explain the rational behind in ’Discharge of colliding APs (...AP propagation is blocked when axial current is shut down...) and in ’Model of ephaptic coupling (Within any cable model, the same boundary...). See also our response to the general comments of reviewer 3 above.

      Reviewer 1 (Recommendations For The Authors):

      Major:

      Accessing data and code requires signing in, which should not be required. The link provided also seems to be not accessible yet - could be pending review.

      The repository is now publicly availible. We did provide an access code within the letter to the editor, this code is no longer required.

      Line 74: how about morphology? Authors should clarify and emphasize in the introduction that the TM model is a spatially continuous model with partial differential equations as opposed to discrete morphological models to simulate HH equations.

      The reviewer is correct that the TM model is continous. However, so is the HH model. The difference between HH and TM is only that the TM model can be solved analytically, which yields a spatially homogeneous analytical solution. It should be noted that this analytical solution can only be valid for a homogeneous (therefore infinite) nerve. Every numerical computation, be it HH or TM, requires a finite number of discrete compartments. In our calculations, we used identical compartment models for HH, TM and RTM model. In each compartment, the differential equations are solved numerically. Since there is no fundamental difference between these models, we obstain from changing the text.

      Minor:

      Major typo: ventral nerve cord, not ”chord”. Repeated in several places.

      Thank you for indicating this typo to us.

      Line 25: inhibition, excitation, and modulation?

      We changed the line to: ... leads to modulation, e.g. excitation or inhibition

      Line 70: better term for ”length” of AP would be ”duration”. Also, the sentence could be simplified to use either ”its” or ”of the AP”

      Space and time are not interchangable. Thus, the term lenght can not be replaced by duration. We simplified the structure of the sentence as suggested.

      Fig 1A/B: it’s strange that panel B precedes panel A.

      Exchanged

      Fig 1C: don’t see the ”horizontal line”; also regarding ”The recording was at a medial position”, the caption is not clear until one reads the main text.

      We changed the legend to: ... The collision is captured in the recording line at y-position 0 mm, while orthodromic propagation is at the top and antidromic propagation is at the bottom. (D) The peak amplitude as a function of the distance to the collision. Examples of four sweeps at three positions along the nerve cord....

      Line 127: the per distance measures could be named as ”specific” conductivity, etc.

      We explicitly provide the units thereby defining the quantities unambigously.

      Line 176: typo ”ad-hoc”.

      Thank you.

      Fig 4B: should clarify that the circle in the schematic is not the soma but a synaptic bouton.

      We rephrased to ’...(B,C) when the AP is annihilating at a bouton of a neuron terminal (upper neuron in end-to-shaft geometry, similar to the Basket cell–Purkinje cell synapse)...’, and we added a label to Fig 4B.

      Reviewer 2 (Recommendations For The Authors):

      Can the authors’ model be quantitatively compared with experimental data of ephaptic interactions at synapses (e.g. the Blot & Barbour study described in the Discussion)?

      We did so as outlined in our response to the reviewer above.

      Can statistical evidence be provided that the velocities of anti- and orthodromic APs are indeed identical in the earthworm nerve recordings?

      These data and statistics are available in Appendix 2, now 3 – table 1

      Why not reorder ABCD in Fig1 so the subpanels run from left to right?

      We adjusted the labels accordingly.

    1. Reviewer #1 (Public review):

      Summary:

      This work proposes a new approach to analyse cell-count data from multiple brain regions. Collecting such data can be expensive and time-intensive, so, more often than not, the dimensionality of the data is larger than the number of samples. The authors argue that Bayesian methods are much better suited to correctly analyse such data compared to classical (frequentist) statistical methods. They define a hierarchical structure, partial pooling, in which each observation contributes to the population estimate to more accurately explain the variance in the data. They present two case studies in which their method proves more sensitive in identifying regions where there are significant differences between conditions, which otherwise would be hidden.

      Strengths:

      The model is presented clearly, and the advantages of the hierarchical structure are strongly justified. Two alternative ways are presented to account for the presence of zero counts. The first involves the use of a horseshoe prior, which is the more flexible option, while the second involves a modified Poisson likelihood, which is better suited to datasets with a large number of zero counts, perhaps due to experimental artifacts. The results show a clear advantage of the Bayesian method for both case studies.

      The code is freely available, and it does not require a high-performance cluster to execute for smaller datasets. As Bayesian statistical methods become more accessible in various scientific fields, the whole scientific community will benefit from the transition away from p-values. Hierarchical Bayesian models are an especially useful tool that can be applied to many different experimental designs. However, while conceptually intuitive, their implementation can be difficult. The authors provide a good framework with room for improvement.

      Weaknesses:

      Alternative possibilities are discussed regarding the prior and likelihood of the model. Given that the second case study inspired the introduction of the zero-inflation likelihood, it is not clear how applicable the general methodology is to various datasets. If every unique dataset requires a tailored prior or likelihood to produce the best results, the methodology will not easily replace more traditional statistical analyses that can be applied in a straightforward manner. Furthermore, the differences between the results produced by the two Bayesian models in case study 2 are not discussed. In specific regions, the models provide conflicting results (e.g., regions MH, VPMpc, RCH, SCH, etc.), which are not addressed by the authors. A third case study would have provided further evidence for the generalizability of the methodology.

    2. Reviewer #2 (Public review):

      Summary:

      This is a well-written methodology paper applying a Bayesian framework to the statistics of cell counts in brain slices. A sharpening of the bounds on measured quantities is demonstrated over existing frequentist methods and therefore the work is a contribution to the field.

      Strengths:

      As well as a mathematical description of the approach, the code used is provided in a linked repository.

      Weaknesses:

      A clearer link between the experimental data and model-structure terminology would be a benefit to the non-expert reader.

    1. Welcome back to stage two of this advanced demo lesson and again have included full instructions attached to this lesson.

      And this stage of the demo will be another one where you're entering lots of commands because you're going to automate the build of the WordPress application instance.

      So again, I would recommend opening the instructions for this demo lesson and copy and pasting the commands rather than typing them out by hand.

      Now at this point in the advanced demo series, you're going to have a leftover instance that you used to manually install WordPress in the previous stage.

      It should be called WordPress - Manual.

      So I'm going to want you to go ahead and right click on that and select terminate instance and confirm that process to remove this instance from your AWS account.

      We're going to be setting up exactly the same single instance deployment of WordPress, so both the database and the application on the same instance.

      But instead of manually building this, we're going to be using a launch template.

      So from the EC2 console, just go ahead and click on launch templates under instances.

      The first step is to create a launch template for our WordPress application.

      So go ahead and click on create launch template.

      Now launch templates are actually a new version of launch configurations that were previously used with auto scaling groups.

      Launch templates allow you to either launch instances manually using the template or they can be part of auto scaling groups.

      But what a launch template allows you to do is to specify all of the configuration in advance to launch an instance and that template can be used to launch one or many instances.

      So we're going to create a launch template which will automate the installation of WordPress, MariaDB and perform all of the configuration.

      And a launch template can actually have many different versions, which is a feature we'll use throughout this demo series as we evolve the design.

      So the first step is to name this template and we're going to call it WordPress.

      Under template version description, go ahead and enter single server DB and app.

      And then check this box which says provide guidance to help me set up a template that I can use with EC2 auto scaling.

      We're not immediately going to set it up as part of an auto scaling group, but it will help us highlight any options which are required if we want to use it with an auto scaling group.

      Now launch templates can actually be created from scratch or they can be based on a previous template version.

      If we expand source template, you're able to specify a template which this template is based on.

      But in this case, we're creating one from scratch so we won't set any of those options.

      Now just scroll down.

      So the next thing we're going to define in this launch template is the AMI that we're going to use.

      So go ahead and click on Quickstart.

      And once this has changed, we're going to use the same AMI we've been using previously.

      So I want you to go ahead and click on Amazon Linux, specifically Amazon Linux 2023.

      It should be the SSD volume type.

      It should be listed as free tier eligible and just make sure that you've got 64 bit x86 selected.

      And then scroll down further still and in the instance type drop down, we're looking for the T series of instances.

      And then you need to select the one that's free tier eligible.

      In most cases, this will be T2.micro, but select whichever is free tier eligible.

      We want to keep this advanced demo as much as possible within the free tier.

      Scroll down again and for key pair, just make sure that it says don't include in the launch template.

      Move down further still to network settings.

      Then make sure select existing security group is selected.

      And then in the security groups drop down, click in that and make sure that you select the A4L.

      VPC - SG WordPress.

      So this is the security group which will automatically be associated with any instances launched using this launch template.

      So select A4L.

      VPC - SG WordPress and there will be some randomness after this.

      That's fine.

      Just make sure you select the SG WordPress group and then we can scroll down further still.

      Now we can leave storage volumes as default.

      We won't set any resource tags.

      We won't do any configuration of network interfaces, but I will want you to expand advanced details.

      There are a few things that we need to set within advanced details.

      The first is an IAM instance profile.

      So click in this drop down and then make sure that you pick A4L.

      VPC - SG WordPress instance profile.

      Again, there will be some randomness.

      That's fine.

      What this is doing is creating the configuration which will attach an instance role to this EC2 instance.

      And this instance role is going to provide all the permissions required to interact with the parameter store and the elastic file system and anything else that this instance requires.

      And this was pre-created on your behalf using the cloud formation template.

      Next, scroll down further still and look for credit specification.

      Remember, this is the same option that you set when launching an instance manually.

      Now, as before, it's always best to set this to unlimited.

      But if you are using a brand new AWS account, then it's possible that AWS won't allow you to use this option.

      So you should probably go ahead and pick standard.

      It won't make that much of a difference.

      I'm going to pick unlimited, but I do suggest if you are using a fairly new account, you go ahead and select standard.

      So that's the configuration for the instance, the base level configuration.

      What I want you to do now though is to scroll all the way down to the bottom and there's a user data box.

      This user data allows us to specify bootstrapping information to automatically configure our EC2 instances.

      So into this user data box, I want you to paste the entire code snippet within stage 2B of this stages instructions.

      And again, they're attached to this lesson.

      The top line should be hash bang forward slash bin forward slash bash and then a space hyphen XE.

      And then if you scroll all the way down to the bottom, the last line should be RM space forward slash TMP forward slash DB dot setup.

      And now we can see we've pasted this entire user data.

      Once you've done that, go ahead and click on create launch template.

      Now that user data that you just pasted in is essentially all of the commands that you ran in the previous stage of the demo.

      Only instead of pasting them one by one, you've defined them within the user data.

      So this simply automates the process end to end.

      So to test this, go ahead and click on launch templates towards the top of the screen.

      It should show that you have a single launch template.

      It's called WordPress.

      The default version is one and the latest version is one.

      And as we move throughout this demo series, the latest version and the default version will change.

      So just keep an eye on those as we go.

      For now, though, I want you to click in the checkbox next to this launch template, click on actions and then launch instance from template.

      So this is going to launch an EC2 instance using this launch template.

      We're asked to choose a launch template and a version and define the number of instances and we can leave all of these as the defaults.

      If we just scroll down, you'll see how it's pre-populating all of these values with the configuration from the launch template.

      And that's what we want.

      Under key pair name, just select to proceed without a key pair not recommended.

      And that's the default value.

      Scroll down further still.

      Even the networking configuration is partially pre-populated.

      The only thing we need to do is specify a subnet that this instance will be launched into.

      And when we configure auto scaling groups to use this launch template, the auto scaling group will configure the subnets on our behalf.

      Because we're launching an instance directly from the launch template, we have to specify this subnet.

      So click in the subnet dropdown and then look for SN-PUB-A.

      Because we're going to deploy this WordPress instance into the public subnet in Availability Zone A.

      So select that.

      Scroll down.

      Look for the resource tag section and click on add tag.

      We're going to add a tag to the instance launched by this template.

      So into key, just type name and then for value, use WordPress-LT.

      And this will just tell us that this is an instance launched using the launch template.

      Once you've entered those, just scroll all the way down to the bottom and click launch instance.

      And this will launch an EC2 instance using this template.

      And this will automate everything that we had to do in the previous stage manually.

      So this saves us significant time and it enables us to use automation in later stages of this demo series.

      So now go ahead and click on the instance ID in this success box and this will take you to the EC2 console.

      Just give this instance a couple of minutes to finish its build process.

      Even though we're automating the process, it does still take some time to perform the installation and the configuration of all of those different components.

      So go ahead and just copy the public IP version for address of this instance into your clipboard.

      And then after you've waited a few minutes, open that in a new tab.

      If you get an error or it opens with a blank page, then you just need to give it a few minutes longer.

      But when it's finished, it should show the same WordPress installation screen.

      Once it does load the installation screen, we're going to follow the same process.

      So site title is Categorum, username is Admin.

      Enter the same password and then enter the fake test at test.com email address.

      Then click on install WordPress.

      Then click on login.

      Enter admin again.

      Enter the password.

      Click on login.

      It looks as though our automated WordPress build has worked because the dashboard has loaded.

      Click on posts.

      Delete the default post.

      Click on add new.

      For the title, the best animals again, click on the plus, select gallery, click on upload.

      And again, pick a selection of animal pictures and click on open.

      Remember, this is a new EC2 instance.

      So the one we previously terminated will have also deleted the data on that previous instance.

      Once these images have uploaded, click on publish and then publish again to upload the images to the EC2 instance and store the data within the database.

      So remember two components, the data stored in the database and the images or media stored locally on the EC2 instance.

      Click on view post to make sure that this loads correctly.

      It does.

      So that means the automatic build has worked okay.

      Everything's functioning as we expect.

      This has been an automatic build of a functional WordPress application.

      Now, the only thing that's changed from the previous stage of this advanced demo series is we've automated the build of this instance.

      It still has much the same limitations as the previous stage.

      So while we can improve the build time and we can use launch templates to support further automation, the database and application are still on the same instance.

      So neither can scale without the other.

      The database of the application is still located on that instance, meaning scale in or out operations risk this data.

      The WordPress content store is also stored locally on the instance.

      So again, any scale in or out operations risk the media that's stored locally as well as the database.

      Customers still connect directly to the instance, which means we can't perform health checks or automatically heal any failed instances.

      For this, we need a load balancer which we'll be looking at in later stages of this demo series.

      And of course, the IP address of the instance is still hard coded into the database.

      So this is something else we need to resolve as we move through the demo series.

      With that being said, though, that is everything that you needed to do in stage two of this demo series.

      So in this stage, you've automated the build of the WordPress instance using a launch template.

      Now, in stage three, you're going to migrate the data from the local database on EC2 into RDS.

      And this will move the data out of the lifecycle of the EC2 instance.

      And this makes it easier to scale.

      So in stage three, you're going to perform that migration and then update the launch template to take account of that configuration change.

      So go ahead and complete this stage of the demo lesson.

      And when you're ready, I'll look forward to you joining me in the next.

    1. Tax Reform Act of 1986. The bill lowered the top corporate tax rate from 46 percent to 34 percent and reduced the highest marginal income tax rate from 50 percent to 28 percent, while also simplifying the tax code and eliminating numerous loopholes.

      ended up benefitting upper class more than anyone else

    1. AbstractBackground Predicting phenotypes from genetic variation is foundational for fields as diverse as bioengineering and global change biology, highlighting the importance of efficient methods to predict gene functions. Linking genetic changes to phenotypic changes has been a goal of decades of experimental work, especially for some model gene families including light-sensitive opsin proteins. Opsins can be expressed in vitro to measure light absorption parameters, including λmax - the wavelength of maximum absorbance - which strongly affects organismal phenotypes like color vision. Despite extensive research on opsins, the data remain dispersed, uncompiled, and often challenging to access, thereby precluding systematic and comprehensive analyses of the intricate relationships between genotype and phenotype.Results Here, we report a newly compiled database of all heterologously expressed opsin genes with λmax phenotypes called the Visual Physiology Opsin Database (VPOD). VPOD_1.0 contains 864 unique opsin genotypes and corresponding λmax phenotypes collected across all animals from 73 separate publications. We use VPOD data and deepBreaks to show regression-based machine learning (ML) models often reliably predict λmax, account for non-additive effects of mutations on function, and identify functionally critical amino acid sites.Conclusion The ability to reliably predict functions from gene sequences alone using ML will allow robust exploration of molecular-evolutionary patterns governing phenotype, will inform functional and evolutionary connections to an organism’s ecological niche, and may be used more broadly for de-novo protein design. Together, our database, phenotype predictions, and model comparisons lay the groundwork for future research applicable to families of genes with quantifiable and comparable phenotypes.Key PointsWe introduce the Visual Physiology Opsin Database (VPOD_1.0), which includes 864 unique animal opsin genotypes and corresponding λmax phenotypes from 73 separate publications.We demonstrate that regression-based ML models can reliably predict λmax from gene sequence alone, predict non-additive effects of mutations on function, and identify functionally critical amino acid sites.We provide an approach that lays the groundwork for future robust exploration of molecular-evolutionary patterns governing phenotype, with potential broader applications to any family of genes with quantifiable and comparable phenotypes.Competing Interest StatementThe authors have declared no competing interest.

      Reviewer 3. Fabio Cortesi

      In their manuscript, Frazer et al. developed a machine-learning approach to predict the spectral sensitivity of a visual pigment based on the gene/amino acid sequence of the opsin protein. First, they created a visual opsin database based on heterologously expressed genes from the literature. They then used deepBreaks, an ML tool developed to explore genotype-phenotype associations, to run several different models and test how well ML could predict spectral sensitivity. Their main findings are that the larger the dataset for training and the more diverse (both in opsin sequences themselves and phylogenetic breadth they were derived from) the dataset, the better the predictions will become. However, there is a plateau for the number of training sequences that should be used as a minimum (~ 200), with a slight gain afterwards. As such, the suggested ML approach works well for larger datasets but needs refining for smaller datasets. There are also several drawbacks to the approach that need to be carefully considered when interpreting the results, including the fact that ML cannot accurately predict the effect on phenotype if confronted with a new mutation or a new combination of mutations not used during training.

      I found the study to be well-written and easy to follow. The results support the conclusions, and as far as I can tell, the ML and associated analysis were performed accurately. All the code and the database are readily accessible, too. It is great to see that we are at a point now where computational power has reached a level that can be used to predict gene-phenotype relationships accurately. The use of ML to study the function of (visual) opsins, i.e., spectral sensitivity, especially if additional parameters can be included, will undoubtedly be of great help to the field and welcomed by the community. As such, I have no major concerns and only a few minor comments I recommend addressing before publication.

      Minor comments

      Introduction - Please add a sentence to explain that a visual pigment consists of an opsin protein bound to a chromophore/retinal and that the two units together lead to the 'spectral sensitivity' phenotype. You cover it in the discussion, but it would be helpful for the reader to have this information upfront.

      • Please provide a reference for the following statement: '[…], and purification of heterologously expressed opsins followed by spectrophotometry [REF]'.

      • You say, 'Despite opsins being a well-studied system with an extensive backlog of published literature, previous authors expressed doubts that sequence data alone can provide reliable computational predictions of λmax phenotypes [37-40]'.

      I agree that the spectral sensitivity predictions from sequences have been criticised in the past as they were sometimes oversimplified (including some of our work). However, spectral sensitivity predictions based on computational modelling, albeit not using ML, have previously been attempted successfully several times, e.g., by Jagdish Suresh Patel and colleagues, and should be mentioned here.

      • You say that: 'The extensive data on animal opsin genotype-phenotype associations remains disorganized, decentralized, often in non-computer readable formats in older literature, and under-analyzed computationally'.

      Again, I agree that the opsin data can profit from a centralised databank like the one you created. However, there have been several previous attempts at summarizing opsin data in recent years (although not specific for heterologously expressed opsins), for vertebrates at least. For example, work by Schweikert and colleagues on fish visual opsins and recent work on frog opsins by Schott et al. These studies should be mentioned and cited appropriately here, as tremendous work went into collating the datasets in the first place.

      Results

      • The use of MWS opsin is somewhat confusing. I presume this refers to vertebrate lws genes that are mid-wavelength shifted? Why have these as a separate group? Ancestrally, there are five sub-families of visual opsin genes in vertebrates: sws1 & sws2 (SWS), rh1, rh2 and lws (MWS & LWS). The MWS range in Figure 1 should be part of a larger lws derived grouping.

      • This part reads like a discussion. It also needs a reference for the age of T1 opsins: 'The similar levels of performances between T1 and invertebrate models were unexpected, especially considering it has a training dataset five times larger than the invertebrate model. One possible explanation is that the very old age of T1 opsins [REF] might have led to a higher complexity of genotype-phenotype associations that are not yet well sampled enough to allow good predictions.'

      • These two sentences could also be weaved into the discussion rather than the results section: 'These equations do not account directly for taxonomic, genetic, or phenotypic diversity, as the number of genes is on the x-axis. Therefore, one should be cautious about applying them to predict model performance based on training data size alone.'

      • Table 1: What do MAPE and RMSE stand for, and what do those numbers mean? Maybe also include a short explanation of the acronyms and their meaning in the main body of the text.

      • This should also be mentioned in the discussion: 'Until the models are trained with more invertebrate (r-opsin) data, we do not put high confidence in the estimates of λmax.'

      • Figure 2 legend: Third line, why 'Mutant predictions …'? Aren't the predictions for all sequences?

      • Figure 3 legend: It says 547 mutant sequences here and 546 sequences in Table 1.

      • Provide a reference for the following sentence: 'The WT SWS/UVS model similarly highlighted p113, a site functionally characterized as the counterion in the retinal-opsin Schiff base interaction for all vertebrate opsins.

      • Figure 4 legend: Please provide references for the following sentence: 'Positions 181, 261 and 308 are highlighted because they are among the highest scoring sites and have all been previously characterized as functionally important to opsin phenotype and function.'

      Discussion

      • Please simplify and do not overstate the first sentence. I suggest: 'To better understand methods to connect genes and their functions, we initiated VPOD, a database of opsin genes and corresponding spectral sensitivity phenotypes.'

      • Section: The important relationship between data availability and predictive power.

      You mention that ML could not accurately predict spectral sensitivity if mutant genes were excluded, especially if smaller datasets are used. This was to be expected since ML is not per-se 'smart' but learns from patterns in the underlying dataset. However, it is a significant drawback of the approach, and I encourage you to state this more clearly. My main concern is that future users will take the ML predictions as absolute truth instead of verifying or experimentally verifying the predictions.

      • Provide a reference for the following sentence: 'One consequence of leaf-based tree construction is that due to its faster convergence/training time, it can be more prone to overfitting, as it constructs trees on a 'best-first basis' with a fixed number of n-terminal nodes.'

      • You should include some information regarding the assumptions in the Introduction and the Methods section. For example, information about what chromophore interaction was modelled should be in the methods, and the basic information about how visual pigments are formed and what different chromophore types are being used by which species should be in the Introduction: 'We also assume the photopigment uses 11-cis-retinal, as all heterologously expressed opsins in VPOD were reconstituted using this chromophore. However, this assumption is violated in some organisms because they use 13-cis-retinal as the in-vivo chromophore [71-73], which is associated with a red-shift in λmax [32,71].'

      Conclusion

      • I recommend being more cautious about the predictive power for epistatic effects since you tested it only on three samples and the predictions were severely restricted by the training dataset containing the single mutant samples.
    1. Competing Interest StatementThe authors have declared no competing interest.

      Reviewer 3. Jose Fernandez Navarro

      The authors present a novel computational method to integrate SRT datasets claiming that the method adjusts for batch effects while retaining the biological differences. The method provides the possibility to adjust the gene expression counts to be used for downstream analysis. The method was benchmarked against other methods that are available for integration of single cell and spatial transcriptomics datasets obtaining positive results. The manuscript is well structured and clear, it provides a robust motivation and the comparisons with other methods are clear and well defined. The method has the potential to make a contribution to the field, specially considering that it has been developed to be compatible with scanpy and that an open-source library has been made available on GitHub.

      Introduction:- In the following sentence: "batch effects caused by nonbiological factors such as technology differences and different experimental batches." the authors could have elaborated more and perhaps included some references.- In the following sentence: "In contrast, popular MNN-based methods such as Seurat v3[16] efficiently address batch effects in gene expression, but their limitation lies in the ability to align only two batches at a time, and they become impractical when dealing with many batches" I do not think the MNN-based term is correct in that context. Also, I do not entirely agree in the claim. One generally does not have many batches to correct for and the referred methods can perform batch correction in datasets with more than 2 batches.- In the following statement: "However, PRECAST only returns the corrected embedding space, and GraphST requires registering the spatial coordinates of samples first to ensure its integration performance; thus, their applications are limited in certain scenarios. "I'm not in total agreement, I understand PRECAST provides a module to obtain corrected gene expression counts for downstream analysis. Results:- I find the introduction to spatiAlign a bit long. It could perhaps be simplified and then leave the implementation details to the Methods section.- In the following sentence: "..spatial neighbouring graphs between cells/spots (e.g., cell‒cell adjacent matrix A), where the connective relationships of cells/spots are negatively associated with Euclidean distance." I find it a bit misleading, are the authors building the spatial graph using a fixed radius? Or euclidean distances in a manifold?- I could not find a detailed description on how the different datasets were processed with the others methods that they used to benchmark.- I believe to measure the power of the methods to retain biological differences, comparing consecutive sections of the same tissueis not enough. I would also include a comparison using sections from different individuals (same region).- In the MOB datasets comparison, by looking at the UMAP figures, the differences in performance it is not so clear in the cases of SCALED and BBKNN.In the Hippocampus dataset, I did not see information on how the clusters were annotated. It would have been nice to include the ABA figures of the same region. I found it difficult to understand the basis and interpretation of the spatial autocorrelation analysis with Moran's I. In the MOB embryo dataset, did the authors consider include a comparison with the other methods? Figures:I observed some of the supplementary figures are out of order or the labels do not match the panels, I encourage the authors to revise this. I also noticed some of the panels showing expression plots do not have a bar with the range of expression. The labels in some of the panels are hard to read and I miss some labels (f.e. the section/dataset in some of the panels).Some figures make reference to the ABA and/or the tissue morphology. For these, I could suggest including the HE images and/or IF images from the ABA. Figure 2a-c: the fonts are hard to read. Figure 2d is hard to read, perhaps the layout would be better by making it one column per method?. Figure 3g would be easier to read if the 3 datasets were arranged side by side. Figure S4, I find the clusters hard to see clearly.

      Datasets and documentation: The authors provide links to the original datasets but they do not provide access to the processed and annotated datasets, this makes it difficult to replicate the results and the examples provided in the documentation. The manuscript would benefit if the authors would provide better documentation and means to reproduce/replicate the analyses.

      Software: I was able to install the package with PyPy in a Conda environment but I had to manually install some dependencies to make it work.Major comments:- I would like to suggest the authors to revise the figures. The supplementary figures descriptions do not seem to match the content of the figures. Some of the figures are missing labels and color bars.- I would like to suggest the authors to correct for grammar and misspelling errors and perform a throughout proof reading of the manuscript for consistency.- I would like the authors to provide links to access the processed/annotated datasets.- I would like the authors to provide more details on how the datasets were processed with their method and the others method (hyperparameters, versions, etc..). This could be complemented greatly if the authors could provide notebooks or step-by-step documentation.- I would like to suggest the authors to include a comparison with true biological differences such as different phenotypes and/or genotypes.- I would like to suggest the authors to include some of other methods in the MOB (stereo-seq) comparison.- I would like to suggest the authors to check their claim that PRECAST does not provide "corrected" gene counts or that the other methods do not provide means to perform downstream analyses (DEG, trajectory inference, etc…).- I would like to suggest the authors to include normalized counts as well as raw counts in some of the comparisons (for example when performing the trajectory analysis or showing the spatial distribution of features). Minor comments:- I would like to suggest the authors to not use the term "expression enhacenment", to me the gene expression is corrected or adjusted but not enhanced.- I would like to suggest the authors to improve the documentation of the open-source package to provide more information on the different arguments and options. It would also be nice to provide documentation and/or notebooks to reproduce the analysis (or some) presented in the manuscript.- I would like to suggest the authors to improve the installation of the PyPy package since some dependencies seem to be missing.- I would like to suggest the authors to improve the layouts and font size of some of the for clarity and readability.

      Re-review: I acknowledge the efforts made by the authors to address the comments and provide answers. However, I still find the manuscript not ready for publication. These are my comments: Major:- The authors have included a new analysis (sup. figure 7) using a dataset (tumor liver) that lacks a stereotypical structure. While this is a good addition to the manuscript, I would still like to see the performance of spatiAlign in correcting technicaleffects while retaining true biological differences (f.e. disease and control). In addiction to this, a comparison using a imaging-based technology (f.e Merfish or CosMx) would make the manuscript stronger.- The authors have made an effort to provide Jupyter notebooks with code to reproduce the analyses. Unfortunately, this is uncompleted. None of the notebooks contain code to reproduce the spatiAlign analyses and only the notebook with the tumor liver dataset (sup. figure 7)includes the processing steps. For the other datasets they authors use hard-coded values. Moreover, I was unable to run some of the notebooks due to errors and missing files and/or dependencies. The authors should provide one notebook for each dataset including the processing and analysis and provide means to run the notebooks (environment files and/or docker files) in an easy way that enables reproduciblity. Ideally, these notebooks should also include the spatiAlign analysis.- I observed a strange effect in figure 2 where the UMAP manifolds of the BBKNN, Harmony and Combat are similar. I could identify the error causing this in one of the notebooks. I strongly suggest the authors to revise all the analyses and figures and to provide notebooks to reproduce these in an easy way as I mentioned before.- I find the MNN performance surprisingly bad. I wonder if this could be due to how the data was processed with this method. Did the authorstry to disable cosine normalization for the output?.

      Minor:- I think the manuscript would be stronger if the authors would include the normalized counts in the figures where they show the raw counts.- I still find inconstancies in the text (typos, grammatical and syntactical errors). The authors are still using the term enhanced (specially in figure legends).- In the MOB dataset, the authors claim that the Visium spots are 100mm but that cannot be true, visium spots are 50mm.- In figure 3 (panel f) use the same layout as figure 2 for consistency.- In figure 4 (panel g) the color bar and labels are missing.- In Sup. figure 3 (panel c) the color bar is out of place and the legend is missing.

      Re-review: The authors have made a great effort to improve the manuscript. The improvements on the documentation and open-source package will be appreciated by the community. I only have minor comments:- The grammar has improved but I could still see some errors (to cite a few):- line 96 "dimensional reduction"- line 346 "structure and MERFISH"- I still think that the authors have not been able to fully demonstrate the performance of their method to integrate datasets with true biological/phenotypical differences (f.e. disease and healthy). Supplementary figures 7 and 8 add value of the manuscript by integrating tumor cells from different patients but this is not exactly what reviewer 1 and Isuggested. I acknowledge the explanations that the authors provide in their response but I'm not in total agreementwith the statements. There are publicly available datasets that could suit this analysis. I will not request to amend such analysis to the manuscript but I could at least suggest to mention this in the manuscript as a limitation or future work.

    1. Thepowerful way in which The Bombe was able to break the Enigma code, a task pre-viously impossible to even the best human mathematicians, made Turing wonder

      This was Alan Turing who broke the Engima code with this machine

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      In this study, the authors describe the construction of an extremely large-scale anatomical model of juvenile rat somatosensory cortex (excluding the barrel region), which extends earlier iterations of these models by expanding across multiple interconnected cortical areas. The models are constructed in such a way as to maintain biological detail from a granular scale - for example, individual cell morphologies are maintained, and synaptic connectivity is founded on anatomical contacts. The authors use this model to investigate a variety of properties, from cell-type specific targeting (where the model results are compared to findings from recent large-scale electron microscopy studies) to network metrics. The model is also intended to serve as a platform and resource for the community by being a foundation for simulations of neuronal circuit activity and for additional anatomical studies that rely on the detailed knowledge of cellular identity and connectivity.

      Strengths:

      As the authors point out, the combination of scale and granularity of their model is what makes this study valuable and unique. The comparisons with recent electron microscopy findings are some of the most compelling results presented in the study, showing that certain connectivity patterns can arise directly from the anatomical configuration, while other discrepancies highlight where more selective targeting rules (perhaps based on molecular cues) are likely employed. They also describe intriguing effects of cortical thickness and curvature on circuit connectivity and characterize the magnitude of those effects on different cortical layers.

      The detailed construction of the model is drawn on a wide range of data sources (cellular and synaptic density measures, neuronal morphologies, cellular composition measures, brain geometry, etc.) that are integrated together; other data sources are used for comparison and validation. This consolidation and comparison also represent a valuable contribution to the overall understanding of the modeled system.

      We thank the reviewer for the kind comments.

      Weaknesses:

      The scale of the model, which is a primary strength, also can carry some drawbacks. In order to integrate all the diverse data sources together, many specific decisions must be made about, for example, translating findings from different species or regions to the modeled system, or deciding which aspects of the system can be assumed to be the same and which should vary. All these decisions will have effects on the predicted results from the model, which could limit the types of conclusions that can be made (both by the others and by others in the community who may wish to use the model for their own work).

      We agree that this is a downside of the principle of biophysically detailed modeling that is best addressed by continuous refinement in collaboration with the community. We would like to once again invite any interested party to participate in this process.

      As an example, while it is interesting that broad brain geometry has effects on network structure (Figure 7), it is not clear how those effects are actually manifested. I am not sure if some of the effects could be due to the way the model is constructed - perhaps there may be limited sets of morphologies that fit into columns of particular thicknesses, and those morphologies may have certain idiosyncrasies that could produce different statistics of connectivities where they are heavily used. That may be true to biology, but it may also be somewhat artifactual if, for example, the only neurons in the library that fit into that particular part of the cortex differ from the typical neurons that are actually found in that region (but may not have been part of the morphological sampling).

      We agree that the limited pool of morphological reconstructions can lead to artifactual results in the way the reviewer pointed out. To investigate that hypothesis, we added a supplementary figure (S14) where we characterize (1): to what degree the morphological composition of a columnar subvolume reflects the overall composition of the model; and (2): The level of morphological diversity in each columnar subvolume. We discuss the results at the end of section 2.6. Briefly, while we cannot fully rule out the possibility of an artificial result, we found a high and virtually uniform level of morphological diversity in all columns and layers. This makes it unlikely that individual idiosyncratic morphologies strongly affect the local connectivity. However, we acknowledge that the minimum level of morphological diversity required is unknown. We believe that at this stage all we can do is characterize this and leave final interpretation to the reader.

      I also wonder how much the assumption that the layers have the same relative thicknesses everywhere in the cortex affects these findings, since layer thicknesses do in fact vary across the cortex.

      We agree that layer thickness variation would affect circuit properties. Variability of layer thickness can be split into two components: variability stemming from differences in total thickness, which our model covers, and variability of relative, i.e., normalized layer thickness, which we miss. In this region of cortex, though, data on the relative thickness of cortical layers is sparse. The Waxholm Atlas does not distinguish somatosensory cortical layers in its labels [Kleven et al, 2023]. Yusufoğulları (2015) compares layer thicknesses of rat hindlimb and barrel field regions. After normalization against total thickness, the relative difference increased towards the superficial layers from 0 in L6 to 33% in L1. Variability of normalized thicknesses within developed rat barrel cortex, based on layer boundaries reported in Narayanan et al. (2017) vary by 2% to 5% over approximately 2 mm. One major effect of such variability would be to scale the number of neurons in a given layer locally by the corresponding factors. For comparison, the resulting variability in neuron counts due to differences in conicality (Fig. 7D1) was around +-25%. A further effect of variable relative layer thickness would be its impact on the selection of suitable morphologies to be placed in the volume.

      In summary, adjustment of layer thickness is a refinement which should be done in future versions of the model, once more data is available. The discussion section has been updated to acknowledge this limitation. However, as outlined at the beginning of this point-by-point reply, we will not conduct such updates to the model in the context of this manuscript, as it describes the version of the model used for a number of follow-up studies.

      In addition, the complexity of the model means that some complicated analyses and decisions are only presented in this manuscript with perhaps a single panel and not much textual explanation. I find, for example, that the panels of Figure S2 seem to abstract or simplify many details to the point where I am not clear about what they are actually illustrating - how does Figure S2D represent the results of "the process illustrated in B"? Why are there abrupt changes in connectivity at region borders (shown as discontinuous colors), when dendrites and axons span those borders and so would imply interconnectivity across the borders? What do the histograms in E1 and E2 portray, and how are they related to each other?

      We apologize for the confusion. We have updated the figure caption of Figure S2 to better explain its contents.

      Overall, the model presented in this study represents an enormous amount of work and stands as a unique resource for the community, but also is made somewhat unwieldy for the community to employ due to the weight of its manifold specific construction decisions, size, and complexity.

      Reviewer #2 (Public Review):

      Summary:

      The authors build a colossal anatomical model of juvenile rat non-barrel primary somatosensory cortex, including inputs from the thalamus. This enhances past models by incorporating information on the shape of the cortex and estimated densities of various types of excitatory and inhibitory neurons across layers. This is intended to enable an analysis of the micro- and mesoscopic organisation of cortical connectivity and to be a base anatomical model for large-scale simulations of physiology.

      Strengths:

      • The authors incorporate many diverse data sources on morphology and connectivity.

      • This paper takes on the challenging task of linking micro- and mesoscale connectivity.

      • By building in the shape of the cortex, the authors were able to link cortical geometry to connectivity. In particular, they make an unexpected prediction that cortical conicality affects the modularity of local connectivity, which should be testable.

      • The author's analysis of the model led to the interesting prediction that layer 5 neurons connect local modules, which may be testable in the future, and provide a basis to link from detailed anatomy to functional computations.

      • The visualisation of the anatomy in various forms is excellent.

      • A subnetwork of the model is openly shared (but see question below).

      We thank the reviewer for their kind comments.

      Weaknesses:

      • Why was non-barrel S1 of the juvenile rat cortex selected as the target for this huge modelling effort? This is not explained.

      We have added an explanation of this decision to the third paragraph of the introduction.

      • There is no effort to determine how specific or generalisable the findings here are to other parts of the cortex. Although there is a link to physiological modelling in another paper, there is no clear pathway to go from this type of model to understand how the specific function of the modelled areas may emerge here (and not in other cortical areas).

      With respect to generality against specific findings, our philosophy is as follows: Despite the fact that most of our source data comes from juvenile rat somatosensory cortex, we also had to generalize many data sources across organisms, ages or regions. Hence, in this iteration we focused on investigating the general features of the (multi-region) mammalian cortex, e.g., high-order motifs, connected by L5 neurons across subregions or the effect of curvature on the connectivity. In the future, more specific data sources can be used to build diverging versions of the model, e.g. one for adult vs. juvenile rat. They can then be used to contrast the ages and focus on more specific findings. We already defined a number of structural metrics that can be used to contrast more specific versions of the model quantitatively.

      We now clarify this pathway to understanding more specific function in the last paragraph of the discussion.

      • In a few places the manuscript could be improved by being more specific in the language, for example:

      - "our anatomy-based approach has been shown to be powerful", I would prefer instead to read about specific contributions of past papers to the field, and how this builds on them.

      - similarly: "ensuring that the total number of synapses in a region-to-region pathway matches biology." Biology here is a loose term and implies too much confidence in the matching to some ground truth. Please instead describe the source of the data, including the type of experiment.

      We have removed or rewritten the mentioned parts. We now clarify that we work based on biological estimates from experiments and cite the experiment sources. We also provide brief descriptions of the types of data and how they were derived.

      • Some of the decisions seem a little ad-hoc, and the means to assess those decisions are not always available to the reader e.g.

      - pg. 10. "Based on these results, we decided that the local connectome sufficed to model connectivity within a region.". What is the basis for this decision? Can it be formalised?

      - "In the remaining layers the results of the objective classification were used to validate the class assignments of individual pyramidal cells. We found the objective classification to match the expert classification closely (i.e., for 80-90% of the morphologies). Consequently, we considered the expert classification to be sufficiently accurate to build the model." The description of the validation is a little informal. How many experts were there? What are their initials? Was inter-rater or intra-rater reliability assessed? What are these numbers? The match with Kanari's classification accuracy should be reported exactly. There are clearly experts among the author list, but we are all fallible without good controls in place, and they should be more explicit about those controls here, in my opinion.

      - "Morphology selection was then performed as previously (Markram et al., 2015), that is, a morphology was selected randomly from the top 10% scorers for a given position." A lot of the decisions seem a little ad-hoc, without justification other than this group had previously done the same thing. For example, why 10% here? Shouldn't this be based on selecting from all of the reasonable morphologies?

      We have clarified that the density of local connectivity is verified against the validation datasets by comparing the diagonals in Figure 4B, in addition to the quantification of Figure 4C.

      For the classification, we have now published a detailed preprint describing the objective confirmation of expert classification by a variety of methods (see Kanari et al. 2024 https://www.biorxiv.org/content/10.1101/2024.09.13.612635v1). We cannot include the full methodology in the current paper, due to its large extent. For the benefit of the reader, we have included the appropriate citation and extended the short description of the methodology. As described in this paper, the classification accuracy varies per layer, cell type, etc. We have now described in more details these results, that can be accessed in details in out preprint.

      • I would like to know if one of the key results relating to modularity and cortical geometry can be further explored. In particular, there seem to be sharp changes in the data at the end of the modelled cortical regions, which need to be explored or explained further.

      We now explore these results further in supplementary figure S15, which we discuss in the results Section 2.6.

      • The shape of the juvenile cortex - a key novelty of this work - was based on merely a scalar reduction of the adult cortex. This is very surprising, and surely an oversimplification. Huge efforts have gone into modelling the complex nonlinear development of the cortex, by teams including the developing Human Connectome Project. For such a fundamental aspect of this work, why isn't it possible to reconstruct the shape of this relatively small part of the juvenile rat cortex?

      We agree that a more complex approach should be used in the future. However, as outlined at the beginning of this point-by-point reply, we will not conduct such updates to the model in the context of this manuscript, as it describes the version of the model used for a number of follow-up studies.

      • The same relative laminar depths are used for all subregions. This will have a large impact on the model. However, relative laminar depths can change drastically across the cortex (see e.g. many papers by Palomero-Gallagher, Zilles, and colleagues). The authors should incorporate the real laminar depths, or, failing that, show evidence to show that the laminar depth differences across the subregions included in the model are negligible.

      This point has also been raised by reviewer #1 above. For convenience, we repeat our reply below.

      We agree that layer thickness variation would affect circuit properties. Variability of layer thickness can be split into two components: variability stemming from differences in total thickness, which our model covers, and variability of relative, i.e., normalized layer thickness, which we miss. In this region of cortex, though, data on the relative thickness of cortical layers is sparse. The Waxholm Atlas does not distinguish somatosensory cortical layers in its labels [Kleven et al, 2023]. Yusufoğulları (2015) compares layer thicknesses of rat hindlimb and barrel field regions. After normalization against total thickness, the relative difference increased towards the superficial layers from 0 in L6 to 33% in L1. Variability of normalized thicknesses within developed rat barrel cortex, based on layer boundaries reported in Narayanan et al. (2017) vary by 2% to 5% over approximately 2 mm. One major effect of such variability would be to scale the number of neurons in a given layer locally by the corresponding factors. For comparison, the resulting variability in neuron counts due to differences in conicality (Fig. 7D1) was around +-25%. A further effect of variable relative layer thickness would be its impact on the selection of suitable morphologies to be placed in the volume.

      In summary, adjustment of layer thickness is a refinement which should be done in future versions of the model, once more data is available. The discussion section has been updated to acknowledge this limitation. However, as outlined at the beginning of this point-by-point reply, we will not conduct such updates to the model in the context of this manuscript, as it describes the version of the model used for a number of follow-up studies.

      • The authors perform an affine mapping between mouse and rat cortex. This is again surprising. In human imaging, affine mappings are insufficient to map between two individual brains of the same species and nonlinear transformations are instead used. That an affine transformation should be considered sufficient to map between two different species is then very surprising. For some models, this may be fine, but there is a supposed emphasis here on biological precision in terms of anatomical location.

      We agree that this is a weakness that we will address in future revisions of the model.

      • One of the most interesting conclusions, that the connectivity pattern observed is in part due to cooperative synapse formation, is based on analyses that are unfortunately not shown.

      We originally decided not to show this part as we underestimated the interest in this particular result. We have now included the result in supplementary figure S10 and discuss the figure in the results.

      • Open code:

      - Why is only a subvolume available to the community?

      We have now made the entire model available under doi.org/10.7910/DVN/HISHXN. The Data and Code availability section has been updated to clarify this.

      - Live nature of the model. This is such a colossal model, and effort, that I worry that it may be quite difficult to update in light of new data. For example, how much person and computer time would it take to update the model to account for different layer sizes across subregions? Or to more precisely account for the shape of the juvenile rat cortex?

      To provide more information to people interested in participating in model refinements, we have added a new Figure 9. We discuss potential opportunities for refinement at the end of the discussion section.

      Reviewer #3 (Public Review):

      This manuscript reports a detailed model of the rat non-barrel somatosensory cortex, consisting of 4.2 million morphologically and biophysically detailed neuron models, arranged in space and connected according to highly sophisticated rules informed by diverse experimental data. Due to its breadth and sophistication, the model will undoubtedly be of interest to the community, and the reporting of anatomical details of modeling in this paper is important for understanding all the assumptions and procedures involved in constructing the model. While a useful contribution to this field, the model and the manuscript could be improved by employing data more directly and comparing simple features of the model's connectivity - in particular, connection probabilities - with relevant experimental data.

      The manuscript is well-written overall but contains a substantial number of confusing or unclear statements, and some important information is not provided.

      Below, major concerns are listed, followed by more specific but still important issues.

      Major issues

      (1) Cortical connectivity.

      Section 2.3, "Local, mid-range and extrinsic connectivity modeled separately", and Figure 4: I am confused about what is done here and why. The authors have target data for connectivity (Figure 4B1). But then they use an apposition-based algorithm that results in connectivity that is quite different from the data (Figure 4B2, C). They then use a correction based on the data (Figure 4E) to arrive at a more realistic connectivity. Why not set the connectivity based on the data right away then? That would seem like a more straightforward approach.

      We have completely re-written our description and discussion of connectivity in the model. We now more explicitly motivate our connectivity modeling choices in the first paragraph of section 2.3 of the results and in the second paragraph of the discussion.

      The same comment applies to Section 2.4., "Specificity of axonal targeting": the distributions of synapses on different types of target cell compartments were not well captured by the original model based on axon-dendrite overlap and pruning, so the authors introduced further pruning to match data specificity. While details of this process and what worked and what didn't may be interesting to some, overall it is not surprising, as it has been well known that cell types exhibit connectivity that is much more specific than "Peters rule" or its simple variations. The question is, since one has the data, why not use the data in the first place to set up the connectivity, instead of using the convoluted process of employing axon-dendrite overlap followed by multiple corrections?

      We would like to point out that we are not employing “Peters rule”, we now make this explicit in the revision in the first paragraph of section 2.3 of the results. Furthermore, we would argue that the match to the Motta et al. data indicates that our approach is more than just a “simple variation”. Finally, we believe that there is important insight in: 1. The specific ways in which the algorithm had to be changed to match the Schneider-Mizell data, e.g. that the connectivity of SST positive neurons did not have to be adapted at all. 2. That the specificity of the other two types could still be matched by a selection of a subset of axonal appositions (i.e., of potential synapses).

      Most importantly, what is missing from the whole paper is the characterization of connection probabilities, at least for the local circuit within one area. Such connection probabilities can be obtained from the data that the authors already use here, such as the MICRONS dataset. Another good source of such data is Campagnola et al., Science, 2022. Both datasets are for mouse V1, but they provide a comprehensive characterization across all cortical layers, thus offering a good benchmark for comparison of the model with the data. It would be important for the authors to show how connection probabilities realized in their model for different cell types compared to these data.

      We now report connection probabilities in the reworked figure 4 and compare them to reported connection probabilities from many different sources and labs in supplementary figure S8. We prefer a comparison to a wide range of sources to relying on a single report.

      (2) Section 2.5, "Structure of thalamic inputs" and Figure 6.

      The text in section 2.5 should provide more details on what was done - namely, that the thalamic axons were generated based on the axon density profiles and then synapses were established based on their overall with cortical dendrites. Figure S10 where the target axon densities from data and the model axon densities are compared is not even mentioned here. Now, Figure S10 only shows that the axon densities were generated in a way that matches the data reasonably well. However, how can we know that it results in connectivity that agrees with data? Are there data sources that can be used for that purpose? For example, the authors show that in their model "the peaks of the mean number of thalamic inputs per neuron occur at lower depths than the peaks of the synaptic density". Is this prediction of the model consistent with any available data?

      Most importantly, the authors should show how the different cell types in their model are targeted by the thalamic inputs in each layer. Experimental studies have been done suggesting specificity in targeting of interneuron types by thalamic axons, such as PV cells being targeted strongly whereas SST and VIP cells being targeted less.

      We have updated the Results section to provide context for the thalamic axon placement, and referred the reader to the methods for more detail. A reference to Figure S10 has now been added to this section as well.

      As for validations of the structure of the thalamo-cortical inputs: We found that the existing literature on the topic, such as Cruikshank et al., 2007, 2010 and more recently Sermet et al., 2019, is predominately on the physiological strengths of the pathways. We acknowledge that the authors provide compelling arguments that their findings are likely partially due to differences in the anatomical innervation strengths. On the other hand, Sporns, 2013 cautioned against mixing up structural and functional connectivity. Overall, we believe that it is simply cleaner to perform this validation in the accompanying manuscript (“Part II: Physiology and Experimentation”), using the full physiological model. Note that we have actually performed that validation in the manuscript (see preprint under the following doi: 10.1101/2023.05.17.541168, Figure 3H1).

      Note that a higher physiological strength onto PV+ neurons is observed.

      (3) "We have therefore made not only the model but also most of our tool chain openly available to the public (Figure 1; step 7)."

      In fact it is not the whole model that is made publicly available, but only about 5% of it (211,000 out of 4,200,000 neurons). Also, why is "most" of the tool chain made openly available, and not the whole tool chain?

      We have now made the entire model available under doi.org/10.7910/DVN/HISHXN. This has also been added to the Key resource table.

      With regard to the tool chain, everything is on our public github (https://github.com/BlueBrain/) except for the algorithm for detecting axonal appositions. For that tool there are currently unresolved potential copyright issues with former collaboration partners. We are working to resolve them.

      Other issues

      "At each soma location, a reconstruction of the corresponding m-type was chosen based on the size and shape of its dendritic and axonal trees (Figure S6). Additionally, it was rotated to according to the orientation towards the cortical surface at that point."

      After this procedure, were cells additionally rotated around the white matter-pia axis? If yes, then how much and randomly or not? If not, then why not? Such rotations would seem important because otherwise additional order potentially not present in the real cortex is introduced in the model affecting connectivity and possibly also in vivo physiology (such as the dynamics of the extracellular electric field).

      They are indeed additionally randomly rotated. We have clarified this in the revision.

      The term "new in vivo reconstructions" for the 58 neurons used in this paper in addition to "in vitro reconstructions" is a misnomer. It is not straightforward to see where the procedure is described, but then one finds that the part of Methods that describes experimental manipulations is mostly about that (so, a clearer pointer to that part of Methods could be useful). However, the description in Methods makes it clear that it is only labeling that is done in vivo; the microscopy and reconstruction are done subsequently in vitro. I would recommend changing the terminology here, as it is confusing. Also, can the authors show reconstructions of these neurons in the supplementary figures? Is the reconstruction shown in Figure 4A representative?

      The term is used because the staining is done in vivo. To the best of our knowledge, the reconstruction process cannot be performed in vivo. However, to avoid any confusion we modified the text to clarify this distinction to in-vivo stained.

      With respect to the reconstruction in Figure 4: The intent of the panel is to demonstrate the concept of targeted long-range axons that our morphologies are missing, necessitating the use of a second algorithm for longer-range connectivity. As such, it is not one of the reconstructions we used, but one of Janelia MouseLight. While we mentioned MouseLight in the figure caption, we formulated it in a way that could be misunderstood to mean that we merely used the MouseLight browser to render one of our morphologies. We apologize for the confusion, and we have fixed the figure caption.

      In this revision we have added exemplars of representative morphology reconstructions (in slice stained and in vivo stained) in a new supplementary figure, as requested (Figure S5). It is referenced in the last paragraph of section 2.1.

      In the Discussion, "This was taken into account during the modeling of the anatomical composition, e.g. by using three-dimensional, layer-specific neuron density profiles that match biological measurements, and by ensuring the biologically correct orientation of model neurons with respect to the orientation towards the cortical surface. As local connectivity was derived from axo-dendritic appositions in the anatomical model, it was strongly affected by these aspects.

      However, this approach alone was insufficient at the large spatial scale of the model, as it was limited to connections at distances below 1000μm."

      As mentioned above, it is not clear that this approach was sufficient for local connectivity either. It would be great if the authors showed a systematic comparison of local connection probabilities between different cell types in their model with experimental data and commented here in the Discussion about how well the model agrees with the data.

      As mentioned in the reply to a previous comment, we now report connection probabilities.

      In the Discussion: "The combined connectome therefore captures important correlations at that level, such as slender-tufted layer 5 PCs sending strong non-local cortico-cortical connections, but thick-tufted layer 5 PCs not." (Also the corresponding findings in Results.)

      If I understand this statement correctly, it may not agree with biological data. See analysis from MICRONS dataset in Bodor et al., https://www.biorxiv.org/content/10.1101/2023.10.18.562531v1.

      Our statement was indeed misleading and formulated too strongly. While thick-tufted pyramidal cells do form long-range intra-cortical connections, the structural strength of these pathways is weaker than for slender-tufted PCs, which are associated with the IT (intra-telencephalic) projection type. We have made this clear in the revision.

      Table 2 is confusing. What do pluses and minuses mean? What does it mean that some entries have two pluses? This table is not mentioned anywhere else in the text. If pluses mean some meaningful predictions of the model, then their distribution in the table seems quite liberal and arbitrary. It is not clear to me that the model makes that many predictions, especially for type-specificity and plasticity. Also, why is the hippocampus mentioned in this table? I don't see anything about the hippocampus anywhere else in the paper.

      We have clarified the description of the table in its caption and removed references to hippocampus, which were left from an earlier draft of the paper.

      In the Discussion, "Thus, we made the tools to improve our model also openly available (see Data and Code availability section)."

      As mentioned before, the authors themselves write that they made "most of our tool chain openly available to the public", but not all of it.

      With regard to the tool chain, everything is on our public github (https://github.com/BlueBrain/) except for the algorithm for detecting axonal appositions. For that tool there are currently unresolved potential copyright issues with former collaboration partners. We are working to resolve them.

      Table S2 has multiple question marks. It is not clear whether the "predictions" listed in that table are truly well-thought-out and/or whether experimental confirmations are real.

      Some of the citations in that table were broken due to technical difficulties with the citation manager used. We apologize and have fixed this in the revision.

      Introduction: It would be quite appropriate to cite here Einevoll et al., Neuron, 2019 ("The Scientific Case for Brain Simulations").

      We now reference this important work.

      Recommendations for the authors:

      Reviewing Editor's note:

      Consultation with the reviewers highlighted three main issues: the integration of connection probability profiles, non-uniform cortical thickness, and the overall organization of the manuscript.

      Reviewer #1 (Recommendations For The Authors):

      Apart from the points discussed in the public review, my main concern is that the manuscript itself is not as tightly constructed as it should be, to the detriment of the reader's ability to understand the model itself and the conclusions from the presented analyses.

      There are places where the text references seemingly incorrect figure panels or refers to panels that don't exist:

      - Section 2.2, first paragraph - refers to Figure 2D, E but those panels do not exist in Figure 2.

      - Section 2.2, second paragraph - refers to Figure 3D3 - perhaps it should be 3B3?

      - Section 2.8, first paragraph - has no figure references but seems like it should be referring to parts of Figure 8 (perhaps Figure 8B1 specifically?)

      - Is the reference to Figure S11A on page 16 supposed to be to S12A?

      In other places, figure labels and descriptions are not clear, and terminology is not always well-defined or explained.

      - Figure 8 and the associated section 2.8 are very difficult to draw conclusions from as presented - several of the terms used are opaque and not clearly defined in the text or legends. I could not easily infer how the normalization works for the "normalized node participation per layer", or what "position in simplex" means for "unique neurons in core", and what their "relative counts" are relative to.

      - Are "targets" in Figure S12A the same as "sinks"? If so, it would be better to use a single term consistently throughout.

      - Figure S12 - figures in part B do not have enough labels to interpret - what is the y-axis of the "rich-club analysis" graph? Also, the figures in part B bottom are labeled "long-range" rather than "mid-range" connections.

      In general, I found the use of both letters and numbers for figure panels (e.g. Figure 7E1) more confusing than helpful - it didn't seem like panels with the same letter were visually grouped consistently, and it sometimes made it more difficult to follow the flow of a figure. I would recommend using only letters in nearly every case here.

      We thank the reviewer for directing our attention to these issues. We have fixed them in the revision. However, we have decided to keep our original panel numbering scheme. Panels with the same letter are meant to be conceptually grouped as they address related or similar measures.

      Other minor points:

      - Section 2.4 - paragraph 2 - sentence 5 "inhbititory" -> "inhibitory".

      - Figure 5B figure legend - references Schneider-Mizell et al. 2023 but probably should be Motta et al. 2019?

      - Figure 5C - figure key "expcected" -> "expected".

      - The lower part of Figure 7C looks like it belongs to panel D2 instead of panel C due to relative spacing.

      We once again thank the reviewer, and we have fixed the listed issues.

      Reviewer #2 (Recommendations For The Authors):

      (1) Abstract:

      - Is it really 'integrating whole brain-scale data'? This seems a bit misleading.

      - "We delineated the limits of determining connectivity from anatomy" - here I think you mean determining connectivity from morphology, or dendrite/axon appositions. Electron microscopy is still anatomy and presumably would be much closer to function.

      We originally used the term “anatomy” as connectivity depends on the correct placement of neurons in addition to their morphology. However, as the reviewer points out, this term is misleading as it would encompass electron microscopy, which can go beyond what we do with the model. We have updated the text to read “morphology and placement”.

      (2) Introduction:

      "Investigating the multi-scale interactions that shape perception requires a model of multiple cortical subregions with inter-region connectivity, but it also requires the subcellular resolution provided by a morphologically detailed model." - This statement, as written, is not true in my opinion. You can argue for the value of morphologically-detailed neuron models to the study of perception, but they are not required for the investigation of perception.

      We have updated the text to be clearer: subcellular resolution is only required for certain aspects that are related to perception.

      (3) Results:

      - Pg. 9/10. There are three sentences in a row that are of the style: "ensuring that the total number of synapses in a region-to-region pathway matches biology." Biology here is a loose term and implies too much confidence in the matching to some ground truth. Please instead describe the source of the data, including the type of experiment here already. o Pg. 10. On the first read, I found it quite hard to follow what exactly was done in Figure 4.

      What are the target values adapted from Reimann et al., 2019, for example?

      - Pg. 10. "Based on these results, we decided that the local connectome sufficed to model connectivity within a region.". What is the basis for this decision? Can it be formalised? o Pg. 16, Figure 7 B-C. The apparent effect of geometry on modularity is potentially very interesting. However, are the sharp drop-offs in values for modularity (but also conicality and height) true, or are some artefacts due to columns at the edges of the sampled area?

      We have discussed these points above in the general comments and strengths and weaknesses.

      - Pg. 18. Simplicial cores define central subnetworks, tied together by mid-range connections. This work, in particular leading to the conclusion of the layer 5 highway hubs, stands out as being a successful attempt to simplify the highly detailed model to a degree that it generates useable new understanding.

      We thank the reviewer for the kind comment.

      (4) Figures:

      Figure 2: The caption doesn't seem to match the Figure (e.g. there are no brain regions depicted in A). o Figure 4f. This is a key panel, but is squished into a small corner of Figure 4, and therefore hard-to-read.

      We have fixed this in the revision.

      Reviewer #3 (Recommendations For The Authors):

      In Major comments, point (1) discusses the issue of connectivity known from data. For all the aspects of connectivity mentioned there, I would recommend the authors re-build their model using the connectivity data directly. It would be interesting to test whether a model constructed in such a way would have any difference in simulated neural activity relative to the model they have constructed.

      This is indeed a very interesting avenue of research. However, we believe that it is best conducted in separate manuscripts. First, in Pokorny et al., 2024 (https://doi.org/10.1101/2024.05.24.593860) we conduct this investigation, comparing the emerging activity in the model to the one for simpler connectivity models. Additionally, in Egas-Santander et al., 2024 (https://www.biorxiv.org/content/10.1101/2024.03.15.585196v3) we found that simpler connectomes lead to less reliable spiking activity globally. Finally, in the accompanying manuscript (https://www.biorxiv.org/content/10.1101/2023.05.17.541168v5) we compare activity with and without the targeting specificity of Schneider-Mizell et al.

      In Major comments, point (2) discusses thalamic inputs. I would recommend the authors to address the issues mentioned there.

      We have replied to those comments above.

      In addition, panels F and G of Figure 6 are mentioned in the caption but are not shown in the figure. In panel B, the choice of visualization is strange. It would make sense to show box plots for all the data instead of bars for mean values and points for randomly selected 50 cells. Panels E1 and E2 lack units.

      We have removed mentions of panels F and G and changed the style of plot. Units for E1 and E2 are now explained in the figure caption.

      In Major comments, point (3) touches upon model and tool sharing. I would recommend making such statements more accurate and reflecting what exactly is provided to the community since not everything is shared.

      We have now made the entire model available under doi.org/10.7910/DVN/HISHXN.

      With regard to the tool chain, everything is on our public github (https://github.com/BlueBrain/) except for the algorithm for detecting axonal appositions. For that tool there are currently unresolved potential copyright issues with former collaboration partners. We are working to resolve them.

      I would recommend the authors address all the other points mentioned in the public review as well. In addition, below are some smaller issues that should be fixed.

      Figure 2: the caption appears to be partially wrong and partially misassigned to the figure panels.

      We fixed the issue.

      Also, note that in L6 the types L6_TPC:A and L6_TPC:C are listed in the figure, but L6_TPC:B is not mentioned.

      There is indeed no TPC:B type in layer 6. The distinction between TPC:A and TPC:B is based on early or late bifurcations of the apical dendrite and is only observed in layer 5.

      Figure 3, panel B2: the caption refers to colors in panel (C), but the authors probably meant to refer to panel (A).

      We fixed the issue.

      "The placement of morphological reconstructions matched expectation, showing an appropriately layered structure with only small parts of neurites leaving the modeled volume (Figure 2D, E)."

      Figure 2 does not have panels D and E.

      "The volume was clearly dominated by dendrites, filling between 23% and 47% of the space, compared to 2% to 11% for axons (Figure 3D3)." There is no panel D or D3 in Figure 3.

      "Recently, the MICrONS dataset (MICrONS-Consortium et al., 2021) has been analyzed with respect to the axonal targeting of inhibitory subtypes in a 100 x 100 μm subvolume spanning all layers (Schneider-Mizell et al., 2023)."

      100 x 100 μm is an area (and should be 100 x 100 μm^2), not a volume.

      Figure S11B requires a legend for the color map.

      We fixed the issues.

      Table S1: What is the difference between L6_BP and L6_BPC? They both are referred to as L6 bipolar cells.

      We have changed the description of L6_BPC to “Layer 6 bitufted pyramidal cell”.

    1. Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.

      Learn more at Review Commons


      Referee #2

      Evidence, reproducibility and clarity

      The authors present a comparative statistical analysis of five RNA velocity methods using two datasets and a single performance metric. Using the selected statistical metric, they describe the variable performance of RNA velocity methods, their variable robustness across different cell states, and the discrepancy of sets of identified lineage-specific driver genes.

      At this point, the scientific community extensively documented limitations and lack of stability of RNA velocity performance across methods and datasets. In that context, the manuscript lacks clear theoretical and practical conclusions that would be beneficial to the scientific community.

      The choice to focus on only a subset of RNA velocity methods is not discussed. Recent and important extensions, such as VeloVAE, VeloVI, LatentVelo, and Pyro-Velocity, are omitted, which limits the generality of the analysis. The statistical properties of the chosen consistency metric are not explored. The authors do not provide a justification for why this metric is appropriate for comparative analysis. Additionally, the authors do not present how the consistency score can be utilized to evaluate RNA velocity performance on user datasets. Overall, a discussion of the pressing issue of choosing statistical metrics to interpret RNA velocity results is lacking. The pros and cons of different RNA velocity methods, especially in light of the various statistical metrics, are not discussed. The manuscript does not present conclusions from the sampling analysis of sequencing depth. For instance, formalizing these findings with code that users can employ for their datasets would enhance the manuscript's practical utility. Overall, the manuscript would benefit from a thorough benchmark of the methodological approaches in the RNA velocity field, and testing various methods to evaluate RNA veloicty performance.

      Significance

      At this point, the scientific community extensively documented limitations and lack of stability of RNA velocity performance across methods and datasets. In that context, the manuscript lacks clear theoretical and practical conclusions that would be beneficial to the scientific community.

    1. In my brag document, I like to do this by making a section for areas that I’ve been focused on (like “security”) and listing all the work I’ve done in that area there. This is especially good if you’re working on something fuzzy like “building a stronger culture of code review” where all the individual actions you do towards that might be relatively small and there isn’t a big shiny ship.

      This is such a clever way to create a container that otherwise might not have existed for that work. I wonder if this would be a good way to highlight glue work?

    1. AbstractsThe rapid advancements in sequencing length necessitate the adoption of increasingly efficient sequence alignment algorithms. The Needleman-Wunsch method introduces the foundational dynamic programming (DP) matrix calculation for global alignment, which evaluates the overall alignment of sequences. However, this method is known to be highly time-consuming. The proposed TSTA algorithm leverages both vector-level and thread-level parallelism to accelerate pairwise and multiple sequence alignments.

      This work has been published in GigaByte Journal under a CC-BY 4.0 license (https://doi.org/10.46471/gigabyte.141). These reviews are as follows.

      Reviewer 1. Xingbao Song and Baoxing Song

      Zong et al. implemented a TSTA package that integrated the difference method, the stripe method, SIMD, and multiple threading approaches to perform efficient sequence alignments. The TSTA toolkit could conduct pairwise and multiple sequence alignments. The memory cost of TSTA is comparable with the most efficient one. Overall, TSTA is a good package, and the manuscript is well-written. While I have a few suggestions: 1) The minimap2 should be mentioned in the section on "difference recurrence relation." It has a much broader range of users and implemented an algorithm that is slightly different from the one by Suzuki, etc. 2) The striped SIMD is also implemented in reads mappers, such as BWA. 3) Page 14, line 215 "1k bps", line 227 "1000 kbps", line 230 and table1 "100k". They should be consistent. 4) In Table 4, I am not sure I understood the second and third lines correctly. Please clarify. 5) I tried to compile TSTA from the source code. To compile the package, I had to copy 'seqio.h' into the 'msa' and 'psa' folders. Please fix it.

      Reviewer 2. Yuansheng Liu

      The article explores strategies for accelerating sequence alignment using multithreading and SIMD (Single Instruction, Multiple Data) techniques, and introduces a new algorithm called TSTA. The paper provides a detailed description of TSTA's performance in pairwise sequence alignment (PSA) and multiple sequence alignment (MSA), and compares it with various existing alignment algorithms. Experimental results indicate that TSTA demonstrates significant speed advantages, particularly when handling long sequences and in the no-backtracking mode. However, the experiments on MSA are limited by the experimental environment, which does not fully address the needs of current sequencing technologies concerning long reads and depth. Specifically, the low number of sequences in MSA does not meet the requirements for downstream genomic analysis applications. While the algorithm is highly innovative, its performance on short sequences and during the backtracking phase still requires optimization. 1. In line 7, the TSTA algorithm utilizes vector-level and thread-level parallelism to accelerate pairwise and multiple sequence alignment. Why are there no experiments designed specifically to evaluate the global alignment performance of TSTA with vector-level parallelism? Or are there any other experimental designs that demonstrate the improved performance of TSTA when vector-level parallelism is employed? 2. In line 149, is the Active-F method used by the TSTA algorithm contributing to the excessive memory usage and access time overhead observed during the iterative process of PSA? Are there better optimization strategies from this perspective? If not, why does TSTA incur higher time costs in traceback as shown in Table 1? Why does bsalign result in lower time consumption? 3. Can you provide the time breakdown for each part of the parallel computation in TSTA for PSA (including at least CPU computation overhead, communication overhead, and I/O overhead) to clarify if there will be significant communication overhead issues with larger datasets and more threads? 4. Table 2 shows that both real and simulated datasets have issues with insufficient depth and short reads. In real MSA processes, it is common to encounter comparisons with depth over 60X and lengths exceeding 100 kbps for long reads. The results under the current experimental conditions seem to perform poorly for such data scenarios. Can you address this? 5. Gene data often includes repetitive regions that affect the accuracy of alignment algorithms. Can you design experiments to verify how TSTA performs in aligning long repetitive regions? Specifically, how accurately does TSTA align sequences in such regions compared to other methods? 6. Besides repetitive regions, sequencing errors produced by ONT R10 chips can also impact alignment accuracy. Alignment algorithms used in genome correction often struggle to detect such errors. How does TSTA handle such issues during MSA? Can the algorithm be designed to address these sequencing errors more effectively? Re-review: After thoroughly reviewing the revised manuscript and testing the TSTA tool, I cannot endorse the manuscript for publication in its current form. I encourage the authors to address the following issues thoroughly and consider re-submitting after significant improvements. Efficiency Concerns: In the context of multiple sequence alignment (MSA), I find that TSTA does not demonstrate a significant advantage in terms of efficiency. I conducted a test with approximately 2G of homologous diploid reads (not too large data), and the tool has been running for around 29 hours. Despite this extensive runtime, the process remains incomplete. This is far from the efficiency one would expect from a tool designed for large-scale sequence alignment. Functionality Issues: There are still unresolved issues with the tool's functionality. The -f parameter does not appear to work as intended, and there are also problems with the -o parameter. Such issues need to be addressed to ensure the tool's reliability and usability.

    1. Editors Assessment:

      This paper presents the SMARTER database, a collection of tools and scripts to gather, standardize, and share with the scientific community a comprehensive dataset of genomic data and metadata information on worldwide small ruminant populations. Which has come out of the EU multi-actor (12 country) H2020 project called SMARTER: SMAll RuminanTs breeding for Efficiency and Resilience. This bringing together genotypes for about 12,000 sheep and 6,000 goats, alongside phenotypic and geographic information. The paper providing insight into how the database was put together, presenting the code for the SMARTER—frontend, backend and API, alongside instructions for users. Peer review tested the platform and provided suggestions on improving the metadata. Demonstrating the project provides valuable information on sheep and goat populations around the world, that can be an essential tool for ruminant researchers. Enabling them to generate new insights and offer the possibility to store new genotypes and drive progress in the field.

      This evaluation refers to version 1 of the preprint

    2. AbstractUnderutilized sheep and goat breeds have the ability to adapt to challenging environments due to their genetic composition. Integrating publicly available genomic datasets with new data will facilitate genetic diversity analyses; however, this process is complicated by important data discrepancies, such as outdated assembly versions or different data formats. Here we present the SMARTER-database, a collection of tools and scripts to standardize genomic data and metadata mainly from SNP chips arrays on global small ruminant populations with a focus on reproducibility. SMARTER-database harmonizes genotypes for about 12,000 sheep and 6,000 goats to a uniform coding and assembly version. Users can access the genotype data via FTP and interact with the metadata through a web interface or programmatically using their custom scripts, enabling efficient filtering and selection of samples. These tools will empower researchers to focus on the crucial aspects of adaptation and contribute to livestock sustainability, leveraging the rich dataset provided by the SMARTER-database.

      This work has been published in GigaByte Journal under a CC-BY 4.0 license (https://doi.org/10.46471/gigabyte.139). These reviews are as follows.

      Reviewer 1. Ran Li

      The authors presented an online SMARTER-database, which collected a large number of genotype data for sheep and goats. The resources are of great importance for the community.

      My major concerns: 1) The below link is not accessible: webserver.ibba.cnr.it 2) For sheep, the database support reference genome assembly of Oar3 and Oar4, but actually Oar 3 is rarely used. Instead, the current ovine reference genome assembly (ARS-UI_Ramb_v3.0) is not supported. 3) For the presentation of metadata (https://webserver.ibba.cnr.it/smarter/breeds?species=Sheep), I suggest additional columns indicating the region and country should be provided. 4) For the datasets (https://webserver.ibba.cnr.it/smarter/datasets), references are needed to know where the data are from.

      Re-review:

      My comments have been properly addressed. The manuscript is acceptable for publication.

      Reviewer 2. Hans Lenstra and Johannes A. Lenstra

      Is there a clear statement of need explaining what problems the software is designed to solve and who the target audience is? Yes. This is implicitly clear and does not need to elaborate upon.

      As Open Source Software are there guidelines on how to contribute, report issues or seek support on the code? No. This does not to seem necessary.

      Is the code executable? unable_to_test Is installation/deployment sufficiently outlined in the paper and documentation, and does it proceed as outlined? unable_to_test Is the documentation provided clear and user friendly? Yes. I did not test this.

      Is there a clearly-stated list of dependencies, and is the core functionality of the software documented to a satisfactory level? No. I did not see such a list, but I would not be able to assess this.

      Have any claims of performance been sufficiently tested and compared to other commonly-used packages? not_applicable

      Is automated testing used or are there manual steps described so that the functionality of the software can be verified? No. I did not find any of this but it does not seem to be essential.

      Additional Comments: This manuscript describes a highly useful database of sheep and goat genome-wide SNP genotypes from several sources, supplemented with phenotypes and geographic locations. I recommend this manuscript for publication in Gigascience after a revision. There is some missing information, whereas the presentation should become less cryptic to readers who are less familiar with the bioinformatic terminology. Missing info. 1. The title and abstract do not mention that SMARTER focuses on SNPs that are genotyped on bead arrays or related technologies. The focus on the genome-wide (GW) SNP genotypes, which only partially represents the total genomic diversity, should already be clear from the Title and the Abstract. 2. Nowadays there are more publications on WGS data, T2T sequences and pangenomes than on GW SNP genotypes, so people may wonder if the GW SNP genotypes still are useful. It may be emphasized that bead-arrays allow an affordable analysis of many animals and that genotypes derived from WGS data contain many false homozygote scores if not sequenced at a very high coverage. 3. Figures 2 and 3 give an idea of the geographic coverage, but what is the unit of the numbers that are visualized in the heat map (0 to 2300 for sheep, 0 to 1100 for goats)? 4. It is not clear which published data have been used or not. We recommend presenting a supplemental table describing the current contents: country, breed, number of animals, number of SNPs (at least 50K or HD), reference. 5. Is there an organized effort to update the database, which ideally should contain all published GW SNP databases? 6. To my experience for most HW SNP datasets only the filtered data after quality control (typically 45 to 49K, less than 42K if sheep 50K and HD genotypes are combined) are available. How is this handled? 7. It may be mentioned that after omission of A/T and G/C SNPs the TOP strand consists only of A/C and A/G SNPs. 8. The problematic SNPs are mentioned twice within the last paragraph of the section Data Composition. 9. Does SMARTER allow to store phased datasets and show the variant haplotypes? These can now be generated by long-read sequencing and are needed for several downstream analysis options. 10. Table 1: OAR3 = Oar_v3.1 and OAR4 = Oar_v4.0? Please use the official codes. 11. Are there options to convert the data to newer assemblies? For instance, the sheep ARS-UI_Ramb_v3.0 is superior to Oar_v4.0. I have used an NCBI tool for conversion of Oar_v1.0 (most popular for 50K datasets) and Oar_3.1 (used often for sheep HD datasets) to Oar_v4.0, but this tool has probably been discontinued and was not available for goat assemblies. 12. I repeatedly found that most published or unpublished databases contain several errors such as duplicates and outliers by mislabeling or crossbreeding. Because these are better removed prior to downstream analysis, data curation would be desirable, for instance by an inspection of a NJ tree of individuals. This also shows the degree of breed-level differentiation, for instance the relationships of different populations of a transboundary breed. These caveats should at least be mentioned. 13. Another caveat: is there a systematic check on the validity of the merging of datasets by testing if breeds sampled independently by different institutes cluster closely together? Presentation. 14. Abbreviations should not be used in abstract. What is “REST API”? These abbreviations of course are in the list, but what is “Representational State Transfer”? And “JSON Web Token”? 15. Figure 1 needs more guidance via the legend. The boxes show alternative formats? What are “str”, “dict “? 16. Figure 5 is useful and seems to retrieve data for the goat Alpine and Bionda dell'Adamello breeds. It would also be useful to show other “API-URL” (this is user input?) while describing in plain language what is being accomplished. 17. Figure 6: bold indicates the user input? What is exactly a “array [string]” (give an example)? A few other examples may be most instructive and familiarize the reader with the logic of SMARTER. 18. In the section “The SMARTER-database project”: what is a mongoengine? 19. In the same section: “Finally the VariantSpecie abstract class is inherited by . . .”: this sentence is difficult to understand. 20. In the section Reproducibility: please give a short description of what is the use of the Conda and Docker programs. 21. Same section: “Raw data undergoes initial exploration”, “structure and potential issues”: can you be more specific? The last part of this section is also difficult to follow.

      Re-review: This paper presents the SMARTER database, a collection of tools and scripts to gather, standardize, and share with the scientific community a comprehensive dataset of genomic data and metadata information on worldwide small ruminant populations. Which has come out of the EU multi-actor (12 country) H2020 project called SMARTER: SMAll RuminanTs breeding for Efficiency and Resilience. This bringing together genotypes for about 12,000 sheep and 6,000 goats, alongside phenotypic and geographic information. The paper providing insight into how the database was put together, presenting the code for the SMARTER—frontend, backend and API, alongside instructions for users. Peer review tested the platform and provided suggestions on improving the metadata. Demonstrating the project provides valuable information on sheep and goat populations around the world, that can be an essential tool for ruminant researchers. Enabling them to generate new insights and offer the possibility to store new genotypes and drive progress in the field.

    1. Editors Assessment:

      This paper presents NucBalancer, a R-pipeline and Shiny app designed for the optimal selection of barcode sequences for sample multiplexing in sequencing. Providing a user-friendly interface aiming to make this process accessible to both bioinformaticians and experimental researchers, enhancing its utility in adapting libraries prepared for one sequencing platform to be compatible with others. Important now with the introduction of additional sequencing platforms by Element Biosciences (AVITI System) and Ultima Genomics (UG100) increasing the diversity and capability of genomic research tools available. NucBalancer’s incorporation of dynamic parameters, including customizable red flag thresholds, allows for precise and practical barcode sequencing strategies. This adaptability is key in ensuring uniform nucleotide distribution, particularly in MGI sequencing and single-cell genomic studies, leading to more reliable and cost-effective sequencing outcomes across various experimental conditions. All the code is available under an open source license, and upon review the authors have also shared the code for the Shiny app.

      This evaluation refers to version 1 of the preprint

    2. AbstractRecent advancements in next-generation sequencing (NGS) technologies have brought to the forefront the necessity for versatile, cost-effective tools capable of adapting to a rapidly evolving landscape. The emergence of numerous new sequencing platforms, each with unique sample preparation and sequencing requirements, underscores the importance of efficient barcode balancing for successful pooling and accurate demultiplexing of samples. Recently launched new sequencing systems claim better affordability comparable to more established platforms further exemplifies these challenges, especially when libraries originally prepared for one platform need conversion to another. In response to this dynamic environment, we introduce NucBalancer, a Shiny app developed for the optimal selection of barcode sequences. While initially tailored to meet the nucleotide, composition challenges specific to G400 and T7 series sequencers, NucBalancer’s utility significantly broadens to accommodate the varied demands of these new sequencing technologies. Its application is particularly crucial in single-cell genomics, enabling the adaptation of libraries, such as those prepared for 10x technology, to various sequencers including G400 and T7 series sequencers. By facilitating the efficient balancing of nucleotide composition and the accommodation of differing sample concentrations, NucBalancer plays a pivotal role in reducing biases in nucleotide composition. This enhances the fidelity and reliability of NGS data across multiple platforms. As the NGS field continues to expand with the introduction of new sequencing technologies, the adaptability and wide-ranging applicability of NucBalancer render it an invaluable asset in genomic research. This tool addresses the current sequencing challenges ensuring that researchers can effectively balance barcodes for sample pooling regardless of the sequencing platform used.

      This work has been published in GigaByte Journal under a CC-BY 4.0 license (https://doi.org/10.46471/gigabyte.138). These reviews are as follows.

      Reviewer 1. Aamir Khan

      Is there a clear statement of need explaining what problems the software is designed to solve and who the target audience is?

      Yes. The tool has novel features not reported in previous tools for barcoding.

      Is the source code available, and has an appropriate Open Source Initiative license been assigned to the code?

      Yes. The tool is available as an R script as well as a shiny app.

      Is installation/deployment sufficiently outlined in the paper and documentation, and does it proceed as outlined? Yes. I would suggest mentioning a few features that are novel or superior to other tools. Perhaps adding a table specifying these novel features that are not part of existing tools will add value to MS.

      Is the documentation provided clear and user friendly?

      Yes. The documentation is provided in a clear and user-friendly way. The input file formats are given in the GitHub page. It would be better to add an example to the shiny app page.

      Yes. Is there a clearly-stated list of dependencies, and is the core functionality of the software documented to a satisfactory level? Yes. Dependencies are mentioned on the tool documentation page and can be installed if R is already installed.

      Additional Comments: The authors have a well-written MS describing the NucBalancer tool. The tool adds value for sequencing by pooling samples and will be useful as we make technological advancements in the sequencing space.

      Reviewer 2. Hugo Varet

      Is there a clear statement of need explaining what problems the software is designed to solve and who the target audience is?

      Yes. The manuscript explains the constraints to be satisfied when looking for barcodes but more details about the context (Illumina chemistry for instance) would be appreciated. Moreover, is the software compatible with dual-indexing?

      Is the source code available, and has an appropriate Open Source Initiative license been assigned to the code?

      Yes. The source code of the program is available on GitHub as a R script, but the source code of the Shiny application is not available.

      As Open Source Software are there guidelines on how to contribute, report issues or seek support on the code?

      Yes. Support can be asked by email to the authors as stated at the end of the README on GitHub.

      Is installation/deployment sufficiently outlined in the paper and documentation, and does it proceed as outlined?

      Yes. The example command line works well. However, the R script needs shiny and xtable packages to be loaded even if none of their functions is actually called in the script.

      Is the documentation provided clear and user friendly?

      No. A detailed documentation would improve the application proposed. In particular, more details about the different chemistries used by Illumina, MGI... and the constraints to find compatible barcodes.

      Is there a clearly-stated list of dependencies, and is the core functionality of the software documented to a satisfactory level?

      No. The strategy used to find barcodes seems very simple, but more details would improve the manuscript.

      Have any claims of performance been sufficiently tested and compared to other commonly-used packages?

      No. The manuscript cites several packages developed to find compatibles sequencing barcodes but the performances are not compared. Moreover, we do not know if NucBalancer still work with a high number of samples/barcodes.

      Are there (ideally real world) examples demonstrating use of the software?

      No. A real world example would be appreciated to illustrate the software, especially in a scenario where the other cited solutions were not able to find compatible barcodes.

      Is automated testing used or are there manual steps described so that the functionality of the software can be verified?

      No.

      Additional Comments: I would suggest the authors to improve the design of the Shiny app as (at the moment) it only runs a R script and prints the result. Moreover, I think the quality of the R code could be easily improved (e.g. loops with strange counters or comparisons with booleans).

      Re-review: I thank the authors for the improvements they made on this new version of the manuscript. At this stage, I'm not totally satisfied for the following reasons: - authors tell the source code of the Shiny app is now available on GitHub, but I have not been able to find it. - in the manuscript, the sentence "The tool does not have any dependency other than the utilities from the base R package" is no longer true as the tool now uses optparse. - in table 1, checkMyIndex is referenced with no web interface available white it actually exists (https://checkmyindex.pasteur.fr/). Moreover, the proposed web interface could still be improved. For instance: - it would be great to add something to show the algorithm is currently looking for a solution. - check the input files have a valid structure to be used. - display the input files when they are loaded to make sure the user uploaded the correct file.

      Reviewer 3. Wen Yao

      The authors reported a new tool for barcode sequences design. This tool is developed using R/Shiny and is available for using online. Below are my comments for further improvement of the manuscript and the tool. 1. Please provide a “load example data” button in the Shiny app. With this button, the example data can be easily loaded by the users for testing NucBalancer. 2. This URL (http://146.118.68.98:8888/) for NucBalancer should also be given in the manuscript. 3. The “Download Table” button is not working. 4. Format of the input data should be checked, as input data in wrong format caused the NucBalancer to crash. 5. The authors should compare NucBalancer with published similar tools in this field. More details are required.

      Re-review: The authors have addressed all my concerns.

    1. Two-factor authentication, or two-step authentication, is a login process where the user is asked to provide two authentication points, such as a password and a code shared through a text message. Two-factor authentication enhances login security.

      This works, but we use this at nsu and it only helped after the fact. and it is a hassle for users.

    1. The code below simulates n=10000 paths with m=1000 time steps. There are some features of the simulation which will prove useful late

      before we used m as the number of paths. Inconsistency here @mark

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews: 

      Reviewer #1 (Public Review): 

      Summary: 

      The manuscript by Rowell et al aims to identify differences in TCR recombination and selection between foetal and adult thymus in mice. Authors sequenced the unpaired bulk TCR repertoire in foetal and adult mice thymi and studied both TCRB and TCRa characteristics in the double positive (DP, CD4+CD8+) and single positive (SP4 CD4+CD8CD3+ and SP8 CD4-CD8+CD3+) populations. They identified age-related differences in TCRa and TCRB segment usage, including a preferential bias toward 3'TRAV and 5' TRAJ rearrangements in foetal cells compared to adults who had a larger perveance for 5'TRAV segments. By depleting the thymocyte population in adult thymi using hydrocortisone, the authors demonstrated that the repertoire became more foetal like, they therefore argue that the preferential 5'TRAV rearrangements in adults may be resulting from prolonged/progressive TCRa rearrangements in the adult thymocytes. In line with previous studies, Authors demonstrate that the foetal TCR repertoire was less diverse, less evenly distributed and had fewer non-template insertions while containing more clonal expansions. In addition, the authors claim that changes in V-J usage and CDR1 and CDR2 in the DP vs SP repertoires indicated that positive selection of foetal thymocytes are less dependent on interactions with the MHC. 

      Strengths: 

      Overall, the manuscript provides an extensive analysis of the foetal and adult TCR repertoire in the thymus, resulting in new insights in T cell development in foetal and adult thymi. 

      Weaknesses: 

      Three major concerns arise:

      (1) the authors have analysed TCR repertoires of only 4 foetal and 4 adult mice, considering the high spread the study may have been underpowered. 

      Given the concerns of the reviewer we have sequenced more libraries and added more data to include repertoires from 7 embryos and 6 young adults (biological replicates from different sorts). We believe that including more replicates has indeed strengthened our study. 

      Our experimental approach was to sequence TCR transcripts, and in studies using RNA-sequencing of inbred mice, often only 3 individuals (biological replicates) are sequenced.

      Our study sequenced from 7 foetal thymuses (generating TCRα and TCRβ repertoires from 4 FACS-sorted cell populations); 6 adult thymuses (generating TCRα and TCRβ repertoires from 4 FACS-sorted cell populations); and 5 adult thymuses from hydrocortisone-treated mice (generating TCRα and TCRβ repertoires from FACS-sorted CD3lo and CD3hi DP populations). We thus analysed 124 distinct repertoires from different populations and libraries, and many tens of thousands of unique sequences.  

      (2) Gating strategies are missing and 

      We have included gating strategies for cell-sorting as SFig7 and SFig8.

      (3) the manuscript is very technical and clearly aimed for a highly specialised audience with expertise in both thymocyte development and TCR analysis. Authors are recommended to provide schematics of the TCR rearrangements/their findings and include a summary conclusions/implications of their findings at the end of each results section rather than waiting till the discussion. This will help the reader to interpret their findings while reading the results. 

      We have modified the manuscript to include a more general introductory paragraph (page 3) to introduce the reader to the topic and we have included brief summaries of the findings at the end of each result section (pages 7,9,10,12,13,15).

      Reviewer #2 (Public Review): 

      Summary: 

      The authors comprehensively assess differences in the TCRB and TCRA repertoires in the fetal and adult mouse thymus by deep sequencing of sorted cell populations. For TCRB and

      TCRA they observed biased gene segment usage and less diversity in fetal thymocytes. The TCRB repertoire was less evenly distributed and displayed more evidence of clonal expansions and repertoire sharing among individuals in fetal thymocytes. In both fetal and adult thymocytes they show skewing of V segment (CDR1-2) repertoires in CD4 and CD8 as compared to DP thymocytes, which they attribute to MHC-I vs MHC-II restriction during positive selection. However the authors assess these effects to be weaker in fetal thymocytes, suggesting weaker MHC-restriction. They conclude that in multiple respects fetal repertoires are distinct from and more innate-like than adult. 

      Strengths: 

      The analyses of the F18.5 and adult thymic repertoires are comprehensive with respect to the cell populations analyzed and the diversity of approaches used to characterize the repertoires. Because repertoires were analyzed in pre- and post-selection thymocyte subsets, the data offer the potential to assess repertoire selection at different developmental stages. The analysis of repertoire selection in fetal thymocytes may be unique. 

      Weaknesses: 

      (1) Problematic experimental design and some lack of familiarity with prior work have resulted in highly problematic interpretations of the data, particularly for TCRA repertoire development. 

      The authors note fetal but not adult thymocytes to be biased towards usage of 3' V segments and 5'J segments. It should be noted that these basic observations were made 20 years ago using PCR approaches (Pasqual et al., J.Exp.Med. 196:1163 (2002)), and even earlier by others.

      We have cited this manuscript (Introduction, page 5) which used PCR of genomic DNA to investigate some TCRα VJ rearrangements in foetal and adult thymus. In contrast, our study uses next generation sequencing of transcripts to investigate all possible combinations of TCRα and TCRβ VJ combinations in different sorted thymocyte populations ex vivo. The greater sensitivity of this more modern technology has thus enabled us to detect many more TCRαVJ rearrangements than the 2002 study, and to conclude on basis of stringent statistical testing that the foetal repertoire is enriched for 3’V to 5’J combinations (Fig. 4). 

      The authors also note that in fetal thymus this bias persists after positive selection, and it can be reproduced in adults during recovery from hydrocortisone treatment. The authors conclude that there are fewer rounds of sequential TCRA rearrangements in the fetal thymus, perhaps due to less time spent in the DP compartment in fetus versus adult. However, the repertoire difference noted by the authors does not require such an explanation. What the authors are analyzing in the fetus is the leading edge of a synchronous wave of TCRA rearrangements, whereas what they are analyzing in adults is the unsynchronized steady state distribution. It is certainly true, as has been shown previously, that the earliest TCRA rearrangements use 3' TRAV and 5'TRAJ segments. But analysis of adult thymocytes has shown that the progression from use of 3' TRAV and 5' TRAJ to use of 5' TRAV and 3' TRAJ takes several days (Carico et al., Cell Rep. 19:2157 (2017)). The same kinetics, imposed on fetal development, would put development of a more complete TCRA repertoire at or shortly after birth. In fact, Pasqual showed exactly this type of progression from F18 through D1 after birth, and could reproduce the progression by placing F16 thymic lobes in FTOC. It is not appropriate to compare a single snapshot of a synchronized process in early fetal thymocytes to the unsynchronized steady state situation in adults. In fact, the authors' own data support this contention, because when they synchronize adult thymocytes by using hydroxycortisone, they can replicate the fetal distribution. Along these lines, the fact that positive selection of fetal thymocytes using 3' TRAV and 5' TRAJ segments occurs within 2 days of thymocyte entry into the DP compartment does not mean that DP development in the fetus is intrinsically rapid and restricted to 2 days. It simply means that thymocytes bearing an early rearranging TCR can be positively selected shortly after TCR expression. The expectation would be that those DP thymocytes that had not undergone early positive selection using a 3' TRAV and a 5' TRAJ would remain longer in the DP compartment and continue the progression of TCRA rearrangements, with the potential for selection several days later using more 5'TRAV and 3'TRAJ. 

      We agree with this summary provided by the reviewer which corresponds closely to the points we made ourselves in the manuscript. Indeed, we discuss the synchronization and kinetics of first wave of T-cell development in Results page 13 and Discussion page 17, which was the rationale for the hydrocortisone experiment.  We have also discussed findings from Carico et al 2017 in this context (see pages 13, 16, 17).  

      (2) The authors note 3' V and 5'J biases for TCRB in fetal thymocytes. The previously outlined concerns about interpreting TCRA repertoire development do not directly apply here. But it would be appropriate to note that by deep sequencing, Sethna (PNAS 114:2253 (2017)) identified skewed usage of some of the same TRBV gene segments in fetal versus adult.  It should also be noted that Sethna did not detect significantly skewed usage of TRBJ  segments. Regardless, one might question whether the skewed usage of TRBJ segments detected here should be characterized as relating to chromosomal location. There are two logical ways one can think about chromosomal location of TRBJ segments - one being TRBJ1 cluster vs TRBJ2 cluster, the other being 5' to 3' within each cluster. The variation reported here does not obviously fit either pattern. Is there a statistically significant difference in aggregate use of the two clusters? There is certainly no clear pattern of use 5' to 3' across each cluster. 

      We have included a statistical comparison of the aggregate TRBJ use between the J1 cluster and the J2 cluster (see SFig5) and Results page 9. 

      (3) The authors show that biases in TCRA and TCRB V and J gene usage between fetal and adult thymocytes are mostly conserved between pre- and post-selection thymocytes (Fig 2). In striking contrast, TCRA and TCRB combinatorial repertoires show strong biases preselection that are largely erased in post-selection thymocytes (Fig 3). This apparent discrepancy is not addressed, but interpretation is challenging. 

      I think the reviewer is referring to heatmaps for individual gene segment usage shown in Figure 2 in comparison to combinatorial usage shown in Figure 4. There is not a discrepancy in the data, but rather the differences between these two figures lie in the way in which the comparisons are made and visualised.  The heatmaps in Figure 2A-D show mean proportional usage of each individual gene segment for each cell type in the two life stages, clustered by Euclidian distance. This visualisation clearly shows bias in foetal 3’ TRAV usage and 5’TRAJ usage (looking at areas of red, which have higher usage), with less pronounced enrichment for TRBV and TRBJ.  The heatmaps also show differences in intensity between different cell populations in each life-stage. 

      In contrast, in Figure 4 the tiles show combinations with statistically significant (P<0.05) differences in mean counts for each VJ combination in each cell type between 7 foetal and 6 adult repertoires by Student’s t-test, after correcting for False discovery rate (FDR) due to multiple combinations.  It is the case, that there are fewer significant differences in proportional combinatorial VxJ use between foetal and adult repertoires after selection. We find this an interesting finding and have expanded our discussion of this aspect of the data (page 10).  More than half of the significant differences persist after repertoire selection, and the reduction in each individual SP population, of course in part reflects the lineage divergence.

      (4) The observation that there is a higher proportion of nonproductive TCRB rearrangements in fetal thymus compared to adult is challenging to interpret, given that the results are based upon RNA sequencing so are unlikely to reflect the ratio in genomic DNA due to processes like NMD.

      We have added two sentences to explain that transcripts of non-productive rearrangements are eliminated by nonsense-mediated decay (NMD), but some non-productive transcripts are detected in many studies of TCR repertoire sequencing, and we have cited three studies from different groups that document this (see Results, page 10-11). We have not commented on how the increase in non-productive TCR rearrangements in the foetal populations (in comparison to adult) relates to rearrangements in genomic DNA or NMD.   We have likewise not commented on the possible significance or biological role of nonproductive TCR transcripts, but simply reported our findings.

      (5) An intriguing and paradoxical finding is that fetal DP, CD4 and CD8 thymocytes all display greater sharing of TCRB CDR3 sequences among individuals than do adults (Fig 5DE), whereas DP and CD8 thymocytes are shown to display greater CDR3 amino acid triplet motif sharing in adults (with a similar trend in CD4). 

      As foetal DP, CD4SP and CD8SP TCRbeta repertoires have fewer non-template insertions and lower means CDR3 length, they are expected to share more CDR3 repertoires than their adult counterparts.  However, in the case of CDR3 amino acid triplet motifs (k-mers) what is being analysed is the sharing of each possible individual k-mer. If k-mers are shared more in the adult for some populations, but CDR3 repertoires are shared more in the foetus, we think it means that some k-mers appear in many different CDR3 sequences in the adult, so that they are over-represented in multiple different CDR3s (presumably due to selection processes, although we agree that this is just an assumption).  

      The authors attribute high amino acid triplet sharing to the result of selection of recurrent motifs by contact with pMHC during positive selection. But this interpretation seems highly problematic because the difference between fetal and adult thymocytes is dramatic even in unfractionated DP thymocytes, the vast majority of which have not yet undergone positive selection. How then to explain the differences in CDR3 sharing visualized by the different approaches? 

      The TCRβ repertoire has been selected in the adult DP population through the process of β-selection, which is believed to involve immune synapse formation and MHC-interactions (Allam et al 2021,10.1083/jcb.201908108). We have now included this reference in the introduction to make this clear (page 4). However, we agree with the reviewer’s comments that it is challenging to explain the k-mer analysis and that we have not been able to actually show that increased k-mer sharing in the adult is a direct consequence of increased positive selection: it was our interpretation of this seemingly paradoxical finding.  For clarity, we have therefore removed the k-mer analyses from the manuscript.

      (6) The authors conclude that there is less MHC restriction in fetal thymocytes, based on measures of repertoire divergence from DP to CD4 and CD8 populations (Fig. 6). But the authors point to no evidence of this in analysis of TRBV usage, either by PC or heatmap analyses (A,B,D). The argument seems to rest on PC analysis of TRAV usage (Fig S6), despite the fact that dramatic differences in the SP4 and SP8 repertoires are readily apparent in the fetal thymocyte heatmaps. The data do not appear to be robust enough to provide strong support for the authors' conclusion. 

      We have written the text very carefully so as not to make the claim too strong, stating in the abstract: “In foetus we identified less influence of MHC-restriction on α-chain and β-chain combinatorial VxJ usage and CDR1xCDR2 (V region) usage in SP compared to adult, indicating weaker impact of MHC-restriction on the foetal TCR repertoire.” We are not saying that MHC-restriction does not impact VJ gene usage in foetal repertoires, but rather that it has less influence (particularly when compared to life-stage).  Evidence for this comes from:  [1] Heatmaps in Fig2A-D which show that all repertoires cluster first by life-stage ahead of cell type; [2] Fig3A and B: PCA of adult and foetal TCRβ VXJ combinations: All repertoires cluster by life-stage on PC1.  PC2 separates adult repertoires by cell type (adult SP8 are positive on PC2 while adult SP4 are negative on PC2, and DP cells are between them) but for foetal repertoires the SP8 and SP4 are highly dispersed with some SP4 cells falling on positive side of PC2.  Only foetal DP repertoires cluster tightly. [3] Fig6A-C: PCA of β−chain CDR1xCDR2 (corresponding to Vβ gene segment usage) again shows the same pattern.  Adult repertoires separate by cell type on PC2, (SP8 positive on PC2, SP4 negative on PC2, with DP in between), but foetal SP8 repertoires are much more dispersed.  [5] SFig6J-K: PCA of α−chain CDR1xCDR2 (Vα usage) frequency distributions: adult repertoires cluster together and are separated by cell type on PC2 (SP4 positive, SP8 negative), but foetal populations are highly dispersed and fail to cluster by cell type on either axis. [6] We have additionally added new PCA analyses to explore differences in MHC-restriction between foetal and adult SP populations.  This is shown in the new Figure 7. We reasoned that in a PCA that included foetal and adult repertoires together, the foetal repertoires might not segregate by SP cell type (MHC-restriction) because of their overall bias towards particular VJ combinations, which would mean that effectively the PCA would be imposing adult MHC restriction on the foetal repertoires.  We therefore carried out PCA in which we analysed the adult repertoires separately from the foetal repertoires.  As expected for adult repertoires, PCA separated SP4 repertoires from SP8 repertoires on PC1 in each comparison (β-chain VxJ (Fig. 7B), α-chain VxJ (Fig. 7F), β-chain CDR1xCDR2 (V region) (Fig. 7H) and α-chain CDR1xCDR2 (V region) (Fig. 7L)). In contrast, for foetal TCRα repertoires (α-chain VxJ and α-chain CDR1xCDR2 (V region)), PCA failed to separate SP4 from SP8 repertoires on PC1 or PC2, so we did not detect impact of MHC-restriction on foetal TCRβ repertoires (Fig. 7E and K).  For foetal TCRβ repertoires, PCA separated SP4 β-chain VxJ from SP8 on PC2, accounting for only 11.1% of variance (Fig. 7A) (in contrast to the 44.2% of variance accounted for by MHC-restriction in adult β-chain VxJ PCA (Fig. 7B)). Thus, in adult repertoires ~4-fold more of the variance in β-chain VxJ usage can be accounted for by MHC-restriction than in foetal repertoires. PCA of foetal β-chain CDR1xCDR2 (V region) separated SP4 from SP8 on PC1, accounting for 28.8% of variance, whereas in PCA of adult β-chain CDR1xCDR2, MHCrestriction accounted for 56.1% (>2-foldmore than in foetus).  Thus, even when we  considered only V-region usage alone, we detected a stronger influence of MHC-restriction on the TCRβ repertoire in adult compared to foetal thymus.  

      Reviewer #3 (Public Review): 

      Summary:

      This study provides a comparison of TCR gene segment usage between foetal and adult thymus.

      Strengths:

      Interesting computational analyses was performed to find interesting differences in TCR gene usage within unpaired TCRa and TCRb chains between foetal and adult thymus.  

      Weaknesses:

      This study was significantly lacking insight and interpretation into what the data analysed actually means for the biology. The dataset discussed in the paper is from only two experiments. One comparing foetal and adult thymi from 4 mice per group and another which involved hydrocortisone treatment. The paper uses TCR sequencing methodology that sequences each TCR alpha and beta chains in an unpaired way, meaning that the true identity of the TCR heterodimer is lost. This also has the added problem of overestimating clonality, and underestimating diversity.

      We have discussed the limitations and benefits of our approach of sequencing TCRβ and TCRα repertoires separately in the Discussion (page 19).  This approach allows the analysis of thousands of sequences from different cell types and different individuals at relatively low cost. We have made no claims in our manuscript about overall diversity or pairing, and given that each chain’s gene locus rearranges at a different time point in development, we believe it is of interest to consider the repertoires individually within this context.

      Limited detail in the methods sections also limits the ability for readers to properly interpret the dataset. What sex of mice were used? Are there any sex differences? What were the animal ethics approvals for the study?

      We have included this information in the Methods (page 19).  Both sexes were used and we found no sex differences, although that was not the focus of our study. All animal experimentation in the UK is carried out under UK Home Office Regulations (following ethical review). This is included in the Methods (page 19).  

      Recommendations for the authors:  

      Reviewer #1 (Recommendations For The Authors): 

      Major points: 

      - Group sizes are very small (4 foetal and 4 adult mice). Considering the spread in TCR analysis (eg fig 1 B-H, Sup figures 2-4), the study is likely underpowered as it often looks like one mouse prevents or supports a statistical difference. Authors should therefore consider increasing the group size. 

      We have sequenced more libraries and included more data, from 7 foetal and 6 young adult animals (biological replicates).  

      - The authors should include a gating strategy for their sorted cells. This is essential to verify the quality of their findings. 

      We have added this to the Methods and SFig7 and SFig8.

      Authors should include a summary sentence at the end of each result section which interprets the main finding. Furthermore, the manuscript would greatly benefit from a schematic figure of their main findings, particularly with regards to the rearrangements and selection differences in foetal and adult thymi. 

      We have added a summary sentence to the end of each results section.

      - Authors should be more careful with their claim that MHC has less of an effect foetal TCR selection. Authors demonstrated that there is a difference in VJ recombination between the foetal and adult TCR repertoire, skewing the foetal TCR repertoire to certain variable and junctional segments. Since both CDR1 and CDR2 are encoded by the variable gene, this is likely to affect their ability to interact with the MHC during positive selection. Have Authors considered whether the selection process is actually a bystander effect of the differences in the rearrangement process? One way to support the authors claim is to demonstrate that mice with an alternative MHC background, have similar foetal/adult gene rearrangements but a different TCR repertoire in the SP populations. 

      Time and resources have prevented us from repeating our experiments in another strain of inbred mice.  However, we note that a previous PCR study that showed 3’TRAV to 5’TRAJ bias in foetal repertoires was carried out in BALB/c mice (Pasqual JEM 2002). We have added this point to the Discussion (page 17). 

      - (supplementary) tables have not been provided. 

      Supplementary Tables were uploaded with the submission.  STables 1 and 2 show antibodies used for cell sorts and STable 3 primers used.

      Moderate points: 

      - The loading plots in Figure 3 onward are visually strong. Authors could consider including an V and J (separate) loading plots for Figure 3 E, F and G to demonstrate preferential V and J usage. 

      We have included additional loading plots in Figure 7 for the new PCA we have added (see Fig. 7C, D,I and J).

      - "the proportion of non-productive rearrangements was higher in the foetal SP8 population than adults (Fig 5A)" Authors should explain how non-productive TCRs end up in SP populations as they need to pass positive and negative selection which both require interactions between the TCR and the MHC. 

      As we used RNA sequencing in our study, we did not comment on how the increase in nonproductive TCRbeta rearrangements in the foetal populations (in comparison to adult) relates to rearrangements in genomic DNA or to nonsense-mediated decay (NMD) that is believed to down-regulate transcripts of non-productively rearranged TCR.  We have not commented on the possible significance or biological role of non-productive TCR transcripts, but simply reported our findings. 

      - Authors have studied CDR3 sequential amino acid triplets (k-mers). However, CDR3 regions are longer than 3 amino acids in length, hence authors should provide 1) an overview/comparison of the identified k-mers in foetal or adult thymocytes 2) explain how different k-mers relate to each other, eg whether they are expressed in the same TCR. Have authors considered using alternative programs to identify CDR3 motifs that are based on the full CDR3amino acid sequence, eg TCRdist provides motifs and indicated which amino acids are germline encoded or inserted. 

      In light of this comment from this reviewer and also comments from Reviewer 2, we have removed the comparison of k-mers from the manuscript.  Please see response to point 5 of Reviewer 2.  

      - The term "innate-like" is confusing as it implies that foetal cells are not antigen specific.

      However, once in the circulation, foetal cells will respond in an antigen-specific manner.

      Hence authors should use another term. 

      We have removed the term “innate-like” from the abstract and the first time we used it in the first paragraph of the Discussion. However, the second time we used the term, we are actually taking it from the manuscript we cited (Beaudin et al 2016) and in this case we left it in. We agree that foetal cells are likely to respond in an antigen-specific manner. 

      - To support their hypothesis in the discussion "However, as TCRd gene segments are nested.... so that 5' TRAV segments are not favoured" can authors confirm that there are indeed less yd T cells in the foetal repertoire? 

      We have removed this section from the discussion, because although it is interesting, it is highly speculative, and the manuscript is already quite complicated to interpret.

      Minor points: 

      - The authors may find the publication by De Greef 2021 PNAS of interest to identify TRBD segments 

      - Authors need to clarify that they mean CDR3-beta in the sentence "The mean predicted CDR3 length.... compared to young adult" 

      We have included new data in the manuscript to show that mean CDR3 length is lower in all foetal populations of beta (Fig5C) and alpha (SFig5C) and clarified which we are referring to in the text. 

      - Authors should bring the section "During TCRb gene rearrangement, these segments.... Initiating the sequence of rearrangements" forward and include a schematic." Forward to figure 2 and provide the reader with a visual schematic of the foetal vs adult recombination events. 

      - Discussion: "The first wave of foetal abT-cells that leave the thymus... tolerant to both self and maternal MHC/antigens". Have Authors considered the alternative hypothesis published by Thomas 2019 in Curr Opin System Biol that the observed bias could potentially provide better protection against childhood pathogens? 

      We have indeed considered this, as stated in the first paragraph of the Discussion “The first wave of foetal αβT-cells that leave the thymus must provide early protection against infection in the neonatal animal”. We have now cited the Thomas 2019 study.

      - Discussion: Authors should rephrase the sentence "The transition from DP to SP cell in the foetus.... From DN3 to SP cell may be slower" as it is unclear what the authors mean. 

      We have rephrased this (see page 17)

      - Discussion "TRAV and TRAJ Array" do authors mean "TRAV and TRAJ area"? 

      We did indeed mean array (as in series of gene segments) but we have changed the wording for clarity (page 14).

      - Methods, Fluorescence activated cell sorting: can authors clarify whether they stained, sorted and sequenced the full thymus and /or specify how many cells were included. Can authors also explain why foetal and adult cells were treated differently (eg the volume of master mix)? 

      - Methods Fluorescence activated cell sorting authors should specify what they mean with "mastermix of either 1:50 (foetal thymus) or 1:100 (adult thymus)". Does this mean all antibodies in the foetal mastermix were 1:50 and all antibodies in the adult master mix were 1:100? If so, why were different concentrations used and why were antibodies not individually titrated before use?  

      We have clarified the methods and antibodies used are listed with clones in supplementary tables.

      Figures: 

      - Several figures did not fit on the page and therefore missed the top or side 

      - Figure 1A: missing a label on the Y axis

      This is visible

      - Figure 2A-D: please indicate the 5' and 3' terminus in each graph. The cell type legend should include two separate colours for the two DP populations. 

      We have added 5’ and 3’ labels.  The two DP populations are clearly labelled.

      - Figure 4: please indicate the 5' and 3' terminus in each graph. 

      We have added 5’ and 3’ labels.   

      - Figure 5C: y axis should read mean CDR3B length (aa), Figure 5D and E: y axis should read Jaccard Index CDR3B, Figure 5 F and G: y axis should read Jaccard index CDR3B k-mers. Same comment for Sup Fig 5 but then CDR3a. 

      We have added these labels for both Figure 5 and Supplementary Figure 6 (was SFig5 previously).

      - Figure 6C top label should read CDR1B x CDR2B with highest contribution 

      We have added this label.

      - Figure 7: please indicate the 5' and 3' terminus in each graph. 

      We have added 5’ and 3’ labels.  This is now Figure 8, as we have added new analyses (new Figure 7).

      - Supplementary Figure 1-4 are missing a colour legend next to the graphs.

      We have added the legends in.  

      Reviewer #2 (Recommendations For The Authors): 

      (1) The authors need to provide better support for the notion that the fetal thymus produces ab T cells with properties and functions that are distinct from adult T cells. There are several  ways they might provide a more meaningful assessment: (1) They could analyze the fetal repertoire at multiple time points. (2) They could compare instead the steady state distributions in early postnatal and adult thymus samples. (3) They could compare the peripheral T cell repertoires in the first week of life versus adult. This last approach would allow them to draw the most impactful conclusion. 

      We appreciate these suggestions.  Sadly, it is beyond our budget for the current manuscript and beyond the scope of our current study that we believe provides interesting new information.

      (2) Fig S2D shows TRBJ1-4 in black lettering meant to indicate no significant difference whereas the figure shows use of this gene segment to be elevated in adult. I believe TRBJ1-4 should be in blue lettering.

      This is now coloured correctly.

      (3) The figure call out on p11 (Fig5I-J) should be H-I.

      This is now corrected.

      (4) Please indicate in the main text that Jaccard analysis in Fig 5 D-E is for TCRB.

      This is now corrected.

      (5) The analysis of usage of TCRB CDR1xCDR2 combinations in Fig6D is said to "reflect the bias observed in their TRBV gene usage (Fig 2C)". Isn't it the case that every TRBV gene presents a distinct CDR1xCDR2 combination, meaning that there is no difference between TRBV usage and TRBV CDR1xCDR2 usage? If so, please make this clearer.

      Yes, this is the case, we have made this clearer in the text.

      Reviewer #3 (Recommendations For The Authors): 

      In general, although there is lots of interesting analyses that can be done with these large datasets, I feel as though the authors did not fully interpret the real meaning and significance of many of these results. Whilst there were some speculation on why a foetal repertoire might be different to those of adults in the discussion sections, the rationale for each individual analyses was not clearly explained. I would suggest that the rationale and a thorough explanation of each analyses be added to the results section, including a finishing sentence on what it means. 

      We have added short summaries to each results section to make the points we are making clearer.

      The authors did not mention how many cells were sorted for from each thymus for sequencing. Was the cell number normalised between each population? As this might have an influence on various downstream measurements of diversity, evenness and clonality, if there is a sampling issue. 

      This is explained in the methods.  We used sampling to allow comparisons between repertoires of different sizes, and this is also explained in the methods.

      The authors should include the cell sorting profiles and example flow cytometry plots, including gating strategies and the post sort purity of each sorted population. 

      We have included sorting strategies in the methods (SFig7 and SFig8).

      I think the manuscript could also be improved if there were some basic characterisation of foetal vs. adult thymus development. How many thymocytes are in a foetal vs adult thymus at the timepoints chosen? 

      I think there were some interesting findings in this paper. Given that overall, the foetal thymus appeared to be less diverse than that of the adult, one question I thought would be interesting to discuss was the overlap between the two repertoires. Is the foetal thymus simply a sub-fraction of the adult repertoire or is it totally distinct with no overlapping sequences? 

      Our analyses indicate that the repertoires are actually different. This is evident in Fig4 and in PCA loading plots shown in Fig, 3C and new Fig. 7C, D, I and J.

      I think that some of the interpretation in the results section may be a bit vague. "When we compaired by thymocyte population, each adult population clustered together, with adult SP4 separating from adult SP8 on PC2 and DP cells scoring in between, suggesting that PC2 might correspond to MHC restriction of the adult populations." - whilst I think I know what the authors mean, I do believe that this could be explained in clearer detail and more explicit. SP4 and SP8 are known to be positively selected in the thymus on distinct MHC class I and MHC class II molecules for example. 

      We have tried to clarify the text describing that PCA and additionally added a new Figure (new Fig. &) to compare the influence of MHC-restriction on the TCR repertoire in foetal and adult thymus.

      In the methods section, the age and sex of mice used were not explained at all. What was used in the experiment? Are there any sex differences? 

      Age and sex of mice is given in the methods.  We have not detected sex differences.

      This is a huge omission from the manuscript. In general, I don't believe the methods section has described the analysis in sufficient detail for replication. All analysis code and data should be publicly accessible and be in a format that allows for the reader to replicate the figures in the paper upon running the code. Perhaps even allowing them to run their own TCR datasets.  Overall, I think the manuscript needs some rewriting to include additional details and deeper interpretation of each individual analyses. 

      Sequencing data files will be made publicly available on UCL Research Data Repository.

    Annotators

    1. Good handwriting, for example, presupposesa gymnastics - a whole routine whose rigorous code invests the bodyin its entirety, from the points of the feet to the tip of the indexfinge

      requiring of the whole body

    2. At the opposite pole one might imagine placing thebody of the condemned man; he, too, has his legal status; he givesrise to his own ceremonial and he calls forth a whole theoreticaldiscourse, not in order to ground the 'surplus power' possessed bythe person of the sovereign, but in order to code the 'lack of power'with which those subjected to punishment are marked.

      body of criminal is a double body- they are their own body but also a legal entity representing a limitation as a form of punishment

    1. Author response:

      Reviewer #1 (Public review):

      Summary:

      The paper by Tolossa et al. presents classification studies that aim to predict the anatomical location of a neuron from the statistics of its in-vivo firing pattern. They study two types of statistics (ISI distribution, PSTH) and try to predict the location at different resolutions (region, subregion, cortical layer).

      Strengths:

      This paper provides a systematic quantification of the single-neuron firing vs location relationship.

      The quality of the classification setup seems high.

      The paper uncovers that, at the single neuron level, the firing pattern of a neuron carries some information on the neuron's anatomical location, although the predictive accuracy is not high enough to rely on this relationship in most cases.

      Thank you for your thoughtful feedback. The level of predictive accuracy offered by our current approach, while far above chance, is insufficient for electrode localization in most cases. Although, we speculate that our results represent a lower limit on possible performance—future improvements are almost certain as larger datasets are generated, more diverse features of neural activity are employed, and more advanced ML tools are implemented. We note that the current performance indicates a far more reliable embedding of anatomy in spiking than precedented by the modest statistical significance previously described in the literature. It would have been impossible to achieve this without the tremendous resources provided by the Allen Institute. In our revision, we will clarify that major performance improvements are both possible and probable.

      Weaknesses:

      As the authors mention in the Discussion, it is not clear whether the observed differences in firing are epiphenomenal. If the anatomical location information is useful to the neuron, to what extent can this be inferred from the vicinity of the synaptic site, based on the neurotransmitter and neuromodulator identities? Why would the neuron need to dynamically update its prediction of the anatomical location of its pre-synaptic partner based on activity when that location is static, and if that information is genetically encoded in synaptic proteins, etc (e.g., the type of the synaptic site)? Note that the neuron does not need to classify all possible locations to guess the location of its pre-synaptic partner because it may only receive input from a subset of locations.  If an argument on activity-based estimation being more advantageous to the neuron than synaptic site-based estimation cannot be made, I believe limiting the scope of the paper (e.g., in the Introduction) to an epiphenomenal observation and its quantification will improve the scientific quality.

      Summarily, in response to the two reviewers, we will minimize our discussion of this question in the revision. However, given that our results are either epiphenomenal or functional, we feel that it is important to indicate these possibilities, even if this indication is succinct and conservative.

      In pursuit of a more concise revision, we will not expand our discussion to accommodate this interesting conversation with the reviewer, but we are excited to briefly offer our perspective here.

      Regarding the epiphenomenal nature of our observations: this is a complex question that would be challenging but not impossible to validate experimentally. It has been previously established that neurons, especially those that integrate inputs from a variety of regions and are involved in diverse functions, could benefit from mechanisms for dynamically parsing inputs (Gutig, Sompolinsky 2006). Neurotransmitter and neuromodulator identities may indeed convey some information about presynaptic neuron location (e.g., NE may originate from the locus coeruleus). However, hypothetically, the binding of a neurotransmitter only bears on the postsynaptic neuron via ionic current, or second messenger activity. Postsynaptic neurons do not consume or otherwise endocytose the neurotransmitter, thus the ability of a neuron to “know” the presynaptic identity is a function of induced postsynaptic activity. Certainly, there are multiple streams of information that can provide insight into anatomical location all taking the ultimate form of neural activity and membrane dynamics. This would be broadly consistent with (for example) reward prediction error which is evident in dopamine release, firing rates, spiking patterns, and oscillatory rhythms.

      We could imagine a possible role for the embedding of location in spiking patterns. It is important to note that many neurons in neighboring areas share common neurotransmitters (e.g., glutamate, GABA). Neurons receiving input from multiple regions with similar neurotransmitter profiles could benefit from additional information in the spiking patterns for distinguishing input sources, especially for multimodal integration. For instance, an inferior parietal lobule neuron or microcircuit could be downstream from both auditory cortex (listening) and Broca’s area (speaking). Imagine an individual is in a crowded coffee shop waiting for their drink order to be called while speaking to their friend. In this scenario, it may be important to recognize region-specific activity and thus selectively attend to it. Thus, it is unlikely that neurons actively update a “location prediction,” but rather that location-related information is passively embedded in spike patterning and this might be dynamically leveraged in computation. We emphasize that this is a simplified conceptual example and not a hypothesis that we test in the paper. This conversation, however, is a wonderful example of the thought experiments that we hope will grow from this type of work.

      Reviewer #2 (Public review):

      Summary:

      In this manuscript, Tolossa et al. analyze Inter-spike intervals from various freely available datasets from the Allen Institute and from a dataset from Steinmetz et al. They show that they can modestly decode between gross brain regions (Visual vs. Hippocampus vs. Thalamus), and modestly separate sub-areas within brain regions (DG vs. CA1 or various visual brain areas).

      Strengths:

      The paper is reasonably well written, and the definitions are quite well done. For example, the authors clearly explained transductive vs. inductive inference in their decoders. E.g., transductive learning allows the decoder to learn features from each animal, whereas inductive inference focuses on withheld animals and prioritizes the learning of generalizable features.

      Thank you!

      Weaknesses:

      However, even with some of these positive aspects, I still found the manuscript to be a laundry list of results, where some results are overly explained and not particularly compelling or interesting, whereas interesting results are not strongly described or emphasized. The overall problem is that the study is not cohesive, and the authors need to either come up with a tool or demonstrate a scientific finding. The current version attempts to split the middle and thus is not as impactful as it could be

      In our revision, we will endeavor to present our results in line with your suggestions. Thank you for the careful and thorough feedback that will improve the readability of our manuscript. We strove to be complete in establishing the logic leading to our ultimate finding—that a robust code for anatomical location can be extracted from single neuron spike trains, but not from more traditional descriptions of neural activity. Our detection of this code, albeit not perfect in performance, is, in most cases, both far above chance levels and is robust to animal identity and laboratory of origin. Our presentation of these results is cohesive in as much as we sequentially establish a series of results that build towards a concluding set of experiments. We start by establishing a baseline via standard measurements and then explore more challenging problems through more complex models that build toward our final test.  Based on your feedback, we will contract and expand elements of this sequence.

      While our findings raise the possibility of developing a computational tool for electrode localization, pending additional features and/or datasets, our current focus is on establishing the neurobiological principle of anatomical embedding in spike trains. The purpose of briefly mentioning a possible application is that we hope to encourage those engaged in machine-learning on multi-modal neural data that this problem is tractable, yet still open. Based on your feedback, we will clarify that the focus of our current work is not an introduction of a new tool.

    1. Reviewer #2 (Public review):

      Summary:

      The authors present an interesting paper where they test the antagonistic pleiotropy theory. Based on this theory they hypothesize that genetic variants associated with later onset of age at menarche and age at first birth have a positive causal effect on a multitude of health outcomes later in life, such as epigenetic aging and prevalence of chronic diseases. Using a mendelian randomization and colocalization approach, the authors show that SNPs associated with later age at menarche are associated with delayed aging measurements, such as slower epigenetic aging and reduced facial aging, and a lower risk of chronic diseases, such as type 2 diabetes and hypertension. Moreover, they identified 128 fertility-related SNPs that are associated with age-related outcomes and they identified BMI as a mediating factor for disease risk, discussing this finding in the context of evolutionary theory.

      Strengths:

      The major strength of this manuscript is that it addresses the antagonistic pleiotropy theory in aging. Aging theories are not frequently empirically tested although this is highly necessary. The work is therefore relevant for the aging field as well as beyond this field, as the antagonistic pleiotropy theory addresses the link between fitness (early life health and reproduction) and aging.

      Points that have to be clarified/addressed:

      (1) The antagonistic pleiotropy is an evolutionary theory pointing to the possibility that mutations that are beneficial for fitness (early life health and reproduction) may be detrimental later in life. As it concerns an evolutionary process and the authors focus on contemporary data from a single generation, more context is necessary on how this theory is accurately testable. For example, why and how much natural variation is there for fitness outcomes in humans? How do genetic risk score distributions of the exposure data look like? Also, how can the authors distinguish in their data between the antagonistic pleiotropy theory and the disposable soma theory, which considers a trade-off between investment in reproduction and somatic maintenance and can be used to derive similar hypotheses? There is just a very brief mention of the disposable soma theory in lines 196-198.

      (2) The antagonistic pleiotropy theory, used to derive the hypothesis, does not necessarily distinguish between male and female fitness. Would the authors expect that their results extrapolate to males as well? And can they test that?

      (3) There is no statistical analyses section providing the exact equations that are tested. Hence it's not clear how many tests were performed and if correction for multiple testing is necessary. It is also not clear what type of analyses have been done and why they have been done. For example in the section starting at line 47, Odds Ratios are presented, indicating that logistic regression analyses have been performed. As it's not clear how the outcomes are defined (genotype or phenotype, cross-sectional or longitudinal, etc.) it's also not clear why logistic regression analysis was used for the analyses.

      (4) Mendelian Randomization is an important part of the analyses done in the manuscript. It is not clear to what extent the MR assumptions are met, how the assumptions were tested, and if/what sensitivity analyses are performed; e.g. reverse MR, biological knowledge of the studied traits, etc. Can the authors explain to what extent the genetic instruments represent their targets (applicable expression/protein levels) well?

      (5) It is not clear what reference genome is used and if or what imputation panel is used. It is also not clear what QC steps are applied to the genotype data in order to construct the genetic instruments of MR.

      (6) A code availability statement is missing. It is understandable that data cannot always be shared, but code should be openly accessible.

    1. Les algorithmes dits fermés, qui sont desoutils classiques : l’intégralité des instructionsa été pensée et écrite par un être humaindans un code source. Ils brassent les seulesdonnées ad hoc préalablement déterminées,c’est-à-dire celles que le responsable del’algorithme a identifiées en amont, et que lapersonne a volontairement et ponctuellementfournies pour un usage précis (par exempleParcoursup, où la liste des donnéespersonnelles traitées par l’algorithme estétablie dans un texte réglementaire) ;
    1. Author response:

      The following is the authors’ response to the previous reviews.

      Public Reviews: 

      Reviewer #1 (Public Review): 

      Summary: 

      In this work, the authors examine the activity and function of D1 and D2 MSNs in dorsomedial striatum (DMS) during an interval timing task. In this task, animals must first nose poke into a cued port on the left or right; if not rewarded after 6 seconds, they must switch to the other port. Thus, this task requires animals to estimate if at least 6 seconds have passed after the first nose poke. After verifying that animals estimate the passage of 6 seconds, the authors examine striatal activity during this interval. They report that D1-MSNs tend to decrease activity, while D2MSNs increase activity, throughout this interval. They suggest that this activity follows a driftdiffusion model, in which activity increases (or decreases) to a threshold after which a decision is made. The authors next report that optogenetically inhibiting D1 or D2 MSNs, or pharmacologically blocking D1 and D2 receptors, increased the average wait time. This suggests that both D1 and D2 neurons contribute to the estimate of time, with a decrease in their activity corresponding to a decrease in the rate of 'drift' in their drift-diffusion model. Lastly, the authors examine MSN activity while pharmacologically inhibiting D1 or D2 receptors. The authors observe most recorded MSNs neurons decrease their activity over the interval, with the rate decreasing with D1/D2 receptor inhibition. 

      We appreciate the careful read by this reviewer. 

      Major strengths: 

      The study employs a wide range of techniques - including animal behavioral training, electrophysiology, optogenetic manipulation, pharmacological manipulations, and computational modeling. The question posed by the authors - how striatal activity contributes to interval timing - is of importance to the field and has been the focus of many studies and labs. This paper contributes to that line of work by investigating whether D1 and D2 neurons have similar activity patterns during the timed interval, as might be expected based on prior work based on striatal manipulations. However, the authors find that D1 and D2 neurons have distinct activity patterns. They then provide a decision-making model that is consistent with all results. The data within the paper is presented very clearly, and the authors have done a nice job presenting the data in a transparent manner (e.g., showing individual cells and animals). Overall, the manuscript is relatively easy to read and clear, with sufficient detail given in most places regarding the experimental paradigm or analyses used. 

      We are glad that our main points come clearly through.

      Major weaknesses: 

      One weakness to me is the impact of identifying whether D1 and D2 had similar or different activity patterns. Does observing increasing/decreasing activity in D2 versus D1, or different activity patterns in D1 and D2, support one model of interval timing over another, or does it further support a more specific idea of how DMS contributes to interval timing? 

      This is a great point - we were not clear.  We observe distinct patterns of D2 and D1-MSN activity, but that disrupting either D2-MSNs or D1-MSNs led to increased response time.  The model that this supports is that D2-MSNs and D1-MSN ensemble activity represents temporal evidence.  This is a very specific model that can be rigorously tested in future work.  We have now made this very clear in the abstract (Page 2). 

      “We found that D2-MSNs and D1-MSNs exhibited distinct dynamics over temporal intervals as quantified by principal component analyses and trial-by-trial generalized linear models. MSN recordings helped construct and constrain a fourparameter drift-diffusion computational model in which MSN ensemble activity represented the accumulation of temporal evidence. This model predicted that disrupting either D2-MSNs or D1-MSNs would increase interval timing response times and alter MSN firing. In line with this prediction, we found that optogenetic inhibition or pharmacological disruption of either D2-MSNs or D1-MSNs increased interval timing response times.”

      And in the results on Page 18:  

      “Because both D2-MSNs and D1-MSNs accumulate temporal evidence, disrupting either MSN type in the model changed the slope. The results were obtained by simultaneously decreasing the drift rate D (equivalent to lengthening the neurons’ integration time constant) and lowering the level of network noise 𝝈: D = 𝟎. 𝟏𝟐𝟗, 𝝈 = 𝟎. 𝟎𝟒𝟑 for D2-MSNs in Fig 4A (in red; changes in noise had to accompany changes in drift rate to preserve switch response time variance. See Methods); and 𝑫 = 𝟎. 𝟏𝟐𝟐, 𝝈 = 𝟎. 𝟎𝟒𝟑 for D1-MSNs in Fig 4B (in blue). The model predicted that disrupting either D2-MSNs or D1-MSNs would increase switch response times (Fig 4C and Fig 4D) and would shift MSN dynamics.” 

      And in the discussion (Page 30): 

      “Striatal MSNs are critical for temporal control of action (Emmons et al., 2017; Gouvea et al., 2015; Mello et al., 2015). Three broad models have been proposed for how striatal MSN ensembles represent time: 1) the striatal beat frequency model, in which MSNs encode temporal information based on neuronal synchrony (Matell and Meck, 2004); 2) the distributed coding model, in which time is represented by the state of the network (Paton and Buonomano, 2018); and 3) the DDM, in which neuronal activity monotonically drifts toward a threshold after which responses are initiated (Emmons et al., 2017; Simen et al., 2011; Wang et al., 2018). While our data do not formally resolve these possibilities, our results show that D2-MSNs and D1MSNs exhibit opposing changes in firing rate dynamics in PC1 over the interval. Past work by our group and others has demonstrated that PC1 dynamics can scale over multiple intervals to represent time (Emmons et al., 2020, 2017; Gouvea et al., 2015; Mello et al., 2015; Wang et al., 2018). We find that low-parameter DDMs account for interval timing behavior with both intact and disrupted striatal D2- and D1-MSNs. While other models can capture interval timing behavior and account for MSN neuronal activity, our model does so parsimoniously with relatively few parameters (Matell and Meck, 2004; Paton and Buonomano, 2018; Simen et al., 2011). We and others have shown previously that ramping activity scales to multiple intervals, and DDMs can be readily adapted by changing the drift rate (Emmons et al., 2017; Gouvea et al., 2015; Mello et al., 2015; Simen et al., 2011). Interestingly, decoding performance was high early in the interval; indeed, animals may have been focused on this initial interval (Balci and Gallistel, 2006) in making temporal comparisons and deciding whether to switch response nosepokes.”

      Regarding the reviewer’s specific question – it is not clear why D1-MSNs and D2-MSNs have opposing patterns of activity, as integration of temporal evidence can certainly be achieved increasing or decreasing firing rates alone. These patterns have been seen in motor control. Prefrontal neurons, which control striatal ramping, also ramp up and down. We have now included a paragraph on Page 30 explicitly discussing these ideas; however, future experiments will be required to investigate the source of the divergent patterns of activity among D2-MSNs and D1-MSNs.   

      “D2-MSNs and D1-MSNs play complementary roles in movement. For instance, stimulating D1-MSNs facilitates movement, whereas stimulating D2-MSNs impairs movement (Kravitz et al., 2010). Both populations have been shown to have complementary patterns of activity during movements with MSNs firing at different phases of action initiation and selection (Tecuapetla et al., 2016). Further dissection of action selection programs reveals that opposing patterns of activation among D2MSNs and D1-MSNs suppress and guide actions, respectively, in the dorsolateral striatum (Cruz et al., 2022). A particular advantage of interval timing is that it captures a cognitive behavior within a single dimension — time. When projected along the temporal dimension, it was surprising that D2-MSNs and D1-MSNs had opposing patterns of activity. Ramping activity in the prefrontal cortex can increase or decrease; and prefrontal neurons project to and control striatal ramping activity (Emmons et al., 2020, 2017; Wang et al., 2018).  It is possible that differences in D2MSNs and D1-MSNs reflect differences in cortical ramping, which may themselves reflect more complex integrative or accumulatory processes. Further experiments are required to investigate these differences. Past pharmacological work from our group and others has shown that disrupting D2- or D1-MSNs slows timing (De Corte et al., 2019b; Drew et al., 2007, 2003; Stutt et al., 2024) and are in agreement with pharmacological and optogenetic results in this manuscript. Computational modeling predicted that disrupting either D2-MSNs or D1-MSNs increased selfreported estimates of time, which was supported by both optogenetic and pharmacological experiments.”

      I found the results presented in Figures 2 and 3 to be a little confusing or misleading. In Figure 2, the authors appear to claim that D1 neurons decrease their activity over the time interval while D2 neurons increase activity. The authors use this result to suggest that D1/D2 activity patterns are different. In Figure 3, a different analysis is done, and this time D2 neurons do not significantly increase their activity with time, conflicting with Figure 2. While in both figures, there is a significant difference between the mean slopes across the population, the secondary effect of positive/negative slope for D2/D1 neurons changes. I find this especially confusing as the authors refer back to the positive/negative slope for D2/D1 neurons result throughout the rest of the text.  

      We were not clear.  First, we attempted to quantify these differences based on PCA and slope.  We have rephrased our characterization of these differences by changing text on (Page 9) to: 

      “These PETHs revealed that for the 6-second interval immediately after trial start, many putative D2-MSN neurons appeared to ramp up while many putative D1-MSNs appeared to ramp down. For 32 putative D2-MSNs average PETH activity increased over the 6-second interval immediately after trial start, whereas for 41 putative D1-MSNs, average PETH activity decreased. Accordingly, D2-MSNs and D1-MSNs had differences in activity early in the interval (0-5 seconds; F = 4.5, p = 0.04 accounting for variance between mice) but not late in the interval (5-6 seconds; F = 1.9, p = 0.17 accounting for variance between mice). Examination of a longer interval of 10 seconds before to 18 seconds after trial start revealed the greatest separation in D2-MSN and D1-MSN dynamics during the 6-second interval after trial start (Fig S2). Strikingly, these data suggest that D2-MSNs and D1-MSNs might display distinct dynamics during interval timing.” 

      We have rephrased our discussion on PCA to quantify differences in Fig 2G-H using data-driven methods (Page 12): 

      “To quantify differences between D2-MSNs vs D1-MSNs in Fig 2G-H, we turned to principal component analysis (PCA), a data-driven tool to capture the diversity of neuronal activity (Kim et al., 2017a). Work by our group and others has uniformly identified PC1 as a linear component among corticostriatal neuronal ensembles during interval timing (Bruce et al., 2021; Emmons et al., 2020, 2019, 2017; Kim et al., 2017a; Narayanan et al., 2013; Narayanan and Laubach, 2009; Parker et al., 2014; Wang et al., 2018). We analyzed PCA calculated from all D2-MSN and D1MSN PETHs over the 6-second interval immediately after trial start. PCA identified time-dependent ramping activity as PC1 (Fig 3A), a key temporal signal that explained 54% of variance among tagged MSNs (Fig 3B; variance for PC1 p = 0.009 vs 46 (44-49)% for any pattern of PC1 variance derived from random data; Narayanan, 2016). Consistent with population averages from Fig 2G&H, D2-MSNs and D1-MSNs had opposite patterns of activity with negative PC1 scores for D2MSNs and positive PC1 scores for D1-MSNs (Fig 3C; PC1 for D2-MSNs: -3.4 (-4.6 – 2.5); PC1 for D1-MSNs: 2.8 (-2.8 – 4.9); F = 8.8, p = 0.004 accounting for variance between mice (Fig S3A); Cohen’s d = 0.7; power = 0.80; no reliable effect of sex (F = 0.44, p = 0.51) or switching direction (F = 1.73, p = 0.19)).”

      And finally, we directly investigate the heart of the reviewer’s question by explicitly comparing PC1 scores – a data-driven analysis of neuronal patterns that explain the least variance – and show that they are less than 0 for D2-MSNs (i.e., negatively correlated with a down-ramping pattern, or ramping up), and greater than 0 for D1MSNs (i.e., positively correlated with an up-ramping pattern): 

      “Importantly, PC1 scores for D2-MSNs were significantly less than 0 (signrank D2MSN PC1 scores vs 0: p = 0.02), implying that because PC1 ramps down, D2-MSNs tended to ramp up. Conversely, PC1 scores for D1-MSNs were significantly greater than 0 (signrank D1-MSN PC1 scores vs 0: p = 0.05), implying that D1-MSNs tended to ramp down.  Thus, analysis of PC1 in Fig 3A-C suggested that D2-MSNs (Fig 2G) and D1-MSNs (Fig 2H) had opposing ramping dynamics.”

      We interpret these data on Page 16: 

      “Our analysis of average activity (Fig 2G-H) and PC1 (Fig 3A-C) suggested that D2MSNs and D1-MSNs might have opposing dynamics. However, past computational models of interval timing have relied on drift-diffusion dynamics that increases over the interval and accumulates evidence over time (Nguyen et al., 2020; Simen et al., 2011).”

      The reviewer mentions our analysis of ‘mean slopes across the population’ -which we clarify as trial-by-trial slope analysis, which is distinct from the population averages in 2G-H and 3A-C.  We have now made this clear (Page 12). 

      “To interrogate these dynamics at a trial-by-trial level, we calculated the linear slope of D2-MSN and D1-MSN activity over the first 6 seconds of each trial using generalized linear modeling (GLM) of effects of time in the interval vs trial-by-trial firing rate (Latimer et al., 2015).  Note that this analysis focuses on each trial rather than population averages in Fig 2G-H and Fig 3A-C.”

      Finally, as the reviewer suggests, we have removed the term ‘slope’ from the rest of the paper, as the increasing/decreasing comes from averages and analyses of PC1.  We have removed all discussion of ‘opposing’ slope or ‘increasing/decreasing’ slope. 

      It is a bit unclear to me how the authors chose the parameters for the model, and how well the model explains behavior is quantified. It seems that the authors didn't perform cross-validation across trials (i.e., they chose parameters that explained behavior across all trials combined, rather than choosing parameters from a subset of trials and determining whether those parameters are robust enough to explain behavior on held-out trials). I think this would increase the robustness of the result. 

      In addition, it remains a bit unclear to me how the authors changed the specific parameters they did to model the optogenetic manipulation. It seems these parameters were chosen because they fit the manipulation data. This makes me wonder if this model is flexible enough that there is almost always a set of parameters that would explain any experimental result; in other words, I'm not sure this model has high explanatory power. 

      We are glad the reviewer raised these points.  First, we have now included a complete exploration of the parameter space, exactly as the reviewer recommends.  These are described in the methods (Page 41): 

      “Selection of DDMs parameters. Our goal was to build DDMs with dynamics that produce “response times” according to the observed distribution of mice switch times. The selection of parameter values in Fig 4 was done in three steps. First, we fit the distribution of the mice behavioral data with a Gamma distribution and found its fitting values for shape 𝜶𝑴 and rate 𝜷𝑴 (Table S2 and Fig S8; R2 Data vs Gamma ≥ 𝟎. 𝟗𝟒). We recognized that the mean 𝝁𝑴 and the coefficient of variation 𝑪𝑽𝑴 are directly related to the shape and rate of the Gamma distribution by formulas 𝝁𝑴 \= 𝜶𝑴/𝜷𝑴 and 𝑪𝑽𝑴 \= 𝟏/√𝜶𝑴.  Next, we fixed parameters 𝑭 and 𝒃 in DDM (e.g., for D2-MSNs: 𝑭 = 𝟏, 𝒃 = 𝟎. 𝟓𝟐) and simulated the DDM for a range of values for 𝑫 and 𝝈. For each pair (𝑫, 𝝈), one computational “experiment” generated 500 response times with mean 𝝁 and coefficient of variation 𝑪𝑽. We repeated the “experiment” 10 times and took the group median of 𝝁 and 𝑪𝑽 to obtain the simulation-based statistical measures 𝝁𝑺 and 𝑪𝑽𝑺. Last, we plotted 𝑬𝝁 \= |(𝝁𝑺 − 𝝁𝑴)/𝝁𝑴| and 𝑬𝒄𝒗 \= |𝑪𝑽𝑺 − 𝑪𝑽𝑴|, the respective relative error and the absolute error to data (Fig S7). We considered that parameter values (𝑫, 𝝈) provided a good DDM fit of mice behavioral data whenever  𝑬𝝁 ≤ 𝟎. 𝟎𝟓    and 𝑬𝒄𝒗

      And included a new Fig S7 which shows the parameter space: 

      These new data clearly comment on the parameter space of our model. 

      Finally, the reviewer mentions cross-validation.  We did this at length on our model and data fits.  We used 10-fold cross-validation as fitlm needs enough data for the individual fits.  We found that the fit was extremely stable – i.e, we ended up with standard deviations in R2<0.004 for all comparisons.  Thus, we added the following point to the methods on Page 41:  

      “10-fold cross-validation revealed highly stable fits between gamma, models and data.”

      Lastly, the results are based on a relatively small dataset (tens of cells). 

      This is an important point.  Although it is a small optogenetically-tagged dataset, we have adequate statistical power and large effect sizes, which we now detail in the text on Page 12:

      “Consistent with population averages from Fig 2G&H, D2-MSNs and D1-MSNs had opposite patterns of activity with negative PC1 scores for D2-MSNs and positive PC1 scores for D1-MSNs (Fig 3C; PC1 for D2-MSNs: -3.4 (-4.6 – 2.5); PC1 for D1MSNs: 2.8 (-2.8 – 4.9); F = 8.8, p = 0.004 accounting for variance between mice (Fig S3A); Cohen’s d = 0.7; power = 0.80; no reliable effect of sex (F = 0.44, p = 0.51) or switching direction (F = 1.73, p = 0.19)).”

      And:  

      “GLM analysis also demonstrated that D2-MSNs had significantly different slopes (0.01 spikes/second (-0.10 – 0.10)), which were distinct from D1-MSNs (-0.20 (-0.47– 0.06; Fig 3D; F = 8.9, p = 0.004 accounting for variance between mice (Fig S3B); Cohen’s d = 0.8; power = 0.98; no reliable effect of sex (F = 0.02, p = 0.88) or switching direction (F = 1.72, p = 0.19)).”

      And we have included the reviewers point as a limitation on Page 33:  

      “Second, although we had adequate statistical power and medium-to-large effect sizes, optogenetic tagging is low-yield, and it is possible that recording more of these neurons would afford greater opportunity to identify more robust results and alternative coding schemes, such as neuronal synchrony.”

      Impact: 

      The task and data presented by the authors are very intriguing, and there are many groups interested in how striatal activity contributes to the neural perception of time. The authors perform a wide variety of experiments and analysis to examine how DMS activity influences time perception during an interval-timing task, allowing for insight into this process. However, the significance of the key finding -- that D1 and D2 activity is distinct across time -- remains somewhat ambiguous to me. 

      Again, we are glad that the reviewer appreciated our main point, and we very much appreciate the additional points about interpretation, model parameters, and statistical power. If there is any way we can clarify the text further we are happy to do so.  

      Reviewer #2 (Public Review):  

      (1) Regarding the results in Figure 2 and Figure 5: for the heatmaps in Fig.2F and Fig.2E, the overall activity pattern of D1 and D2 MSNs looks very similar, both D1 and D2 MSNs contains neurons showing decreasing or increasing activity during interval timing. And the optogenetic and pharmacologic inhibition of either D1 or D2 MSNs resulted in similar behavior outcomes. To me, the D1 and D2 MSN activities were more complementary than opposing. 

      This is a great point. In our last revision, R3 suggested that complementary means opposing – and suggested we change the title to reflect this.  Our original title was ‘Complementary cognitive roles for D2-MSNs and D1-MSNs during interval timing’ – and we have changed the title back to this. We have clarified what we meant by complementary in the abstract (Page 2):

      “Together, our findings demonstrate that D2-MSNs and D1-MSNs had opposing dynamics yet played complementary cognitive roles, implying that striatal direct and indirect pathways work together to shape temporal control of action.”

      And on Page 30: 

      “These data, when combined with our model predictions, demonstrate that despite opposing dynamics,  D2-MSNs and D1-MSN contribute complementary temporal evidence to controlling actions in time.”

      If the authors want to emphasize the opposing side of D1 and D2 MSNs, then the manipulation experiments need to be re-designed, since the average activity of D2 MSNs increased, while D1 MSNs decreased during interval timing, instead of using inhibitory manipulations in both pathways, the authors should use inhibitory manipulation in D2-MSNs, while using optogenetic or pharmacology to activate D1-MSNs. In this way, the authors can demonstrate the opposing role of D1 and D2 MSNs and the functions of increased activity in D2-MSNs and decreased activity in D1-MSNs. 

      These are great ideas, which we agree with.  We would like to emphasize the complementary nature as noted in our original title, and not the opposing side of D1/D2 MSNs. The experiments proposed by reviewer are certainly worth doing, but would likely be quite complex to find the right stimulation parameters to affect timing without affecting movement – and we have now included them as an important limitation / future direction (Page 33):

      “Fifth, we did not deliver stimulation to the striatum because our pilot experiments triggered movement artifacts or task-specific dyskinesias (Kravitz et al., 2010). Future stimulation approaches carefully titrated to striatal physiology may affect interval timing without affecting movement.”

      (2) Regarding the results in Figure 3 C and D, Figure 6 H and Figure 7 D, what is the sample size? From the single data points in the figures, it seems that the authors were using the number of cells to do statistical tests and plot the figures. For example, Figure 3 C, if the authors use n= 32 D2 MSNs and n= 41D1 MSNs to do the statistical test, it could make a small difference to be statistically significant. The authors should use the number of mice to do the statistical tests. 

      These are important points that were discussed at length in the prior review.  First, for the sample size, we now have detailed in our Table 1: 

      Second, we have detailed our statistical approach which explicitly deals with repeated observations of neurons across mice (Page 43):

      “Statistics. All data and statistical approaches were reviewed by the Biostatistics, Epidemiology, and Research Design Core (BERD) at the Institute for Clinical and Translational Sciences (ICTS) at the University of Iowa. All code and data are made available at http://narayanan.lab.uiowa.edu/article/datasets. We used the median to measure central tendency and the interquartile range to measure spread. We used Wilcoxon nonparametric tests to compare behavior between experimental conditions and Cohen’s d to calculate effect size. Analyses of putative single-unit activity and basic physiological properties were carried out using custom routines for MATLAB. For all neuronal analyses, variability between animals was accounted for using generalized linear-mixed effects models and incorporating a random effect for each mouse into the model, which allows us to account for inherent betweenmouse variability. We used fitglme in MATLAB and verified main effects using lmer in R. We accounted for variability between MSNs in pharmacological datasets in which we could match MSNs between saline, D2 blockade, and D1 blockade. P values < 0.05 were interpreted as significant.”   

      We have formally reviewed this approach with professional biostatisticians at the University of Iowa.

      Finally, we note that we do have adequate statistical power for analysis of Fig 3C and D:  we have adequate statistical power and large effect sizes, which we now detail in the text on Page 12:

      “Consistent with population averages from Fig 2G&H, D2-MSNs and D1-MSNs had opposite patterns of activity with negative PC1 scores for D2-MSNs and positive PC1 scores for D1-MSNs (Fig 3C; PC1 for D2-MSNs: -3.4 (-4.6 – 2.5); PC1 for D1MSNs: 2.8 (-2.8 – 4.9); F = 8.8, p = 0.004 accounting for variance between mice (Fig S3A); Cohen’s d = 0.7; power = 0.80; no reliable effect of sex (F = 0.44, p = 0.51) or switching direction (F = 1.73, p = 0.19)).”

      And, on Page 12:  

      “GLM analysis also demonstrated that D2-MSNs had significantly different slopes (0.01 spikes/second (-0.10 – 0.10)), which were distinct from D1-MSNs (-0.20 (-0.47– 0.06; Fig 3D; F = 8.9, p = 0.004 accounting for variance between mice (Fig S3B); Cohen’s d = 0.8; power = 0.98; no reliable effect of sex (F = 0.02, p = 0.88) or switching direction (F = 1.72, p = 0.19)).”

      And we have included the reviewers point as a limitation on Page 33: 

      “Second, although we had adequate statistical power and medium-to-large effect sizes, optogenetic tagging is low-yield, and it is possible that recording more of these neurons would afford greater opportunity to identify more robust results and alternative coding schemes, such as neuronal synchrony.”

      (3) Regarding the results in Figure 5, wly at is the reason for the increase in the response times? The authors should plot the position track during intervals (0-6 s) with or without optogenetic or pharmacologic inhibition. The authors can check Figures 3, 5, and 6 in the paper https://doi.org/10.1016/j.cell.2016.06.032 for reference to analyze the data. 

      These are key points, and we are glad the reviewer raised them.  Our interpretation is that response time increases – without reliable changes in other task-specific movements such as nosepoke reaction time or traversal time (Fig S9).  This was lacking in our prior manuscript, and we are glad the reviewer raised it.  We have now added this to Page 30

      “Our interpretation is that because the activity of D2-MSN and D1-MSN ensembles represents the accumulation evidence, pharmacological/optogenetic disruption of D2-MSN/D1-MSN activity slows this accumulation process, leading to slower interval timing-response times (Fig 5) without changing other task-specific movements (Fig S9).  These results provide new insight into how opposing patterns of striatal MSN activity control behavior in similar ways and show that they play a complementary role in elementary cognitive operations.”

      Regarding the tracking of velocity, we unfortunately do not have this information reliably across all conditions. This citation is a beautiful landmark paper, and we are working on collecting this information in our new datasets going forward.  We have included this as a major limitation (Page 34): 

      “Still, future work combining motion tracking/accelerometry with neuronal ensemble recording and optogenetics and including bisection tasks may further unravel timing vs. movement in MSN dynamics (Robbe, 2023; Tecuapetla et al., 2016).”

      Once again, we are appreciative of the thoughtful points raised by this reviewer.  

      Reviewer #3 (Public Review): 

      Summary: 

      The cognitive striatum, also known as the dorsomedial striatum, receives input from brain regions involved in high-level cognition and plays a crucial role in processing cognitive information. However, despite its importance, the extent to which different projection pathways of the striatum contribute to this information processing remains unclear. In this paper, Bruce et al. conducted a study using various causal and correlational techniques to investigate how these pathways collectively contribute to interval timing in mice. Their results were consistent with previous research, showing that the direct and indirect striatal pathways perform opposing roles in processing elapsed time. Based on their findings, the authors proposed a revised computational model in which two separate accumulators track evidence for elapsed time in opposing directions. These results have significant implications for understanding the neural mechanisms underlying cognitive impairment in neurological and psychiatric disorders, as disruptions in the balance between direct and indirect pathway activity are commonly observed in such conditions. 

      Strengths: 

      The authors employed a well-established approach to study interval timing and employed optogenetic tagging to observe the behavior of specific cell types in the striatum. Additionally, the authors utilized two complementary techniques to assess the impact of manipulating the activity of these pathways on behavior. Finally, the authors utilized their experimental findings to enhance the theoretical comprehension of interval timing using a computational model. 

      We very much appreciate the considered read and comments by the reviewer, and recognition of the breadth of techniques in this manuscript. 

      Weaknesses: 

      The behavioral task used in this study is best suited for investigating elapsed time perception, rather than interval timing. Timing bisection tasks are often employed to study interval timing in humans and animals. In the optogenetic experiment, the laser was kept on for too long (18 seconds) at high power (12 mW). This has been shown to cause adverse effects on population activity (for example, through heating the tissue) that are not necessarily related to their function during the task epochs. Given the systemic delivery of pharmacological interventions, it is difficult to conclude that the effects are specific to the dorsomedial striatum. Future studies should use the local infusion of drugs into the dorsomedial striatum. 

      These are important points.  We agree with them completely and have now included responses to them.  First, bisection tasks certainly have advantages – we have justified our approach in the discussion (Page 32):

      “Our task version has been used extensively to study interval timing in mice and humans (Balci et al., 2008; Bruce et al., 2021; Stutt et al., 2024; Tosun et al., 2016; Weber et al., 2023). However, temporal bisection tasks, in which animals hold during a temporal cue and respond at different locations depending on cue length, have advantages in studying how animals time an interval because animals are not moving while estimating cue duration (Paton and Buonomano, 2018; Robbe, 2023; Soares et al., 2016). Our interval timing task version – in which mice switch between two response nosepokes to indicate their interval estimate has elapsed – has been used extensively in rodent models of neurodegenerative disease (Larson et al., 2022; Weber et al., 2024, 2023; Zhang et al., 2021), as well as in humans (Stutt et al., 2024). This version of interval timing involves motor timing, which engages executive function and has more translational relevance for human diseases than perceptual timing or bisection tasks (Brown, 2006; Farajzadeh and Sanayei, 2024; Nombela et al., 2016; Singh et al., 2021).  Furthermore, because many therapeutics targeting dopamine receptors are used clinically, these findings help describe how dopaminergic drugs might affect cognitive function and dysfunction. Future studies of D2-MSNs and D1-MSNs in temporal bisection and other timing tasks may further clarify the relative roles of D2- and D1-MSNs in interval timing and time estimation.”

      Second – we have included an explicit control that has the same laser that is on for the same epoch as in the experimental animal – and find no effects.  This is now detailed in the methods: (Page 37): 

      “To control for heating and nonspecific effects of optogenetics, we performed control experiments in mice without opsins using identical laser parameters in D2-cre or D1-cre mice (Fig S6).”

      And in the results (Page 21): 

      “To control for heating and nonspecific effects of optogenetics, we performed control experiments in D2-cre mice without opsins using identical laser parameters; we found no reliable effects for opsin-negative controls (Fig S6).”

      And on Page 21:

      “As with D2-MSNs, we found no reliable effects with opsin-negative controls in D1MSNs (Fig S6).”

      We have now detailed these results in Figure S6:

      Regarding focal pharmacology, we performed this experiment with focal infusion of D1/D2 antagonists in our prior work, which we have now cited (Page 4):

      “Similar behavioral effects were found with systemic (Stutt et al., 2024) or focal infusion of D2 or D1 antagonists locally within the dorsomedial striatum (De Corte et al., 2019a).”

      Comments on revised version: 

      Thank you for the comprehensive revisions. Most of my (addressable) concerns were addressed. The current version of your manuscript appears significantly improved. 

      Once again, we appreciate the reviewer’s constructive and insightful comments and careful review of our manuscript.  Their comments have been extremely helpful.

    1. truly generic problems -- i.e., conditions that might apply to any resource on the Web -- are usually better expressed as plain status codes. For example, a "write access disallowed" problem is probably unnecessary, since a 403 Forbidden status code in response to a PUT request is self-explanatory.

      If there is no information in response other than the error status code, there is not need to return a Problem Details body in it. The response code is enough.

    2. When "about:blank" is used, the title SHOULD be the same as the recommended HTTP status phrase for that code (e.g., "Not Found" for 404, and so on), although it MAY be localized to suit client preferences (expressed with the Accept-Language request header).

      "title" of a Problem Details object whose type is about:blank should be the same as the recommended phrase for the HTTP response status code with which the object is being returned.

    3. New problem type definitions MUST document:¶ a type URI (typically, with the "http" or "https" scheme)¶ a title that appropriately describes it (think short)¶ the HTTP status code for it to be used with

      The wording implies that a new problem type must be documented.

      Moreover, these are the three fields that must be documented whenever a new problem type (identified by its type URI) is defined.

      Moreover,

    1. 1 (100 points)1. Choose a piece of text from your tech courses (a paragraph from a report / an assignment forexample) that you have written here at CMU-Africa (guide: between 100 – 150 words max).2. Cite the title of the assignment, the course name with the course code. (E.g. Assignment 1;Data Inference & Applied Machine Learning; Course code: 18785-R)3. Copy and paste the original text on a new document – do not change anything.4. Below the original piece of text rewrite / revise by making it more concise, with shortersentences; remove unnecessary words, whose / those / commas / and’s etc.5. Annotate the revised text (highlight) and describe the changes / improvement that you havemade.6. No Gen AI is permitted for this task.Grading Criteria:• 50 points - your ability to make the text more concise (See number 4 above).• 30 points – your ability to describe the changes / improvement that you have made withannotations (See number 5).• 10 points – your ability to use accurate / precise language (Grammar / Vocabulary /Punctuation etc.).• 10 points – your ability to follow the instructions above (1 – 6).Extra Credits (25 points)1. AFTER you have done Task 1 above – use Gen AI (ChatGPT / Copilot etc.) to revise yourORIGINAL piece of writing, copy and paste the Gen AI output.2. Annotate / highlight the changes made by Gen AI and describe what are the changes /improvement that have been made.3. Comment on / evaluate the quality of the Gen AI output.4. Share a link of your Gen AI prompts (share the entire prompt dialogue).Grading Criteria:• 10 points – your ability to annotate and describe the changes / improvement that Gen AI hasmade.• 10 points – your ability to evaluate the quality of the Gen AI output (You can comment on theusefulness of Gen AI in improving your writing).• 5 points - follow instructions (1-4) above.

      Checking if annotation works

    Annotators

    1. Summary of Ravi Chugh's Talk on "Programming with Direct Manipulation":

      Motivation to make programming languages more interactive, human-friendly, and accessible:

      Quote: "This talk is about research efforts to make programming languages and tools more interactive, more human friendly, and hopefully more accessible and useful to a wide variety of people."

      Tension between programming and direct manipulation interfaces:

      Quote: "On one hand, we want and need the full expressive power of our fancy general purpose programming languages that are equipped for abstract symbolic reasoning; at the same time, we also want and need tangible interactive user interfaces for understanding and manipulating the concrete things we are making."

      Desire for systems that blend programming languages with direct manipulation UIs:

      Quote: "So naturally, what we would like are systems that blend programming languages and concrete direct manipulation user interfaces, allowing us to smoothly move back and forth between these different modes as needed."

      Introduction of the concept "Programming with Direct Manipulation":

      Quote: "I'll refer to these goals as programming with direct manipulation—that is, in addition to unrestricted text-based editing of source code in whatever our favorite language happens to be, we would like the ability to inspect and interact with and change the output, and have the system help suggest changes to the code based on these interactions."

      Historical context of interactive programming systems:

      Quote: "Similar visions for interactive programming systems to augment human creativity and intelligence can be traced all the way back to the 1960s, from the constraint-oriented Sketchpad system by Ivan Sutherland to the work on graphical user interfaces and interactive computing by Doug Engelbart, Alan Kay, and many, many others."

      Recent interest and efforts in the intersection of PL and HCI:

      Quote: "In the past decade or so, there's been renewed interest in these challenges which lie at the intersection of PL and HCI."

      Introduction of Sketch-n-Sketch prototype system:

      Quote: "In my group, we've been exploring a few ideas in a prototype system called Sketch-n-Sketch—for sketching partial programs in the program synthesis sense, and sketching or drawing objects in the GUI editor sense."

      Three main ideas explored in Sketch-n-Sketch:

      Programming by demonstration in a pure lambda calculus:

      Quote: "The first idea is to explore programming by demonstration techniques for building programs in a pure lambda calculus, rather than in a lower-level imperative language as in most PBD work."

      Streamlined structure editing of abstract syntax trees in a text editor:

      Quote: "The second idea is to explore how structure editing of an abstract syntax tree might be streamlined into an ordinary existing text editor, as opposed to being a completely separate editing paradigm."

      Incorporating bidirectional programming techniques:

      Quote: "The third idea explores how to incorporate bidirectional programming techniques so that relatively small changes to the output can be mapped back to changes in the program."

      Demo Part 1: Programming by Demonstration—Every interaction is codified as a program change:

      Quote: "The key takeaway from this first part of the demo is that every direct manipulation interaction in the output pane is codified as a change to the program in the left pane."

      Demo Part 2: Structure Editing—Combining text and structure edits with GUI overlays:

      Quote: "The key takeaway from the following demo is that, in addition to regular text edits on the concrete syntax of the program, the left pane also supports certain program transformations by hovering, selecting, and clicking on the abstract syntax tree."

      Demo Part 3: Bidirectional Programming—Mapping output changes back to the program:

      Quote: "In the third and final part of the demo, we'll talk about the bidirectional programming features in Sketch-n-Sketch that support such changes and compare to the previous examples where Sketch-n-Sketch was configured for SVG programming, the following example will show a program that generates a simple HTML page."

      Exploration of programming by demonstration techniques in functional programming languages:

      Quote: "In contrast, our goal in Sketch-n-Sketch so far has not been to build the ultimate visual graphics editor, but rather to explore whether GUI interactions can be represented as ordinary text-based programs, as a way to bridge rather than replace a full-featured programming language."

      Discussion on structure editing and the use of GUI elements to manipulate ASTs:

      Quote: "There are many aspects to consider, both in the user interface side as well as the semantics of the transformations."

      Integration of text and visual editing in programming environments:

      Quote: "All of these ideas help make progress toward the goal of integrating text and visual editing, but these user interfaces really only make edits at the leaves of the AST."

      Challenges in scaling up structure editing and program transformations:

      Quote: "It remains an open question how to scale up such edit languages to describe larger program transformations and refactorings in a way that preserves static and dynamic information across compiles and allows the editor to be extended with new and custom transformations."

      Importance of bidirectional programming in mapping output changes to code:

      Quote: "I think there's potential to develop this kind of bidirectional programming for a lot of practical settings."

      Potential application domains: Data science, web development, graphics design:

      Quote: "I think it's easy to imagine a workflow where programmers, designers, and end users with a variety of technical backgrounds and different permission levels can work together to suggest and commit changes in this kind of a bidirectional system."

      Advances in live programming interfaces and integrating program synthesis:

      Quote: "It's been great to see all these efforts to make synthesis techniques more usable, and we'll need a lot more of this work going forward."

      Discussion on the role of spreadsheets as live programming interfaces:

      Quote: "So although spreadsheets have always lacked many of the bread-and-butter features that we would expect in any real programming system, spreadsheets have proven extraordinarily flexible and useful, and especially with some of these recent language extensions, spreadsheets provide a lot of really compelling opportunities both for PL and user interface design."

      Bridging the gap between designers and developers in collaborative projects:

      Quote: "Here's one setting in which I'm personally interested in trying to bridge these gaps between the designer and developer, regardless of whether they are multiple different people or just an individual user."

      Conclusion emphasizing recurring themes and future challenges:

      Quote: "So that was a whirlwind tour of a bunch of ideas spanning PL and HCI that factor into this pursuit of more interactive programming systems that support direct manipulation."

      Summary of recurring themes:

      Quote: "One is, can we design every graphical user interface to be backed by text-based programs in a general programming language?"

      Encouraging collaboration and future work in PL plus HCI:

      Quote: "So if you're interested, if you're sympathetic to the cause, there are certainly missions out there for you."

      Acknowledgments and appreciation:

      Quote: "There are a lot of people I want to thank for encouraging me in this work."

    1. clarifier nos idées, les rendre non ambigües

      Oui. C'est ce qui me plaisait, au milieu des années 2000, quand je faisais un peu de code pour des sites web : l'expression univoque et sans ambiguïté du code (une simple virgule oubliée fait sauter tout le site), c'était tellement reposant par rapport à la philo. Mais la vraie question, ici, c'est la suivante : les progrès récents de l'IA amènent-ils à penser qu'il n'existe pas d'autre pensée que "non ambigüe et formelle". Hegel disait : "c'est dans le mot que nous pensons" (donc par la formalisation), j'ai trouvé ça juste et insuffisant à la fois (Bergson et l'ineffable).

    1. Altman was born on April 22, 1985, in Chicago, Illinois,[7][8] into a Jewish family,[9] and grew up in St. Louis, Missouri. His mother is a dermatologist, while his father was a real estate broker. Altman is the eldest of four siblings.[10] At the age of eight, he received his first computer, an Apple Macintosh, and began to learn how to code and take apart computer hardware.[11][12] He attended John Burroughs School, a private school in Ladue, Missouri.[13] In 2005, after two years at Stanford University studying computer science, he dropped out without earning a bachelor's degree.[14][15]

      I had my first "computer" (that wasnt an Atari or a ... ping pond padle kind of simpler than an Atari) much earlier, at least 4 years before .. but I had an older brother Seth, whose best friend and our near neighbor in Plantation was named "Andy."

      He's the first ... "Andy." (srne1)

      In any case he had "modified the code of some MMORP or "MUD" sort of text based choose your own adventure games to create one "like Spaceballs" of Star Wars; with "penis references" and Swchwartz nearly as big as ... Cough; Petreus, and nowhere near the size of a Big jMac.

      I also played a game called "Ali Baba and the Prince of Thieves" on the same Apple II or IIe; and can barely remember playing the GAIA game on "Geowindows" (which I think was an early IBM competitor to MSFT)

    1. https://web.archive.org/web/20241117122125/https://support.signal.org/hc/en-us/articles/6829998083994-Phone-Number-Privacy-and-Usernames-Deeper-Dive#:~:text=A%20username%20is%20a%20way,are%20chatting%20with%20in%20Signal

      Signal allows you to set usernames. They are unique but temporary (and you can have only 1 at a time). User names can be used to connect to you without sharing your phone number. Set an optional username in Settings Profile. They have two numbers at the end (you can set them).

      User names can be shared in three ways: - tell someone (and then change it so they cannot communicate it further) - share a QR code - share a unique URL (which does not contain your username in clear text)

      Signal can't 'easily' see which phone number has which username. But given a username it can find the associated phonenumber. 'easily' means it can be done though, and thus both ways.

      An old username will become available to others after a week, meaning imo they should not contain any identifiable or associative information.

      Found this through someone suggesting that sharing your Signal username through Mastodon would allow private msgs. Yes, but the world will know your username, so you're open to all people who might think it fun to msg you.

    1. As one scrolls through any social media platform, a funky alien-like instrumental akin to Morse code is bound to be heard.

      This is a great introduction sentence that relates to the playful tone you were going with in the writing before! I noticed that this is written in passive voice, and I wonder how it would sound if you changed this into active voice. For example, "As one scrolls through any social media platform, they are bound to hear a funky alien-like instrumental akin to Morse code."

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Comments:

      (1) We find it interesting that the reshaped model showed decreased firing rates of the projection neurons. We note that maximizing the entropy <-ln p(x)> with a regularizing term -\lambda <\sum _i f(x_i)>, which reflects the mean firing rate, results in \lambda _i = \lambda for all i in the Boltzmann distribution. In other words, in addition to the homeostatic effect of synaptic normalization which is shown in Figures 3B-D, setting all \lambda_i = 1 itself might have a homeostatic effect on the firing rates. It would be better if the contribution of these two homeostatic effects be separated. One suggestion is to verify the homeostatic effect of synaptic normalization by changing the value of \lambda.

      This is an interesting question and we, therefore, explored the effects of different values of $\lambda$ on the performance of unconstrained reshaped RP models and their firing rates. The new supp. Figure 2B presents the results of this exploration: We found that for models with a small set of projections, a high value of $\lambda$ results in better performance than models with low ones, while for models with a large set of projections we find the opposite relation. The mean firing rates of the projection neurons for models with different values of $\lambda$ show a clear trend, where higher $\lambda$ values results in lower mean firing rates.

      Thus, these results suggest an interplay between the optimal size of the projection set and the value of $\lambda$ one should pick. For the population sizes and projection sets we have used here, $\lambda=1$ is a good choice, but, for different population sizes or data sets a different value of $\lambda$ might be better.

      Thus, in addition to supp. Figure 2B, we therefore added the following to the main text:

      “An additional set of parameters that might affect the Reshaped RP models are the coefficients $\lambda$, that weigh each of the projections. Above, we used $\lambda=1$ for all projections, here we investigated the effect of the value of $\lambda$ on the performance of the Reshaped RP models (supp. Figure 2B). We find that for models with a small projection set, high $\lambda$ values result in better performance than models with low values. We find an opposite relation for models with large number projection sets. (We submit that the performance decrease of Reshaped RP models with high value of $\lambda$, as the number of projections grows, is a reflection of the non-convex nature of the Reshaped RP optimization problem).

      The mean firing rates of the projection neurons for models with different values of $\lambda$ show a clear trend, higher $\lambda$ values results in lower mean firing rates. Thus, we conclude that there is an interplay between the number of projections and the value of $\lambda$ we should pick. For the sizes of projection sets we have used here, $\lambda=1$ is a good choice, but, we note that in general, one should probably seek the appropriate value of $\lambda$ for different population sizes or data sets.”

      In addition, we explored the effect of synaptic normalization on models with different values of $\lambda$ (supp. Figure 3). We found that homeostatic Reshaped RP models are superior to the non-homeostatic Reshaped RP models: For low values of $\lambda$, the homeostatic and Reshaped RP models show similar performance in terms of log-likelihood, whereas the homeostatic models are more efficient. For high values of $\lambda_i$ homeostatic models are not only more efficient but also show better performance. These results indicate that the benefit of the homeostatic model is insensitive to the specific choice of $\lambda$.

      In addition to supp. Figure 3, we added the following to the main text:

      “Exploring the effect of synaptic normalization on models with different values of $\lambda$ (supp. Figure 3), we find that homeostatic Reshaped RP models are superior to the non-homeostatic Reshaped RP models: For low values of $\lambda$, the homeostatic and Reshaped RP models show similar performance in terms of log-likelihood, whereas the homeostatic models are more efficient. Importantly, for high values of $\lambda_i$ homeostatic models are not only more efficient but also show better performance. We conclude that the benefit of the homeostatic model is insensitive to the specific choice of $\lambda$.”

      (2) As far as we understand, \theta_i (thresholds of the neurons) are fixed to 1 in the article. Optimizing the neural threshold as well as synaptic weights is a natural procedure (both biologically and engineeringly), and can easily be computed by a similar expression to that of a_ij (equation 3). Do the results still hold when changing \theta _i is allowed as well? For example,

      a. If \theta _i becomes larger, the mean firing rates will decrease. Does the backprop model still have higher firing rates than the reshaped model when \theta _i are also optimized?

      b. Changing \theta _i affects the dynamic range of the projection neurons, thus could modify the effect of synaptic constraints. In particular, does it affect the performance of the bounded model (relative to the homeostatic input models)?

      We followed the referee’s suggestion, and extended our current analysis, and added threshold optimization to the Reshape and Backpropagation models, which is now shown in supp. Figure 2A.  Comparing the performance and properties of these models to ones with fixed thresholds, we found that this addition had a small effect on the performance of the models in terms of their likelihood. (supp. Figure 2A). We further find that backpropagation models with tuned thresholds show lower firing rates compared to backpropagation models with fixed threshold, while reshaped RP models with optimized thresholds show higher firing rates compared to models with fixed threshold. These differences are, again, rather small, and both versions of the reshaped RP models show lower firing rates compared to both versions of the backpropagation models.

      In addition to supp. Figure 2A, we added the following to the main text:

      “The projections' threshold $\theta_i$, which is analogous to the spiking threshold of the projection neurons, strongly affects the projections' firing rates. We asked how, in addition to reshaping the coefficients of each projection, we can also change $\theta_i$ to optimize the reshaped RP and backpropagation models.

      We find that this addition has a small effect on the performance of the models in terms of their likelihood (supp. Figure 2A).

      We also find that this has a small effect on the firing rates of the projection neurons: backpropagation models with tuned thresholds show lower firing rates compared to backpropagation models with fixed threshold, whereas reshaped RP models with optimized thresholds show higher firing rates compared to models with fixed threshold. Yet, both versions of the reshaped RP models show lower firing rates compared to both versions of the backpropagation models. Given the small effect of tuning threshold on models' performance and their internal properties, we will, henceforth, focus on Reshaped RP models with fixed thresholds.”

      (3) In Figure 1, the authors claim that the reshaped RP model outperforms the RP model. This improved performance might be partly because the reshaped RP model has more parameters to be optimized than the RP model. Indeed, let the number of projections N and the in-degree of the projections K, then the RP model and the reshaped RP model have N and KN parameters, respectively. Does the reshaped model still outperform the original one when only (randomly chosen) N weights (out of a_ij) are allowed to be optimized and the rest is fixed? (or, does it still outperform the original model with the same number of optimized parameters (i.e. N/K neurons)?)

      Indeed, the number of tuned parameters in the reshaped RP model is much larger compared to the number of tuned parameters in an RP model with the same projection set size. Yet, we submit that the larger number of tuned parameters is not the reason for the improved performance of the reshaped RP model: Maoz et al [30] have already shown that by optimizing an RP model with a small projection set using the pruning and replacement of projections (P&R), one can reach high accuracy with an almost order of magnitude fewer projections. Thus, we argue that the improved performance stems from the properties of the projections in the model.

      Accordingly, we therefore added supp. Figure 2B that shows the performance of P&R sigmoid RP model compared to RP and reshaped RP models. We added the following to the main text:

      “Because reshaping may change all the existing synapses of each projection, the number of parameters is the number of projections times the projections in-degree. While this is much larger than the number of parameters that we learn for the RP model (one for each projection), we suggest that the performance of the reshaped models is not a naive result of having more parameters. In particular, we have seen that RP models that use a small set of projections can be very accurate when the projections are optimized using the pruning and replacement process [30] (see also supp. Figure 1B). Thus, it is really the nature of the projections that shapes the performance. Indeed, our results here show that a small fixed connectivity projection set with weight tuning is enough for accurate performance which is on par or better than an RP model with more projections.”

      (4) In Figure 2, the authors have demonstrated that the homeostatic synaptic normalization outperforms the bounded model when the allowed synaptic cost is small. One possible hypothesis for explaining this fact is that the optimal solution lies in the region where only a small number of |a_ij| is large and the rest is near 0. If it is possible to verify this idea by, for example, exhibiting the distribution of a_ij after optimization, it would help the readers to better understand the mechanism behind the superiority of the homeostatic input model.

      We modified supp. Figure 4 and made the following change in the relevant part in the main text to address the reviewer comment about the distribution of the $a_{ij}$ values:

      “Figure 5E shows the mean rotation angle over 100 homeostatic models as a function of synaptic cost -- reflecting that the different forms of homeostatic regulation results in different reshaped projections. We show in Supp. Figure 4C the histogram of the rotation angles of several different homeostatic models, as well as the unconstrained Reshape model.

      Analyzing the distribution of the synaptic weights $a_{ij}$ after learning leads to a similar conclusion (supp. Figure 4D): The peak of the histograms is at $a_{ij} = 0$, implying that during reshaping most synapses are effectively pruned. While the distribution is broader for models with higher synaptic budget, it is asymmetric, showing local maxima at different values of $a_{ij}$.

      The diversity of solutions that the different model classes and parameters show imply a form of redundancy in model choice or learning procedure. This reflects a multiplicity of ways to learn or optimize such networks that biology could use to shape or tune neural population codes.“

      (5) In Figures 5D and 5E, the authors present how different reshaping constraints result in different learning processes ("rotation"). We find these results quite intriguing, but it would help the readers understand them if there is more explanation or interpretation. For example,

      a. In the "Reshape - Hom. circuit 4.0" plot (Fig 5D, upper-left), the rotation angle between the two models is almost always the same. This is reasonable since the Homeostatic Circuit model is the least constrained model and could be almost irrelevant to the optimization process. Is there any similar interpretation to the other 3 plots of Figure 5D?

      We added a short discussion of this difference to the main text, but do not have a geometric or other intuitive explanation for the nature of these differences.

      b. In Figure 5E, is there any intuitive explanation for why the three models take minimum rotation angle at similar global synaptic cost (~0.3)?

      We added discussion of this issue to the main text, and the histogram of the rotation angles in Supp Figure 4c shows that they are not identical. But, we don’t have an intuitive explanation for why the mean values are so similar.

      Recommendations for the authors:

      (1) Some claims on the effect of synaptic normalization on the reshaped model sound a little overstated since the presented evidence does not clearly show the improvement of the computational performance (in comparison to the vanilla reshaped model) in terms of maximizing the likelihood of the inputs. Here are some examples of such claims: "Incorporating more biological features and utilizing synaptic normalization in the learning process, results in even more efficient and accurate models." (in Abstract), "Thus, our new scalable, efficient, and highly accurate population code models are not only biologically-plausible but are actually optimized due to their biological features." (in Abstract), or "in our Reshaped RP models, homeostatic plasticity optimizes the performance of network models" (in Discussion).

      We changed the wording according to the reviewers’ suggestions.

      (2) In equation (1) and the following sentence, \theta _j (threshold) should be \theta _i.

      Fixed

      (3) While the authors mention that "reshaping with normalization or without it drives the projection neurons to converge to similar average firing rate values (Figure 3B)", they also claim that "reshaping with normalization implies lower firing rates as well as... (Figure 3E)". These two claims look a little inconsistent to us. Besides, it is not very clear from Figure 3E that the normalization decreases the firing rate (it is clear from Figure 3B, though). How about just deleting "lower firing rates as well as"?

      We changed the wording according to the reviewers’ suggestion.

      (4) The captions of Figures 4D and 4E should be exchanged.

      Fixed

      (5) Typo in In Figure 4F: "normalized in-dgreree".

      Fixed

      (6) In Figure 5D (upper left plot) the choice of "Reshape" and "Bounded3.0" looks a bit weird. Is this the typo of "Hom. cicruit 4.0"?

      There is no typo in the figure labels. We discussed the results of figure 5D in our response to point (5) in the public comments list and addressed the upper left panel of figure 5D in the main text.

      (7) In the paper, the letter \theta represents (1) the threshold of the projection neurons (eq. 1), (2) the "ceiling" value of the bounded model, and (3) the rotation angle of projections (Figure 5). We find this notation a bit confusing and recommend using different notations for different entities.

      Thanks for the suggestion, we changed the confusing notations: (1) The threshold of each projection neuron is still $\theta$, following the notation of the original RP model formulation [30]. (2) The notation of the “ceiling” value of the bounded model is now $\omega$. (3) The rotation angle of the projections during reshape is now marked by $\alpha$.

    1. Already, the listener is immersed in the cosmic story and sound.

      It could definitely be a style choice, but I think this sentence could be reworked since it is to brief and in a way disrupts the flow of your paragraph. Perhaps combining this idea with the previous sentence saying "the morse code like instrumental transition immerses listeners in cosmic story and mystical sounds" or transitioning more fluidly.

    2. the morse-code-like instrumental

      This is a great description and really plays into the Alien theme. However, I would personally describe the instrumental features a bit more. For example, cite the specific instrument that plays or the tempo or rhythm that really conveys this feeling.

    1. There’s an abyss to cross between using an app and modifying it with code by calling APIs. The user has to switch to a whole other paradigm including setting up a development environment. Consequently, few users take the step from using a tool to customizing or making their own tools.
    1. Code doesn’t have feelings, doesn’t develop complexes and certainly doesn’t exhibit the most important trait (the ability to reproduce) of that which carries for your genetic strains.

      No, the fact that code can't reproduce is not the most important trait here, it's lack of personhood is and that was already adequately described earlier in the sentence.

    1. talked about freedom metaphorically (so the slave masters would not understand it

      I don't think this needs to be in parenthesis---the idea of slave spirituals being in code is at the core of understanding them and so I don't think you need to make it a side note; you could even make it a focus.

    1. Informations Utiles pour des Parents Débutants au CA

      Voici quelques informations qui pourraient manquer aux parents débutants élus au conseil d'administration (CA) d'un établissement scolaire du second degré, en complément des informations déjà présentes dans les sources fournies:

      Comprendre le rôle du CA:

      ● Impact des décisions:

      Il est crucial de comprendre que le CA prend des décisions importantes qui affectent directement la vie scolaire de l'enfant, comme le budget de l'établissement, le règlement intérieur et les projets pédagogiques.

      ● Représentation des parents:

      Les parents élus au CA ont la responsabilité de représenter l'ensemble des parents d'élèves.

      Il est important de communiquer avec les autres parents, de recueillir leurs préoccupations et de les faire entendre lors des réunions du CA.

      ● Travail en équipe:

      Le CA est composé de différents membres de la communauté éducative.

      Il est essentiel de travailler en collaboration avec les autres membres, en respectant les différents points de vue et en cherchant des solutions consensuelles.

      Préparation aux réunions:

      ● Étude des documents préparatoires:

      Le chef d'établissement envoie des documents préparatoires avant chaque réunion.

      Il est important de les étudier attentivement afin de comprendre les sujets qui seront abordés et de pouvoir participer activement aux discussions.

      ● Formulation de questions:

      N'hésitez pas à formuler des questions par écrit au chef d'établissement avant la réunion si certains points ne sont pas clairs ou si vous souhaitez obtenir des informations complémentaires.

      ● Échange avec les autres parents élus:

      Il peut être utile de discuter avec les autres parents élus avant la réunion pour coordonner vos interventions et vos positions sur les différents sujets.

      Participation active aux réunions:

      ● Prise de parole: N'hésitez pas à prendre la parole lors des réunions pour exprimer votre point de vue, poser des questions et partager les préoccupations des parents d'élèves.

      ● Droit de vote:

      Les parents élus au CA ont le droit de voter sur les décisions qui sont prises. Il est important de voter en conscience, en tenant compte de l'intérêt général des élèves.

      ● Proposition de motions:

      Vous pouvez proposer des motions sur des sujets qui vous semblent importants pour la vie de l'établissement. Assurez-vous de respecter les procédures pour l'inscription d'une motion à l'ordre du jour.

      Confidentialité et communication:

      ● Devoir de réserve:

      Les membres du CA sont tenus à une obligation de réserve concernant les informations confidentielles dont ils ont connaissance lors des réunions.

      ● Diffusion d'un compte rendu: Il est important de communiquer avec les autres parents d'élèves en diffusant un compte rendu des discussions et des décisions prises lors du CA. Ce compte rendu doit respecter les règles de confidentialité.

      Ressources supplémentaires:

      ● Formation en ligne: La plateforme Jaimemonecolepublique.fr propose un module sur les élections scolaires qui peut être utile aux parents débutants.

      ● Guide "L'Incollable du parent d'élève":

      Ce guide fournit des informations complémentaires sur le fonctionnement des collèges et des lycées, ainsi que sur les droits et devoirs des parents d'élèves.

      En plus de ces informations,

      il est important pour les parents élus au CA de se familiariser avec le Code de l'éducation, notamment les articles R 421-14 à R 421-36 qui concernent le fonctionnement du CA.

      Les circulaires mentionnées dans les sources peuvent également apporter des précisions utiles.

    1. Author response:

      The following is the authors’ response to the original reviews.

      We thank the reviewers for their overall positive evaluation of the manuscript and finding MChIP-C to be a valuable technological advance. To address the reviewer’s helpful comments and recommendations, we performed several additional analyses and improved the text and figures.

      Briefly, we extended and clarified the main text and methods, added analyses of interactions at consensus and method-specific CTCF/DHS sites (Figure S3), added additional comparison tracks to other methods in specific loci (Figure 4), added examples of MChIP-C E-P interactions at previously-verified loci (Figure S2a) and added extensive MChIP-C downsampling analysis (Figure S6).

      Recommendations for authors:

      Reviewer #2 (Recommendations For The Authors:

      (1) Provide .HiC and .cool files for the community to explore the data.

      We thank the reviewer for this suggestion. We have uploaded both the raw and processed data to GEO. We note that .cool and .hic formats may be less useful for this type of data, since it includes only promoter-based interactions and thus the resulting interaction matrix is extremely sparse at the relevant resolutions. In addition, we provide an online genomic browser for our data.

      (2) Provide an R or bioconda package for future data processing.

      We thank the reviewer for this suggestion. We have organized and streamlined the relevant code for processing MChIP-C data and it is available as a github repository.

      (3) The authors should avoid using "mln" for "million".

      We thank the reviewer for this suggestion. We have corrected this in the text.

      Reviewer #3 (Recommendations For The Authors):

      (1) Figure 2- A handful of sites identified by MChIP-C should be verified by 3C or 4C to validate they are true interactions using an orthogonal approach.

      We thank the reviewer for this suggestion. As we show in the current manuscript (and supported by several papers using MNase-based C-methods), C-methods based on restriction enzymes are considerably less sensitive than those based on MNase, so using these methods for anecdotal validation may not be adequate. In addition, it is difficult to extract accurate quantitative measurements from 3C and 4C due to challenges in bias normalization. As a large-scale alternative, we analyzed a set of consensus promoter-CTCF and promoter-DHS interactions identified by all 3 methods (PLAC-seq/Micro-C/MChIP-C; Figure S3). We find that MChIP-C shows clearly superior resolution and sensitivity on these consensus sites. In fact, even for sites which were only called by one of the competing methods, we still see better signal in the MChIP-C data (suggesting that our simplistic MChIP-C peak-calling approach could be improved for further gain). However, as this analysis focuses on “easily detectable” consensus sites, we also emphasize the importance of inspecting interactions which are not detected clearly by alternative methods. To this end, we now show in our manuscript interaction profiles for 11 loci (MYC, PTGER3, CITED2, BTG1, ANTXR2, SEMA7A, LMO2, GATA1, HBG2, VEGFA, MYB), each showing high-resolution MChIP-C interactions which coincide with expected genomic features (p300, CTCF, H3K27ac, known enhancers) and are not clearly observable in Micro-C and PLAC-seq. We also note that the extended overlap of detected MChIP-C interactions with functionally validated enhancers (as measured by CRISPRi) provides an additional large-scale orthogonal validation.

      (2) A supplemental table indicating read pair depth, etc, similar to S02, should be added for the datasets used for comparison (HiChIP-etc). Given the age differences between some of the reference data used, it may represent simply an improvement by increasing sequencing depth rather than a true technical advantage.

      We thank the reviewer for this suggestion. We have added the sequencing depths of the relevant datasets in the methods section. We also performed extensive downsampling analyses as explained in response to the next point.

      (3) I would recommend performing a downsampling analysis to determine at what point the MChIP-C data reaches saturation in terms of the number of reads, with a comparison to the HiChIP reference data. This would allow a more objective measure of the sensitivity of the assays with reference to read depth.

      We thank the reviewer for this suggestion. First, we note that downsampling does not affect the high sensitivity and resolution results as shown in aggregate plots (e.g. Figure 2 and Figure S3). However, downsampling can affect individual peak calling. We thus downsampled our data to 50%, approximately matching the number of total informative reads of both PLAC-seq and Micro-C (i.e. ~20M). We also further downsampled our data to 25% and 10%. With respect to prediction of K562 functionally validated enhancer-promoter interactions (Figure S6b), even at 25% downsampling MChIP-C achieves both a higher recall and higher precision than the other methods, with a slightly higher false-positive rate. At 10% sampling, recall is slightly worse than Micro-C and PLAC-seq, but both the precision and false-positive rate are better than the alternatives. With respect to saturation, we plotted the number of unique distal cis read pairs versus the total number of reads (Figure S6c), and find that our MChIP-C data does not yet show saturation. We also show that downsampling our data to 50% maintains  ~80% of the called interactions (Figure S6d).

      (4) "our results suggest that MChIP-C achieves superior sensitivity and resolution compared to C-methods based on standard restriction enzymes." The sensitivity claims are supported by Figure 2, but not the resolution claims. This is particularly challenging when using histone marks since they can be broad. To directly compare the resolution of MChIP-C to other approaches such as ChIA-PET or HiChIP CTCF or a similar DNA binding protein is required.

      We thank the reviewer for this suggestion. We first note that actually both sensitivity and resolution are relevant for the results shown in Figure 2 and for the signal-to-noise calculations. This is because the low resolution of PLAC-seq peaks can result in very broad peaks that cover the entire area of the interrogated window (5kb on each side), which could seem like low sensitivity. However, we believe that the new Figure S3 may show the higher resolution of MChIP-C more clearly, as do the 11 locus interaction profiles tracks shown in Figure 2, Figure 4 and Figure S2.

      Public reviews:

      Reviewer #1:

      The authors presented a new MNase-based proximity ligation method called MChIP-C, allowing for the measurement of protein-mediated chromatin interactions at single-nucleosome resolution on a genome-wide scale. With improved resolution and sensitivity, they explored the spatial connectivity of active promoters and identified the potential candidates for establishing/maintaining E-P interactions. Finally, with published CRISPRi screens, they found that most functionally verified enhancers do physically interact with their cognate promoters, supporting the enhancer-promoter looping model.

      The study's experimental approach and findings are interesting. However, several issues need to be addressed.

      (1) The authors described that "the lack of interaction between experimentally-validated enhancers and their cognate promoters in some studies employing C-methods has raised doubts regarding the classical promoter-enhancer looping model", so it's intriguing to see whether the MChIP-C could indeed detect the E-P interactions which were not identified by C-methods as they mentioned (Benabdallah et al., 2019; Gupta et al., 2017). I agree that they identified more E-P interactions using MChIP-C, but specifically, they should show at least 2-3 cases. It's important since this is the main conclusion the authors want to draw.

      We thank the reviewer for this suggestion. As we show in the current manuscript (and supported by several papers using MNase-based C-methods), C-methods based on restriction enzymes are considerably less sensitive than those based on MNase, so using these methods for anecdotal validation may not be useful. In addition, it is difficult to extract accurate quantitative measurements from 3C and 4C due to challenges in bias normalization. As a large-scale alternative, we analyzed a set of consensus promoter-CTCF and promoter-DHS interactions identified by all 3 methods (PLAC-seq/Micro-C/MChIP-C; new Figure S3). We find that MChIP-C shows clearly superior resolution and sensitivity on these consensus sites. However, as this analysis focuses on “easily detectable” consensus sites, we also emphasize the importance of inspecting interactions which are not detected clearly by alternative methods. To this end, we now show in our manuscript interaction profiles for 11 loci (MYC, PTGER3, CITED2, BTG1, ANTXR2, SEMA7A, LMO2, GATA1, HBG2, VEGFA, MYB), each showing high-resolution MChIP-C interactions which coincide with expected genomic features (p300, CTCF, H3K27ac, known enhancers) and are not clearly observable in Micro-C and PLAC-seq. We also note that the extended overlap of detected MChIP-C interactions with functionally validated enhancers (as measured by CRISPRi) provides an additional large-scale orthogonal validation.

      (2) The authors compared their data to those of Chen et al. (Chen et al., 2022), who used PLAC-seq with anti-H3K4me3 antibodies in K562 cells and standard Micro-C data previously reported for K562, concluding that "MChIP-C achieves superior sensitivity and resolution compared to C-methods based on standard restriction enzymes.". This is not convincing since they only compared their data to one dataset. More datasets from other cell lines should be included.

      We thank the reviewer for this suggestion. We would like to clarify that all datasets in the paper are K562 datasets, and this cell line is unique in the availability of CRISPRi screens, PLAC-Seq, Micro-C, and hundreds of ChIP-Seq tracks for it. We would expect datasets from other cell types to have changes in their regulatory interactions, so they would be less adequate for direct comparison. In addition, the general resolution and sensitivity limitations (e.g. due to restriction fragment size) are not dependent on cell type and has been shown in other MNase-based method papers.

      (3) The reasons for choosing Chen's data (Chen et al., 2022) and CRISPRi screens (Fulco et al., 2019; Gasperini et al., 2019) should be provided since there are so many out there.

      We thank the reviewer for this comment. We selected these CRISPRi screen datasets since they match the cell type (K562) which we used for MChIP-C, and we selected the PLAC-seq data as it is the only PLAC-seq/HiChIP dataset which matches both the cell type (K562) and the antibody (H3K4me3).

      (4) The authors identify EP300 histone acetyltransferase and the SWI/SNF remodeling complex as potential candidates for establishing and/or maintaining enhancer-promoter interactions, but not RNA polymerase II, mediator complex, YY1, and BRD4. More explanation is needed for this point since they're previously suggested to be associated with E-P interactions.

      We thank the reviewer for this comment. We apologize for this point being unclear: as Figure S5 shows, we actually did identify Pol2, mediator YY1 and BRD4 as predictive features, but P300 and SWI/SNF show somewhat higher predictive power. We have now clarified this in the text.

      (5) The limitations of the method should be discussed.

      We thank the reviewer for this suggestion. We have now added to the text a discussion of what we view as the current main limitation of the method, namely its low fraction of informative reads.

      Reviewer #2:

      Summary:

      Golov et al performed the capture of MChIP-C using the H3K4me3 antibody. The new method significantly increases the resolution of Micro-C and can detect clear interactions which are not well described in the previous HiChIP/PLAC-seq method. Overall, the paper represents a significant technological advance that can be valuable to the 3D genomic field in the future.

      Strengths:

      (1) The authors established a novel method to profile the promoter center genomic interactions based on the Micro-C method. Such a method could be very useful to dissect the enhancer promoter interaction which has long been an issue for the popular HiC method.

      (2) With the MChIP-C method the authors are able to find new genomic interactions with promoter regions enriched in CTCF. The author has significantly increased the detection sensitivity of such methods as PLAC-seq, Micro-C, and HiChIP.

      (3) The authors identified a new type of interaction between the CTCF-less promoter and the CTCF binding site. This particular type of interaction could explain the CTCF's function in regulating gene transcription activity as observed in many studies. I personally think the second stripe model of P-CTCF interaction is more likely as this has been proposed for the super-enhancer stripe model before. The author should also discuss this part of the story more.

      Weaknesses:

      (1) The data presentation should include the contact heat map. The current data presentation makes it hard for the readers to have a comprehensive view of pair-wise interactions between promoters and the PIR. In particular, these maps may directly give answers to the proposed model of promoter-CTCF interactions by the authors in Figure 3a.

      We thank the reviewer for this suggestion. We note that since the data mainly includes promoter-based interactions, the resulting interaction matrix is extremely sparse at the relevant resolutions. Specifically with respect to promoter-CTCF interactions, without a good sampling of the entire interaction matrix it is difficult to confidently distinguish between the two models only based on MChIP-C data, as it would require data about interaction between non-promoter regions and CTCF.

      (2) In Fig 3D, there seems a very limited increase of power predicting MChIP-C signal for DHS-promoter pairs beyond the addition of CTCF. This figure could be simplified with fewer factors.

      We thank the reviewer for this suggestion. We agree that the last factors do not add predictive power, but we do not think this overly complicates the figure and we prefer to leave these for the reader to evaluate.

      (3) The current method seems to have a big fraction of unusable reads. How the authors process the data should be included to allow for future reproduction. Ideally, the authors should generate a package on R or Bioconda for this processing.

      We thank the reviewer for this suggestion. We agree that the fraction of informative reads is small with respect to some other methods, and expect future versions of MChIP-C to address this limitation. We have organized and streamlined the relevant code for processing MChIP-C data and it is available as a github repository.

      Reviewer #3:

      Summary:

      This manuscript represents a technological development- specifically a micrococcal nuclease chromatin capture approach, termed MChIP-C to identify promoter-centered chromatin interactions at single nucleosome resolution via a specific protein, similar to HiChIP, ChIA-PET, etc.. In general, the manuscript is technically well done. Two major issues raise concerns that need to be addressed. First, it does not appear that novel chromatin interactions identified by MChIP-C which were missed by other approaches such as HiChIP, were validated. This is central to the argument of "improved" sensitivity, which is one of the key factors to assess sensitivity. Second is the question of resolution. Because the authors focus on a histone mark (H3K4me3) it is unclear whether the resolution of the assay truly exceeds other approaches, especially microC. These two issues are not completely supported by the data provided.

      Strengths:

      The method appears to hold promise to improve both the sensitivity and resolution of protein-centered chromatin capture approaches.

      Weaknesses:

      (1) Specific validation experiments to demonstrate the identification of previously missed novel interactions are missing.

      We thank the reviewer for this suggestion. Given that such interactions are missed by Micro-C and PLAC-seq, it would not make sense to use these methods for validation. We thus propose that MChIP-C interactions can be validated by their overlap with expected genomic features. To this end, we now show in our manuscript interaction profiles for 11 loci (MYC, PTGER3, CITED2, BTG1, ANTXR2, SEMA7A, LMO2, GATA1, HBG2, VEGFA, MYB), each showing high-resolution MChIP-C interactions which coincide with expected genomic features (p300, CTCF, H3K27ac, known enhancers) and are not clearly observable in Micro-C and PLAC-seq. In addition, the higher overlap of MChIP-C interactions with functionally-validated K562 enhancer-promoter interactions (provided by CRISPRi screens) provides further functional validation for novel MChIP-C interactions.

      (2) It is unclear if the resolution is really superior based on the data provided.

      We thank the reviewer for this comment. We first note that actually both sensitivity and resolution are relevant for the results shown in Figure 2 and for the signal-to-noise calculations. This is because the low resolution of PLAC-seq peaks can result in very broad peaks that cover the entire area of the interrogated window (5kb on each side), which could seem like low sensitivity. However, we believe that the new Figure S3 may show the higher resolution of MChIP-C more clearly, as do the 11 locus interaction profiles tracks shown in Figure 2, Figure 4 and Figure S2.

      (3) It is unclear how much advantage the approach has, especially compared to existing approaches such as HiChIP since sequencing depth as a variable is not adequately addressed.

      We thank the reviewer for this comment. First, we note that downsampling does not affect the high sensitivity and resolution results as shown in aggregate plots (e.g. Figure 2 and Figure S3). However, downsampling can affect individual peak calling. We thus downsampled our data to 50%, approximately matching the number of total informative reads of both PLAC-seq and Micro-C (i.e. ~20M). We also further downsampled our data to 25% and 10%. With respect to prediction of K562 functionally validated enhancer-promoter interactions (Figure S6b), even at 25% downsampling MChIP-C achieves both a higher recall and higher precision than the other methods, with a slightly higher false-positive rate. At 10% sampling, recall is slightly worse than Micro-C but both the precision and false-positive rate are better than the alternatives.

    1. How to Read a Scientific Paper Add Favorite Print Email Share Menu Facebook Pinterest Twitter More Menu Report a Problem close Create Assignment Embed Link Google Classroom Microsoft Teams Canvas Schoology Other Link copied to clipboard. To create assignments or embed links, you must be logged in to your Science Buddies account. Please log in or join for free. Log In / Join var _____WB$wombat$assign$function_____ = function(name) {return (self._wb_wombat && self._wb_wombat.local_init && self._wb_wombat.local_init(name)) || self[name]; }; if (!self.__WB_pmw) { self.__WB_pmw = function(obj) { this.__WB_source = obj; return this; } } { let window = _____WB$wombat$assign$function_____("window"); let self = _____WB$wombat$assign$function_____("self"); let document = _____WB$wombat$assign$function_____("document"); let location = _____WB$wombat$assign$function_____("location"); let top = _____WB$wombat$assign$function_____("top"); let parent = _____WB$wombat$assign$function_____("parent"); let frames = _____WB$wombat$assign$function_____("frames"); let opener = _____WB$wombat$assign$function_____("opener"); let arguments; { var codes = []; showPopup = function() { openSurvey = function() { } $('.lms-selection').show(); $('.lms-embed').hide(); $('#lms-embed-code-copied').hide(); $('#lms-popup-background').show(); $('#lms-popup-outer').show(); $('html').css('overflow', 'hidden'); } openPopup = function(openCodes) { codes = openCodes; showPopup(); sb.track(false, 'share', 'popup', 'open'); } closePopup = function() { $('html').css('overflow', ''); $('#lms-popup-outer').hide(); $('#lms-popup-background').hide(); sb.track(false, 'share', 'popup', 'close'); } embed = function(code, title) { } }} var _____WB$wombat$assign$function_____ = function(name) {return (self._wb_wombat && self._wb_wombat.local_init && self._wb_wombat.local_init(name)) || self[name]; }; if (!self.__WB_pmw) { self.__WB_pmw = function(obj) { this.__WB_source = obj; return this; } } { let window = _____WB$wombat$assign$function_____("window"); let self = _____WB$wombat$assign$function_____("self"); let document = _____WB$wombat$assign$function_____("document"); let location = _____WB$wombat$assign$function_____("location"); let top = _____WB$wombat$assign$function_____("top"); let parent = _____WB$wombat$assign$function_____("parent"); let frames = _____WB$wombat$assign$function_____("frames"); let opener = _____WB$wombat$assign$function_____("opener"); let arguments; { sb.browser.getScript("https://assets.pinterest.com/js/pinit.js"); }}

      Sahira Haq

    1. Sometimes membersof the a u d ie n c e are referred to not even by a s lig h tin g namehut by a code ti t l e which a s s i m i l a t e s them fully to an a b s tr a c tca te g o ry . T h u s d o c to rs in the a b s e n c e of a p a tie n t may referto him a s ‘ the c a r d i a c ’ or ' t h e s t r e p ; ’ barb ers privately referto th e ir c u s to m e r s a s ' h e a d s of h a i r

      other is downplaying of politeness or name calling

    1. Author response:

      The following is the authors’ response to the original reviews.

      Public Reviews:

      Reviewer #1 (Public Review):

      Summary:

      It is evident that studying leukocyte extravasation in vitro is a challenge. One needs to include physiological flow, culture cells and isolate primary immune cells. Timing is of utmost Importance and a reproducible setup essential. Extra challenges are met when extravasation kinetics in different vascular beds is required, e.g., across the blood-brain barrier. In this study, the authors describe a reliable and reproducible method to analyze leukocyte TEM under physiological flow conditions, including this analysis. That the software can also detect reverse TEM is a plus.

      Strengths:

      It is quite a challenge to get this assay reproducible and stable, in particular as there is flow included. Also for the analysis, there is currently no clear software analysis program, and many labs have their own methods. This paper gives the opportunity to unify the data and results obtained with this assay under label-free conditions. This should eventually lead to more solid and reproducible results.

      Also, the comparison between manual and software analysis is appreciated.

      We thank the Reviewer for their positive evaluation of our manuscript and highlighting the value of obtaining more reproducible and unbiases results, as well as detection of forward and reverse transmigration with UFMTrack.

      Weaknesses:

      The authors stress that it can be done in BBB models, but I would argue that it is much more broadly applicable. This is not necessarily a weakness of the study but more an opportunity to strengthen the method. So I would encourage the authors to rewrite some parts and make it more broadly applicable.

      We thank the Reviewer for this suggestion. In the revised version of our manuscript, we have now emphasized the broader applicability of UFMTrack to analyze the interaction of immune cells with 2dimensional endothelial monolayers in various contexts in the abstract, introduction, and discussion sections.

      Reviewer #2 (Public Review):

      Summary:

      This paper develops an under-flow migration tracker to evaluate all the steps of the extravasation cascade of immune cells across the BBB. The algorithm is useful and has important applications.

      Strengths:

      Algorithm is almost as accurate as manual tracking and importantly saves time for researchers.

      We thank the Reviewer for this positive evaluation of our work.

      Weaknesses:

      Applicability can be questioned because the device used is 2D and physiological biology is in 3D. Comparisons to other automated tools was not performed by the authors.

      We thank the Reviewer for pointing our attention to these weaknesses in our manuscript.

      We have clarified in the revised manuscript that using 2D endothelial monolayer models in parallel laminar flow chambers is still a state-of-the-art methodology for studying the multi-step extravasation process of immune cells across endothelial monolayers under physiological flow by in vitro live cell imaging. These models provide excellent optical quality that is not yet achieved in 3D models. We have extended the introduction to emphasize the limitations of existing tools that motivated us to establish UFMTrack. We have furthermore extended the discussion section to highlight the features unique to our UFMTrack framework.

      Reviewer #3 (Public Review):

      Summary:

      The authors aimed to establish a faster and more efficient method of tracking steps of T-cell extravasation across the blood brain barrier. The authors developed a framework to visualize, recognize and track the movement of different immune cells across primary human and mouse brain microvascular endothelial cells without the need for fluorescence-based imaging. The authors succinctly describe the basic requirements for tracking in the introduction followed by an in-depth account of the execution.

      We thank the Reviewer for their positive evaluation of our manuscript and highlighting the value of label-free analysis of the multistep immune cell extravasation cascade with UFMTrack.

      Weaknesses and Strengths:

      Materials & methods and results:

      (1) The methods section also lacks details of the microfluidic device that the authors talk about in the paper. Under physiological sheer stress, the T-cells detach from the pMBMEC monolayer, and are hence unable to be detected; however, this observation requires an explanation pertaining to the reason of occurrence and potential solutions to circumvent it to ensure physiologically relevant experimental parameters.

      We thank the Reviewer for pointing out this oversight. We have used a custom-made microfluidic device that has been published and described in detail before. This information has now been included in the Methods Section under Point 7, and the two references describing the flow chamber in depth are mentioned below and have been included in the manuscript.  

      Coisne Caroline, Ruth Lyck and Britta Engelhardt. 2013. Live cell imaging techniques to study T cell trafficking across the blood-brain barrier in vitro and in vivo. Fluids and Barriers of the CNS 10:7 doi:10.1186/20458118-10-7; 21 January 2013

      Lyck R, Hideaki Nishihara, Sidar Aydin, Sasha Soldati and Britta Engelhardt. 2022. Modeling brain vasculature immune interactions in vitro. Angogenesis, 2nd edition. Editors PatriciaD’Amore and Diane Bielenberg Cold Spring Harb Perspect Med doi: 10.1101/cshperspect.a041185

      T cell detachment is a physiologically relevant parameter besides T cell arrest, polarization, crawling, probing, and transmigration during the interaction with an endothelial monolayer. T cell detachment means that post-arrest, the T cell cannot engage adhesion molecules required for subsequent polarization and, eventually, transmigration. 

      (2) The author describes a method for debris exclusion using UFMTrack that eliminates objects of <30 pixels in size from analysis based on a mean pixel size of 400 for T lymphocytes. However, this mean pixel size appears to stem from in-vitro activated CD8 T cells, which rapidly grow and proliferate upon stimulation. In line with this, activated lymphocytes exhibit increased cytoplasmic area, making them appear less dense or “brighter” by phase microscopy compared to naïve lymphocytes, which are relatively compact and subsequently appear dimmer. Given this, it is not clear whether UFMTrack is sufficiently trained to identify naïve human lymphocytes in circulating blood, nor smaller, murine lymphocytes. Analysis of each lymphocyte subtype in terms of pixel size and intensity would be beneficial to strengthen the claim that UFMTrack can identify each of these populations. Additionally, demonstrating that UFMTrack can correctly characterize the behavior of naïve versus activated lymphocytes isolated from murine and human sources would strengthen the claim that UFMTrack can be broadly applied to study lymphocyte dynamics in diverse models without additional training

      We thank the Reviewer for the suggestion to more precisely evaluate the range of cell sizes that can be analyzed by our framework. We have included a visualization of crawling cell sizes successfully analyzed by the UFMTrack in Supplementary Figure 7. It demonstrates that the human peripheral blood mononuclear cells, that are almost twice as small as the activated mouse CD4 T cells used in these assays, can be successfully segmented, tracked, and analyzed with the UFMTrack framework. Thus, our UFMTrack framework is suitable for a broad application to differentially sized immune cells during their interaction with the endothelial cell monolayer under flow. 

      (3) Average precision was compared to the analysis of UFMTrack but it is unclear how average precision was calculated. This information should have been included in the methods section

      We thank the Reviewer for pointing our attention to the missing information. We have added a subsection, “Performance Analysis”, to the Materials and Methods section, where we describe the statistical methods and the performance metrics used to evaluate the UFMTrack framework.

      (4) CD4 and CD8 T cells exhibit distinct biology and interaction kinetics driven in part by their MHC molecule affinity and distinct receptor expression profiles. Thus, it is unclear why two distinct mechanisms of endothelial cell activation are needed to see differences between the populations.

      We thank the Reviewer for pointing out that different cytokine stimulations of endothelial cells were used in the assays used here to test our UFMTrack to analyze CD4 and CD8 T cell interactions with the endothelial monolayer. While the Reviewer is correct that CD4 and CD8 T cells use different mechanism to cross the pMBMEC monolayer as show by us (doi: 10.1002/eji.201546251.) and others and that recognition of cognate antigen on MHC class I on pMBMECs will arrest CD8 T cells and lead to CD8 T-cell mediated apoptosis ( doi: 10.1038/s41467-023-38703-2.) the focus of the present study was not on comparing CD4 and CD8 T cell interactions with the pMBMEC monolayer but rather to test suitability of UFMTrack to study the different multi-step transmigration of these T cell subsets across the endothelial monolayer. 

      (5) The BMECs are barrier tissues but were cultured on µdishes in this study. To study the transmigration of T-cells across the endothelium, the model would have been more relevant on a semi-permeable membrane instead of a closed surface.

      We understand the critique of the Reviewer, but laminar flow chambers with endothelial monolayers still provide a state-of-the-art and established methodology to study immune cell migration across endothelial monolayers by in vitro live cell imaging including endothelial cells forming the blood-brain barrier.  

      (6) Methods are provided for the isolation and expansion of human effector and memory CD4+ T cells. However, there is no mention of specific CD4+ T cell populations used for analysis with UFMTrack, nor a clear breakdown of tracking efficiency for each subpopulation. Further, there is no similar method for the isolation of CD8+ T cell compartments. A clear breakdown of the performance efficiency of UFMTrack with each cell population investigated in this study would provide greater insight into the software’s performance with regard to tracking the behavior and movement of distinct immune populations.

      We thank the Reviewer for this comment. Since a fair performance evaluation requires collecting reliable and consistent manual annotations, in this work we have performed such analysis only for the mouse CD8 T-cell population migrating on the pMBMEC monolayer. We have chosen this as a reference since it is a different cell population than the one the segmentation model was trained on. This provides an insight into how high performance is expected when other immune cell types are studied than the ones used for model development.

      (7) The results section is quite extensive and discusses details of establishment of the framework while highlighting both the pros and cons of the different aspects of the process, for example the limitation of the two models, 2D and 2D+T were highlighted well. However, the results section includes details which may be more fitting in the methods section.

      We thank the Reviewer for highlighting the extensive work carried out in the development of our UFMTrack framework. We decided to include in the results section only the description of key elements and design decisions taken when developing the framework, such as the need to include a time series of images for successful segmentation of the transmigrated cells. At the same time, the majority of implementational details can be found in the Supplementary Material.

      (8) A few statements in the results section lacked literary support, which was not provided in the discussion either, such as support for increased variance of T-cell instantaneous speed on stimulated vs non-stimulated pMBMECs. Another example is the enhancement of cytokine stimulation directed T-cell movement on the pMBMECs that the authors observed but failed to relay the physiological relevance of it. The authors don’t provide enough references for developments in the field prior to their work which form the basis and need for this technology.

      We thank the Reviewer for this comment and for asking for literature references. However, we cannot provide such references as these are original observations we made by employing the UFMTrack framework.  This shows that UFMTrack observes T-cell behaviors that have previously been overlooked. Their physiological relevance will have to be explored in separate studies. We have extended the introduction section to include the details on the existing methods developed in the field, as well as their weaknesses that motivated the development of the UFMTrack framework.

      (9) The rationale for use of OT-1 and 2D2-derived murine lymphocytes is unclear here. The OT-1 model has been generated to study antigen-specific CD8+ T cell responses, while the 2D2 model has been generated to recapitulate CD4 T cell-specific myelin oligodendrocyte glycoprotein (MOG) responses.

      To establish and test the UFMTrack framework, we have made use of the specific T-cell subsets and endothelial cell models we generally use within our research context. Especially for animal work, this is according to the 3R rules requesting to reduce animal experimentation.  

      Figures and text:

      (1) There are certain discrepancies and misarrangement of figures and text. For example, discussion of the effect of sheer flow on T cell attachment as part of the introduction in figure 1 and then mentioning it in the text again in the results section as part of figure 4 is repetitive.

      We thank the Reviewer for pointing our attention to this misarrangement. We have adjusted the label of Figure 4 to emphasize that this effect is correctly captured by the UFMTrack.

      (2) Section IV, subsection 1 of the results section, refers to ‘data acquisition section above’ in line 279, however the said section is part of materials and methods which is provided towards the end of the manuscript.

      We thank the Reviewer for pointing our attention to this misarrangement. We have adjusted the text to reflect the correct chapter order.

      (3) There are figures in the manuscript that have not been referenced in the results section, for example, figure 3A and B. Figure 1 hasn’t been addressed until subsection 7 of materials and methods

      We thank the Reviewer for pointing our attention to this misarrangement. We have adjusted the text to refer to all figure panels and the clarification of the cell multiplicity estimation in the supplementary information section. References to Figure 1 were added in the introduction section to illustrate the in vitro under flow imaging setup as well as the typical T cell behaviors in such experiments.

      (4) A lack of significance but an observed trend of increased variance of T cell instantaneous speed is reported in line 296-298; however, the graph (figure 4G) shows a significant change in instantaneous speed between non-stimulated and TNFα-stimulated systems. This is misleading to the readers.

      We thank the Reviewer for pointing our attention to this discrepancy. We have expanded the text to indicate a low statistical significance for the TNF and no significance but just a trend for the IL1-beta conditions.

      (5) The authors talk about three beginner experimentors testing the manual T cell tracking process but figure 5 only showcases data from two experimentors without stating the reason for excluding experimentor 1.

      We thank the Reviewer for pointing our attention to this ambiguity. While both the migration analysis and the manual cell tracking were performed by all three beginner experimenters, the cell tracking data for the first one was unfortunately lost due to a hardware failure.

      Discussion:

      (1) While the discussion captures the major takeaways from the paper, it lacks relevant supporting references to relate the observation to physiological conditions and applicability.

      This study is not about the physiological relevance of the microfluidic devices and immune cells used but rather about advancing methodology to analyze dynamic immune cell behavior on endothelial monolayers under physiological flow. Therefore, the discussion does not extend to comparing the physiological relevance of the specific in vitro models employed in this study.   

      (2) The discussion lacks connection to the results since the figures were not referenced while discussing an observed trend

      We thank the Reviewer for pointing our attention to this misarrangement. We have included the references to the relevant figures as well as supporting references.

      (3) The authors briefly looked into mouse and human BMECs and their individual interaction with Tcells, but don’t discuss the differences between the two, if any, that challenged their framework.

      We thank the Reviewer for pointing our attention to this weakness. We have added to the discussion section clarifications on the challenges of analyzing the T cell interactions with the HBMEC and the BMDM interactions with the pMBMEC monolayer.

      (4) Even though though the imaging tool relies on difference in appearance for detection, the authors talk about lack of feasibility in detecting transmigration of BMDMs due to their significantly different appearance. The statement lacks a problem solving approach to discuss how and why this was the case.

      We thank the Reviewer for pointing our attention to this weakness and apologize for the misleading explanation of the problem of analyzing the BMDM sample. Since the transmigrated part of the macrophages differs in appearance from a transmigrated part of a T cell, its detection by a Deep Neural Network trained on the T cell data is worse than that for the T cells. At the same time, the detection performance before the transmigration is sufficient for the BMDM migration analysis. The potential approaches to alleviate this are added to the discussion section.

      Relevance to the field:

      Utilizing the framework provided by the authors, the application can be adapted and/or utilized for visualizing a range of different cell types, provided they are different in appearance. However, this would require extensive changes to the script and won’t be adaptable in its current form.

      Recommendations for the authors:

      Reviewer #1 (Recommendations For The Authors):

      The authors should announce in the abstract that the software analysis Track is downloadable and free to use for all researchers. They may consider providing some sort of helpdesk, although I realize that that may run into too much time.

      As said above, they stress that it can be done in BBB models, but I would argue that it is much more broadly applicable.

      We thank the Reviewer for these suggestions. We have emphasized the broader applicability of UFMTrack in the abstract and pointed out the public availability of the code and data.

      Can they add an experiment that shows that it also works for neutrophils for example? I understand that on paper yes it should work, but the neutrophils are of course faster etc.

      This is an excellent suggestion, but we tested UFMTrack within the current framework of ongoing research, which does not include the investigation of neutrophil transmigration across endothelial monolayers.  

      Also, the combination of different leukocytes in one TEM assay would really be a step forward. If the software can detect different-sized leukocytes, then this should be possible.

      We thank the Reviewer for this suggestion. We have added Supplementary Figure 7, demonstrating the range of cell sizes that were successfully analyzed by the UFMTrack framework throughout our manuscript. We also added a statement to the discussion that according to this data, “simply by discriminating cells by size, it is possible to extend UFMTrack to study the interaction of several types of immune cells migrating on top of a cellular monolayer under flow.”

      Extra challenges: can the method also discriminate between paracellular and transcellular migration modes? In particular for T-cells this is known to happen.

      We thank the Reviewer for this suggestion. We have added this to the potential applications of UFMTrack in the discussion section. While this differentiation is not feasible relying solely on the phasecontrast imaging data, UFMTrack can simplify this analysis by providing automatically the predictions of the transmigration locations, for analysis of the fluorescent data of the junctional labels.

      Reviewer #2 (Recommendations For The Authors):

      This paper develops an under-flow migration tracker to evaluate all the steps of the extravasation cascade of immune cells across the BBB. The algorithm is useful and has important applications. There are several points that need to be addressed, particularly about the claims made by the authors.

      Please see the comments below for more details:

      • Lines 88-92: Add a citation for the characteristics of the BBB as a barrier

      We have added two references accordingly.  

      • Lines 94-95: Can the authors indicate what models were used for these studies and how those compare to their in vitro model? In addition, can the authors say whether T cells were manually tracked in this study to translate results to the clinic and whether the results were successful when translated to the clinic? This may enhance the argument that automatic trackers are needed if the translation was not 100% successful

      This introductory paragraph summarizes in vivo and in vitro observations from several laboratories. Although these studies include manual tracking of T cells, they do not necessarily distinguish all sequential steps of the multi-step T cell transmigration cascade. Thus, automated tracking may provide additional insights, allowing for increased translation of findings to the clinic.  

      • Lines 96-98: Citing the work of Roger Kamm and Noo Li Jeon would be helpful here as they pioneered these BBB microfluidic models and have protocol papers on how to build them and how to use them for cancer cell extravasation studies. Roger Kamm has also worked on several extravasation studies with neutrophils, monocytes, and PBMCs from 3D vasculatures in microfluidic devices, under flow using pressurized fluid or recirculating pumps. Mentioning those would be helpful as they are directly related to what the authors are presenting in their paper.

      We thank the Reviewer for this comment, and we consider the work of Roger Kamm and Noo Li Jeon as very valuable for the field. However, these authors have focused on developing functional 3D microfluidic devices, including, e.g., all cells of the neurovascular unit which is not the focus of this present study that solely employed parallel flow chamber devices and endothelial monolayers.  

      • Lines 110-116: Can the authors comment on the use of ImageJ or similar automatic tracking tools and how these compare to the under-flow migration tracker developed in this paper? Several groups use ImageJ to track cellular migration successfully and in an automatic manner with short intervals between each frame. One paper that comes to mind is Chen et al: DOI: 10.1073/pnas.1715932115 where neutrophil migration in 3D was assessed with ImageJ in microfluidic devices of the vasculature. If the authors can highlight differences between their tool and what is currently available and used for automatic tracking (e.g. ImageJ), this would help in understanding the advantages of the migration tracker developed in this paper.

      • Lines 118-121: Add citations for the current state of the art for T cell extravasation tracking

      We thank the Reviewer for these suggestions. We have extended the introduction to add more details on the available tools for tracking migrating immune cells and their limitations, as well as the discussion section to emphasize the features unique to the developed UFMTrack framework.

      • Figure 1: The device used by the authors is considered to be a 2D microfluidic device with a monolayer of mouse brain endothelial cells. I would recommend the authors to carefully revise the claims made in the paper to mention that this is a 2D device as opposed to a 3D device, in order to not mislead readers who may be expecting these analyses to be performed in 3D vasculatures.

      We thank the Reviewer for this suggestion. We have included in the summary the mention of the 2dimensional nature of the employed BBB model.

      • Figure 1: The T cells used in this study are not fluorescently-labeled but the authors mention that this is an issue from current state-of-the-art tools. I would recommend that the authors remove this point as being an issue because it is not addressed in their paper. The T cells are also not labeled in this study so this limitation of other systems is not addressed in this paper.

      We apologize to the Reviewer as we do not understand this question. There will be many experimental conditions not allowing to study fluorescently tagged T cells. Therefore, UFMTrack is tailored to follow and analyze T cells and other immune cells during their interaction with endothelial monolayers independent of a fluorescence tag.  

      • Figure 1: Was the shear stress controlled manually with a syringe? Or with the use of a pressure controller? I would clarify this aspect and discuss human errors that can be introduced from manually controlling the pressure applied to the monolayer.

      We thank the Reviewer for pointing our attention to this ambiguity. We have added a mention of the automated syringe pump used to control the shear stress in the text where the values of shear stress applied to the sample are first mentioned.

      • Figure 1: Does T cell attachment occur within the first 5 minutes? Can the authors comment on how they chose this timeline and the percentage of T cells that are washed off at the second step at 1.5 dynes/cm^2? Is 30 seconds enough to ensure all the non-adhered T cells are washed off with 1.5 dyns/cm^2?

      Superfusion of the T cells over the endothelial monolayer is performed under 0.5 dynes/cm2 to allow the T cells to settle on the endothelial cell monolayer under flow. After increasing to physiological, flow non adherent T cells detach within 30 seconds, as described by the Reviewer. We have included in the Methods Section Point 7 the references describing in depth the design of the flow chamber device and methods used here.  

      • Line 154: How many images were used in the training vs. testing dataset for T cell migrations?

      We thank the Reviewer for pointing our attention to this missing information. We have added the sizes of the training and validation datasets. Specifically, the 226MPix of available imaging data was split into 154Mpix training and 37 MPix validation sets. The gap in between was introduced to avoid a correlation between validation and training set that would compromise the performance evaluation.

      • Are the supplementary videos at real speed or accelerated?

      We thank the Reviewer for pointing our attention to this missing information. The videos are sped up by a factor of 96. We have added this information to the Supplementary video descriptions.  

      • Lines 208 216: Can the authors comment on how their initial adhesion timeframe of 30sec before starting the recording at 5.5min affects the number of T cells with rapid displacement? 30 seconds may not be enough to ensure T cells have adhered to the endothelium

      Please see our comment above. The methodology used in the present assays has been set up and validated in numerous publications. We have included in the Methods Section under Point 7 the references describing in depth the design of the flow chamber device and the methods used here.  

      • Lines 275-277: Was the number of testing images 18? Can the authors comment on how this compares to training dataset size and whether these numbers are enough to achieve robust results?

      We apologize for this ambiguity in our manuscript. The framework was evaluated on 18 imaging datasets, each corresponding to 32 minutes of recording, not 18 images. We have added this clarification to the “CD4+ T cell analysis” subsection. The total size of these datasets is 18 datasets * 191 timeframe/dataset * 9.9MPix/frame = 34MPix

      • Figure 4B: Can the authors add statistics here? Individual datapoints on the error bars would be helpful too. 

      We thank the Reviewer for pointing our attention to this weakness. The data corresponds to the statistical errors as evaluated based on all cells in the 18 datasets. We have added the total number of cells in each of the endothelium stimulation conditions to the text.

      • Figure 4C-J: Can the authors put individual datapoints here as well and explain whether they considered each T cell to be one datapoint or each endothelium (averaging all T cells) to be one datapoint? 

      We thank the Reviewer for this suggestion. However, adding about one thousand points corresponding to each cell would be impractical. We thus present the distributions of the evaluated from the data metrics as a histogram on the violin plot instead of the swarm plot.

      • Figure 4: Did the authors wash the monolayers before introducing T cells? Soluble unbound cytokines may still be present and there are two different questions that would be studied here: “Is the inflamed endothelium affecting T cell migration?” (if washing was performed) or “Is T cell and microenvironmental inflammation affecting T cell migration?” (if no washing was performed)

      The endothelial monolayers are “washed” by starting the flow in the flow chamber device and this is before superfusing the T cells over the endothelial monolayer. We agree that our flow chamber device combined with UFMTrack will allow to address all these questions.

      • Figure 4I: Are all the T cells decelerating? (negative AM speed)

      We thank the Reviewer for this question. The cells are moving along the flow, which, in our experiments, is from left to right. The vector of speed is thus pointing against the x-axis, and thus the AM speed is negative.

      • Lines 302 306: Please explain how this compares to ImageJ or similar trackers that can achieve similar outputs. 

      We thank the Reviewer for this question. We have added a statement in the “T-cell tracking” section emphasizing that standard trackers are incapable of correctly capturing large displacements.

      • Lines 306-309: It is not lower for TNF stimulation though. How do the authors address this? TNF is also a pro-inflammatory cytokine.

      We have previously shown that stimulation of pMBMECs with IL-1 and TNF-a induces different cell surface levels of ICAM-1 and VCAM-1, which will influence T cell behavior on the pMBMEC monolayer.  

      • Lines 313-315: Could this be because the monolayer was not washed and soluble cytokines affected T cell response directly?

      Please see our answer to lines 306-309.  

      • Lines 319: Please cite Roger Kamm and Noo Li Jeon’s papers on BBB models with human BMECs, pericytes and astrocytes in 3D microfluidic devices.

      We thank the Reviewer again for pointing out these studies. As mentioned above, as our present study does not explore 3D models of the BBB, we think it does not fit into the framework of our study to elaborate on 3D models of the BBB. In addition, this would require the inclusion of a discussion of the work of others like, e.g., Peter Searson and others.  

      • Figure 5: Several statistics are missing from parts of the figure. Please add those.

      We apologize – but we do not understand which statistical analysis the Reviewer is missing from this Figure.  

      • Can the authors comment on the number of T cells perfused over the monolayer and if this ratio of T cells to endothelial cells makes physiological sense? Too many T cells may result in endothelium inflammation and increased diapedesis.

      The number of T cells used to suprerfuse over the endothelial monolayer is tested to avoid aggregation of T cells in suspension and thus artificial interactions with the endothelial monolayer. T cell behavior on the pMBMEC monolayer remains the same over the dilution of factor 10.  

      • Lines 381 383: How does this compare to analyses that look at the cross-section of the endothelium? It is difficult to assess transmigration looking at the top view of the endothelium. Perhaps, cross-section assessments will identify differences in manual vs. automatic tracking.

      There is, to the best of our knowledge, no microscopic device that would allow for in vitro live cell imaging of a live endothelial monolayer – this is in the presence of tissue culture medium – from the side at a resolution that would allow to define transmigration. Our current study rather shows the UFMTrack can distinguish cells moving above or below the endothelial monolayer.  

      • Figure 5J: This is probably the most important argument of the paper. If the authors can show statistical differences in their graph, this would greatly help convince readers that this tool is necessary and actually computationally efficient compared to manual work by researchers.

      We thank the Reviewer for this suggestion. However, comparing a single data point for automated measurement with four manual experimenter analysts is not a statistically sound comparison. We believe that Figure 5K is clearly showing the factor 5 difference in analysis speed as compared to manual analysis. More importantly, though, the automated analysis is taking the machine time, lifting the need for the experimenter to invest even 1/5th of the original analysis time.

      • Figure 6: Did the authors use autologous immune cells and endothelial cells? This is particularly relevant with the use of human-derived T cells (line 436) on the BMEC monolayer. Can the authors comment on non-self reactivity by the T cells encountering BMEC from another human subject?

      Autologous T cell interaction with BMECs would only be possible when using hiPSC-derived EECM-BMECs and the T cells from the same individual. All other experimental frameworks will not include autologous interactions. This is the experimental framework used by most authors studying immune cell interactions with commercially available donors. We have not studied alloreactive interactions in our assays and thus cannot further comment.  

      • Figure 6M,N,O: How does this compare to ImageJ for tracking of fluorescent cells? I recommend the authors to try that, at least for this section, as this may enhance their argument for their tool vs. standard tools like ImageJ if success rates are higher for their tool.

      We thank the Reviewer for this suggestion. We included a note on the analysis of the fluorescent datasets using the  TrackMate plugin for imageJ performed previously in our lab in the “Human T cells on immobilized recombinant BBB adhesion molecules” subsection.

      • Figure 6: Please put individual datapoints on the bar or violin plots where they are missing.

      We thank the Reviewer for this suggestion. However, adding about one thousand points corresponding to each cell would be impractical. We thus present the distributions of the evaluated from the data metrics as a histogram on the violin plot instead of the swarm plot.

      • Lines 467-471: This argument is important and should be mentioned earlier in the introduction.

      Another point that can be mentioned is the application of this platform to imaging modalities in vivo (mouse or human) given that there is no fluorescent staining in these cases. This review may be relevant: https://doi.org/10.1002/jcb.10454

      We thank the Reviewer for this suggestion. We have clarified in the introduction that UFMTrack does not require fluorescent labels of the imaged migrating cells and relies solely on the phase contrast imaging data.

      • Discussion: Please address a few more potential applications to this study. One can be cancer and immune infiltration.

      We thank the Reviewer for this suggestion. We have elaborated on additional potential applications to the discussion section.

      Reviewer #3 (Recommendations For The Authors):

      (1) Line 327-328: The authors talk about ‘As we have previously shown…pMBMEC monolayers differs between CD4+ and CD8+ cells…’. Where was this shown? If it was in a previously published article, please provide a reference.

      We have added these missing references.  

      (2) Line 353: Please provide clear location on where to find the associated information instead of stating ‘see below’.

      We thank the Reviewer for pointing our attention to this ambiguity. We have corrected the phrase to “see next paragraph”

      (3) Line 439: Please correct the acronym to BMECs

      We thank the Reviewer for pointing our attention to this typo. We have corrected it.

    1. ```{.r .cell-code} wws_7_freedom_equality <- wvs_7_russia %>% select(Q149, Q260) %>% rename(freedom_equality = Q149, sex = Q260) %>% mutate( freedom_equality = if_else(freedom_equality == 1, "Freedom", "Equality"), sex = if_else(sex == 1, "Male", "Female") ) ``` :::

      Комментарии: Все сломалось...

    Annotators

    1. The speaker aims to distill years of thinking about Functional Reactive Programming (FRP) into a concise talk.

      Quote: "So this talk is really distilling those years into like a 40-minute thing so you don't have to you don't have to go through the same thing that I did."

      The main goals are to understand what FRP is, categorize different variations, and evaluate them effectively.

      Quote: "Our goals for today are to understand what is FRP, how do we categorize different things that sort of fall into that umbrella, and then how do we evaluate those different things in a nice way."

      First-order FRP, as implemented in Elm, involves signals connected to inputs from the world, representing values that change over time.

      Quote: "The key part of a signal graph is that it has inputs from the world... it's a mouse position that's changing over time."

      Signals in Elm are infinite and cannot be created or destroyed; they model continuous inputs like mouse position or keyboard presses.

      Quote: "Another property is signals are infinite... there's no such thing as deleting a signal... the inputs to your program are fixed."

      Signal graphs in Elm are static, meaning their structure is known at startup and does not change over time, which provides several benefits.

      Quote: "Another property is signal graphs are static; there's a known structure for your application from startup all the way into the future."

      Elm's FRP model is synchronous by default, ensuring events are processed in the order they occur, which is crucial for consistent user interactions.

      Quote: "Finally, it's synchronous by default... when you type hello... you want those letters to show up in that order."

      Transformations on signals are performed using functions like lift, allowing for pure functional manipulation of time-varying values.

      Quote: "We have a function that goes from A's to B's... and we go make a signal of B."

      Stateful computations are handled using foldp (fold from the past), which accumulates state over time based on incoming events.

      Quote: "In Elm, this is called foldp... we give it a starting state... and a way to update that state."

      Signals can be merged and combined using functions like merge and lift2, enabling complex signal graphs built from simpler components.

      Quote: "Finally, we have a way to merge signals together... we can apply a function that puts them together."

      The static nature of signal graphs in Elm enables efficient execution, as everything is event-driven and stateful nodes only need to look back at their previous state.

      Quote: "Okay, what do we get when we make these design choices... the first one is efficiency... everything's event-driven."

      Elm's architecture promotes modularity and separation of concerns, dividing programs into models, updates, and views in a pure functional style.

      Quote: "So when we look at the structure of this application, it breaks up nicely into four parts... we first have a model... we have update... we have a view."

      Hot swapping allows code changes to propagate in real-time without restarting the application, facilitated by the static signal graph.

      Quote: "So we're able to change the behavior of the program while the program is running and sort of see those changes propagate automatically."

      The time-travel debugger in Elm leverages the static nature of signal graphs to record and replay events, aiding in debugging and development.

      Quote: "We can pause this program and sort of go back in time to wherever we want to go... you can start to get really cool insights about what's going on in your program at particular points in time."

      Higher-order FRP introduces dynamic signal graphs that can be reconfigured at runtime, but this comes with significant trade-offs and complexities.

      Quote: "What happens if signal graphs could be reconfigured? What is higher-order FRP? Surely higher is better... this is a very surprisingly hard question."

      Introducing join in higher-order FRP allows for signals of signals, enabling dynamic switching between signals but leading to challenges like infinite lookback and memory growth.

      Quote: "We have a signal of signals... creating a new signal may need infinite lookback... memory growth is linear with time."

      To mitigate these issues, higher-order FRP requires restricting join with advanced type systems, adding complexity to the API or language.

      Quote: "The solution isn't that this is a bad idea; it's that we only should switch to signals that have safe amounts of history... How do we restrict the definition of join with fancier types?"

      Asynchronous dataflow systems like Reactive Extensions abandon the infinite and synchronous nature of signals, allowing signals to end and defaulting to asynchrony.

      Quote: "Again, we can look at our core design and see which things we cross off... we cross off that signals are infinite... we get rid of... that it's synchronous by default."

      In asynchronous dataflow, switching signals creates entirely new signals, and concepts like hot and cold signals determine if signals produce values when not observed.

      Quote: "Whenever you create one of these signals, you're creating a totally new one... If it's hot, it's going to keep producing things, and if it's cold, it's going to stop."

      Arrowized FRP uses automata (state machines) that can be composed and switched, but these automata are not connected directly to the world, limiting their scope.

      Quote: "We cross off that signal graphs are connected to the world... these nodes aren't connected to the world... when we take it out of the graph... it doesn't keep running."

      The choice between different FRP approaches involves trade-offs; no one method is universally better, and the decision depends on application needs.

      Quote: "This isn't a competition of like which one's better but rather different points in a design space that are complementary."

      Elm integrates aspects of these different FRP models, emphasizing a static signal graph for predictability and tooling while allowing asynchrony where necessary.

      Quote: "Elm has a thing called automaton which is the same concept... you can have all these nice guarantees when you want but still integrate with a system that has something more complex going on."

      Evaluating FRP systems requires considering factors like synchronicity, ability to handle asynchrony, support for dynamic graphs, and the complexity of code produced.

      Quote: "When you want to evaluate this, the question isn't how can I get the fanciest words on my library; it's what properties do I need for my application."

      Questions to ask when choosing an FRP system include: Is it synchronous by default? Can it handle asynchrony? Can I talk about inputs? Can I reconfigure my graph?

      Quote: "The questions you want to be asking are: Is it synchronous by default? Does it allow asynchrony? Can I talk about inputs? Can I reconfigure my graph?"

      Debugging capabilities and code complexity are important considerations; the system should make debugging straightforward and keep the code maintainable.

      Quote: "What is debugging like... does the code come out nice... you're paying a complexity cost for that, and do you need that in your application?"

      The speaker encourages learning Elm to understand the principles of FRP, as it provides a foundation that can be applied to other FRP systems.

      Quote: "If you learn Elm, then it's easy to go to these other ones, and you learn these principles which you can use there."

      In conclusion, the talk provides insights into the trade-offs of different FRP designs, emphasizing that the best choice depends on specific application requirements.

      Quote: "Hopefully, this has given you some insight... it's not about which one is better but what properties you need for your application."

    1. Reviewer #1 (Public review):

      In this study, Ma et al. aimed to determine previously uncharacterized contributions of tissue autofluorescence, detector afterpulse, and background noise on fluorescence lifetime measurement interpretations. They introduce a computational framework they named "Fluorescence Lifetime Simulation for Biological Applications (FLiSimBA)" to model experimental limitations in Fluorescence Lifetime Imaging Microscopy (FLIM) and determine parameters for achieving multiplexed imaging of dynamic biosensors using lifetime and intensity. By quantitatively defining sensor photon effects on signal-to-noise in either fitting or averaging methods of determining lifetime, the authors contradict any claims of FLIM sensor expression insensitivity to fluorescence lifetime and highlight how these artifacts occur differently depending on the analysis method. Finally, the authors quantify how statistically meaningful experiments using multiplexed imaging could be achieved.

      A major strength of the study is the effort to present results in a clear and understandable way given that most researchers do not think about these factors on a day-to-day basis. The model code is available and written in Matlab, which should make it readily accessible, although a version in other common languages such as Python might help with dissemination in the community. One potential weakness is that the model uses parameters that are determined in a specific way by the authors, and it is not clear how vastly other biological tissue and microscope setups may differ from the values used by the authors.

      Overall, the authors achieved their aims of demonstrating how common factors (autofluorescence, background, and sensor expression) will affect lifetime measurements and they present a clear strategy for understanding how sensor expression may confound results if not properly considered. This work should bring to awareness an issue that new users of lifetime biosensors may not be aware of and that experts, while aware, have not quantitatively determined the conditions where these issues arise. This work will also point to future directions for improving experiments using fluorescence lifetime biosensors and the development of new sensors with more favorable properties.

    2. Author response:

      eLife Assessment

      This important study describes a computational tool termed FliSimBA (Fluorescence Lifetime Simulation for Biological Applications), which uses simulations to rigorously assess experimental limitations in fluorescence lifetime imaging microscopy (FLIM), including diverse noise factors, hardware effects, and sensor expression levels. The evidence from simulation and experimental measurements supporting the usefulness of FlimSimBA is solid. The authors may improve the application of the tool to a wide range of biological samples by providing the simulation package, currently in MATLB, in other common languages such as Python, and having better descriptions of the fitting algorithm and model assumptions. The work will interest scientists who wish to perform quantitative FLIM imaging for cells and tissues.

      We thank the editors and reviewers for the constructive feedback. We plan to provide the FLiSimBA simulation package in Python in addition to Matlab. We will also describe in more detail in the Results section our fitting method. Furthermore, we will explain more clearly in the text that our simulation package makes almost no model assumptions, and features flexibility and adaptability so that it can be used for any fluorescence lifetime measurements. We will clearly outline what are the specific examples we use for our case studies, and how users can input their own values based on the specific sensors, autofluorescence, and hardware they use.

      Public Reviews:

      Reviewer #1 (Public review):

      In this study, Ma et al. aimed to determine previously uncharacterized contributions of tissue autofluorescence, detector afterpulse, and background noise on fluorescence lifetime measurement interpretations. They introduce a computational framework they named "Fluorescence Lifetime Simulation for Biological Applications (FLiSimBA)" to model experimental limitations in Fluorescence Lifetime Imaging Microscopy (FLIM) and determine parameters for achieving multiplexed imaging of dynamic biosensors using lifetime and intensity. By quantitatively defining sensor photon effects on signal-to-noise in either fitting or averaging methods of determining lifetime, the authors contradict any claims of FLIM sensor expression insensitivity to fluorescence lifetime and highlight how these artifacts occur differently depending on the analysis method. Finally, the authors quantify how statistically meaningful experiments using multiplexed imaging could be achieved.

      A major strength of the study is the effort to present results in a clear and understandable way given that most researchers do not think about these factors on a day-to-day basis. The model code is available and written in Matlab, which should make it readily accessible, although a version in other common languages such as Python might help with dissemination in the community. One potential weakness is that the model uses parameters that are determined in a specific way by the authors, and it is not clear how vastly other biological tissue and microscope setups may differ from the values used by the authors.

      Overall, the authors achieved their aims of demonstrating how common factors (autofluorescence, background, and sensor expression) will affect lifetime measurements and they present a clear strategy for understanding how sensor expression may confound results if not properly considered. This work should bring to awareness an issue that new users of lifetime biosensors may not be aware of and that experts, while aware, have not quantitatively determined the conditions where these issues arise. This work will also point to future directions for improving experiments using fluorescence lifetime biosensors and the development of new sensors with more favorable properties.

      We appreciate the comments and helpful suggestions. We plan to present FLiSimBA simulation code in Python in addition to Matlab to make it more accessible to the community.

      One of the advantages of FLiSimBA is that the simulation package is flexible and adaptable, allowing users to input parameters based on the specific sensors, hardware, and autofluorescence measurements for their biological and optical systems. We used parameters based on one FRET-based sensor, measured autofluorescence from mouse tissue, and measured dark count/after pulse of our specific GaAsP PMT in this manuscript as examples. We will emphasize this advantage and further clarify how these parameters can be adapted to diverse tissues, imaging systems, and sensors based on individual users in our revision.

      Reviewer #2 (Public review):

      Summary:

      By using simulations of common signal artefacts introduced by acquisition hardware and the sample itself, the authors are able to demonstrate methods to estimate their influence on the estimated lifetime, and lifetime proportions, when using signal fitting for fluorescence lifetime imaging.

      Strengths:

      They consider a range of effects such as after-pulsing and background signal, and present a range of situations that are relevant to many experimental situations.

      Weaknesses:

      A weakness is that they do not present enough detail on the fitting method that they used to estimate lifetimes and proportions. The method used will influence the results significantly. They seem to only use the "empirical lifetime" which is not a state of the art algorithm. The method used to deconvolve two multiplexed exponential signals is not given.

      We appreciate the comments and constructive feedback and will more clearly describe the fitting methods in our revision.

      Two metrics are currently used to estimate lifetime in our paper, which are currently described in the Methods section ‘Experimental data collection, parameter determination, and simulation’ and ‘FLIM analysis’: (1) fitted P1: we described how lifetime histograms were fitted to Equation 2 with the Gauss-Newton nonlinear least-square fitting algorithm and the fitted P1 was used as lifetime estimation; (2) empirical lifetime, defined by Equation 5. These two metrics were used for the following reasons: (1) when the exponential decay equation of a sensor is known (for example, the FRET-based PKA activity sensor FLIM-AKAR can be described as a double exponential equation), fitted coefficients for each exponential component provide a robust way for lifetime estimate that is less sensitive to noise and background signals; (2) when the biophysical properties of sensors are unknown, or when the sensors cannot be easily described with single or double exponential equations, empirical lifetime (i.e. average lifetime values) provides an unbiased way to quantify fluorescence lifetime without assumptions of underlying models to describe sensor lifetime.

      To deconvolve two multiplexed exponential signals (Fig. 8), histograms were fitted to Equation 2 with the Gauss-Newton nonlinear least-square fitting algorithm, as described in Methods section ‘Simulation and analysis of multiplexed imaging with fluorescence intensity and lifetime data’.

      Considering the importance of these methodological details for evaluating the conclusions of this study, and the importance of appreciating the advantages and limitations of different methods of lifetime estimates (e.g. Figure 7), we will move the description of the fitting method to estimate P1 and the method of calculating empirical lifetime from Methods to Results, and will further clarify the rationale of using these different methods of lifetime estimates.

      Reviewer #3 (Public review):

      Summary:

      This study presents a useful computational tool, termed FLiSimBA. The MATLAB-based FLiSimBA simulations allow users to examine the effects of various noise factors (such as autofluorescence, afterpulse of the photomultiplier tube detector, and other background signals) and varying sensor expression levels. Under the conditions explored, the simulations unveiled how these factors affect the observed lifetime measurements, thereby providing useful guidelines for experimental designs. Further simulations with two distinct fluorophores uncovered conditions in which two different lifetime signals could be distinguished, indicating multiplexed dynamic imaging may be possible.

      Strengths:

      The simulations and their analyses were done systematically and rigorously. FliSimba can be useful for guiding and validating fluorescence lifetime imaging studies. The simulations could define useful parameters such as the minimum number of photons required to detect a specific lifetime, how sensor protein expression level may affect the lifetime data, the conditions under which the lifetime would be insensitive to the sensor expression levels, and whether certain multiplexing could be feasible.

      Weaknesses:

      The analyses have relied on a key premise that the fluorescence lifetime in the system can be described as two-component discrete exponential decay. This means that the experimenter should ensure that this is the right model for their fluorophores a priori and should keep in mind that the fluorescence lifetime of the fluorophores may not be perfectly described by a two-component discrete exponential (for which alternative algorithms have been implemented: e.g., Steinbach, P. J. Anal. Biochem. 427, 102-105, (2012)). In this regard, I also couldn't find how good the fits were for each simulation and experimental data to the given fitting equation (Equation 2, for example, for Figure 2C data).

      We thank the reviewer for the constructive feedback. We agree that the FLiSimBA users should ensure that the right decay equations are used to describe the fluorescent sensors. In this study, we used a FRET-based PKA sensor FLIM-AKAR to provide a proof-of-principle demonstration of FLiSimBA usage. The donor fluorophore of FLIM-AKAR, truncated monomeric enhanced GFP, follows a single exponential decay. FLIM-AKAR, a FRET-based sensor, follows a double exponential decay. The time constants of the two exponential components were determined previously (Chen, et al, Frontiers in pharmacology (2014)).  Thus, a double exponential decay equation with known τ1 and τ2 (Equation 1) was used for both simulation and fitting. In our revision, we will refer to our prior study characterizing the double exponential decay model of FLIM-AKAR. We will also emphasize the importance of using the right decay equations, strategies to estimate sensor decays, and how the flexibility of FLiSimBA allows users to input different forms of models to describe their specific sensor histograms. We will additionally provide data showing the goodness of fit for both simulated data and experimental data.

      Also, in Figure 2C, the 'sensor only' simulation without accounting for autofluorescence (as seen in Sensor + autoF) or afterpulse and background fluorescence (as seen in Final simulated data) seems to recapitulate the experimental data reasonably well. So, at least in this particular case where experimental data is limited by its broad spread with limited data points, being able to incorporate the additional noise factors into the simulation tool didn't seem to matter too much.

      We agree that in Figure 2C the contributions from autofluorescence, afterpulse, and background signals are small, because sensor photon count is high here. As seen in Figure 2B, when sensor photon counts are higher, the contributions from these other factors become less pronounced. The simulated data in Figure 2C were based on high photon counts because the simulated P1 value was determined by fitting experimental data. To achieve reasonable fitting with minimal interference from autofluorescence, afterpulse, and background signals, we used experimental data with high sensor expression. We will clarify these details in our revision.

    1. Risques

      Risques : 3 grandes catégories Catégorie 1 : - Risques liés à l’utilisation d’outils d’IA générative par le personnel de l’administration publique : Confidentialité de l’information sensible et protection des renseignements personnels. - Biais - Fiabilité - Dépendance technologique - Respect de la législation - Respect de la propriété intellectuelle - Génération de code non sécurisé

      Catégorie 2 : Risques de l’IA générative liés à la cybercriminalité 2.1 Risques d’utilisation de l’IA générative par les cybercriminels : - L’IA peut être utilisée pour amplifier les campagnes de mésinformation et de désinformation - L’IA peut également être exploitée pour des attaques d’hameçonnage plus sophistiquées - L’IA peut jouer un rôle dans la propagation de rançongiciels - L’IA générative renforce les menaces d'ingénierie sociale

      2.2 Risques d’exploitation des vulnérabilités d’un modèle d’IA générative par les cybercriminels: - Les attaques par empoisonnement - L’extraction de données - Les violations par abus - Perte de confiance envers les institutions publiques et le gouvernement du Québec

    1. Risques : 3 grandes catégories Catégorie 1 : - Risques liés à l’utilisation d’outils d’IA générative par le personnel de l’administration publique : Confidentialité de l’information sensible et protection des renseignements personnels. - Biais - Fiabilité - Dépendance technologique - Respect de la législation - Respect de la propriété intellectuelle - Génération de code non sécurisé

      Catégorie 2 : Risques de l’IA générative liés à la cybercriminalité 2.1 Risques d’utilisation de l’IA générative par les cybercriminels : - L’IA peut être utilisée pour amplifier les campagnes de mésinformation et de désinformation - L’IA peut également être exploitée pour des attaques d’hameçonnage plus sophistiquées - L’IA peut jouer un rôle dans la propagation de rançongiciels - L’IA générative renforce les menaces d'ingénierie sociale

      2.2 Risques d’exploitation des vulnérabilités d’un modèle d’IA générative par les cybercriminels: - Les attaques par empoisonnement - L’extraction de données - Les violations par abus - Perte de confiance envers les institutions publiques et le gouvernement du Québec

    1. Last Modified on 10/26/2023 3:41 pm EDT stages® stores Alarm History on all sites. In the

      There's some weird hyperlinking error happening in this highlighted text. Just looking at the source code in Chrome's developer tool, I can't figure out how/why that, at least for me, summons the Windows print preview dialog, is being applied so frequently.

    2. the alarm number, Priority, Alarm Event Code, the date and time of the alarm, the date and time the alarm was retrieved and locked by an operator, the time between the alarm generation and the retrieval, the time the alarm was cleared, the transmit# of the device, the Action Plan followed, the action plan's variation number if applicable, the initials of the operator who originally locked the account, and the disposition of the alarm if applicable

      This info might be better digestible in a bulleted list

    1. Author response:

      Public Reviews:

      Summary:

      We sincerely thank the reviewers for their insightful and thorough feedback. Their comments cover both technical and conceptual aspects of our project, which we have attempted to address in our provisional responses.

      First, we would like to clarify that any current lack of documentation or technical issues (such as local installation challenges) reflect the software's early stage. These aspects are receiving our full attention and are not intended to remain in their current state. As suggested, we plan to enhance the toolbox’s structure by separating it into a standalone library and a web application, alongside developing smaller satellite apps for SWC and MOD file management. We will also expand our documentation, provide a more detailed user guide, and add video tutorials for the GUI.

      Second, we have clarified the rationale behind specific implementation choices in our software, explaining why certain features of the toolbox were designed and implemented in particular ways. Our goal is to maintain a strong focus on single-cell level modeling, addressing its various aspects in great detail. We are also working on new features, such as automated parameter optimization and support for multiple output formats, to further enrich the toolbox’s functionality.

      Reviewer #1 (Public review):

      Summary:

      Dendrotweaks provides its users with a solid tool to implement, visualize, tune, validate, understand, and reduce single-neuron models that incorporate complex dendritic arbors with differential distribution of biophysical mechanisms. The visualization of dendritic segments and biophysical mechanisms therein provide users with an intuitive way to understand and appreciate dendritic physiology.

      Strengths:

      (1) The visualization tools are simplified, elegant, and intuitive.

      (2) The ability to build single-neuron models using simple and intuitive interfaces.

      (3) The ability to validate models with different measurements.

      (4) The ability to systematically and progressively reduce morphologically-realistic neuronal models.

      We thank the reviewer for their positive comments.

      Weaknesses:

      (1) Inability to account for neuron-to-neuron variability in structural, biophysical, and physiological properties in the model-building and validation processes.

      We agree with the reviewer that it is important to account for neuron-to-neuron variability. The core approach of DendroTweaks and its distinctive feature is interactive exploration of how morpho-electric parameters affect neuronal activity. In light of this, variability can be achieved through interactive updating of the model parameters with widgets. In a sense, by adjusting a widget (e.g., channel distribution or kinetics), a user ends up with a new instance of a cell in the parameter space and receives almost real-time feedback on how this change affects neuronal activity. Implementing complex algorithms to account for neuron-to-neuron variability during the validation process would detract from the interactivity aspect of the GUI. That being said, we acknowledge the importance of this issue and we will explore the options to address it more comprehensively in our revised manuscript.

      (2) Inability to account for the many-to-many mapping between ion channels and physiological outcomes. Reliance on hand-tuning provides a single biased model that does not respect pronounced neuron-to-neuron variability observed in electrophysiological measurements.

      We acknowledge the challenge of accounting for degeneracy in the relation between ion channels and physiological outcomes and the importance of capturing neuron-to-neuron variability. One possible way to address this, as we mention in the Discussion, is to integrate automated parameter optimization algorithms alongside the existing interactive hand-tuning with widgets. We are currently exploring the possibility of integrating Jaxley (Deistler et al., 2024) into DendroTweaks in addition to NEURON. This would allow for automated and fast gradient-based parameter optimization, including optimization of heterogeneous channel distributions.

      (3) Lack of a demonstration on how to connect reduced models into a network within the toolbox.

      Building a network of reduced models is a promising direction, albeit it goes beyond the scope of this manuscript. We do not plan to add support for network models to the toolbox itself. In DendroTweaks, we focus on single-cell modeling, aiming to cover its various aspects in great detail. Of course, such refined single-cell models—both detailed and reduced—are likely to be integrated into networks but this will not take place within the DendroTweaks toolbox. To support the integration of DendroTweaks-produced model neurons into networks, we will focus on better compatibility with existing formats and standards and improve exporting capabilities. It is already possible to export reduced morphologies as SWC files, standardized ion channel models as MOD files and channel distributions as JSON files. Nevertheless, as a proof of concept, we plan to generate a simple network of exported reduced models outside the toolbox and include it as a separate Jupyter notebook.

      (4) Lack of a set of tutorials, which is common across many "Tools and Resources" papers, that would be helpful in users getting acquainted with the toolbox.

      This is a valid concern that we aim to address promptly. Currently, an online user guide is available at https://dendrotweaks.dendrites.gr/guide.html. This guide introduces users to the GUI elements and covers basic use cases. We are working on video tutorials and detailed documentation, which will be available soon (as part of the revised manuscript). The toolbox will be split into two parts: a Bokeh app and a standalone library. The library will offer the core functionality, such as reducing morphology and standardizing channels, without the GUI, enabling bulk processing. It will be installable through PyPI and integrated into the app code as an external library. We will provide thorough documentation for all classes and functions in the library.

      Reviewer #2 (Public review):

      The paper by Makarov et al. describes the software tool called DendroTweaks, intended for the examination of multi-compartmental biophysically detailed neuron models. It offers extensive capabilities for working with very complex distributed biophysical neuronal models and should be a useful addition to the growing ecosystem of tools for neuronal modeling.

      Strengths

      (1) This Python-based tool allows for visualization of a neuronal model's compartments.

      (2) The tool works with morphology reconstructions in the widely used .swc and .asc formats.

      (3) It can support many neuronal models using the NMODL language, which is widely used for neuronal modeling.

      (4) It permits one to plot the properties of linear and non-linear conductances in every compartment of a neuronal model, facilitating examination of the model's details.

      (5) DendroTweaks supports manipulation of the model parameters and morphological details, which is important for the exploration of the relations of the model composition and parameters with its electrophysiological activity.

      (6) The paper is very well written - everything is clear, and the capabilities of the tool are described and illustrated with great attention to detail.

      We thank the reviewer for their positive comments.

      Weaknesses

      (1) Not a really big weakness, but it would be really helpful if the authors showed how the performance of their tool scales. This can be done for an increasing number of compartments - how long does it take to carry out typical procedures in DendroTweaks, on a given hardware, for a cell model with 100 compartments, 200, 300, and so on? This information will be quite useful to understand the applicability of the software.

      DendroTweaks functions as a layer on top of a simulation engine. As a result, currently its performance scales in proportion to the NEURON’s one. Note that the GUI displays the time taken to run a given simulation in NEURON at the bottom of the Simulation tab in the left menu. While GUI-related processing and rendering also consume time, this is not as straightforward to measure. Nonetheless, we will explore options to provide suggested benchmarking in the revised manuscript.

      (2) Let me also add here a few suggestions (not weaknesses, but something that can be useful, and if the authors can easily add some of these for publication, that would strongly increase the value of the paper).

      (3) It would be very helpful to add functionality to read major formats in the field, such as NeuroML and SONATA.

      We agree with the reviewer that support for major formats will substantially improve and ensure reproducibility and reusability of the models. As mentioned in the Discussion, we plan to add support for NeuroML. Regarding SONATA, it is indeed possible to view our models as a network with a single morphologically-detailed biophysical node receiving inputs from multiple populations of virtual nodes. In future editions of the tool we plan to expand its support for additional file formats.

      (4) Visualization is available as a static 2D projection of the cell's morphology. It would be nice to implement 3D interactive visualization.

      We offer an option to rotate a cell around the vertical axis using a slider under the plot. This is a workaround, as implementing a true 3D visualization in Bokeh would require custom Bokeh elements, along with external JavaScript libraries. Despite these implementation difficulties, we advocate for a different approach than the one used in most of the morphology viewers mentioned in the Discussion. The core idea of DendroTweaks' morphology exploration is that each section is "clickable" allowing its geometric properties to be examined in a 2D Section view. Furthermore, we believe the Graph view presents the overall cell topology more clearly than a 3D visualization.

      (5) It is nice that DendroTweaks can modify the models, such as revising the radii of the morphological segments or ionic conductances. It would be really useful then to have the functionality for writing the resulting models into files for subsequent reuse.

      This functionality is already available. Users can export JSON files with channel distributions and SWC files after morphology reduction through the GUI. In the standalone version, users can modify and export SWC files, as well as export MOD files after standardization. Please note that in the online demo version export and import functionality is currently limited, but we plan to fully enable it when submitting our revisions. We are considering separating file managers as satellite apps—one for SWC and one for MOD files. It is worth mentioning that the MOD file manager along with parsing the files and generating Python classes for visualization purposes is already capable of producing Jaxley-compatible Python channel classes.

      (6) If I didn't miss something, it seems that DendroTweaks supports the allocation of groups of synapses, where all synapses in a group receive the same type of Poisson spike train. It would be very useful to provide more flexibility. One option is to leverage the SONATA format, which has ample functionality for specifying such diverse inputs.

      Currently, each group shares the same set of parameters for both biophysical properties of synapses (e.g., reversal potential, time constants) and presynaptic "population" activity (e.g., rate, onset). The parameter that controls an incoming Poisson spike train is the rate, which is indeed shared across all synapses in a group. The suggestion to allow for variability in input properties within a group is interesting and is worth implementing. We will explore this in the revised manuscript.

      (7) "Each session can be saved as a .json file and reuploaded when needed" - do these files contain the whole history of the session or the exact snapshot of what is visualized when the file is saved? If the latter, which variables are saved, and which are not? Please clarify.

      These files capture the exact snapshot of the model's latest state. They include model parameters such as channel distributions, equilibrium potentials, and temperature. Currently, stimuli (current clamps and synapses) are not saved. However, we plan to add an option to export stimuli parameters in the same JSON file. This will also be available as part of the revised manuscript.

      References

      Michael Deistler, Kyra L. Kadhim, Matthijs Pals, Jonas Beck, Ziwei Huang, Manuel Gloeckler, Janne K. Lappalainen, Cornelius Schröder, Philipp Berens, Pedro J. Gonçalves, Jakob H. Macke Differentiable simulation enables large-scale training of detailed biophysical models of neural dynamics bioRxiv 2024.08.21.608979; doi:https://doi.org/10.1101/2024.08.21.608979

    1. Reviewer #1 (Public review):

      Summary:

      The authors introduce a novel algorithm for the automatic identification of long-range axonal projections. This is an important problem as modern high-throughput imaging techniques can produce large amounts of raw data, but identifying neuronal morphologies and connectivities requires large amounts of manual work. The algorithm works by first identifying points in three-dimensional space corresponding to parts of labelled neural projections, these are then used to identify short sections of axons using an optimisation algorithm and the prior knowledge that axonal diameters are relatively constant. Finally, a statistical model that assumes axons tend to be smooth is used to connect the sections together into complete and distinct neural trees. The authors demonstrate that their algorithm is far superior to existing techniques, especially when dense labelling of the tissue means that neighbouring neurites interfere with the reconstruction. Despite this improvement, however, the accuracy of reconstruction remains below 90%, so manual proofreading is still necessary to produce accurate reconstructions of axons.

      Strengths:

      The new algorithm combines local and global information to make a significant improvement on the state-of-the-art for automatic axonal reconstruction. The method could be applied more broadly and might have applications to reconstructions of electron microscopy data, where similar issues of high-throughput imaging and relatively slow or inaccurate reconstruction remain.

      Weaknesses:

      There are three weaknesses in the algorithm and manuscript.

      (1) The best reconstruction accuracy is below 90%, which does not fully solve the problem of needing manual proofreading.

      (2) The 'minimum information flow tree' model the authors use to construct connected axonal trees has the potential to bias data collection. In particular, the assumption that axons should always be as smooth as possible is not always correct. This is a good rule-of-thumb for reconstructions, but real axons in many systems can take quite sharp turns and this is also seen in the data presented in the paper (Figure 1C). I would like to see explicit acknowledgement of this bias in the current manuscript and ideally a relaxation of this rule in any later versions of the algorithm.

      (3) The writing of the manuscript is not always as clear as it could be. The manuscript would benefit from careful copy editing for language, and the Methods section in particular should be expanded to more clearly explain what each algorithm is doing. The pseudo-code of the Supplemental Information could be brought into the Methods if possible as these algorithms are so fundamental to the manuscript.

    1. #if defined(CONFIG_ARCH_HAS_PTE_DEVMAP) && defined(CONFIG_TRANSPARENT_HUGEPAGE) static int __gup_device_huge(unsigned long pfn, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { int nr_start = *nr; struct dev_pagemap *pgmap = NULL; do { struct page *page = pfn_to_page(pfn); pgmap = get_dev_pagemap(pfn, pgmap); if (unlikely(!pgmap)) { undo_dev_pagemap(nr, nr_start, flags, pages); break; } if (!(flags & FOLL_PCI_P2PDMA) && is_pci_p2pdma_page(page)) { undo_dev_pagemap(nr, nr_start, flags, pages); break; } SetPageReferenced(page); pages[*nr] = page; if (unlikely(try_grab_page(page, flags))) { undo_dev_pagemap(nr, nr_start, flags, pages); break; } (*nr)++; pfn++; } while (addr += PAGE_SIZE, addr != end); put_dev_pagemap(pgmap); return addr == end; } static int __gup_device_huge_pmd(pmd_t orig, pmd_t *pmdp, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { unsigned long fault_pfn; int nr_start = *nr; fault_pfn = pmd_pfn(orig) + ((addr & ~PMD_MASK) >> PAGE_SHIFT); if (!__gup_device_huge(fault_pfn, addr, end, flags, pages, nr)) return 0; if (unlikely(pmd_val(orig) != pmd_val(*pmdp))) { undo_dev_pagemap(nr, nr_start, flags, pages); return 0; } return 1; } static int __gup_device_huge_pud(pud_t orig, pud_t *pudp, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { unsigned long fault_pfn; int nr_start = *nr; fault_pfn = pud_pfn(orig) + ((addr & ~PUD_MASK) >> PAGE_SHIFT); if (!__gup_device_huge(fault_pfn, addr, end, flags, pages, nr)) return 0; if (unlikely(pud_val(orig) != pud_val(*pudp))) { undo_dev_pagemap(nr, nr_start, flags, pages); return 0; } return 1; } #else static int __gup_device_huge_pmd(pmd_t orig, pmd_t *pmdp, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { BUILD_BUG(); return 0; } static int __gup_device_huge_pud(pud_t pud, pud_t *pudp, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { BUILD_BUG(); return 0; } #endif

      seems like a check to see if pages can be grabbed. A quick skim maybe hints possible checks if huge pages can be grabbed?

    2. #ifdef CONFIG_ARCH_HAS_HUGEPD static unsigned long hugepte_addr_end(unsigned long addr, unsigned long end, unsigned long sz) { unsigned long __boundary = (addr + sz) & ~(sz-1); return (__boundary - 1 < end - 1) ? __boundary : end; } static int gup_hugepte(pte_t *ptep, unsigned long sz, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { unsigned long pte_end; struct page *page; struct folio *folio; pte_t pte; int refs; pte_end = (addr + sz) & ~(sz-1); if (pte_end < end) end = pte_end; pte = huge_ptep_get(ptep); if (!pte_access_permitted(pte, flags & FOLL_WRITE)) return 0; /* hugepages are never "special" */ VM_BUG_ON(!pfn_valid(pte_pfn(pte))); page = nth_page(pte_page(pte), (addr & (sz - 1)) >> PAGE_SHIFT); refs = record_subpages(page, addr, end, pages + *nr); folio = try_grab_folio(page, refs, flags); if (!folio) return 0; if (unlikely(pte_val(pte) != pte_val(ptep_get(ptep)))) { gup_put_folio(folio, refs, flags); return 0; } if (!folio_fast_pin_allowed(folio, flags)) { gup_put_folio(folio, refs, flags); return 0; } if (!pte_write(pte) && gup_must_unshare(NULL, flags, &folio->page)) { gup_put_folio(folio, refs, flags); return 0; } *nr += refs; folio_set_referenced(folio); return 1; } static int gup_huge_pd(hugepd_t hugepd, unsigned long addr, unsigned int pdshift, unsigned long end, unsigned int flags, struct page **pages, int *nr) { pte_t *ptep; unsigned long sz = 1UL << hugepd_shift(hugepd); unsigned long next; ptep = hugepte_offset(hugepd, addr, pdshift); do { next = hugepte_addr_end(addr, end, sz); if (!gup_hugepte(ptep, sz, addr, end, flags, pages, nr)) return 0; } while (ptep++, addr = next, addr != end); return 1; } #else static inline int gup_huge_pd(hugepd_t hugepd, unsigned long addr, unsigned int pdshift, unsigned long end, unsigned int flags, struct page **pages, int *nr) { return 0; } #endif /* CONFIG_ARCH_HAS_HUGEPD */ static int gup_huge_pmd(pmd_t orig, pmd_t *pmdp, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { struct page *page; struct folio *folio; int refs; if (!pmd_access_permitted(orig, flags & FOLL_WRITE)) return 0; if (pmd_devmap(orig)) { if (unlikely(flags & FOLL_LONGTERM)) return 0; return __gup_device_huge_pmd(orig, pmdp, addr, end, flags, pages, nr); } page = nth_page(pmd_page(orig), (addr & ~PMD_MASK) >> PAGE_SHIFT); refs = record_subpages(page, addr, end, pages + *nr); folio = try_grab_folio(page, refs, flags); if (!folio) return 0; if (unlikely(pmd_val(orig) != pmd_val(*pmdp))) { gup_put_folio(folio, refs, flags); return 0; } if (!folio_fast_pin_allowed(folio, flags)) { gup_put_folio(folio, refs, flags); return 0; } if (!pmd_write(orig) && gup_must_unshare(NULL, flags, &folio->page)) { gup_put_folio(folio, refs, flags); return 0; } *nr += refs; folio_set_referenced(folio); return 1; } static int gup_huge_pud(pud_t orig, pud_t *pudp, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { struct page *page; struct folio *folio; int refs; if (!pud_access_permitted(orig, flags & FOLL_WRITE)) return 0; if (pud_devmap(orig)) { if (unlikely(flags & FOLL_LONGTERM)) return 0; return __gup_device_huge_pud(orig, pudp, addr, end, flags, pages, nr); } page = nth_page(pud_page(orig), (addr & ~PUD_MASK) >> PAGE_SHIFT); refs = record_subpages(page, addr, end, pages + *nr); folio = try_grab_folio(page, refs, flags); if (!folio) return 0; if (unlikely(pud_val(orig) != pud_val(*pudp))) { gup_put_folio(folio, refs, flags); return 0; } if (!folio_fast_pin_allowed(folio, flags)) { gup_put_folio(folio, refs, flags); return 0; } if (!pud_write(orig) && gup_must_unshare(NULL, flags, &folio->page)) { gup_put_folio(folio, refs, flags); return 0; } *nr += refs; folio_set_referenced(folio); return 1; } static int gup_huge_pgd(pgd_t orig, pgd_t *pgdp, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { int refs; struct page *page; struct folio *folio; if (!pgd_access_permitted(orig, flags & FOLL_WRITE)) return 0; BUILD_BUG_ON(pgd_devmap(orig)); page = nth_page(pgd_page(orig), (addr & ~PGDIR_MASK) >> PAGE_SHIFT); refs = record_subpages(page, addr, end, pages + *nr); folio = try_grab_folio(page, refs, flags); if (!folio) return 0; if (unlikely(pgd_val(orig) != pgd_val(*pgdp))) { gup_put_folio(folio, refs, flags); return 0; } if (!pgd_write(orig) && gup_must_unshare(NULL, flags, &folio->page)) { gup_put_folio(folio, refs, flags); return 0; } if (!folio_fast_pin_allowed(folio, flags)) { gup_put_folio(folio, refs, flags); return 0; } *nr += refs; folio_set_referenced(folio); return 1; } static int gup_pmd_range(pud_t *pudp, pud_t pud, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { unsigned long next; pmd_t *pmdp; pmdp = pmd_offset_lockless(pudp, pud, addr); do { pmd_t pmd = pmdp_get_lockless(pmdp); next = pmd_addr_end(addr, end); if (!pmd_present(pmd)) return 0; if (unlikely(pmd_trans_huge(pmd) || pmd_huge(pmd) || pmd_devmap(pmd))) { /* See gup_pte_range() */ if (pmd_protnone(pmd)) return 0; if (!gup_huge_pmd(pmd, pmdp, addr, next, flags, pages, nr)) return 0; } else if (unlikely(is_hugepd(__hugepd(pmd_val(pmd))))) { /* * architecture have different format for hugetlbfs * pmd format and THP pmd format */ if (!gup_huge_pd(__hugepd(pmd_val(pmd)), addr, PMD_SHIFT, next, flags, pages, nr)) return 0; } else if (!gup_pte_range(pmd, pmdp, addr, next, flags, pages, nr)) return 0; } while (pmdp++, addr = next, addr != end); return 1; } static int gup_pud_range(p4d_t *p4dp, p4d_t p4d, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { unsigned long next; pud_t *pudp; pudp = pud_offset_lockless(p4dp, p4d, addr); do { pud_t pud = READ_ONCE(*pudp); next = pud_addr_end(addr, end); if (unlikely(!pud_present(pud))) return 0; if (unlikely(pud_huge(pud) || pud_devmap(pud))) { if (!gup_huge_pud(pud, pudp, addr, next, flags, pages, nr)) return 0; } else if (unlikely(is_hugepd(__hugepd(pud_val(pud))))) { if (!gup_huge_pd(__hugepd(pud_val(pud)), addr, PUD_SHIFT, next, flags, pages, nr)) return 0; } else if (!gup_pmd_range(pudp, pud, addr, next, flags, pages, nr)) return 0; } while (pudp++, addr = next, addr != end); return 1; } static int gup_p4d_range(pgd_t *pgdp, pgd_t pgd, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { unsigned long next; p4d_t *p4dp; p4dp = p4d_offset_lockless(pgdp, pgd, addr); do { p4d_t p4d = READ_ONCE(*p4dp); next = p4d_addr_end(addr, end); if (p4d_none(p4d)) return 0; BUILD_BUG_ON(p4d_huge(p4d)); if (unlikely(is_hugepd(__hugepd(p4d_val(p4d))))) { if (!gup_huge_pd(__hugepd(p4d_val(p4d)), addr, P4D_SHIFT, next, flags, pages, nr)) return 0; } else if (!gup_pud_range(p4dp, p4d, addr, next, flags, pages, nr)) return 0; } while (p4dp++, addr = next, addr != end); return 1; } static void gup_pgd_range(unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { unsigned long next; pgd_t *pgdp; pgdp = pgd_offset(current->mm, addr); do { pgd_t pgd = READ_ONCE(*pgdp); next = pgd_addr_end(addr, end); if (pgd_none(pgd)) return; if (unlikely(pgd_huge(pgd))) { if (!gup_huge_pgd(pgd, pgdp, addr, next, flags, pages, nr)) return; } else if (unlikely(is_hugepd(__hugepd(pgd_val(pgd))))) { if (!gup_huge_pd(__hugepd(pgd_val(pgd)), addr, PGDIR_SHIFT, next, flags, pages, nr)) return; } else if (!gup_p4d_range(pgdp, pgd, addr, next, flags, pages, nr)) return; } while (pgdp++, addr = next, addr != end); } #else static inline void gup_pgd_range(unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { }

      policy use functions for gup_huge pte policy code function above (not right above, gotta scroll probably to find it)

    3. static int internal_get_user_pages_fast(unsigned long start, unsigned long nr_pages, unsigned int gup_flags, struct page **pages) { unsigned long len, end; unsigned long nr_pinned; int locked = 0; int ret; if (WARN_ON_ONCE(gup_flags & ~(FOLL_WRITE | FOLL_LONGTERM | FOLL_FORCE | FOLL_PIN | FOLL_GET | FOLL_FAST_ONLY | FOLL_NOFAULT | FOLL_PCI_P2PDMA | FOLL_HONOR_NUMA_FAULT))) return -EINVAL; if (gup_flags & FOLL_PIN) mm_set_has_pinned_flag(&current->mm->flags); if (!(gup_flags & FOLL_FAST_ONLY)) might_lock_read(&current->mm->mmap_lock); start = untagged_addr(start) & PAGE_MASK; len = nr_pages << PAGE_SHIFT; if (check_add_overflow(start, len, &end)) return -EOVERFLOW; if (end > TASK_SIZE_MAX) return -EFAULT; if (unlikely(!access_ok((void __user *)start, len))) return -EFAULT; nr_pinned = lockless_pages_from_mm(start, end, gup_flags, pages); if (nr_pinned == nr_pages || gup_flags & FOLL_FAST_ONLY) return nr_pinned; /* Slow path: try to get the remaining pages with get_user_pages */ start += nr_pinned << PAGE_SHIFT; pages += nr_pinned; ret = __gup_longterm_locked(current->mm, start, nr_pages - nr_pinned, pages, &locked, gup_flags | FOLL_TOUCH | FOLL_UNLOCKABLE); if (ret < 0) { /* * The caller has to unpin the pages we already pinned so * returning -errno is not an option */ if (nr_pinned) return nr_pinned; return ret; } return ret + nr_pinned; } /** * get_user_pages_fast_only() - pin user pages in memory * @start: starting user address * @nr_pages: number of pages from start to pin * @gup_flags: flags modifying pin behaviour * @pages: array that receives pointers to the pages pinned. * Should be at least nr_pages long. * * Like get_user_pages_fast() except it's IRQ-safe in that it won't fall back to * the regular GUP. * * If the architecture does not support this function, simply return with no * pages pinned. * * Careful, careful! COW breaking can go either way, so a non-write * access can get ambiguous page results. If you call this function without * 'write' set, you'd better be sure that you're ok with that ambiguity. */ int get_user_pages_fast_only(unsigned long start, int nr_pages, unsigned int gup_flags, struct page **pages) { /* * Internally (within mm/gup.c), gup fast variants must set FOLL_GET, * because gup fast is always a "pin with a +1 page refcount" request. * * FOLL_FAST_ONLY is required in order to match the API description of * this routine: no fall back to regular ("slow") GUP. */ if (!is_valid_gup_args(pages, NULL, &gup_flags, FOLL_GET | FOLL_FAST_ONLY)) return -EINVAL; return internal_get_user_pages_fast(start, nr_pages, gup_flags, pages); } EXPORT_SYMBOL_GPL(get_user_pages_fast_only); /** * get_user_pages_fast() - pin user pages in memory * @start: starting user address * @nr_pages: number of pages from start to pin * @gup_flags: flags modifying pin behaviour * @pages: array that receives pointers to the pages pinned. * Should be at least nr_pages long. * * Attempt to pin user pages in memory without taking mm->mmap_lock. * If not successful, it will fall back to taking the lock and * calling get_user_pages(). * * Returns number of pages pinned. This may be fewer than the number requested. * If nr_pages is 0 or negative, returns 0. If no pages were pinned, returns * -errno. */ int get_user_pages_fast(unsigned long start, int nr_pages, unsigned int gup_flags, struct page **pages) { /* * The caller may or may not have explicitly set FOLL_GET; either way is * OK. However, internally (within mm/gup.c), gup fast variants must set * FOLL_GET, because gup fast is always a "pin with a +1 page refcount" * request. */ if (!is_valid_gup_args(pages, NULL, &gup_flags, FOLL_GET)) return -EINVAL; return internal_get_user_pages_fast(start, nr_pages, gup_flags, pages); } EXPORT_SYMBOL_GPL(get_user_pages_fast); /** * pin_user_pages_fast() - pin user pages in memory without taking locks * * @start: starting user address * @nr_pages: number of pages from start to pin * @gup_flags: flags modifying pin behaviour * @pages: array that receives pointers to the pages pinned. * Should be at least nr_pages long. * * Nearly the same as get_user_pages_fast(), except that FOLL_PIN is set. See * get_user_pages_fast() for documentation on the function arguments, because * the arguments here are identical. * * FOLL_PIN means that the pages must be released via unpin_user_page(). Please * see Documentation/core-api/pin_user_pages.rst for further details. * * Note that if a zero_page is amongst the returned pages, it will not have * pins in it and unpin_user_page() will not remove pins from it. */ int pin_user_pages_fast(unsigned long start, int nr_pages, unsigned int gup_flags, struct page **pages) { if (!is_valid_gup_args(pages, NULL, &gup_flags, FOLL_PIN)) return -EINVAL; return internal_get_user_pages_fast(start, nr_pages, gup_flags, pages); } EXPORT_SYMBOL_GPL(pin_user_pages_fast); /** * pin_user_pages_remote() - pin pages of a remote process * * @mm: mm_struct of target mm * @start: starting user address * @nr_pages: number of pages from start to pin * @gup_flags: flags modifying lookup behaviour * @pages: array that receives pointers to the pages pinned. * Should be at least nr_pages long. * @locked: pointer to lock flag indicating whether lock is held and * subsequently whether VM_FAULT_RETRY functionality can be * utilised. Lock must initially be held. * * Nearly the same as get_user_pages_remote(), except that FOLL_PIN is set. See * get_user_pages_remote() for documentation on the function arguments, because * the arguments here are identical. * * FOLL_PIN means that the pages must be released via unpin_user_page(). Please * see Documentation/core-api/pin_user_pages.rst for details. * * Note that if a zero_page is amongst the returned pages, it will not have * pins in it and unpin_user_page*() will not remove pins from it. */ long pin_user_pages_remote(struct mm_struct *mm, unsigned long start, unsigned long nr_pages, unsigned int gup_flags, struct page **pages, int *locked) { int local_locked = 1; if (!is_valid_gup_args(pages, locked, &gup_flags, FOLL_PIN | FOLL_TOUCH | FOLL_REMOTE)) return 0; return __gup_longterm_locked(mm, start, nr_pages, pages, locked ? locked : &local_locked, gup_flags); } EXPORT_SYMBOL(pin_user_pages_remote); /** * pin_user_pages() - pin user pages in memory for use by other devices * * @start: starting user address * @nr_pages: number of pages from start to pin * @gup_flags: flags modifying lookup behaviour * @pages: array that receives pointers to the pages pinned. * Should be at least nr_pages long. * * Nearly the same as get_user_pages(), except that FOLL_TOUCH is not set, and * FOLL_PIN is set. * * FOLL_PIN means that the pages must be released via unpin_user_page(). Please * see Documentation/core-api/pin_user_pages.rst for details. * * Note that if a zero_page is amongst the returned pages, it will not have * pins in it and unpin_user_page*() will not remove pins from it. */ long pin_user_pages(unsigned long start, unsigned long nr_pages, unsigned int gup_flags, struct page **pages) { int locked = 1; if (!is_valid_gup_args(pages, NULL, &gup_flags, FOLL_PIN)) return 0; return __gup_longterm_locked(current->mm, start, nr_pages, pages, &locked, gup_flags); } EXPORT_SYMBOL(pin_user_pages); /* * pin_user_pages_unlocked() is the FOLL_PIN variant of * get_user_pages_unlocked(). Behavior is the same, except that this one sets * FOLL_PIN and rejects FOLL_GET. * * Note that if a zero_page is amongst the returned pages, it will not have * pins in it and unpin_user_page*() will not remove pins from it. */ long pin_user_pages_unlocked(unsigned long start, unsigned long nr_pages, struct page **pages, unsigned int gup_flags) { int locked = 0; if (!is_valid_gup_args(pages, NULL, &gup_flags, FOLL_PIN | FOLL_TOUCH | FOLL_UNLOCKABLE)) return 0; return __gup_longterm_locked(current->mm, start, nr_pages, pages, &locked, gup_flags); }

      fast gup functions

    4. #ifdef CONFIG_ARCH_HAS_PTE_SPECIAL /* * Fast-gup relies on pte change detection to avoid concurrent pgtable * operations. * * To pin the page, fast-gup needs to do below in order: * (1) pin the page (by prefetching pte), then (2) check pte not changed. * * For the rest of pgtable operations where pgtable updates can be racy * with fast-gup, we need to do (1) clear pte, then (2) check whether page * is pinned. * * Above will work for all pte-level operations, including THP split. * * For THP collapse, it's a bit more complicated because fast-gup may be * walking a pgtable page that is being freed (pte is still valid but pmd * can be cleared already). To avoid race in such condition, we need to * also check pmd here to make sure pmd doesn't change (corresponds to * pmdp_collapse_flush() in the THP collapse code path). */ static int gup_pte_range(pmd_t pmd, pmd_t *pmdp, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { struct dev_pagemap *pgmap = NULL; int nr_start = *nr, ret = 0; pte_t *ptep, *ptem; ptem = ptep = pte_offset_map(&pmd, addr); if (!ptep) return 0; do { pte_t pte = ptep_get_lockless(ptep); struct page *page; struct folio *folio; /* * Always fallback to ordinary GUP on PROT_NONE-mapped pages: * pte_access_permitted() better should reject these pages * either way: otherwise, GUP-fast might succeed in * cases where ordinary GUP would fail due to VMA access * permissions. */ if (pte_protnone(pte)) goto pte_unmap; if (!pte_access_permitted(pte, flags & FOLL_WRITE)) goto pte_unmap; if (pte_devmap(pte)) { if (unlikely(flags & FOLL_LONGTERM)) goto pte_unmap; pgmap = get_dev_pagemap(pte_pfn(pte), pgmap); if (unlikely(!pgmap)) { undo_dev_pagemap(nr, nr_start, flags, pages); goto pte_unmap; } } else if (pte_special(pte)) goto pte_unmap; VM_BUG_ON(!pfn_valid(pte_pfn(pte))); page = pte_page(pte); folio = try_grab_folio(page, 1, flags); if (!folio) goto pte_unmap; if (unlikely(folio_is_secretmem(folio))) { gup_put_folio(folio, 1, flags); goto pte_unmap; } if (unlikely(pmd_val(pmd) != pmd_val(*pmdp)) || unlikely(pte_val(pte) != pte_val(ptep_get(ptep)))) { gup_put_folio(folio, 1, flags); goto pte_unmap; } if (!folio_fast_pin_allowed(folio, flags)) { gup_put_folio(folio, 1, flags); goto pte_unmap; } if (!pte_write(pte) && gup_must_unshare(NULL, flags, page)) { gup_put_folio(folio, 1, flags); goto pte_unmap; } /* * We need to make the page accessible if and only if we are * going to access its content (the FOLL_PIN case). Please * see Documentation/core-api/pin_user_pages.rst for * details. */ if (flags & FOLL_PIN) { ret = arch_make_page_accessible(page); if (ret) { gup_put_folio(folio, 1, flags); goto pte_unmap; } } folio_set_referenced(folio); pages[*nr] = page; (*nr)++; } while (ptep++, addr += PAGE_SIZE, addr != end); ret = 1; pte_unmap: if (pgmap) put_dev_pagemap(pgmap); pte_unmap(ptem); return ret; } #else /* * If we can't determine whether or not a pte is special, then fail immediately * for ptes. Note, we can still pin HugeTLB and THP as these are guaranteed not * to be special. * * For a futex to be placed on a THP tail page, get_futex_key requires a * get_user_pages_fast_only implementation that can pin pages. Thus it's still * useful to have gup_huge_pmd even if we can't operate on ptes. */ static int gup_pte_range(pmd_t pmd, pmd_t *pmdp, unsigned long addr, unsigned long end, unsigned int flags, struct page **pages, int *nr) { return 0; } #endif /* CONFIG_ARCH_HAS_PTE_SPECIAL */

      non concurrent fast gup approach that checks for pinned page and unmaps pte or clears it

    5. #ifdef CONFIG_HAVE_FAST_GUP /* * Used in the GUP-fast path to determine whether a pin is permitted for a * specific folio. * * This call assumes the caller has pinned the folio, that the lowest page table * level still points to this folio, and that interrupts have been disabled. * * Writing to pinned file-backed dirty tracked folios is inherently problematic * (see comment describing the writable_file_mapping_allowed() function). We * therefore try to avoid the most egregious case of a long-term mapping doing * so. * * This function cannot be as thorough as that one as the VMA is not available * in the fast path, so instead we whitelist known good cases and if in doubt, * fall back to the slow path. */ static bool folio_fast_pin_allowed(struct folio *folio, unsigned int flags) { struct address_space *mapping; unsigned long mapping_flags; /* * If we aren't pinning then no problematic write can occur. A long term * pin is the most egregious case so this is the one we disallow. */ if ((flags & (FOLL_PIN | FOLL_LONGTERM | FOLL_WRITE)) != (FOLL_PIN | FOLL_LONGTERM | FOLL_WRITE)) return true; /* The folio is pinned, so we can safely access folio fields. */ if (WARN_ON_ONCE(folio_test_slab(folio))) return false; /* hugetlb mappings do not require dirty-tracking. */ if (folio_test_hugetlb(folio)) return true; /* * GUP-fast disables IRQs. When IRQS are disabled, RCU grace periods * cannot proceed, which means no actions performed under RCU can * proceed either. * * inodes and thus their mappings are freed under RCU, which means the * mapping cannot be freed beneath us and thus we can safely dereference * it. */ lockdep_assert_irqs_disabled(); /* * However, there may be operations which _alter_ the mapping, so ensure * we read it once and only once. */ mapping = READ_ONCE(folio->mapping); /* * The mapping may have been truncated, in any case we cannot determine * if this mapping is safe - fall back to slow path to determine how to * proceed. */ if (!mapping) return false; /* Anonymous folios pose no problem. */ mapping_flags = (unsigned long)mapping & PAGE_MAPPING_FLAGS; if (mapping_flags) return mapping_flags & PAGE_MAPPING_ANON; /* * At this point, we know the mapping is non-null and points to an * address_space object. The only remaining whitelisted file system is * shmem. */ return shmem_mapping(mapping); }

      policy logic. avoids locks unlike get user pages unlocked/locked which seems risky so its not supposed to be used on concurrent gup logic

    6. #ifdef CONFIG_MIGRATION /* * Returns the number of collected pages. Return value is always >= 0. */ static unsigned long collect_longterm_unpinnable_pages( struct list_head *movable_page_list, unsigned long nr_pages, struct page **pages) { unsigned long i, collected = 0; struct folio *prev_folio = NULL; bool drain_allow = true; for (i = 0; i < nr_pages; i++) { struct folio *folio = page_folio(pages[i]); if (folio == prev_folio) continue; prev_folio = folio; if (folio_is_longterm_pinnable(folio)) continue; collected++; if (folio_is_device_coherent(folio)) continue; if (folio_test_hugetlb(folio)) { isolate_hugetlb(folio, movable_page_list); continue; } if (!folio_test_lru(folio) && drain_allow) { lru_add_drain_all(); drain_allow = false; } if (!folio_isolate_lru(folio)) continue; list_add_tail(&folio->lru, movable_page_list); node_stat_mod_folio(folio, NR_ISOLATED_ANON + folio_is_file_lru(folio), folio_nr_pages(folio)); } return collected; }
    7. int __mm_populate(unsigned long start, unsigned long len, int ignore_errors) { struct mm_struct *mm = current->mm; unsigned long end, nstart, nend; struct vm_area_struct *vma = NULL; int locked = 0; long ret = 0; end = start + len; for (nstart = start; nstart < end; nstart = nend) { /* * We want to fault in pages for [nstart; end) address range. * Find first corresponding VMA. */ if (!locked) { locked = 1; mmap_read_lock(mm); vma = find_vma_intersection(mm, nstart, end); } else if (nstart >= vma->vm_end) vma = find_vma_intersection(mm, vma->vm_end, end); if (!vma) break; /* * Set [nstart; nend) to intersection of desired address * range with the first VMA. Also, skip undesirable VMA types. */ nend = min(end, vma->vm_end); if (vma->vm_flags & (VM_IO | VM_PFNMAP)) continue; if (nstart < vma->vm_start) nstart = vma->vm_start; /* * Now fault in a range of pages. populate_vma_page_range() * double checks the vma flags, so that it won't mlock pages * if the vma was already munlocked. */ ret = populate_vma_page_range(vma, nstart, nend, &locked); if (ret < 0) { if (ignore_errors) { ret = 0; continue; /* continue at next VMA */ } break; } nend = nstart + ret * PAGE_SIZE; ret = 0; } if (locked) mmap_read_unlock(mm); return ret; /* 0 or negative error code */ }

      policy use function that populates pages like the func before this.

    8. long populate_vma_page_range(struct vm_area_struct *vma, unsigned long start, unsigned long end, int *locked) { struct mm_struct *mm = vma->vm_mm; unsigned long nr_pages = (end - start) / PAGE_SIZE; int local_locked = 1; int gup_flags; long ret; VM_BUG_ON(!PAGE_ALIGNED(start)); VM_BUG_ON(!PAGE_ALIGNED(end)); VM_BUG_ON_VMA(start < vma->vm_start, vma); VM_BUG_ON_VMA(end > vma->vm_end, vma); mmap_assert_locked(mm); /* * Rightly or wrongly, the VM_LOCKONFAULT case has never used * faultin_page() to break COW, so it has no work to do here. */ if (vma->vm_flags & VM_LOCKONFAULT) return nr_pages; gup_flags = FOLL_TOUCH; /* * We want to touch writable mappings with a write fault in order * to break COW, except for shared mappings because these don't COW * and we would not want to dirty them for nothing. */ if ((vma->vm_flags & (VM_WRITE | VM_SHARED)) == VM_WRITE) gup_flags |= FOLL_WRITE; /* * We want mlock to succeed for regions that have any permissions * other than PROT_NONE. */ if (vma_is_accessible(vma)) gup_flags |= FOLL_FORCE; if (locked) gup_flags |= FOLL_UNLOCKABLE; /* * We made sure addr is within a VMA, so the following will * not result in a stack expansion that recurses back here. */ ret = __get_user_pages(mm, start, nr_pages, gup_flags, NULL, locked ? locked : &local_locked); lru_add_drain(); return ret; }

      policy use code.

    9. static __always_inline long __get_user_pages_locked(struct mm_struct *mm, unsigned long start, unsigned long nr_pages, struct page **pages, int *locked, unsigned int flags) { long ret, pages_done; bool must_unlock = false; /* * The internal caller expects GUP to manage the lock internally and the * lock must be released when this returns. */ if (!*locked) { if (mmap_read_lock_killable(mm)) return -EAGAIN; must_unlock = true; *locked = 1; } else mmap_assert_locked(mm); if (flags & FOLL_PIN) mm_set_has_pinned_flag(&mm->flags); /* * FOLL_PIN and FOLL_GET are mutually exclusive. Traditional behavior * is to set FOLL_GET if the caller wants pages[] filled in (but has * carelessly failed to specify FOLL_GET), so keep doing that, but only * for FOLL_GET, not for the newer FOLL_PIN. * * FOLL_PIN always expects pages to be non-null, but no need to assert * that here, as any failures will be obvious enough. */ if (pages && !(flags & FOLL_PIN)) flags |= FOLL_GET; pages_done = 0; for (;;) { ret = __get_user_pages(mm, start, nr_pages, flags, pages, locked); if (!(flags & FOLL_UNLOCKABLE)) { /* VM_FAULT_RETRY couldn't trigger, bypass */ pages_done = ret; break; } /* VM_FAULT_RETRY or VM_FAULT_COMPLETED cannot return errors */ if (!*locked) { BUG_ON(ret < 0); BUG_ON(ret >= nr_pages); } if (ret > 0) { nr_pages -= ret; pages_done += ret; if (!nr_pages) break; } if (*locked) { /* * VM_FAULT_RETRY didn't trigger or it was a * FOLL_NOWAIT. */ if (!pages_done) pages_done = ret; break; } /* * VM_FAULT_RETRY triggered, so seek to the faulting offset. * For the prefault case (!pages) we only update counts. */ if (likely(pages)) pages += ret; start += ret << PAGE_SHIFT; /* The lock was temporarily dropped, so we must unlock later */ must_unlock = true; retry: /* * Repeat on the address that fired VM_FAULT_RETRY * with both FAULT_FLAG_ALLOW_RETRY and * FAULT_FLAG_TRIED. Note that GUP can be interrupted * by fatal signals of even common signals, depending on * the caller's request. So we need to check it before we * start trying again otherwise it can loop forever. */ if (gup_signal_pending(flags)) { if (!pages_done) pages_done = -EINTR; break; } ret = mmap_read_lock_killable(mm); if (ret) { BUG_ON(ret > 0); if (!pages_done) pages_done = ret; break; } *locked = 1; ret = __get_user_pages(mm, start, 1, flags | FOLL_TRIED, pages, locked); if (!*locked) { /* Continue to retry until we succeeded */ BUG_ON(ret != 0); goto retry; } if (ret != 1) { BUG_ON(ret > 1); if (!pages_done) pages_done = ret; break; } nr_pages--; pages_done++; if (!nr_pages) break; if (likely(pages)) pages++; start += PAGE_SIZE; } if (must_unlock && *locked) { /* * We either temporarily dropped the lock, or the caller * requested that we both acquire and drop the lock. Either way, * we must now unlock, and notify the caller of that state. */ mmap_read_unlock(mm); *locked = 0; } return pages_done; }

      same as gup but sets/unsets mmap_lock

    10. static long __get_user_pages(struct mm_struct *mm, unsigned long start, unsigned long nr_pages, unsigned int gup_flags, struct page **pages, int *locked) { long ret = 0, i = 0; struct vm_area_struct *vma = NULL; struct follow_page_context ctx = { NULL }; if (!nr_pages) return 0; start = untagged_addr_remote(mm, start); VM_BUG_ON(!!pages != !!(gup_flags & (FOLL_GET | FOLL_PIN))); do { struct page *page; unsigned int foll_flags = gup_flags; unsigned int page_increm; /* first iteration or cross vma bound */ if (!vma || start >= vma->vm_end) { /* * MADV_POPULATE_(READ|WRITE) wants to handle VMA * lookups+error reporting differently. */ if (gup_flags & FOLL_MADV_POPULATE) { vma = vma_lookup(mm, start); if (!vma) { ret = -ENOMEM; goto out; } if (check_vma_flags(vma, gup_flags)) { ret = -EINVAL; goto out; } goto retry; } vma = gup_vma_lookup(mm, start); if (!vma && in_gate_area(mm, start)) { ret = get_gate_page(mm, start & PAGE_MASK, gup_flags, &vma, pages ? &page : NULL); if (ret) goto out; ctx.page_mask = 0; goto next_page; } if (!vma) { ret = -EFAULT; goto out; } ret = check_vma_flags(vma, gup_flags); if (ret) goto out; } retry: /* * If we have a pending SIGKILL, don't keep faulting pages and * potentially allocating memory. */ if (fatal_signal_pending(current)) { ret = -EINTR; goto out; } cond_resched(); page = follow_page_mask(vma, start, foll_flags, &ctx); if (!page || PTR_ERR(page) == -EMLINK) { ret = faultin_page(vma, start, &foll_flags, PTR_ERR(page) == -EMLINK, locked); switch (ret) { case 0: goto retry; case -EBUSY: case -EAGAIN: ret = 0; fallthrough; case -EFAULT: case -ENOMEM: case -EHWPOISON: goto out; } BUG(); } else if (PTR_ERR(page) == -EEXIST) { /* * Proper page table entry exists, but no corresponding * struct page. If the caller expects **pages to be * filled in, bail out now, because that can't be done * for this page. */ if (pages) { ret = PTR_ERR(page); goto out; } } else if (IS_ERR(page)) { ret = PTR_ERR(page); goto out; } next_page: page_increm = 1 + (~(start >> PAGE_SHIFT) & ctx.page_mask); if (page_increm > nr_pages) page_increm = nr_pages; if (pages) { struct page *subpage; unsigned int j; /* * This must be a large folio (and doesn't need to * be the whole folio; it can be part of it), do * the refcount work for all the subpages too. * * NOTE: here the page may not be the head page * e.g. when start addr is not thp-size aligned. * try_grab_folio() should have taken care of tail * pages. */ if (page_increm > 1) { struct folio *folio; /* * Since we already hold refcount on the * large folio, this should never fail. */ folio = try_grab_folio(page, page_increm - 1, foll_flags); if (WARN_ON_ONCE(!folio)) { /* * Release the 1st page ref if the * folio is problematic, fail hard. */ gup_put_folio(page_folio(page), 1, foll_flags); ret = -EFAULT; goto out; } } for (j = 0; j < page_increm; j++) { subpage = nth_page(page, j); pages[i + j] = subpage; flush_anon_page(vma, subpage, start + j * PAGE_SIZE); flush_dcache_page(subpage); } } i += page_increm; start += page_increm * PAGE_SIZE; nr_pages -= page_increm; } while (nr_pages); out: if (ctx.pgmap) put_dev_pagemap(ctx.pgmap); return i ? i : ret; }

      Literally the actual policy logic of gup. Most important piece of code right here for gup

    11. static bool writable_file_mapping_allowed(struct vm_area_struct *vma, unsigned long gup_flags) { /* * If we aren't pinning then no problematic write can occur. A long term * pin is the most egregious case so this is the case we disallow. */ if ((gup_flags & (FOLL_PIN | FOLL_LONGTERM)) != (FOLL_PIN | FOLL_LONGTERM)) return true; /* * If the VMA does not require dirty tracking then no problematic write * can occur either. */ return !vma_needs_dirty_tracking(vma); }

      Def policy code. checks if we can write to a map

    12. /* user gate pages are read-only */ if (gup_flags & FOLL_WRITE) return -EFAULT; if (address > TASK_SIZE) pgd = pgd_offset_k(address); else pgd = pgd_offset_gate(mm, address); if (pgd_none(*pgd)) return -EFAULT; p4d = p4d_offset(pgd, address); if (p4d_none(*p4d)) return -EFAULT; pud = pud_offset(p4d, address); if (pud_none(*pud)) return -EFAULT; pmd = pmd_offset(pud, address); if (!pmd_present(*pmd)) return -EFAULT; pte = pte_offset_map(pmd, address); if (!pte) return -EFAULT; entry = ptep_get(pte); if (pte_none(entry)) goto unmap; *vma = get_gate_vma(mm); if (!page) goto out; *page = vm_normal_page(*vma, address, entry); if (!*page) { if ((gup_flags & FOLL_DUMP) || !is_zero_pfn(pte_pfn(entry))) goto unmap; *page = pte_page(entry); } ret = try_grab_page(*page, gup_flags); if (unlikely(ret)) goto unmap;

      Most of these seem like sanity checks right up until line 897 i.e, 'if(!page)'* after which we seem to unmap the page.

    13. static struct page *follow_page_mask(struct vm_area_struct *vma, unsigned long address, unsigned int flags, struct follow_page_context *ctx) { pgd_t *pgd; struct mm_struct *mm = vma->vm_mm; ctx->page_mask = 0; /* * Call hugetlb_follow_page_mask for hugetlb vmas as it will use * special hugetlb page table walking code. This eliminates the * need to check for hugetlb entries in the general walking code. */ if (is_vm_hugetlb_page(vma)) return hugetlb_follow_page_mask(vma, address, flags, &ctx->page_mask); pgd = pgd_offset(mm, address); if (pgd_none(*pgd) || unlikely(pgd_bad(*pgd))) return no_page_table(vma, flags); return follow_p4d_mask(vma, address, pgd, flags, ctx); }

      places mask after following page into pte

    14. if (likely(!pmd_trans_huge(pmdval))) return follow_page_pte(vma, address, pmd, flags, &ctx->pgmap); if (pmd_protnone(pmdval) && !gup_can_follow_protnone(vma, flags)) return no_page_table(vma, flags); ptl = pmd_lock(mm, pmd); if (unlikely(!pmd_present(*pmd))) { spin_unlock(ptl); return no_page_table(vma, flags); } if (unlikely(!pmd_trans_huge(*pmd))) { spin_unlock(ptl); return follow_page_pte(vma, address, pmd, flags, &ctx->pgmap); }

      branch prediction to check if pmd is there and if it's big

    15. /* FOLL_GET and FOLL_PIN are mutually exclusive. */ if (WARN_ON_ONCE((flags & (FOLL_PIN | FOLL_GET)) == (FOLL_PIN | FOLL_GET))) return ERR_PTR(-EINVAL); ptep = pte_offset_map_lock(mm, pmd, address, &ptl); if (!ptep) return no_page_table(vma, flags); pte = ptep_get(ptep); if (!pte_present(pte)) goto no_page; if (pte_protnone(pte) && !gup_can_follow_protnone(vma, flags)) goto no_page; page = vm_normal_page(vma, address, pte); /* * We only care about anon pages in can_follow_write_pte() and don't * have to worry about pte_devmap() because they are never anon. */ if ((flags & FOLL_WRITE) && !can_follow_write_pte(pte, page, vma, flags)) { page = NULL; goto out; } if (!page && pte_devmap(pte) && (flags & (FOLL_GET | FOLL_PIN))) { /* * Only return device mapping pages in the FOLL_GET or FOLL_PIN * case since they are only valid while holding the pgmap * reference. */ *pgmap = get_dev_pagemap(pte_pfn(pte), *pgmap); if (*pgmap) page = pte_page(pte); else goto no_page; } else if (unlikely(!page)) { if (flags & FOLL_DUMP) { /* Avoid special (like zero) pages in core dumps */ page = ERR_PTR(-EFAULT); goto out; } if (is_zero_pfn(pte_pfn(pte))) { page = pte_page(pte); } else { ret = follow_pfn_pte(vma, address, ptep, flags); page = ERR_PTR(ret); goto out; } } if (!pte_write(pte) && gup_must_unshare(vma, flags, page)) { page = ERR_PTR(-EMLINK); goto out; } VM_BUG_ON_PAGE((flags & FOLL_PIN) && PageAnon(page) && !PageAnonExclusive(page), page); /* try_grab_page() does nothing unless FOLL_GET or FOLL_PIN is set. */ ret = try_grab_page(page, flags); if (unlikely(ret)) { page = ERR_PTR(ret); goto out; } /* * We need to make the page accessible if and only if we are going * to access its content (the FOLL_PIN case). Please see * Documentation/core-api/pin_user_pages.rst for details. */ if (flags & FOLL_PIN) { ret = arch_make_page_accessible(page); if (ret) { unpin_user_page(page); page = ERR_PTR(ret); goto out; } } if (flags & FOLL_TOUCH) { if ((flags & FOLL_WRITE) && !pte_dirty(pte) && !PageDirty(page)) set_page_dirty(page); /* * pte_mkyoung() would be more correct here, but atomic care * is needed to avoid losing the dirty bit: it is easier to use * mark_page_accessed(). */ mark_page_accessed(page); }

      finds page in pte. Judging by the complexity of the logic this is most likely policy code because we're literally getting user page

    16. void unpin_user_pages_dirty_lock(struct page **pages, unsigned long npages, bool make_dirty) { unsigned long i; struct folio *folio; unsigned int nr; if (!make_dirty) { unpin_user_pages(pages, npages); return; } sanity_check_pinned_pages(pages, npages); for (i = 0; i < npages; i += nr) { folio = gup_folio_next(pages, npages, i, &nr); /* * Checking PageDirty at this point may race with * clear_page_dirty_for_io(), but that's OK. Two key * cases: * * 1) This code sees the page as already dirty, so it * skips the call to set_page_dirty(). That could happen * because clear_page_dirty_for_io() called * page_mkclean(), followed by set_page_dirty(). * However, now the page is going to get written back, * which meets the original intention of setting it * dirty, so all is well: clear_page_dirty_for_io() goes * on to call TestClearPageDirty(), and write the page * back. * * 2) This code sees the page as clean, so it calls * set_page_dirty(). The page stays dirty, despite being * written back, so it gets written back again in the * next writeback cycle. This is harmless. */ if (!folio_test_dirty(folio)) { folio_lock(folio); folio_mark_dirty(folio); folio_unlock(folio); } gup_put_folio(folio, nr, FOLL_PIN); } }

      unpins and dirties page

    1. The article discusses Rama, a dataflow language and platform built on Clojure that leverages continuation-passing style (CPS) to generalize functions and enable powerful programming paradigms, especially for parallel and asynchronous code in distributed systems.

      Key Concepts:

      1. Rama Operations (deframaop):

      Functions that can emit zero, one, or multiple values.

      Use :> to emit values to an implicit continuation.

      Variables are prefixed with * (e.g., *v).

      1. Continuation-Passing Style (CPS):

      Functions receive an extra argument—the continuation—to which they pass results.

      Rama hides the continuation, simplifying the syntax compared to explicit CPS in Clojure.

      1. Emitting Multiple Times:

      Operations can emit values multiple times or not at all.

      Useful for operations like filtering or generating sequences without materializing collections.

      1. Anonymous Operations:

      Defined with <<ramaop, allowing for closures that capture lexical scope.

      Can be passed around as first-class citizens, similar to functions.

      1. Asynchronous Emission and Partitioners:

      Rama operations can emit asynchronously, enabling distributed computing.

      Partitioners like |hash, |all, and |global relocate computation across different threads or nodes in a cluster.

      Example: |hash sends computation to a task determined by hashing a key.

      1. Multiple Output Streams:

      Operations can emit to different output streams (e.g., :>, :a>, :b>).

      Allows branching control flow based on different emitted streams.

      Handled using constructs like <<branch and inline hooks :>>.

      1. Unification:

      Merges separate computation branches using unify>.

      Ensures shared code executes after different conditional paths.

      1. Loops (loop<-):

      Support for iterative processes that can emit multiple times.

      Can be combined with partitioners for distributed loops across nodes.

      1. Optimizations:

      Rama distinguishes between deframaop and deframafn.

      deframafn must emit exactly once and synchronously, allowing stack-efficient invocation.

      The compiler optimizes code to prevent stack overflows and ensure efficiency comparable to idiomatic Clojure code.

      1. Uniformity and Composition:

      Operations, conditionals, loops, and even partitioners are all treated uniformly.

      This uniformity simplifies the language and enhances code composability.

      Applications:

      Distributed Systems: Rama's ability to emit asynchronously and control execution location makes it ideal for distributed computing tasks.

      Backend Development: Expresses computation and storage needs for backends at any scale.

      Parallel and Asynchronous Code: Simplifies writing complex parallel operations without the usual boilerplate.

      Example Highlights:

      Identity Function in Rama:

      (deframaop identity-rama [v] (:> v))

      Emitting Multiple Times:

      (deframaop emit-many-times [] (:> 1) (:> 3) (:> 2) (:> 5))

      Asynchronous Partitioning:

      (|hash from-user-id) (user-current-funds $$funds from-user-id :> *funds)

      Multiple Output Streams and Branching:

      (deframaop emit-multiple-streams [] (:a> 1) (:> 2) (:> 3) (:a> 4) (:b> 5 6))

      Conclusion:

      Rama extends the capabilities of functional programming by generalizing the concept of functions through CPS and dataflow principles. It provides:

      A powerful framework for writing efficient, parallel, and distributed applications.

      A unified approach to computation that simplifies the handling of asynchronous and multi-emission operations.

      Enhanced composability and expressiveness in code, reducing boilerplate and improving readability.

      Note: While the article delves deeply into the technical aspects of Rama and its implementation details, the overarching theme is the exploration of how CPS and dataflow paradigms can revolutionize the way we write and reason about code in distributed systems.