- Sep 2020
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #2
The manuscript "Evolutionary transcriptomics implicates HAND2 in the origins of implantation and regulation of gestation length" by Marinić et al. uses an innovative expression dataset in an evolutionary framework to identify a set of transcripts whose endometrial expression emerged at the eutherian stem lineage. One of these is the transcription factor HAND2. Using both existing datasets and experimental data they build a model of the activity of HAND2 and its associated protein IL15 at the maternal-fetal interface and implicate the proteins in both the evolution and disorders of pregnancy. I highly recommend this manuscript. This work illustrates the utility of evolutionary analysis for elucidating functional mechanisms of complex disorders. The authors support their evolutionary analysis with a thorough characterization, including additional experimental data, of their hypothesized gene association. This work substantially contributes to our knowledge of the evolution and diseases of pregnancy.
I have only two point of inquiry that I believe the authors should address in the manuscript:
1) Of the 149 genes that unambiguously evolved endometrial expression why was only HAND2 analyzed? I am not suggesting that each gene be followed up with this level of rigor but would you hypothesize that each of the genes you identified play a role in eutherian reproduction? Or are there other major innovations that some of these genes may be associated with? How frequently would this pattern occur by chance?
2) Figures 2F and 4F - there appears to be a gap in the data points during the third trimester (which looks like it says "thirdr"). Is there still a negative trend if each section is analyzed independently as if they were independent datasets? Aka could this linear trend be composed of two separate trends instead?
-
Reviewer #1
Parsing mechanisms of disease from the perspective of evolutionary biology is an interesting approach. This perspective may be particularly advantageous when focussing the 'bigger picture' as it is perhaps less constrained by details that tend to preoccupy more conventional disease-focussed studies, such as clinical phenotyping, timing of biopsies, sample size, validation studies etc. In this study, Marinić and colleagues made use of a wealth of publicly available data sets to argue for a role of HAND2-IL15 axis in endometrial cells in implantation and, more importantly, the onset of parturition. The observation that enhancer regions in both HAND2 and IL15 harbour SNPs associated with gestational length/preterm birth renders the study timely and compelling. However, to my knowledge, the impact of these SNPs on the expression of either gene is not known. Further, the lack of validation studies on clinical samples renders the proposed mechanism plausible but speculative, as acknowledged by the authors. There are several other issues that require clarification:
1) Fig. 1C appears interesting but there is no comparator or controls. Without comparison, for example the histotrophic phase, it appears difficult to conclude that estrogen signaling genuinely persists during pregnancy in the opossum. pESR1 staining in the tissue section is ubiquitous with no evidence of nuclear localisation, raising concerns about antibody specificity. KI67 staining may be more informative?
2) The authors used a large single-cell RNA-seq data set to map HAND2 expression at the human maternal-fetal interface in the first-trimester of pregnancy (Vento-Tormo et al. 2018). They demonstrate that HAND2 expression is confined to 3 maternal subsets, termed endometrial stromal fibroblast (ESF) 1 and 2 and decidual stromal cells (DSC). If I am not mistaken, in the Vento-Tormo paper, these populations of cells were labelled decidual stromal cells 1-3 (DS1-3), emphasizing that all these cells were decidualized, as expected in pregnancy. Vento-Tormo et al. further demonstrated that the differences in gene expression between DS subsets relate to their topography in the maternal tissue. Hence, it is confusing that the authors changed the terminology of these subsets, giving the erroneous impression of two undifferentiated ESF populations and a single DS/DSC population in pregnancy. By doing so, the inference seems to be that T-HESC, a telomerase-transformed endometrial stromal cell line used in functional studies, is a good model of ESF populations in vivo, which is doubtful.
3) Fig. 2G. The authors state that 'We also used previously published gene expression datasets (see Methods) to explore if HAND2 was associated with disorders of pregnancy and found significant HAND2 dysregulation in the endometria of women with infertility (IF) and recurrent spontaneous abortion (RSA) compared to fertile controls' - This bold statement is based on microanalysis of merely 5 biopsies in each group. Considering the intrinsic temporo-spatial heterogeneity of the cycling endometrium, this sample size is grossly inadequate. The microarray study was published in 2011. In fact there are several more recent and more robust datasets available (e.g. 115 IF biopsies in GSE58144 and 20 RM biopsies in GPL11154). These comments also apply to Figure 4G.
4) The authors also state 'HAND2 was not differentially expressed in ESFs or DSCs from women with preeclampsia (PE) compared to controls (Figure 2G).' It is unclear which dataset this was based on. The authors' claim seems to indicate that this was single-cell data? In any case, the sample size is again grossly inadequate to draw robust conclusions without further validation in a much larger cohort of samples.
5) Figure 3. The authors decided to knockdown HAND2 in T-HESC, a telomerase-transformed endometrial stromal cell line, and performed RNA-seq 48 h later. The cells were not decidualized or even treated with progesterone. Hence, the rationale for this experiment, and its relevance to the in vivo situation, is genuinely lost on me. See also comment regarding the renaming of DS subsets into ESF. In an undifferentiated state, these cells are not representative of gestational cells (with the possible exception that decidual senescence is characterised by progesterone resistance, i.e. re-activation of genes that are suppressed by progesterone). More importantly, as HAND2 is critical for the identity of these cells, perhaps knockdown triggers a stress response? For example, from the data presented in Supplementary Table 6 (it would be helpful to add gene names), on of the strongest up-regulated gene upon HAND2 knockdown is BLCAP2 [Log2(FC): 10.2], which encodes a protein that reduces cell growth by stimulating apoptosis.
6) The authors illustrated the importance of examining the right cellular state: knockdown HAND2 in T-HESC increases IL15 expression whereas it is well established that HAND2 knockdown in decidual cells decreases IL15 expression. Further, IL15 is strongly induced upon decidualization and previous studies on primary endometrial stromal cells demonstrated that IL15 secretion is undetectable in undifferentiated cells whereas it is abundantly secreted upon decidualization (PMID: 31965050). Thus, to be informative, the authors should repeat HAND2 KD in decidualizing T-HESC and measure IL15 secretion in both states, with and without HAND2 knockdown.
7) Fig. 3B - it is unclear what is compared here: genes deregulated upon HAND2 knockdown in T-HESC versus knockdown NR2F2, FOXO1 and GAT2 in decidualized primary cultures? If this is the case, the comparison is not informative as it involves two different cell states. It is surprising that FOSL2 was not included in this analysis.
8) I do not understand the relevance of the experiments described in Figure 5 to the context of gestation length or preterm birth. Trophoblast invasion will have been completed in the 2nd trimester of pregnancy - what is the purpose/message of these experiments? What is the level of IL15 secreted by these cells? Again the T-HESC appears not decidualized - so, what is the relevance to either the midluteal implantation window or gestation?
9) What is the evolution of IL15 expression at the maternal-fetal interface? Does it parallel HAND2?
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript.
Summary:
Parsing mechanisms of disease from the perspective of evolutionary biology is a powerful approach. The manuscript by Marinić et al. uses an innovative expression dataset in an evolutionary framework to identify a set of transcripts whose endometrial expression emerged at the eutherian stem lineage. One of these is the transcription factor HAND2. Using both existing datasets and experimental data the authors build a model of the activity of HAND2 and its associated protein IL15 at the maternal-fetal interface and implicate the proteins in both the evolution and disorders of pregnancy. The work illustrates the utility of evolutionary analysis for elucidating functional mechanisms of complex disorders and substantially contributes to our knowledge of the evolution and diseases of pregnancy.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3
This work presents a method to analyze integrated mutation and transcript data to identify mutations in individual genes that drive similar and divergent transcriptional signatures. Overall the work appears novel and provides potential insights that could generate hypotheses worthy of further study. The work is limited in that confirmation is done only for a set of mutations on GATA3 with existing drug sensitivity cell line data. It would be helpful to have an indication that more than a single result from the large study provides validated insights.
Concerns:
1) While the approach is nicely detailed, one critical aspect remains unclear. An AUC is generated for each prediction of mutation from transcriptional signature based on cross-validation. I could not deduce from this statement exactly how this was done given in the introduction here of a mean score: "We measured a classifier's ability to identify a transcriptomic signature for its assigned task using the area under the receiver operating characteristic curve metric (AUC) calculated using samples' mean scores across ten iterations of four-fold cross-validation."
2) The claim "These results are striking in that predicting the presence of a rarer type of mutation should, everything else being equal, be more difficult owing to decreased statistical power" is really applicable to a hypothesis test, so it is not immediately obvious that is applies in a case of cross-validation generating an AUC.
3) The claim that a Spearman correlation of AUCs between methods is a validation of robustness is difficult to accept. Note that if you uniformly subtracted 0.5 from every AUC, the result would give a Spearman correlation of 1 with the original data, but it would not be a very robust result. Why is Pearson correlation not used?
4) It is clear that many classifiers were actually run, and it would be helpful to have the number actually summarized. This ties into the concern with only a single validation in drug sensitivity data, since there may be false discoveries given a large number of classifiers.
-
Reviewer #2
In this study, authors analyzed the association between types of somatic mutations and the downstream effects on the transcriptome using data obtained from many large tumor data consortia such as METABRIC, TCGA etc. Subsequently, authors systematically show functional relevance using CCLE data.
Concerns:
Using the tumor profiling data from various consortia, several groups have shown these associations using different statistical methodologies (PMID:21555372, PMID: 26436532, PMID: 27127206 and thereon). In that light, results described in this study are correlational and some are obvious. It is not clearly described what transcriptional programs are impacted by mutation subgroups and how distinct they are from other tumor types with similar mutation subgroups. Also, it is not clear if these distinct mutation subgroups carry any clinical significance such as outcomes. Furthermore, transcriptional programs are also under regulation by DNA methylation and its role in defining the transcriptional program under the influence of mutation subgroups is not described.
Specific Concerns:
1) What data normalization and batch correction methods were applied on expression data from TCGA, METABRIC and other datasets.
2) What clustering methods were applied for subsequent UMAP projection.
3) Although association between mutation sub-groups and expression is described, it is not clear if expression profile of a group of genes found in the analysis. If so, functional significance of those co-regulated genes is not described.
4) Page 35 (lines 781-782); What is the biological and statistical rationale for removing neighborhood genes. There is significant neighborhood effect in certain cancers such as ccRCC where 3p is significant for tumorigenesis and progression.
5) Statistical methods and reasons of their application on the data is not well described. Moreover, linearity in describing the methods on data from start is not clear thus leading to confusion. Multiple correction sections, although mentioned are vague.
6) Earlier studies have shown concordance between RNA-Seq and microarrays. In that context, page 16; lines 348-351, why do the authors assume differences exist between these platforms.
7) Manuscript is long and difficult to read with emphasis on some obvious things. Manuscript can be shortened for easy reading.
-
Reviewer #1
While this is an important area, the organization and results presentation render this current form of the manuscript unacceptable. Some specific challenges are described below.
1) Throughout the manuscript, the authors report AUC on the training set as the primary metric of assessment and to compare models between genes. However, these performance metrics are more valid for cross-validation and may be sensitive to the differences in sample size introduced by the number of mutations. The authors would be better served by using the permutation-based statistic they develop later in the results throughout to report results.
2) The authors develop a permutation based statistic to assess performance in a manner that controls for sample size presented as part of the results and relegate most of its description to the supplemental methods. This is a critical part of evaluation that should appear in the main manuscript and used for all results presented in the manuscript. This is of particular importance for the comparison between TCGA and METABRIC performance, which have different sample sizes.
3) Several hypotheses about the function of specific mutations or mutational groupings are made throughout the manuscript based solely on the AUC prediction values. These appear speculative and could be better grounded in results by evaluating the function of the genes in the transcriptional programs that underlie the prediction (e.g., using feature importance scores to determine specific genes associated with the classifier.
4) It is unclear why specific genes are selected for presentation in the manuscript. These appear cherry picked to describe well performing genes and do not do a comprehensive presentation of the performance of the algorithm, particularly in the first subsection of results "Subgrouping classifiers uncover alteration divergence in a breast cancer cohort" and "Subgrouping classifier output reveals the structure of downstream effects within cancer genes." The latter section particularly includes a substantial amount of biological description of function based solely on performance that is not grounded in the results presented.
5) The definition of "subgroupings" is not clearly described. It is not possible to follow as written how the 7598 groupings are determined and how these are used in the machine learning framework. This needs to be significantly clarified.
6) It is unclear why HER2 amplifications are a focus of analysis for Luminal A subtype breast cancer samples, which are by definition HER2-.
7) An expanded presentation of the results of relative classification accuracy by gene and cancer type would be useful for evaluating the further impact of cancer-type on performance to determine the role of the biology on mediating mutations. In particular, it would be useful to evaluate whether cancers with different cell type composition (e.g.,large fibroblast content in messenchymal HPV- HNSCC tumors) impact the results of the classifier. A similar comparison would be useful between in vivo tumors and in vitro cancer types from the gene expression profiles in CCLE.
8) The GitHub links for the software presented in this paper do not work.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript.
Summary:
The reviewers are in agreement that the authors present an innovative classifier framework to predict mutational status and subgroups based upon transcriptional profiles. They perform a comprehensive analysis across cancer subtypes to assess context-dependence of mutations and link these classifiers to cell line data to further predict therapeutic outcomes. Overall the work appears novel and provides potential insights that could generate hypotheses worthy of further study. While this is an important area, the work is limited in several ways. These include numerous issues with the statistical methods used, lack of clarity as to whether the results were significant, potential concern about cherry-picking results, and the need to consider alternative factors contributing to the reported relationships, coupled with weaknesses in the organization and presentation of the results.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3
General assessment of the work: The authors report a study of the mesial temporal lobe (MTL), particularly focusing on structural/functional changes related to transition regions from six layer isocortex to three layered allo-cortex. This group uses their expertise in imaging processing techniques to define the anatomical regions of the mesial temporal lobe transition from isocortex to allocortex using the BigBrain high-resolution histological reconstruction. Using this single high-resolution histological image, they show intensity changes which correlate with the isocortex/allocortex transition. They then use this high resolution reconstruction to coregister to rs-fMRI, and define effective connectivity within the mesiotemporal lobe. Finally, they show variation rs-fMRI global patterns in relationship to the iso-to-allocortical axis, as well as the mesial temporal a/p axis.
Substantive concerns:
This is an interesting study which shows novel relationships between mesial temporal structures and whole brain functional organization. As the authors point out, the novel part of the study involves defining cytoarchitectural regions, and correlating these changes with both local and global function as defined by BOLD fMRI. This is a novel study examining the iso-allocortical transitions with the MTL, and correlating them with local and global rs-fMRI changes. As the authors state, the global rs-fMRI findings related to the anterior-posterior axis of the MTL are not new, but add complementary findings in comparison to the iso-allocortical transition findings. Given this, I will focus my comments on the use of the BigBrain image, and definition of the MTL transitions for use in defining regions in the rs-fMRI images.
1) With the BigBrain data, only the right hippocampus was used for segmentation, due to a rip in the histopathological sections of entorhinal cortex on the left. It is therefore assumed that the right MTL segmentations were inverted and also used for the left MTL rs-fMRI analysis. If this is the case, it should be more clearly stated in the methods. Also, discussion should be added to the possible implications for results, both in respect to replicating the histological intensity findings (which could be tested in two hippocampi if both right and left were processed) and the known structural differences between the right and left hippocampi.
2) I had concerns that using the higher resolution BigBrain image as a template for the 8 nodes in the MTL for the much lower resolution rs-fMRI images would be problematic for signal to noise ratio. However, the authors have convincingly shown consistent findings when controlling for signal to noise ratios.
3) The authors mention (and reference) the correlation of histopathological cellular staining intensities with cellular densities and soma size in the methods section. Given the centrality of this concept to their findings of the BigBrain data, some addition to the discussion about this concept and the underlying evidence for correlation of staining intensity and cellular densities and soma size would be helpful.
-
Reviewer #2
This paper does a very good job of underscoring the importance of characterizing the structural organization of the cortex at a deep level in order to inform functional organization. The authors present an exciting and innovative method of bridging post-mortem cytoarchitecture with in vivo functional MRI, allowing for a powerful and compelling investigation of MTL micro-architecture. This work has important implications for how information transfer occurs through macro-structural and more local brain circuits. The two major findings regarding the allo-iso and the anterior-posterior gradient are supported by the previous literature, but so far characterization of this organization in humans in vivo has been somewhat limited. Most of my suggestions below are regarding points that could be clarified or methods that were unclear.
1) Was there an a-priori prediction regarding the "multi-demand" network? This part of the narrative seemed to come out of the blue and could use more background.
2) Some of the methods are not fully described and are hard to understand. For example, the surface models that are used to sample and model the properties of the microstructure at different cortical depths could be described in more detail. I was also having trouble understanding two things about the "confluence" or "intersection" between the allocentric and isocentric cortices. I was left wondering if the intersection is defined as a plane in surface space, demarcating the separation between hippocampus and entorhinal cortex? Is the confluence/intersection defined based on the manual hippocampal subfields (i.e. medial boundary of the subiculum) or is it defined some other way using the surface profiles/features? Finally, how is geodesic "distance" computed? I would suggest adding a figure to give an overview of these aspects of the methods.
3) Related to the point above, I get the impression that this data shows there is no strict boundary between the allo and iso-cortex but rather that there is a somewhat smooth gradient. This point could be made more clear in the abstract and discussion. What implications does this particular finding have for theories of MTL subregion function?
4) When r-values are reported to differ for different gradients (e.g. iso versus allo) it is important to test for a significant difference in the slopes (e.g. Fisher r-to-z transform or similar) to know if the relationships are statistically different from one another.
5) This paper builds nicely on other work by DeKraker and colleagues (2019) that has analyzed the microstructural properties of the hippocampus. I think the readers of this paper would appreciate a brief description of how this investigation is similar/different from that work. For example, are the "features" identified here largely overlapping with those identified by DeKraker, and if not, how do they differ here?
6) In the effective connectivity analysis of the MTL, how is variability of the MTL anatomy taken into account? For example, the fusiform and parahippocampal regions of interest will contain highly variable anatomical structures across subjects (e.g. different folding patterns of the collateral sulcus). Given that the focus on anatomical specificity is a major strength of this paper, I would be curious to know how anatomical variability/specificity is accounted for when the data is morphed into MNI152 volume space.
7) I was unsure which analyses were replicated in the Human Connectome Project (HCP) dataset. It is stated that the isocortical functional gradients were re-generated within the HCP cohort and that results were "highly similar" (p. 18) to the original dataset. Was this similarity formally tested?
-
Reviewer #1
Thank you for inviting me to review this manuscript by Paquola and colleagues, in which the authors used a combination of high-resolution anatomical data, machine learning, spectral DCM and resting functional connectivity measures to interrogate the relationship between structural and functional gradients of organization within the mesial temporal lobe.
The study is broken into four related sections. In the first section, the authors analysed vertices within a set of mesial temporal lobe structures using a random-forest algorithm, which identified a set of microstructural profiles across the structure. They then interrogated these profiles for evidence of an iso-to-allometric axis, which is a principle known to characterise the transition from 6-layered isocortex (in entorhinal cortex) to 3-layer allocortex (in the hippocampal formation). The authors found evidence consistent with this transition in the BigBrain data, particularly with respect to the skewness of the distribution of thickness across the layers.
In the second section, the authors use Spectral DCM on resting state data from a group of 40 individuals. They then relate the results of the spectral DCM model to the gradients identified using structural anatomy. This section was well-motivated and conducted.
In the third section, the authors compare the structural gradient to resting state functional connectivity with vertices within the cerebral cortex. The results here were quite compelling, showing a dissociation between the iso- and allo-cortical poles in the MTL in which the iso-cortex was correlated with fluctuations in the lateral dorsal attention and frontoparietal networks, whereas the allo-cortical pole was correlated with vertices in the default mode and medial occipital regions.
In the final section, the authors conducted a number of checks of their analysis, including an SNR test to ensure that the temporal lobes (a notorious site for MRI signal dropout) were adequate, and a substantial replication analysis. They should be commended for these steps, and also for making their code freely available.
Comments:
1) Section 1: I wonder whether the manuscript might benefit from the unpacking of the random forest results. Is there an intuitive way to characterize skewness that may benefit the reader - such as a particularly uneven spread of thickness distributed across the layers? And is this finding something that we might expect, given the hypothesized gradient of iso-to-allocortex in the MTL?
2) Section 1: Along these lines, is it fair to single out an individual measure from the random-forest regression as being the most salient? From my understanding (which might be mistaken), the weights on a particular variable in a regression need to be viewed in context of the performance of the whole model.
3) Section 2: One minor comment is that it might be helpful for the reader if the "in" and "out" effective connectivity directions were incorporated into the matrix in Figure 2A.
4) Section 2: I wasn't sure that I followed the logic of the experiment in which the authors split the MTL data into thirds to test for the consistency of their results. Were each of these sufficiently powered to allow for direct comparison with the main effect? Did the boundaries between these models cut across known regional areas? Perhaps a different way to achieve the same ends would be to use bootstrapping in order to provide a confidence interval around the relationship between structure and function?
5) Section 3: Did the authors hypothesize the iso vs. allo-cortical relationship to resting state networks a priori, or was it discovered upon exploration of the data. Either is fine, in my opinion, but I think it would benefit the reader to have these results placed in the context of the known literature.
6) Section 3: Do the authors expect that the patterns identified in the MTL will relate to subcortical gradients identified in other structures, such as the cerebellum (Guell et al., 2018), thalamus (Müller et al., 2020, and basal ganglia (Stanley et al., 2019)? See also Tian et al., 2020 for general subcortical gradients.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
All three reviewers saw great merit in your work and were enthusiastic about its potential. Nonetheless, each reviewer raised several substantive concerns. Broadly speaking, we see the essential revisions as (1) providing additional clarity with respect to methods, (2) further unpacking of some of the results, as well as conducting a few targeted statistical analyses (i.e., to test for differences in slopes), and (3) clearer positioning of the current work as it relates to the existing literature.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #2:
General Assessment:
The role of visual experience with faces in the formation of face-specific neural "modules" is tested in a deep convolutional neural network model of object recognition, AlexNet. A modified version of the ILSVRC-2012 training dataset was constructed by removing all images with primate faces, removing remaining categories with fewer than 640 images, and re-training the deprived network: d-Alexnet. d-Alexnet was compared to pre-trained Alexnet on classification performance, quality of fit to fMRI data, strength of face-selectivity, representational similarity, and learned receptive field properties. The authors argue that face-selectivity is significantly reduced, but not eliminated, with the deprivation, and that this reduction is consistent with an interpretation that d-Alexnet represents faces more similarly to objects than Alexnet. While this work is well-motivated and timely, there are substantial issues in the conceptual approach, the methods used, clarity of the results, and most importantly, the strength of the conclusions.
Major Concerns:
1) The validity of these results is uncertain due to a) insufficient reproducibility within this work and b) fragile definitions of face-selectivity.
a) Given that small changes in weight initialization or training procedure can have a large effect on learned representations (see Mehrer et al. 2020, https://www.biorxiv.org/content/10.1101/2020.01.08.898288v1.abstract ), the authors must demonstrate that their results hold across multiple initializations of each network type. Several key results hinge on the number and identity of "face-selective" channels (Figure 2, 3c-e) and only a single instance of each model type is used. In particular, the result that 2/256 channels are "selective" in d-Alexnet compared to 4/256 in Alexnet is likely sensitive to small variations in the methods, including the choice of evaluation stimuli and the initialization of the weights. If the models were re-trained, could the ratio be 4 channels to 4 channels, 0 channels to 2 channels, or some other result? With only a single instance of each model and such a small (and potentially unstable) number of face-selective channels in each model, I am not convinced that these results support the claims made.
SUGGESTION: Report results averaged across multiple initializations of each model to demonstrate robustness. Statistical tests should be conducted across models (as if they were individual subjects) to demonstrate the significance of any effects found.
b) The definition of "selectivity" is potentially fragile and may not hold when tested with more standard evaluation sets. In the primate face-selectivity literature, functional localizers are used to compare face responses to non-face responses. These localizers have much stronger controls over low-level features than the stimuli used to evaluate selectivity in this work. I am especially concerned that the faces (from FITW) differ from non-face objects (from Caltech-256) in low-level properties such as image resolution, pose, background, contrast, luminance, and more. Furthermore, selectivity is typically defined in the field as a continuous quantity (e.g., t-contrast, d-prime, face-selectivity-index) and is not often assessed in a binary fashion by the number of units significantly more responsive to faces than the second-best category. Many of these continuous metrics also incorporate variance in responses as well as the mean of responses. Thus, the designation of channels as "selective" or "not-selective" in this work based on mean responses to only 2 of the 205 categories (L101) prevents the reader from understanding how the distribution of face-selectivity shifted under the deprivation, which is one of the primary claims. Instead, we only see the number of selective channels after a binary cutoff, which may be sensitive to initialize and the stimulus set used to evaluate selectivity.
SUGGESTION: Compute selectivity using evaluation sets in which faces are better matched to non-face objects. Report the distribution of selectivity for each channel before and after deprivation.
2) Because one model in the comparison is pre-trained and the other is trained from scratch, there is the possibility that all of the differences between the models are due to differences in the training that are independent from the content of the training images.
a) In the regression analysis, is it the case that non-selective channels also show differences in R2? For example, if the d-Alexnet is worse on the training task (d-ImageNet) than Alexnet, we expect a general reduction in its ability to explain neural responses (see e.g. Yamins et al., 2014). The claims that face-selectivity is specifically impaired in d-Alexnet need to be supported by demonstration that non-selective channels are equally good (or poor) fits to vertices in face-selective regions. Furthermore, the authors do not demonstrate that face-selective channels are better than non-selective channels in either model type, which is useful context for understanding whether the correspondence between face-selective channels and face-selective brain regions is meaningful.
SUGGESTION: report non-selective channel fits to the same vertices for each model type and compare to face-selective channel fits.
b) L366: the authors write that "the d-Alexnet was initialized with values drawn from a uniform distribution". This is not standard practice; in fact, the kernel weights in the original AlexNet model were initialized from a Gaussian distribution. To make comparisons to the non-deprived model, the authors need to also retrain the non-deprived model to account for the potential confounds between their training/initialization procedure and that used in the pre-training.
SUGGESTION: re-train the non-deprived AlexNet in-house, then compare that model to d-AlexNet.
- A major conceptual issue is in the definition of a "face module". Despite "face module" in the title, a working definition of "face module" is not clearly provided in the manuscript. Context clues suggest that the authors may consider any face-specific process evidence of a "face module", but the experiments performed indicate that a specific set of criteria were explored: selectivity for faces, different representations for faces and non-face objects, holistic processing, etc. Especially given that the results of this work indicate some residual face-selectivity, a clear definition of "face module" - grounded in the existing literature - is needed to evaluate the claims provided.
SUGGESTION: clearly define what the "face module" is in the brain, then explain what the corresponding evidence for a "face module" would be in the DCNN.
4) A number of analyses are not well-motivated or are lacking in detail
a) The analysis of the "empirical receptive field" is lacking in detail and motivation, and the color-scale is both nonlinear and missing a label. Specific questions:
i) How should this result be compared to data in primate face-selective regions?
ii) Is this result a trivial consequence of the difference in number of activated units (panel D)?
iii) What are the units of the colormap?
iv) Why are only two channels shown for AlexNet if 4 channels are face-selective?
v) Is the extent of the empirical receptive field quantified?
vi) How should the reader think about empirical receptive fields in a weight-shared convolutional architecture?
b) The evaluation of the face-inversion test is poorly motivated. The face-inversion effect indicates that human subjects are better at remembering upright faces than inverted faces. However, the analysis performed here evaluates the magnitude of the response of face-selective channels. If anything, a classification task is needed to compare to the human task, because the "face inversion effect" cited is not simply that face-selective units respond more strongly to upright than inverted faces, but that the activation of the units supports differences in classification between upright and inverted faces.
SUGGESTION: At minimum, justify 1) why the magnitude of channel response is a good measure of the face inversion effect or 2) remove the claim that the models do/don't exhibit the behavioral effect.
-
Nancy Kanwisher (Reviewer #1):
Xu et al use deep nets to ask whether face selectivity, and face discrimination performance, can arise in a network that has never seen faces. By painstakingly removing all faces from the training set, and comparing Alexnet trained with and without faces, they claim to find, first, that the face-deprived network does not have deficits in face categorization or discrimination (relative to the same network trained with faces), second that the face-deprived network showed some face-selectivity, and third that face deprivation reduced face selectivity. They conclude that "domain-specificity may evolve from non-specific experience without genetic predisposition, and is further fine-tuned by domain-specific experience."
I love the question and the general strategy behind this study, and indeed we have long discussed doing something much like this in my lab, and we presented a preliminary result of this kind at VSS years ago (https://jov.arvojournals.org/article.aspx?articleid=2433862 ). It is a great use of deep nets to ask what kinds of structures can in principle arise with different kinds of training diets. Xu et al are also to be congratulated for the huge effort they went to in curating a data set of stimuli with no faces, for which they are correct no current algorithm is adequate, requiring a huge amount of labor-intensive human effort.
Nonetheless, despite my might enthusiasm for the question, the general logic of the study, and the major effort to create the training set, I do have a few significant concerns about the paper:
1) The biggest problem in the paper in my view is that although regular Alexnet saw faces in the training set, it was not trained on face discrimination, and its performance on this task is very low (66%). That is above chance but very much lower than a network that is actually trained on face discrimination. In our studies, which are typical of this literature, we find that when Alexnet is trained on the VGG-Face dataset identification of novel faces is around 85% correct (top-1). So to say that the face-deprived network performed no differently from the face-experienced network on a face discrimination task, while true, is misleading, because really this reflects the fact that neither was trained on face discrimination and both do pretty badly. And perhaps more importantly, for faces humans have learned, their typical face recognition accuracy would be way higher than 66% correct. So, the face-deprived network really does very badly compared to a real face-trained network, or to humans, and does not represent a strong case of preserved face discrimination despite lack of face experience. Instead, it reflects the kind of face recognition performance one would expect from an object recognition system or a prosopagnosic patient: above chance but not very accurate. Thus, I think the behavioral data show not preservation of face perception abilities in a network trained without faces, but low performance at face discrimination, much like a network that has seen faces but not been trained to discriminate them.
2) The claim that "face-selective channels already emerged in the d-AlexNet" is similarly overstated in my view, given that only two such units were found and the selectivity of the one we are shown (on the right in Figure 2a) is weak. Although the authors concede that the selectivity of these two units is lower than found in Alexnet trained with faces, that understates the case, as Figure 2a shows. The analysis in Figure 2b, correlating responses of face-selective channels from Alexnet to natural movies, with brain responses to the same movies, is interesting but doesn't tell us what we most need to know. Several public data sets include the magnitude of response of FFA and OFA to a set of 50-100 images, and I would find it more useful to compare those to the response of Alexnet face units to the same images.
A small point: Only human and primate faces were removed from the dataset, but I would think other animal faces (e.g. cats and dogs) should produce some relevant training. Certainly face-selective regions in the human brain respond strongly to animal faces, as several studies have shown. This might be worth considering in the discussion when potential reasons for the emergence of face-selective channels are discussed (line 229-236).
For the reasons above, I don't think the results of this study strongly support the conclusion that "the visual experience of faces was not necessary for an intelligent system to develop a face-selective module". At least the "face-specific module" so claimed is a far cry from the human face processing system in both neurally measured selectivity and behavioral performance.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 3 of the manuscript. Thomas Serre served as the Reviewing Editor.
Summary:
In general, the reviewers and myself agreed that the study had strength including the question being asked and the general strategy used. We also thought that it was a great use of deep nets to ask what kinds of structures can in principle arise with different kinds of visual training diets. The authors should also be commended for the huge effort that went into curating ImageNet to remove images containing faces requiring a huge amount of labor-intensive human effort.
At the same time, as you will see, the reviewers found a number of shortcomings in your study. Most of them could be addressed with (a lot of) additional work but, unfortunately, one issue raised seems impossible to convincingly address. Specifically, the accuracy of both the face-deprived network and the control network for face discrimination is far below that of both comparable networks specifically trained for face discrimination and most likely human observers (although this was not tested). Hence, the study does not represent a strong case of preserved face discrimination despite lack of face experience. To paraphrase the reviewer: "Instead, it reflects the kind of face recognition performance one would expect from an object recognition system or a prosopagnosic patient: above chance but not very accurate. Thus, I think the behavioral data show not preservation of face perception abilities in a network trained without faces, but low performance at face discrimination, much like a network that has seen faces but not been trained to discriminate them."
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
In this manuscript, Carvajal and coworkers prepared a recombinant LUBAC complex, composed of the full-length HOIP, HOIL-1L, and SHARPIN subunits, and analyzed its 3D structure by electron microscopy. This is the first report to show that the LUBAC complex has an elongated, asymmetric crescent-like structure, although it is low resolution. Moreover, the authors examined the intra- and inter-domain associations by cross-linking mass spectrometry, and investigated the oxyester-linked heterotypic branched ubiquitin chains produced through the E3 activity of HOIL-1L. These results are novel and intriguing; but unfortunately, this study has not provided detailed clarifications of the LUBAC structure and catalysis.
Major comments:
1) How about the EM structure from peaks I and III in Suppl. Fig. 1A? Peak I eluted in a higher molecular weight fraction than that of thyroglobulin (670 kDa). Is it possible to form a LUBAC complex consisting of trimers with 1:1:1 stoichiometry between the HOIP, HOIL-1L, and SHARPIN subunits? Peak III predominately includes HOIL-1L and SHARPIN, but lacks HOIP. Therefore, it seems possible to estimate the subunit organization in the 3D structure. Please clarify whether the 3D structure shown in Fig. 2B represents monomers or dimers with 1:1:1 stoichiometry between the HOIP, HOIL-1L, and SHARPIN subunits.
2) On pages 7-8: The authors emphasize the interaction of the RBR domains of HOIP and HOIL-1L, based on their XL-MS analysis, and speculate that LUBAC may have a single catalytic center. However, since multiple interactions in-between LUBAC domains are detected (Figs. 3B-E), the authors need to explain why they focused on this particular interaction. It will be interesting to analyze the effect of E2 or E2~Ub.
3) In Fig. 4B, why could the mixed LUBAC subunits generate a linear chain, but not an oxyester-linked branched Ub4? Does it form a high molecular weight complex in gel filtration? Please indicate the anti-ubiquitin blot in Figs. 4B and 4C to clarify the doublet migration in M1-Ub3.
4) In Figs. 4E and 5A, it is interesting that Cezanne and vOTU could cleave ester-linked branched Ub4, although the molecular bases of these reactions were not revealed. Are the NH2OH-sensitive His-Ub3 and Ub2 generated by LUBAC, as shown in Fig. 5B, cleavable by Cezanne and vOTU? Please indicate that the Ub2 remaining after the OTULIN-treatment (Fig. 4E) is sensitive to NH2OH or not.
5) Why did the NH2OH-treatment in Figs. 5F and 6C cause a drastic decrease in the linear ubiquitin level? The previous PNAS paper from Cohen's group showed a partial reduction in the molecular weight of the Ub chain bound to IRAK and Myd88 after NH2OH-treatment. In contrast, the current data seem to indicate that most of the LUBAC-generated ubiquitin chains were composed of an ester-linked Ub chain, but not a linear chain. Please indicate the lower molecular weight region of the immunoblot. It is surprising that GST-NEMO(250-412) almost non-specifically captured a variety of Ub chains. How about employing GST-NEMO-UBAN alone or M1-TUBE to specifically pull-down the linear polyubiquitin-containing chains?
6) On page 11, 2nd paragraph, although the authors described that "the restriction analyses showed that the ubiquitin chains assembled by LUBAC contained non-linear di- and tri-ubiquitin chains", the di-ubiquitin is barely detectable in Fig. 6B.
7) On the bottom of page12, the authors mentioned that "LUBAC with HOIL-1L T203A,R210A assembled ubiquitin chains more efficiently than WT-LUBAC, but less efficiently than HOIL-1L C460". However, in Fig. 6E, LUBAC with HOIL-1L T203A,R210A seems to have the most powerful E3 activity. Moreover, it is not clear if the partial impairment of branching activity is due to HOIL-1L T203A,R210A, since the upper band of Ub4 has a good signal. Therefore, the authors should reconsider the scheme shown in Fig. 7. The NH2OH-sensitive upper band of Ub3 did not react with an anti-linear ubiquitin antibody, in contrast to the pan-ubiquitin antibody. These results suggested that the upper band of Ub3 consists of two ester-linked branched ubiquitins on single ubiquitin. Does it bind HOIL-1L NZF? If not, then HOIL-1L NZF apparently does not contribute the ester-linked branched ubiquitination activity of LUBAC.
-
Reviewer #2:
The manuscript by Carvajal et al. describes a study on the LUBAC complex. They build upon the striking and highly significant discovery that the HOIL-1 protein is an active ubiquitin E3 ligase with non-lysine esterification activity. This discovery was initially demonstrated by Kelsall et al. As the original findings by Kelsall et al. were quite unexpected, and in part contrary to a study from the Iwai lab, the findings presented here corroborating the former study are of great importance for the field.
Testament to the challenges with structure determination of the LUBAC complex, little structural information is known, despite its discovery over 10 years ago, few structural insights have been obtained. Carvajal et al. report an insect-based expression and purification system for preparing recombinant LUBAC and present a low-resolution structure of the LUBAC complex consisting of sharpin, HOIL and HOIP at 1:1:1 stoichiometry. The structure is supported by mass photometry and most informatively, crosslinking mass spectrometry. However, the low resolution of the negative stain EM LUBAC structure does not allow placement of the individual subunits but does reveal an asymmetric elongated dumbbell shape. Complementary XL-MS data suggests the catalytic RBR modules from HOIP and HOIL-1 are in proximity. They build upon the work of Kelsall et al. by demonstrating that HOIL-1 retains its esterification activity when part of the LUBAC complex. This is notable as it allows prior LUBAC-associated function to be implicated with non-lysine ubiquitination. The manuscript implies that a major function of HOIL-1 esterification activity is to introduce ester branch points within linear Ub chains, and this is observed within cells after TNF stimulation. Intriguingly, at the end of the manuscript they propose that HOIP and HOIL-1 might undergo ubiquitin relay, reminiscent of that reported for MYCBP2 by the Virdee lab.
Overall the manuscript is an important contribution. Some additional experiments should be carried out. Furthermore, the manuscript in its current form affords only a modest advance over the Kelsall et al. study. Additional experiments should also be carried out to address this as stated below.
1) The grey unannotated regions (Figure 3) in sharpin, HOIL and even HOIP to a degree demonstrate anomalously promiscuous crosslinking. Could the authors comment and perhaps add some discussion to the paper? Does this suggest these unannotated regions are highly dynamic? Might this relate to the difficulty in solving higher resolution structures?
2) Thr12 and Thr55 were identified as potential ester linkage sites within polyUb species. However, their mutation did not abolish formation of the hydroxylamine sensitive bands. The authors should state the observed ubiquitin sequence coverage in the MS experiment. Which regions were not covered?
3) To confirm that the residual oligomeric Ub species after OTULIN treatment are exclusively ester-linked, a subsequent hydroxylamine treatment step should be performed.
4) The authors hypothesise that a key function of the HOIL-1 esterification activity is to form heterotypic chains. Whilst this might be the case, the alternative hypothesis that HOIL-1 primes substrates via an ester linkage, which are then linearly extended by HOIP, is also equally valid. Particularly as multiple substrates have been reported to be modified with linear chains yet HOIP appears to be tailored to modify a Ub substrate exclusively. The authors should discuss this alternative hypothesis and also how and why both systems might be important.
5) Perhaps in further support of substrates being the most abundant ester linked species, NEMO enriched linear chains from TNF treated cells show a much more pronounced collapse compared to the ester-linked Ub-Ub linkages produced in vitro in the absence of substrate. It would greatly strengthen the paper if they could add a recombinant substrate to the in vitro reaction (e.g. IRAK1/2 or MyD88). I am not sure about the feasibility of this.
6) Finally, the suggestion that HOIP-HOIL Ub relay might be at play is exciting and implies that E3-mediated Ub relay might be a prevalent process. In principle it should be possible to test this by impairing E2 binding to the RING1 domain in HOIL in the LUBAC complex. A steric mutation (e.g. X to Arg) would be a more elegant approach than mutation of the zinc coordinating cysteine. If relay is at play then the LUBAC should still be able to form ester linkages.
-
Reviewer #1:
Carvajal et al. provide a novel mechanistic insight to the function of HOIL-1L in the formation of heterotypic ubiquitin chains in the context of the full LUBAC complex. This expands on recent work suggesting HOIL-1L has the intrinsic ability to form oxyester-type linkages on its own, and nicely describes the phenomenon in the context of LUBAC both in vitro and in cells. Initial descriptions of the preparation of pure and stoichiometric LUBAC complex are clear and will be of utility to the field. The authors use negative stain EM to structurally characterize the complex, but conformational flexibility prevented the generation of a reliable 3D model for de novo model or docking of known components. The organization of the complex is also described by XL-MS, which enabled the authors to suggest positions the RBR domains of HOIP and HOIL-1L in proximity along with the NZF domain of HOIL-1L into a putative catalytic center. Visualization of a unique triUb or tetraUb conjugate is analyzed with gel-based assays to assess determinants associated with its formation or destruction. The unusual species are formed only in the presence of co-purified LUBAC containing catalytically active HOIL-1L, but without requirement for the previously suggested T12 acceptor residue within Ubiquitin. Further, the heterotypic chains are removed by treatment with hydroxylamine (a nucleophilic acceptor of oxyester-linked Ub) or treatment with Cezanne (a deubiquitinase with K11 linkage specificity) but not OTULIN, a deubiquitinase specific for Met1 linkages. The work is given cellular context by induction of LUBAC activity in response to TNF signaling in lysates of MEFs with wild-type or mutant HOIL-1L. Indeed, more hydroxylamine-sensitive Ubiquitin chains are formed (and immunoprecipitated by the Linear-chain binding NEMO construct) in the wild-type but not HOIL-1L catalytic mutant MEFs upon TNF stimulation.
This clearly written and well-organized manuscript presents new insights into LUBAC assembly and its formation of heterotypic chains. While it is unfortunate that the seemingly well-behaved, monodisperse, stoichiometric complex could not be further structurally characterized, the biochemical characterization of heterotypic Ub formation is thorough and the study constitutes an impactful advance in our understanding of polyubiquitin formation, non-traditional chain linkages, and the LUBAC.
My primary criticism is centered on the 3D structure presented - what does it really contribute to the study? The 2D analyses demonstrate the substantial flexibility of the complex, and projections generated from the 3D structure only marginally match the selected projections shown in Figure 2. If EM analyses are meant to support the biochemical reconstitution of the active LUBAC complex, then the 2D class averages are more than sufficient. Based on the 2D data, and the fact that there are many class averages that are not recapitulated by 2D projections (and vice versa) it is highly unlikely that the purified complex is consistent with a single 3D structure. If the authors were able to use negative stain of complexes, where individual subunits contained identifiable tags (e.g. GFP, MBP), to localize subunits and corroborate the XL-MS, perhaps a 3D model would be appropriate, but as it stands, I don't see the utility of the 3D density.
One other issue has to do with the 2D XL-MS plots. I've always found these plots to be particularly uncompelling representations of 3D structures. In particular, circus plots such as Figure 3B are difficult to interpret. Is it possible to "weight" the quantity or confidence of observed crosslinks, such that the reader's attention would be drawn to the most important and obvious linkages? This could be accomplished by using different line widths, color shade, or the presentation of multiple plots at distinct cutoff values. Further, the pair-wise domain representation similarly gives the impression that a single domain (or even single residue) is caught crosslinking to almost every part of the opposing protein {a straight line in the plot which contains many dots) in several instances. This could similarly benefit from thresholding or a more cautious description. Can it truly be inferred that the red RBRs and green NZF of HOIP and HOIL-1L are forming a catalytic center, when grey linker-regions are over-represented in the plot? It may also be visually more appealing to make non-domain grey regions significantly smaller in thickness than known domains or even just a linking line, in all representations 3A-3E and 6D.
I do not review anonymously, and I applaud the authors for publicly sharing their submitted manuscript on the bioRxiv preprint server. This practice enables others to benefit from findings presented in this research, as well as providing the authors with feedback from the community prior to completion of formal peer review. A postdoc in my lab, Randy Watson, helped me with this review.
-Gabe Lander
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript.
Summary:
This manuscript by Carvajal et al. provides novel insights into HOIL-1L's activity within the LUBAC complex in synthesizing heterotypic, branched ubiquitin chains through oxyester-bond formation. The authors successfully produced and isolated recombinant LUBAC, containing full length HOIL-1L, HOIP, and SHARPIN, and although the intrinsic flexibility prevented a higher-resolution 3D-structure determination, negative-stain EM combined with crosslinking mass spec revealed important new information about the architecture of this complex. Based on the observed spatial proximity of HOIL-1L's and HOIP's catalytic RBR domains, the authors propose an intriguing ubiquitin relay mechanism between these E3 ligases in LUBAC.
The reviewers agreed that this work represents an important contribution to the field, as it corroborates and extends previous findings of HOIL-1L's non-lysine esterification activity. However, the advance and impact could be improved by some additional experiments to further strengthen the mechanistic conclusions.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
General assessment:
The manuscript "Calponin-Homology Domain mediated bending of membrane associated actin filaments" by Palani et al investigates the role of truncated versions IQGAP1 (from yeast and humans) in forming ring-like structures on lipid supported bilayers in an in vitro TIRF assay. This reviewer is still confused by the mechanism that the "curly" truncation uses to bend actin filaments and context between this new "curved actin" generating mechanism with the mechanisms for generating actin rings in other contexts could help the reader understand this advance with more clarity. The authors mention several physiological contexts where the formation of actin rings might apply (associated with mitochondria, in axons, and during cell division in the actomyosin ring) however do not follow up with experiments addressing these specific ringed structures, rather non-specific cortical actin rings in several cell types. While this work has strong potential and is very intriguing additional support/clarification is required to back the claims made by the authors.
Numbered summary of substantive concerns:
1) The visual components of this work are striking. However, the accompanying quantification is somewhat confusing. Throughout the text mean values are listed for various parameters beyond those shown in the figures and it will improve the flow of the manuscript/aid the reader if these were represented as panels in each figure. Further, at least 3 FOVs should be analyzed for all analysis, from independent experiments, however it appears that a single FOV was measured in several figures (i.e. Figure 3 sup 1; Figure 3 sup 2). Other experiments also have relatively low "n" (i.e. 6 filaments measured for the analysis in Figure 2 sup 2). Do these N values have enough statistical power to support these conclusions?
2) In the movies provided it looks like many of the "rings" are formed away from the coverslip and "fall" down into the TIRF field. Are these movies the most representative of ring formation for these versions of IQGAP? A comparison to actin filaments "alone" but with the lipids might ease this concern.
3) Are the two IQGAP1 truncations dimers or monomers? Based on sequence alone it appears the dimerization domain is lacking from these constructs, but the SNAP-labeled images in Figure 2 have bright punctate and dimmer filament-like structures. The addition of a model or further clarification on how this arrangement of labeled IQGAP leads to ring formation would aid the reader.
4) From the image presented in Figure 4 the "rings" from the human IQGAP1 truncation look substantially different than that from the yeast version - they are much larger (about 5x) and while "curvy" not exactly tight rings like I can see in the yeast examples. Yet the quantification as presented looks very similar. Is there a different optimal lipid content between mammalian or yeast lipids? Is the longer unstructured region in the mammalian isoform contributing to the difference?
5) The authors should provide an explanation in the body of the manuscript of what "curly" constructs are being used in mammalian cells. From the methods it looks like the yeast truncations are being expressed. This should be compared to the mammalian version. Additionally, are the cellular rings a similar size to those observed in vitro (perhaps from the example in mammalian cells they are, but not for the yeast?). Additionally, this work would be really sing the in vitro rings were linked to a specific population(s) of cellular actin rings - what is the nature of the cortical rings analyzed by the authors? Are these actin associated mitochondria? Where is IQGAP1 during cell division?
-
Reviewer #2:
In this manuscript, Palani and coworkers investigate the structural effects of binding of a fragment of the IQGAP family of proteins, called "curly", to actin filaments. When tethered to a supported lipid bilayer, curly induces curvature in actin filaments, ultimately giving rise to ring-shaped filament structures. Filament decoration by tropomyosin increases the propensity of ring formation, and introduction of myosin II filaments induces constriction.
This manuscript presents novel and intriguing insights into the mechanisms that regulate the formation of cytoskeletal structures with curved geometries. The manuscript is well written, and the experiments are logically described. As such, this paper is sure to be of interest to a broad audience.
Below are a few suggestions I would like to see addressed:
1) What is the magnitude of curly's affinity for actin filaments? How does this compare to the binding affinity of the isolated CH domain?
2) Given that curly is proposed to contain two actin-binding sites, has this protein ever been observed to bundle filaments? Also, do multiple filaments ever become incorporated into the same ring?
3) How does the counter-clockwise direction of curvature of the actin rings compare to the helical pitch of the actin filament? In other words, are the actin subunits being wound tighter around the filament's long axis or are they being loosened?
4) The authors compare the structural effects of curly binding to those produced by cofilin. Cofilin binding has been reported to alter the twist of actin filaments. Is this what is proposed to happen for curly-bound filaments as well?
5) At the bottom of page 3, the authors state that: "Importantly, the uni-directional bending supports the hypothesis that the binding site of curly with actin filaments defines an orientation, and the propagation of a curved trajectory once established indicates a cooperative process."
Cooperativity implies that a process becomes easier once it is started. Do the authors have evidence that it becomes easier to bend the filament along its length once the first binding/bending event occurs? Or is it possible that the additive effect of multiple filament bending events eventually generates a ring-like shape?
6) It is unclear to me how the model of the myosin II-bound actin ring in Figure 3 Supplement 4 Part E illustrates a possible mechanism for myosin-induced constriction of the actin ring. If I am interpreting the schematic correctly, the authors indicate that ring constriction occurs via the application of force in the upward direction to the inner portion of the filament on the left side of the ring, and in the downward direction to both the inner and outer parts of the filament on the right side of the ring. However, it is my understanding that pulling simultaneously on the outer and the inner parts of the filament on the right side of the ring would not stimulate constriction. I believe one would have to pull on only one of those outer and inner segments at a time to slide them along each other and constrict the ring.
If I am misunderstanding the schematic, can the authors correct me by expanding on their proposed mechanism?
7) How constrained are the motions of Rng2 in S. pombe? Once Rng2 localizes to cytokinetic nodes, do the nodes move around enough to be mimicked by tethering curly to the supported lipid bilayer?
8) The reference to the Tebbs and Pollard paper has an incorrect author listing in the References.
9) The filament on the left in Figure 1A has a left-handed helical twist and should be corrected. The same is true for the filaments in Figure 3 Supplement 2, and Figure 3 Supplement 3.
-
Reviewer #1:
The IQGAP family proteins interact with actin, and contribute e.g. to the formation of cytokinetic rings. Here, Palani et al. provide evidence that the N-terminal fragments of these proteins, composed of a CH domain and 'unstructured region', contain two separate actin-binding sites and can bend actin filaments into rings. This activity requires anchoring of the IQGAP fragment, which they named 'Curly', on the surface of a membrane. Moreover, they demonstrate that actin filament bending by Curly can be enhanced by addition of tropomyosin, and that myosin II can contract these actin rings.
Major comments:
1) The authors discuss on pages 1 -2 how full-length Curly and its various deletion constructs bind actin filaments. However, actin-binding was not properly tested for any of the constructs used in this study. Thus, the authors should carry out proper actin filament co-sedimentation assays for all constructs. The assays should be performed with a constant concentration of Curly, and varying the actin concentration (form 0 uM to e.g. 8 uM) to obtain binding curves, and to be hence able to compare the F-actin affinities of different constructs.
2) The cell biology data presented in Fig. 4 and Fig. 4 - figure supplement 2 are not particularly convincing. The authors should thus perform a careful quantification of F-actin curvature and 'actin ring frequency' in cells transfected with plasmids expressing (i). EGFP, (ii). EGFP-Curly, and (iii). an EGFP-Curly mutant defective in ring formation. Because EGFP-Curly most likely does not associate with the plasma membrane in cells, it is somewhat confusing how it could still induce the formation of actin rings. Thus, the authors may observe much more robust actin ring formation in cells if they would use a membrane-anchored Curly-EGFP instead of soluble EGFP-Curly.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript.
Summary:
This manuscript reports the structural effects of a fragment of the IQGAP family proteins, called "curly", on actin filaments. When tethered to a supported lipid bilayer, Curly induces curvature in actin filaments, ultimately giving rise to ring-shaped filament structures. Moreover, this study demonstrates that filament decoration by tropomyosin increases the propensity of ring formation, and introduction of myosin II filaments induces constriction of actin rings.
The findings presented in this manuscript are potentially very important. However, in some cases the results are somewhat preliminary and lack essential controls. Thus, additional experiments and data analysis are required to strengthen the study.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #2:
This study investigated gene expression profiles related to diabetic retinopathy by using several strategies. First, they tested differential gene expression associated with response to glucose by comparing lymphoblastoid cell lines (LCLs) between cases (with retinopathy) and controls (without retinopathy) with type 1 diabetes. Secondly, they identified significant eQTLs from gene expression analysis and public gene expression databases and then tested significant eSNPs by the meta-analysis GWAS using independent cohorts. Furthermore, they confirmed one gene expression, the FLCN gene, to be a mediator of diabetic retinopathy by the Mendelian Randomization method. The aims of the study are clear and the paper is well organized. However, the following points should be addressed.
Comments:
1) It is confusing that the authors used different selection criteria for gene identifications. In Results (Line 472), they identified 19 differential response genes (P <0.05) between retinopathy cases and controls. However, they have selected the top 103 genes with P<0.01 (Results, Line 494) for further investigation. The reason for this is unclear. I assume that the FLCN gene is in the top 103 gene set but not in the above 19 gene set. Explanations are needed for including specific genes for different analysis purposes.
2) The authors selected LCLs from individuals of 3 groups, non-diabetes (nDM), type 1 diabetes without retinopathy (nDR) and type 1 diabetes with proliferative diabetic retinopathy (PDR). I didn't see much benefit of utilizing nDM samples in the analysis. Although both gene expression and GSEA methods were conducted, the results were not relevant to diabetic retinopathy. What is the purpose of including these samples?
3) Similarly, it is not clear what the purpose of using the gene set enrichment analysis (GSEA) was. My understanding is that the authors performed most analyses to identify genetic components by gene-based or SNP-based methods in the manuscript.
4) The authors tested gene expression profile and associations using data from type 1 diabetic retinopathy. However, for the confirmation with UK BioBank (UKBB) data, they included all samples with both type 1 and type 2 diabetes. Did you perform the analysis stratified by the type of diabetes? Do you have any explanations of possible differences?
-
Reviewer #1:
This paper is based on the analysis of a blood cell line of 22 subjects from three different groups in relation to diabetic eye disease. It includes first a transcriptome analysis based on microarrays. Then the studies are mainly based on bioinformatics analyses with GWAS meta-analysis and GTEx data extraction. The in silico study is followed by a so-called validation in the UK biobank.
The overall strategy is sound and the paper well written. It remains that the whole paper and it’s conclusions are based on a very small number of samples and not supported by strong experimental data about causality. This reviewer is surprised that the title only focused on "Mendelian randomization", which is an overstatement of this gene expression study. In addition stating that RM "identifies folliculin expression as a mediator of diabetic retinopathy" is also an overstatement for this reviewer (the mediator effect is not shown). Overall, the small group of studied subjects present huge differences in duration of diabetes and glucose control, the 2 main risk factors for retinopathy. How can you differentiate the biological effects of long term high glucose and their impact on retinopathy? In other words is it possible to change the title to "Mendelian randomization identifies folliculin expression as a mediator of long term uncontrolled diabetes"?
Based on the transcriptome analysis this reviewer is afraid that the conclusion "This finding suggests that chronic glucose exposure depresses cellular immune responsiveness and may explain in part the increased risk of infection found in patients with diabetes" is not based on evidence as authors selected transcripts of their choice and also because causality is not shown. "Individuals with diabetic retinopathy exhibit a differential transcriptional response to glucose". Note that the level of association shown (especially for PDGF) is somewhat marginal. "Genes with differential response to glucose are implicated in the pathogenesis of diabetic retinopathy." This part is the most intriguing and original but it is based on expression in many tissues and thus the title is also overstated: it shows some kind of association but certainly not that these 103 genes "are implicated" in retinopathy.
"Folliculin (FLCN) is a putative diabetic retinopathy disease gene" this part is also interesting (and includes some in vivo experiments) but this reviewer wants to stress that the original whole genome gene expression study did not detect FLNC as differentially expressed in the blood cells of the patients with retinopathy. Why?
It is also noteworthy that to this reviewer's knowledge no GWAS found SNPs near FLCN associated with diabetes or complications. This is worrying.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
This study investigates gene expression profiling related to diabetic retinopathy using several strategies including differential gene expression associated with response to glucose by comparing lymphoblastoid cell lines (LCLs) between cases (with retinopathy) and controls (without retinopathy) with type 1 diabetes. The study identified significant eQTLs from gene expression analysis and public gene expression databases and then tested significant eSNPs by the meta-analysis GWAS using independent cohorts. The expression of one gene, FLCN, to be a mediator of diabetic retinopathy by the Mendelian Randomization method was confirmed.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
In their manuscript "Characterization of the dynamic resting state of a ligand-gated ion channel by cryo-electron microscopy and simulations", Rovšnik et al. describe a structural study of the GLIC ion channel under 3 pH conditions combining cryo-electron microscopy and molecular dynamics simulations. Their aim is to shed light on the resting state (neutral pH) structure of this ion channel, that has previously been described by a crystallographic study with intriguing observations. Although the authors do not really say so explicitly, it seems their interpretations of the new data largely confirm the conclusions of that previous work. This is a major point that needs to be made explicit: does their study confirm (and to what extent) the one by Delarue (ref [27]) and how similar are the structures. Here a comparison of the pH7 cryo-EM and x-ray density maps could be a welcome analysis. The important related question is: what new information (in terms of the ion channel function etc., not in terms of structure determination methodology) do we learn from this study compared to ref [27]? This should also be made more explicit and be implemented by taking into account intrinsic uncertainties in the study (see next paragraph).
One concern - quite honestly raised by the authors themselves - is to what extent the cryo-EM maps obtained at ph3 and ph5 may represent the expected functional state, or incorporate some artefactual conformational substates, as they seem to lack a few key features of an open/active state that would be expected under these conditions. For the ph7 state as well, it cannot be excluded that the observed conformation bears some traits of desensitized or intermediate states, as is mentioned in the present manuscript. These overall uncertainties are somehow convoluted with the interpretation and analysis of the data, and in the present version of the manuscript it needs to be made clear much earlier that most of the interpretations only hold/make sense if one assumes certain hypotheses (eg that the pH7 structure is a resting one and not any of the other possibilities for instance, etc.), which otherwise is perfectly fine.
The last major concern about the manuscript concerns the computer simulations. The protonation states adopted to represent activating or resting simulations are not explicitly given in the paper, nor the choices discussed and justified in any way, whereas this seems to be a rather controversial issue for the simulation of this particular pH-gated channel as literature attests, and obviously a central one with respect to the questions studied in the present work. Also, are there indications in the cryo-EM derived structures on specific protonation states (eg two acidic side chains very closeby may indicate at least one is deprotonated, etc.)? The next issue that has not been mentioned, but seems quite critical to assess whether activating simulations actually go the right way, is about the wetting/dewetting of the channel pore. Are they stably water-filled in any of the simulations? This is one of the metrics actually used in ref. [21] and a few of which have been adopted for the analysis in Fig. 5 of this paper. A more detailed comparison with that computational work seems rather commendable, as well as probing more of the metrics that are employed there. Also, the discussion of Fig. 5 results should be extended, as it is not clear how to interpret this important figure. Why were the simulations ordered as they are? And how consistent are the observed trends for ECD radius, twist and upper spread?
-
Reviewer #2:
This article reports 3 new structures by cryo-EM of a bacterial pentameric ligand-gated ion channel (pLGIC) known as GLIC, in its resting form, at 3 different pH: pH 7, pH 5 and pH 3. The resolution extends from 4.1 Å for the first one and to 3.4-3.6 Å for the last two. Since GLIC is gated by protons, one should see at least two different forms, resting and active, at the various pHs. The main results are the following:
1) The structure at pH 7 is in a resting state and is highly flexible
2) It becomes much less flexible at pH 5 or pH 3, but the pore remains closed
3) All three structures were obtained in detergent (not in nano-discs)
In itself, this is a valuable article with a lot of new interesting information. However, I suggest to consider the 4 following points to improve the manuscript. In a nutshell, I see 3 main points in the analysis of the structures that should be addressed, plus a methodological issue.
1) The fact that GLIC at pH 7 in its resting form is highly flexible was already known before this study and has been extensively documented in the article that describes the x-ray structure at 4.4 Å (Sauguet et al., 2014, Ref. 27) because the asymmetric unit of the crystal contains in fact 4 different pentamers in different conformations. This should be better discussed in the article, in particular in relation with Figure 4 of Ref 27, where the dynamical nature of the resting state is clearly mentioned.
2) While the analysis of differences between GLIC structures at 3 different pH is well conducted, there is no detailed comparison with the other crystal structures of the same ion channel GLIC, which are listed in the manuscript (p. 2, line 27 to p. 3 line 6): the crystal structures of the resting state, the activated state, a locally-closed state and a possible desensitized state. One should expect at least a panel in a principal Figure of a detailed comparison between these structures. To understand the differences between the 3 structures presented here (pH 7, pH 5 and pH 3) and other known structures of GLIC, a projection of these 3 structures on various 2D maps should be presented using relevant variables (RMSD are rather useless here), along with representative structures of all other known forms of GLIC: the open form (4HFI), the 4 structures in 4NPQ and the locally closed form in 3TLT. See B. Lev et al, PNAS 2017 for such variables, in Figure 4 and 5 (ECD radius, beta expansion, M2-M1(-) distance, ECD twist).
3) While it is surprising to observe that the pH 3 structure is still in a resting form, it is possible to interpret this as the left side of the minimalist reaction path of the allosteric transition that looks like this:
pH 7 closed <-> open
^ ^
| |
v v
pH 4 closed <-> open
However, the reaction path of the gating transition is unlikely to be this simple. The dynamics of the gating transition in GLIC has been extensively studied in B. Lev et al., PNAS 2017 by long MD simulations and the string method. Unfortunately, this article is not cited in the present work, nor any detailed comparison of its conclusions with the proposed pathway presented in Figure 6A. In particular, Lev et al. insist on the role of the salt-bridge D32-R192, that gets broken to form another salt bridge D32-K248 in the open form. Do the 3 new GLIC structures solved in this new work confirm the importance of this salt bridge in driving the transition or not? In p. 6 the authors analyze specifically the conformations of the side-chain K248 but do not mention this possibility.
4) Methodology (p. 10) The paper reports both a new and interesting method to refine models in cryo-EM maps using MD simulations with adaptive constraints and the resulting refined models. But the validation of the method itself on well documented test cases is missing (unless I missed something). In other words, there is some sort of a circular argument here: a new method is presented that allows good sampling and flexibility in the refinement under experimental constrains, but the justification is simply the output of the method, namely fitted -and flexible- models. While it is possible that the new method is superior to other extant and validated methods in speed, is it as accurate - or more?
Specific comment on the Figures:
Figure 1: The structure at pH 3 has (overall) a slightly higher local resolution than at pH 5. Any comment?
Figure 2: Does K248 makes a salt bridge with D122 (Panel B)?
Figure 4: Rmsd do not bring a lot of information. Could the authors map their structures, along with all other known GLIC structures, on 2D maps with essential parameters such as ECD twist angle, M2-M1(-) distances as in Figure 4 and Figure 5 in Lev et al., PNAS, 2017?
Figure 5: Again Rmsd -and their distribution- plots do not bring a lot of information. Also,
1) Which pentamer has been used for the pH 7 X-ray form? (there are 4 of them in the asymmetric unit). Would the result be different with a different pentamer?
2) I strongly oppose the names of the so-called pH5 and pH3 cryo Activating forms: they are not Activating, but merely the same structures with different sets of electrostatic charges. This is misleading, the reader might think it is an experimental structure (cryo). Best if the words Resting and Activating are changed to Deprotonated and Protonated, respectively.
Figure 6: Panel A should be compared and discussed with Figures 3 & 4 in Sauguet et al., PNAS 2014, as well as with the Discussion in Lev et al., PNAS 2017.
-
Reviewer #1:
This manuscript reports cryo EM structures of the GLIC channel under resting (high pH), partially (pH 5) and fully (pH 3) activating conditions. The structures reveal some features that were not so well resolved in previous X-ray structures and use simulations to suggest a dynamic structure at high pH, indicative of an ensemble of resting state conformations, compared to a more compact and well-defined structure under activating conditions. This idea is not entirely new, however, as it was a conclusion of the resting state X-ray structure paper of Delarue and co-workers (ref.27). The study also sees changing structural elements that might imply roles in gating, such as with loop F and interactions of E243, though also suggested in past X-ray structures. It is surprising that all structures, including under maximally activating conditions, are completely closed, and the explanation for this is not compelling. Another surprising outcome is that the distributions from simulations of the resting state at high pH based on the new cryo EM structure are so different to those obtained using the past X-ray structure, and there are indications of lack of convergence of these simulations.
The findings and discussion of Delarue and co-workers in Ref27 could be more prominent, including in the introductory statement, which could be cited along with refs 11,14,15 as a solved resting state, and not just described as being of low resolution. I refer to Fig.3c of ref.27 which conveys the idea of the diverse resting state distribution in that paper.
In regards to the "relative novelty" of the methods used for MD fitting to cryo EM data, it is not obvious how different the approach is to standard MDFF flexible fitting strategies. Although there is brief mention in the discussion section, it is not clear from the introduction and methods how novel the approach is. I do suggest, however, that it does not make sense to refine the structure with simulations of GLIC in a POPC lipid bilayer, when the cryo EM involved detergent solubilised particles. Fitting MD should have been done in micelles as it is not appropriate to refine in a different environment to which it was solved.
The authors claim higher RMSD for pH 7, but fig.4A suggests divergence of simulations in 1us. It seems the simulations would need to run longer to reach an equilibrium distribution. It is curious that such divergence is not evident in high pH X-ray structure simulations in the same figure. Does this suggest the cryo EM structure at high pH is unstable? Is this increasing RMSD spread uniformly or due to changes in particular parts of the protein during MD? I note that subsequent analysis, such as fig.5, revealing no maximum in the distribution for ECD bloom compared to X-ray simulations at high pH, may be due to not yet converging on an equilibrium for the resting state (and pre-equilibration period not being excluded).
Despite the pH 7 cryo EM simulations likely being not yet equilibrated, leading to some uncertainty about the meaning of the distributions in Fig.5, it is clear that low pH leads to a more tightly bound ECD bloom range than pH 7 in that figure. Although the effects of pH are similar between cryo EM and X-ray starting structures, why is the peak in Fig5b ECD twist also so different for pH 7? This also could be an artefact of lack of equilibration. Differences are also noted at low pH.
Fig5c is striking. It suggests cryo EM at low pH has failed to capture an open pore, whereas X-ray was able to capture an enlarged pore radius. The authors write that this was initially surprising, having all low pH structures closed, but consistent with past X-ray with one structure partially closed. But here all structures look completely closed, whereas a fairly even mix of open and closed TMDs may have been anticipated at low pH, at worst. The possible artefact due to interaction with the glow-discharged cryo EM grid could be better explained for the reader. On page 16, the authors say the closed pores do not look like they would expect for a desensitised state. This also needs a better explanation with more specifics. They then suggest it may be because at low pH the pore can flicker and the open pore has a high free energy. Why is the open state expected to be high free energy at low pH? Doesn't the pH50 of 5 suggest the equilibrium is shifted to the open channel (lower free energy) by pH 3, as also suggested by previous free energy analysis in ref.21? While fig.6 is used to illustrate a reduction of number of closed states to the left with lowered pH, "priming" the protein for gating, again it does not make sense to me that at low pH the free energy of the open state on the right is higher than the closed state on the left.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
This manuscript reports cryo-EM structures of the pentameric ligand-gated ion channel GLIC at pH 7, 5 and 3. The reviewers have appreciated several aspects of the manuscript, which combines experiment with simulation to describe the GLIC channel's resting state. However, concerns have been raised. The reviewers have questioned what has been gained in addition to previous work on the structure and mobility of the resting state (Sauguet et al. PNAS 2014; ref.27), not described in this manuscript. How do the new structures compare to past X-ray structures/density maps? The reviewers raise questions about the functional states found. In particular, while rigidification at pH 5 or 3 is interesting, normally it should switch to the open state, especially at pH 3, and why this has not occurred is not explained well. Several concerns have been raised about the simulations and what is learned. This includes protonation state choices (not discussed or justified), why flexible fitting was conducted in a bilayer instead of a micelle (which may impact regions of the map less well defined), and have the simulations converged? The reviewers note lack of informative analysis, leaving us in the dark as to the functional states visited. It has been suggested that analysis in collective variable space would be needed, such as defined in Ref.21 (not discussed in this manuscript), so that the reader can observe if structural features change, despite maintaining an apparent resting conformation (e.g. does the D32-R192 salt bridge break; does the pore wet/dewet)?
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
The manuscript by Karamanlis and Gollisch examines the responses of mouse retinal ganglion cells (RGCs) to natural stimuli. The primary conclusion of the manuscript is that spatial integration of stimuli within the receptive field is nonlinear. This nonlinear integration is consistent with "local signal rectification". This results in a set of RGCs that are sensitive to spatial contrast within the RF. The Authors also note the presence of cells that are suppressed by contrast and cells that prefer uniform stimulation of the RF. To reach these conclusions the authors use multi-electrode array recordings from isolated mouse retina. Spatial RFs are estimated using white noise stimuli, which are then used to generate a null-model for linear spatial summation. They compare predictions of this null-model to the responses of the same RGCs to briefly flashed natural images. The authors find some RGCs that are consistent with this null model and many that are not consistent. The authors correlate deviations from linear spatial summation to deviations revealed by contrast reversing gratings. They also used a mixed-contrast, flashed-checkerboard paradigm to map the contrast tuning and rectification of RF subunits. Finally, the authors show that some of these results track with functionally distinct RGC types such as direction-selective and "IRS" RGCs.
The data and analyses presented in this manuscript are high quality. However, I think the study is largely consistent with many previous studies that demonstrate nonlinear spatial integration among RGCs in the mammalian (including mouse) retina. I think the Authors view the use of natural stimuli as a major departure from previous work, but I'm not convinced of this for two reasons. First, I don't see a compelling reason to think that results using contrast reversing gratings or other 'textured stimuli' (e.g. Schwartz et al Nat Neuro 2012) would fail to generalize to flashed natural scenes. Second, the implicit claim here is that a 200ms flashed natural scene interleaved with an 800ms gray screen is a natural stimulus. I think this assumes a lot about the space-time separability of the RF mechanisms, and these assumptions are not well justified.
Major Concerns:
1) I think the introduction of the manuscript is building a straw man argument, suggesting that many (or most) scientists think the retina is predominantly linear. A pubmed search of 'retinal ganglion cell' and 'nonlinear' produced more than 300 studies. Specifying subunit nonlinearity produces 28 studies. The discovery of subunit nonlinearities is roughly 50 years old and many manuscripts demonstrate Y-like receptive fields are more common across RGC types than X-like receptive fields.
2) The authors seem to be arguing that the spatial nonlinearities engaged by the contrast reversing gratings are not the same as those engaged by their natural scenes (Figure 3). However, I think the authors are assuming too much that the spatial and temporal components of the RFs are separable. The flashed natural scenes are interleaved with relatively long gray screens. The contrast reverse granting are reversed in a square-wave fashion with no interleaved gray screen. These distinct spatiotemporal dynamics in the stimuli seem likely to explain the difference. This would also seem likely to explain why the flashed checkerboards in Figure 4 produced results more correlated to flashed scenes in Figure 1. In summary, I don't see a strong reason to think the authors are observing anything other than subunit rectification of the sort described by Hochstein and Shapley in the 1970s and followed up in many subsequent studies.
3) It is not clear to this reviewer that flashed natural images interleaved by a gray screen is qualitative more natural than white noise, sinusoidal gratings, or square-wave gratings.
4) The null-model constructed by the authors in Figure 1 assumes the RF follows a specific functional form (e.g. Gaussian). However, many studies show that individual RFs frequently exhibit strong deviations from a Gaussian RF. To what extent are the deviations from the null model produced by deviations from linear summation or just linear mechanisms that deviate from the specific parametric form imposed by the model?
5) It was unclear how the authors rule out the contribution of differences in (nonlinear) temporal integration to the effects in this study. In general, RGC RFs are not space-time separable, and it seems that the analyses in the manuscript assume they are.
6) This study overlaps significantly with Cao, Merwine and Grzywacs (2011), 'Dependence of retinal Ganglion cell's responses on local textures of natural scenes', Journal of Vision. This article is not cited here, but in my view, the major conclusions are similar.
7) In my experience, the strength of subunit rectification can be labile during ex vivo experiments. What controls have the author's performed to ensure the effect they are studying remain stable over the duration of their recordings?
-
Reviewer #2:
Summary:
Understanding how retinal ganglion cells respond to natural stimuli is a central but daunting question, which retinal neurophysiologists have begun to tackle recently. Here Karamanlis and Gollisch perform large-scale multi-electrode recordings in the mouse retina and demonstrate that the responses of many ganglion cells cannot be predicted by standard linear-nonlinear models (L-LN). They go on to test a variety of clever artificial stimuli that emphasize and allow for the quantification of the non-linear aspects of RGCs responses and convincingly demonstrate that non-linear processing is associated with sensitivity to fine spatial contrasts (subunits) and local rectification. While these aspects of RGC receptive fields have been previously described, demonstrating their applicability to natural vision is a significant advancement.
Major Comments:
My first main concern is with the way the paper is written. It does not highlight the significant advancements but rather emphasizes what is already known from other studies. For example, many of the conclusions of non-linear spatial integration & signal rectification arising in bipolar cells have been well described previously. By contrast, novel aspects like the sensitivity of reversal gratings being unrelated to LN model performance for natural scenes should be explained more in detail. The authors should more clearly state the major advancements that are being made here beyond what has already been shown previously (e.g. Turner and Rieke, 2016)
Second, the authors never include non-linear subunits in their model to demonstrate improved performance. Testing models with filters that incorporate rectification and convexity as experimentally determined will enable them to show their utility more convincingly. Without this, the reader is left with the conclusion that there are RGCs that exhibit non-linear or linear spatial integration (already known) and that non-linear integrators cause LN models to perform poorly with natural images (Turner and Rieke, 2016).
Third, I'm not sure how 'natural' their natural images are, given static images are flashed over the cell intermittently. While such stimuli might simulate some sort of saccadic eye movements, whether this is relevant for mouse vision is not clear. Would linear models be more predictive for responses to natural movies? Some discussion on this issue would be helpful.
-
Reviewer #1:
This paper investigates how retinal ganglion cells integrate inputs across space, with a focus on natural images. Nonlinear spatial integration is a well-studied property of ganglion cells, but it has been largely characterized using grating stimuli. A few studies have extended this to look at spatial integration in the context of natural images, but we certainly lack a comprehensive treatment of that issue. The current paper has a number of strengths - notably using a number of complementary stimuli and analysis tools to study a large population of ganglion cells and linking properties of responses to artificial stimuli with those to natural stimuli. It also has a few weaknesses (some detailed carefully in the paper) - such as the inability to identify ganglion cell types (aside from a few), and to pinpoint specific circuit mechanisms. These are limitations of the techniques used. This is not a request as much as setting the context of the contribution of the paper. Generally the paper was in good shape, and the data supported the conclusions well. I do think there are a number of issues that could be strengthened. Those are listed below in rough order of importance.
Statistical correlations in natural scenes:
A number of analyses in the paper rely on estimating the spatial contrast from an image and comparing the dependence of various measures of the cells' responses on spatial contrast. A danger in this analysis is that spatial contrast is likely correlated with many other statistical properties of the image, so attributing a given response property to spatial contrast has some potential confounds. This issue should be discussed as a possible caveat, unless the authors can rule it out. The paper, accurately, describes the results in terms of correlations (and not causal relationships), but some discussion of the complexity of natural image statistics would be helpful.
Comparison of grating and natural scene spatial scale:
The section starting around line 233 was confusing for several reasons. First, this section starts by measuring the spatial scale associated with the grating responses, and then comparing that to LN model performance for natural inputs. It's not clear why the spatial scale is the relevant aspect of the responses to gratings. Indeed, the next paragraph provides a measure of the relative sensitivity of the nonlinear and linear response components (via a comparison of F1 and F2 responses). It would be helpful to include some initial text to motivate the different measures of the grating responses and to anticipate that you will look at both spatial scale and sensitivity. A related issue that bears more directly on the scientific conclusions comes up later in the blurring experiments. The issue is whether it is valid to directly compare the apparent spatial scale of nonlinear responses to images (estimated via blurring) with that of the grating responses. Natural images should have much higher power at low spatial frequencies, and this may strongly impact the spatial scale identified with the blurring experiments.
Clustering of orientation-selective cells:
An interesting suggestion in the paper is that the orientation-selective cells can be divided into two groups that differ in their spatial integration properties. Do these groups represent different orientations, as suggested in the text? That seems a simple piece of information to add. Related to this, I would suggest moving Figure S4 into the main text.
Presentation of checkerboard stimuli and results:
The checkerboard analysis, particularly how it isolates properties of spatial integration, could get introduced more thoroughly for a reader unfamiliar with it. A related issue is how well the chosen isoresponse contour captures structure in the full distribution of responses. In some cases that looks pretty good, but in others it is less clear. Could you add a supplementary figure or something similar that characterizes how consistent the isoresponse contours are for different response levels?
Drift in responses over time:
Some of the rasters - e.g. the bottom left in Figure 1C - show considerable drift over time. It is important that this drift not be interpreted as a failure of the LN model and hence indicative of nonlinear spatial integration. Can you test for drift like this across cells, and exclude any that seem potentially problematic? More generally, some assurance that the variability in the responses for a given generator signal value is real variability across images is needed.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript.
Summary:
All of the reviewers expressed concerns about the advance that the work described in the paper represents. These issues were a focus of the consultation among the reviewers. The main concern is that the work needs to go beyond demonstrating that some ganglion cells exhibit nonlinear integration for naturalistic inputs - as that point is quite well established in the literature. The comparison between natural stimuli and gratings could help in this regard, but several issues confound that comparison (e.g. differences in dynamics of the two types of stimuli). These concerns are detailed in the individual reviews below.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
The authors study the effect of confinement on the alignment of REF cells confined within circular micropatterned islands. They observed that the cells are aligned perpendicularly to the boundary after 48h, contrary to other elongated cells such as NIH-3T3. After testing several subclones of that cell line, they identified cell contractility and cell-cell adhesion that affect the organization of the cells in the circular patterns. They confirmed this using drugs that affect contractility and disrupt cell adhesion. Then they compared their results to a continuum model and to a voronoi model.
The science is interesting. Many cell types are elongated and do align with their neighbors. The fact that these cells align perpendicularly to a boundary is curious, and deserved to be studied in depth. 3 similar papers came out on arxiv from the Roux group. They should be discussed in the manuscript and cited.
It is not clear what is the "condensation" process the authors are referring to and how this is related to the boundary alignment of the REF cells. Please, read the work of trepat et al on active dewetting published in 2018. I do not know what the author means by tendency. Some it condensed, sometimes It does not? IT is not a scientific term. I would advise choosing different wording to explain their results. Condensation is the first word in the title of the manuscript, still it appears for the first time in the text on page 18, and is poorly defined. It is never well explained and the 2 terms always come up together, condensation and tendency, like if the author does not know themselves what to call what they are observing.
There is a lot of data, analysis and model, but it is very confused, not well organized and poorly presented, which prevents me from judging the quality of the interpretations. The authors chose to show all the analysis they could do in the figures, and therefore there is no clear take-home message. Are all those plots necessary?
It was a very difficult paper to read. Often, terms like nematic, or symmetry are misused. Such words have a very specific meaning, in particular for liquid crystal physisicst, which are one of the targeted audience for this paper. The figures are not clear. They at the same time put too much information and not enough. There are too many graphs, I don't know which one is important. Please, plot the 2 cells types in the same graph instead of showing one graph/cell type. At the same time, there is often not enough information to understand what the authors are plotting, and what is the take-away message.
Below, I have specific comments about the text, not so much about the science. Again, I found it very hard to read and understand, hence I am not able to judge the quality of the research at that point.
Specific comments about the figures: Fig 3: What is the unit of the heat maps? Please add fluorescent image, and average for the second row, and for the plot, please, add a label "normalized mean intensity" of what?
I do not understand Fig 4. The captions just reads the labels of the plots, it does not tell me the results, nor the relevance.the is no information in the caption, please revise.
What is the main result of Fig5? The title could not be more vague: "Voronoi cell modeling predicts REF 2c cell behaviors in circular pattern.". Please give specific titles to your figures that help the reader understand the take-away message. Please change the contrast of Fig. 5A. All the disks look black to me. I have difficulties trusting statistical analysis. The top right plot of fig 5C looks totally flat to me. Why is there sometimesa statistical analysis and sometimes not? ( %B 1,2,4 and %C1 have no statistical analysis).
Same critics for Figure 6.
About the abstract: The terminology is vague and confusing, which I think that the authors have not fully characterized the connection between their experiments and the physics of liquid crystals. examples: "to form nematic symmetry" "to form a new type of symmetry" "new symmetry?" changing boundary condition does not mean you are changing the symmetry of the liquid crystal...
Strong adhesive interaction... MDCK also have strong adhesive interactions, therefore the comparison is not adequate, please revise.
What does "condensation tendency" mean? What does "prestrech" in the last sentence of the abstract mean? Is the tissue under stretch? There is no reference to stretch in the abstract before that.
Comments about the introduction: The introduction is scattered, very confusing as it mixes results from a broad range of model systems. For example in 4 successive sentences, we have: adipocytes, then fish then reconstituted asters, then back to muscle cells. This looks like a laundry list... Same thing in the next paragraph: neural crest cells, mesenchymal stem cells, chondrocytes, At this point, it is not clear what cells types the authors are studying and why it is relevant to all the others cited in the introduction.
Cell condensation is not "unique" to their cell types. MDA-MB-231 also do that ( ref: TRepat et al, Active wetting of epithelial tissues, 2018).
"to robustly self-organize in polarized organization", please rephrase
"mechanical variable have been used to describe the mechanical behavior of a cell monolayer", please rephrahe, this is way too vague. What are you trying to say?
Why epitheial-like? Why not just epithelial? Are these cells different?
What does "presented cytoskeleton" mean?
3T3 cells are not incompressible. No cell types are. They divide all the time.
You can have radial alignment in a nematic liquid crystal, it is called homeotropic anchoring. It has nothing to do with the symmetry of the liquid crystaline units.
Condensation driven by chemotaxis? I never heard that. See again TRepat et al 2018. The cells are confined in a similar circular island, there is no chemotaxis.
References were not properly cited. As an example ref [11] does not talk about the effect of confinement at all.
About the methods: Manual tracking is passe. There are robust methods to automatically track cells. You are already segmenting the tissue, why not tracking the cells automatically this way?
"The average speed for each cell was calculated as the total migration length of each cell divided by the total time". So if I track the cells for long enough and they diffuse randomly, the average speed is 0? Does not really make sense. For how long were the cells tracked? Are all the trajectories the same length?
About the model: What other types of stress were neglected in the model and why? Especially, if you are trying to model a nematic liquid crystal, why not take into account the nematic elastic stress?
Why nematic-like? This is confusing as is much of the terminology used in this manuscript.
-
Reviewer #2:
This article reports the radial alignment of rat embryonic fibroblasts at the periphery of circular confinement patterns. The authors experimentally isolate that contractility, adhesion and stiffness gradient are necessary to obtain this alignment. They further devise continuum and discrete models, with only two free parameters, to describe the mechanical origin of such cellular arrangements.
The article is an interesting contribution to the field, with the discussion and conclusion well supported by the experimental data. It is further well written, with a good logic.
1) The authors should explain (e.g., in an appendix) how they solve Eqs.(7-9) and how they run their Voronoi simulations (or indicate which solver/package they use if those already exist).
2) A movie showing the formation of the radially aligned cell pattern would be a good addition, even if the dynamics are not discussed in the article. The x,y,t axes should be labelled (with units) in Fig.1-Supp.1.
3) p.17 l.3, "stiffnesses" instead of "substrates"?
4) p.20 l.7, the authors should better explain how Fig.1-Supp.4 supports a homogeneous isotropic contractility.
5) The authors should show some of the images used to extract actin fibers structure (or are these shown in Fig.3?). Is Fig.4-Supp.1 obtained for REF 2c?
6) p.24 l3, the authors may comment on how stiffness anisotropy could be incorporated in their model to explain inner cells' circumferential alignment. The author should plot the structure parameter (k_h) vs radial distance instead of giving a table (Fig.4-Supp.1 and Fig.6-Supp.1); they should use the same origin (the center of the circle) for the radial distance in the ring experiments (x-axis in Fig.6B and Fig.6-Supp.1A vs x-axis in Fig.7 and Fig.7-Supp.1) to facilitate comparisons.
7) The authors should clarify what they mean by "clear boundary junctions" (p.18 l.9) when describing Fig.2D, which is challenging to discern.
8) In Fig.4, are the authors showing the strain or the stretch ratio? It would help to start the y-axis at 0 in Figs.4A-B. At which distance are the radial strain and stress evaluated in Figs.4C-D? Are the pre-stretch ratio and stiffness gradient challenging to evaluate from the experiments (p.20 l.4)? Can the authors comment on the values needed for these model parameters to see radial alignment in the simulations? Are they realistic when compared to the experimental data?
-
Reviewer #1:
The manuscript by Xie et al combines an impressive array of experimental and modeling approaches to study cell morphological changes due to stiffness heterogeneities and contractility.
1) The assumption of a purely elastic process needs substantiation. Fig. 1A shows a dramatic increase in the number of REF2c cells from 24 to 48 hours, suggesting that cells are proliferating. This, together with continuous remodeling of cell-cell contacts, would result in deformations that dissipate elastic energy. Neither modeling approach accounts for this. It would be important for authors to incorporate these behaviors, or to provide evidence that cell proliferation and remodeling are unimportant, and similar between the three cell populations being compared.
2) The assumption that contractility is uniform needs to be substantiated. Work cited (Tambe et al) shows on the contrary that collective cell behaviors exhibit highly heterogeneous active stresses. Experimentally, there are a few potential ways at this. Authors could use the stiffer (1 MPa) micro post cultures, which recreate radial alignment seen on micropatterned PDMS islands, and compute force variations from post deflection. Alternatively, authors could perform short time lapse experiments to measure deformations following treatment with blebbistatin or Y27632. Yet another option would be to perform staining for contractile proteins such as phospho-myosin light chain, GTP-bound RhoA, or others, to confirm they are uniformly distributed despite the heterogeneity of F-actin (although this reviewer is skeptical that such experiments would reveal uniform contractility when F-actin is nonuniform). Finally, if no experimental support is possible, then authors could turn to model simulations to test whether spatial heterogeneities in contractility alter the overall behavior of the system (although, again, this reviewer is skeptical that such simulations would suggest the heterogeneity of contraction is unimportant). In addition to either modeling or experimental support for the assumption that contractility is uniform, authors should provide examples from the literature on related systems that support this assumption.
3) The importance of a stiffness gradient in the cell population is one of the key aspects of this work. However, evidence for the existence of such a gradient is provided only by staining for F-actin, which is insufficient. While F-actin is indeed a key cytoskeletal component in defining the stiffness of cells, the link between intensity of staining and stiffness needs to be proven. Only a single reference is provided, which focused on one specific cancer cell line and the role of stress fibers - a specific configuration of F-actin together with myosin - in stiffening the cell. Moreover, given that F-actin interacts with nonmuscle myosin to form the key contractile machinery of most cell types, heterogeneity in F-actin likely implies heterogeneity in contractility as well. There are also concerns with the measurement of F-actin abundance, including need for statistics on the spatial distribution, and to normalize per cell to reflect variations in F-actin as opposed to simply variations in cell density, which are also present (Fig. 1A). Finally, the F-actin gradient is only shown and quantified when intensities are summed over many samples. It would be important to demonstrate a significant gradient within individual samples, and how it varies across samples.
4) Greater integration between modeling and experiment would strengthen the manuscript. This is particularly true of the continuum model, where it is nontrivial to relate strain and stress to cell shape changes, given that cell shape is not simply an affine elastic deformation owing to stresses acting on it, but instead a response to stresses integrated with cell autonomous behaviors. There is a large body of literature on the alignment of cells relative to the direction of applied static or dynamic stretch. This mechano-responsivity that dictates cell shape is not considered in the present study. Even without considering these complicating cell behaviors, it is not clear how the magnitude of stress or strain relate to the change in cell shape. In addition, authors would ideally make use of the models to pinpoint what underlies the distinct polarization phenotypes between REF2c, REF11, and 3T3 cell types.
5) The importance of cell-cell adhesion is another crux of the story, pointing to differences underlying the various polarization phenotypes. However, the only experimental support for this is via treatment with a calcium chelator, EGTA. Only one reference is provided for this method (#35, Chen et al), yet Chen et al appear not to have used EGTA at all, and instead disrupted E-Cadherin using neutralizing antibodies. This is a much more specific and direct approach that the authors of the present study should consider in place of EGTA. In the absence of this or similarly targeted approaches (RNAi, etc), the authors should include control experiments that demonstrate this rather broad perturbation does not alter contractility or cell-substrate interactions. This could be done at least in part, by using the traction force measurement system the authors have devised. It is particularly important to do so given the importance of calcium for cytoskeletal contraction via calmodulin. A second experiment authors could supplement this with is pharmacologic inhibition of calcium-depdendent contractility, with the hope/expectation that calmodulin-mediated contractility does not predominate this system. Even with these experiments, however, authors need to provide support from published work that this method of disrupting cell-cell adhesion is well established.
6) The system is quite artificial with respect to in vivo conditions in most contexts. This on its own is not a limitation, as such approaches can still be used to reveal fundamental insights into the mechanisms of cell behaviors and interactions, employing approaches that are not feasible in vivo. However, it is important to tie the specific behaviors and outcomes of this study directly to events of developmental, physiologic, or pathologic importance. While authors do broadly invoke these as motivations for the work, the true impact of the findings is not fully realized without more direct links. Further, because the work is largely descriptive, and lacks direct measurement of cell generated forces, it does not truly take full advantage of the artificiality of the system.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
This manuscript is in revision at eLife.
Summary:
The authors study the effect of confinement on the alignment of REF cells confined within circular micropatterned islands. They observed that the cells are aligned perpendicularly to the boundary after 48h, contrary to other elongated cells such as NIH-3T3. After testing several subclones of that cell line, they identified cell contractility and cell-cell adhesion affect the organization of the cells in the circular patterns. They confirmed this finding using drugs that affect contractility and disrupt cell adhesion. Then they compared their results to a continuum model and to a Voronoi model.
Enthusiasm for the work is diminished by the limited experimental support for key assumptions of the conceptual and math models (e.g. existence of stiffness gradient, assumption of uniform contractility, use of calcium chelator to show importance of adhesion). Further, integration of model and experiment could be improved, and some of the narrower assumptions of the models (e.g. omitting cell proliferation, remodeling of cell-cell contacts, and cell-substrate interactions, assuming uniform contractility) need better justification. Also, a clear correlation to specific events in development, physiology, or disease would highlight the broader impact of the work beyond a very specific event in a carefully engineered system. Finally, 3 similar papers came out on arxiv from the Roux group. They should be discussed in the manuscript and cited.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #2:
While this paper develops some useful tools for targeting neurons expressing different isoforms of the FoxP transcription factor, the broad expression of FoxP (~1800 neurons throughout the brain and VNC) makes it challenging to interpret the general motor deficits that result from knocking out FoxP expression during development. The study lacks a structural or physiological link between the low-level genetic manipulations (elimination of FoxP expression) and high-level behavioral phenotypes (abnormal locomotion and landmark fixation).
-
Reviewer #1:
This is an elegant molecular manipulation of the FoxP gene, coupled with anatomical description of the neuronal distribution of isoform expression in the brain and ventral nerve cord of the fly.
Isoform B functional knockouts show behavioral abnormalities in flies' ability to walk toward a dark vertical bar representing naturally attractive landscape features like plant stalks. FoxP isoform B manipulated animals walk slower and are less adept at targeting the dark bar. Knocking out all FoxP isoforms has similar behavioral effects as knocking out FoxP-iB alone.
FoxP is expressed broadly throughout the peripheral and central brain and in the ventral nerve cord, throughout development. Expression within leg motorneurons and the protocerebral bridge of the central complex is required for normal walking visual fixation, which is entirely consistent with what we've been learning about the functional organization of this brain region for spatial navigation.
The problem here is that the conceptual gap between molecular manipulation of the FoxP gene and the behavioral phenotype is wide. Absent any understanding of either the cell physiological mechanisms of action of FoxP, or the function of FoxP-positive neural circuitry involved in the behavior being explored, the advance remains preliminary.
Even in the case where identified neurons that have recently been implicated in bar fixation by walking flies, which the authors demonstrate express at least some FoxP isoforms, broad FoxP knockout had no effect on the behavior. As the work is currently presented, there is not enough resolution between FoxP expression, cell circuit function, and behavior for the work to make a sufficiently compelling case.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript.
Summary:
The reviewers thought that the work was of quality and that the paper develops some useful tools for targeting neurons expressing different isoforms of FoxP. However, they also felt that there is a conceptual gap between the molecular manipulation of FoxP and the behavioral phenotype, with little understanding of the mechanisms of action of FoxP and of the function of FoxP in the neural circuitry involved in the behavior.
The broad expression of FoxP in ~1800 neurons makes it challenging to interpret the motor deficits that result from knocking out its expression during development. Although neurons that express FoxP have recently been implicated in bar fixation, the behavioral phenotype of the FoxP knockout is difficult to interpret. Therefore, the integration of FoxP expression, the function of the circuit involving FoxP and the behavior is not sufficiently clear.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
The study by Pesoli et al. uses MEG acquisition in sleep deprived participants in order to explore the functional integration derived from MEG source reconstructed connectivity and its potential link to attentive functions. The study is well conducted with an appropriate size to explore global graph measures derived from MEG connectivity.
1) My major concern is that the authors' main claim that MEG connectivity is correlated to attentive function has at best very weak support from the presented data. Though the authors claim in the methods that all analysis were FDR corrected the correlational analysis linking behavior to MEG connectivity does report uncorrected values. E.g. the correlation between Alpha-MEG Degree of Right superior Occipital gyrus bases on a statistical test on 90ROIs x 5 frequency bands x 2 nodal metrics which would result in a Bonferroni threshold of p=0.05/900, the reported p=0.009 is by orders larger than this threshold. This problem applies (on different levels) to all correlations reported in Fig. 6. In order to limit the amount of false positives more stringent statistical thresholding would be needed to analyze the link between connectivity and behavior (a good starting point to solve this issue can be [Makin et al. 2019, elife]). Related to this issue: the hypothesis 'such topological rearrangements would relate to cognitive performance' is highly underdetermined and the authors could stress the strong exploratory character of this study more in both abstract and introduction.
2) The link to previous literature unclear for the connectivity measure used (Phase Linearity Measurement): the authors should shortly address in a paragraph what we should expect e.g when comparing the measure to more frequently used connectivity measures such as amplitude envelope coupling or coherence (Colclough et al. 2016, NeuroImage). What are the differences of the used measure and why did the authors choose this measure instead of a more frequently used measure?
3) I was generally missing a consistent definition of the term integration: why did the authors choose the selected graph metrics to measure integration and how do the graph metrics show that the brain loses integration (like they state in the title of the article). The use of all graph measures should be clearly motivated: why did the authors choose these measures and what are they planning to measure to support their hypothesis?
4) 'In particular, with regards to TS, median reaction times (in ms; median RT) to both repetition and switch trials, and angular transformations of the proportion of errors resulting from the two experimental sessions were submitted to two-factor repeated-measures ANOVA, instead, SC as well as all dependent variables obtained from LCT (number of hits and number of rows completed), were submitted to paired t-test.' This sentence is difficult to understand, I did not understand why in one case you use only posthoc t-tests and in the other case an ANOVA.
5) Data availability: 'All data generated or analyzed during this study are included in the manuscript and supporting files.', the authors should include a more detailed description of where the interested reader can find data and code. Is it available on request or will it be provided in a repository?
-
Reviewer #2:
This study employs the use of MEG to incorporate both spatial and temporal strengths of previous fMRI and EEG studies to uncover the effects of sleep deprivation on brain function. While the motivation is clear, there are some issues with methodology and the writing is difficult to understand in many places.
Introduction:
1) L32-33 This sentence is not clear - 'neuroimaging techniques allowing us to overcome the concept of specific control vs. a distributed property'. Can you use a term like 'distinguish' or 'clarify'?
2) L56-68 It would be better to talk about overall function of neural oscillations (SWA and spindles) during sleep on executive function and memory consolidation (systems consolidation/synaptic downscaling theories), rather than 'increases', as your study does not augment SWA per se. In fact sleep deprivation does augment SWA in the subsequent recovery period as an indicator of sleep pressure/intensity but we wouldn't consider this as beneficial.
3) L100 - Can you briefly explain here why these tasks were chosen - e.g. if they have been used in prior SD work with other imaging modalities.
Results:
1) L-173 - you're not really comparing between two groups... should read conditions
2) L204-216 - correlation assumes independence of observations, here you are combining both T0 and T1 conditions and combining them in 1 plot. This is problematic, also if you split these, some relationships look like they are going in opposite directions (e.g. Fig. 6b). Why not correlate change scores (brain/behavior) with each other?
Discussion:
1) L277 - There is a lot of discussion about the loss of integration measures during SD, however, the leaf fraction which is supposed to indicate integration of the networks is not significant between conditions.
2) L252 - Most of the manuscript is set up for the reader to expect that SD would primarily affect frontal lobes and top-down cognition. However, the findings here are somewhat opposite - occipital regions associated with processing of visual stimuli are the ones that show altered diameter and degree metrics - but the authors claim that bottom up processing does not suffer from the effects of SD (L294). These findings need to be reconciled, and also with prior work.
3) L293 - even if task engagement were a factor, we would not typically expect that participants would perform better after SD (maintained performance might be possible). This could suggest a practice effect at play here - since the first session was always the well-rested session.
Methods:
1) L315 - Can you show in a table descriptives for the actigraphic assessments of sleep the night before the experiment?
2) L378 - disjoint sentence
3) L400 - what does 'on the letter a beamforming procedure was performed' mean?
4) L436 - there appears to be no counterbalancing across conditions here as all participants completed T0 first before T1. This could lead to practice effects confounding some of the interpretations. There is a statement about reduction of learning effects using different parallel forms from the LCT (L330) but it is not clear what this means. Can you show within each session (rested/SD) whether or not you see improvements in performance as the task progressed?
-
Reviewer #1:
In this study, 34 participants underwent 24 hours of sleep deprivation. They performed two tasks (letter cancellation and task switching) before and after sleep deprivation. Graph metrics were computed based on resting-MEG data. The authors showed that participants performed worse in the letter cancellation task after sleep deprivation, but performed better in task switching after sleep deprivation. They showed that certain graph metrics were changed after sleep deprivation and some of these metrics were correlated with task performance changes in task switching, but not letter cancellation.
1) I think it's quite worrisome that participants actually performed better at task switching after sleep deprivation. I wonder if there's a serious flaw in the experimental procedure. One possibility is practice effect since participants performed the task before they were sleep deprived and then performed the task again after sleep deprivation.
2) While the minimal spanning tree (MST) has been used in some papers, it seems to me that the resulting tree might be sensitive to noise. Besides, such pruning does not seem biologically plausible. I would suggest the authors repeat their analyses using more standard approaches, while taking into account potential pitfalls ( https://www.sciencedirect.com/science/article/pii/S105381191730109X )
3) False discovery rate was not reported.
4) It's unclear the sequence of experimental procedure. Perhaps I missed it but were the tasks performed before or after the MEG/MRI acquisition? I only knew the tasks were not performed during MEG because the authors mentioned in the discussion that "the brain measures are made at rest and not during the execution of the task." Seems pretty important to mention this more prominently in the manuscript.
5) The title states that "Loss of integration of brain networks after one night of sleep deprivation underlies worsening of attentive functions". However, the authors' results contradict the title, since network measures did not correlate with worse letter cancellation task (LCT) performance, but correlated with better task switching performance! The same issue is present in the abstract, where the authors state that "brain network changes due to SD selectively impaired attention", yet the authors reported that "LCT performance and NASA score were not correlated with topological data".
6) It's hard to follow the results section without first reading the methods section. This is fine if the methods section was before the results section. However, in this manuscript, the results section was before the methods section. Therefore, the authors should provide more methodological overview in the results section. For example, graph theoretic terms like BC and Diameter in Alpha were used in the results section with no explanation.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript.
Summary:
This study utilizes MEG to study the effects of sleep deprivation on functional network integration, attention and task-switching. The strength of this study is that this is perhaps the first MEG sleep deprivation dataset and thus, the community would benefit from this data. However, the reviewers felt that there were potentially serious issues with the study design and statistical analyses. More specifically, the improvement in task switching performance after sleep deprivation might simply be due to practice effects. Without counterbalancing T0 and T1, it is unclear how this issue could be resolved. Furthermore, there were concerns about the pooling of T0 and T1 conditions in the correlations with KSS and task performance, as well as issues with multiple comparisons correction.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
Holmgren et al describe a novel model of reversible mechanical damage to zebrafish neuromast hair cells. The authors demonstrate that when zebrafish are exposed to strong currents, neuromast morphology, hair cell number, innervation, and MET function suffer various types and degrees of damage, from which the NMs recover within 2 days. Additionally, they show macrophage recruitment to damaged neuromasts, where they may be phagocytosing synaptic debris. Based on various mechanistic and phenotypic commonalities (involvement of ROS, stereocilia and synapse phenotype), the authors argue that this model is a good approximation of noise-induced hair cell damage in mammals.
Overall impact:
This reviewer agrees that a "noise" damage model in the zebrafish would be a powerful tool to better understand the mechanisms underlying noise-induced hearing loss. However, due to various weaknesses of the data (detailed below), the main claims of the paper are not sufficiently supported. In addition, noise-induced hearing loss has been previously modeled in the zebrafish model. The present model, therefore, does not provide a significant methodological innovation.
Major concerns:
1) As the authors point out, zebrafish hair cells can be regenerated. With that in mind, and to make the relevance for mammalian hair cell repair clear, a clear distinction between mechanisms mediated by "repair" or "regeneration" needs to be made. The authors discuss that proliferative hair cell generation can be excluded based on the short time period, but suggest that transdifferentiation might be involved. Recovery of NM hair cell number occurs within the same 2 hour period in which NM morphology and hair cell function improved, making it difficult to determine the extent to which "regeneration" contributed to the recovery. The amount of transdifferentiation has to be shown experimentally (lineage tracing?).
2) The classification of "normal" vs "disrupted" is vague and not quantitative. The examples shown in the paper seem to be quite clear-cut, but this reviewer doubts that was the case throughout all analyzed samples. Formulate clear benchmarks and criteria for the disrupted phenotype (even when blind analysis is performed).
3) Sustained and periodic exposure: These two exposure protocols not only differ with respect to sustained vs periodic, they also differ in total exposure time (Fig 2B). This complicates the interpretation, especially considering the authors own finding that a pre-exposure is protective.
4) The data on the mitochondrial ROS aspect seems not well integrated into the overall story.
5) It is surprising that the hair bundle morphology was not assessed after recovery. This is crucial. Overall, it would be good to see some quantification of the SEM data, e.g. kinocilia length and number of splayed bundles.
6) Behavioral recovery (measured as number of "fast start" responses) was also not assessed. This is essential for determining the functional relevance of the recovery.
7) This reviewer is not yet convinced that this damage model displays enough commonalities to mammalian noise damage to justify the ubiquitous use of the term "noise" throughout the manuscript. It would be more prudent to use a more careful term along the lines of "mechanical overstimulation-induced damage".
8) Overall, there was a lack of experimental and analysis detail in the results section. For example, how was afferent innervation quantified? Just counting GFP labeled contacts to hair cells? There was also inconsistency in the use of two variations of the mechanical damage protocol, the time points at which repair was assessed, and whether the damage was quantified in all neuromasts or in normal vs. disrupted neuromasts separately, making the data difficult to interpret.
-
Reviewer #2:
Holmgren et al. describe the development of a model for hair cell noise damage using the zebrafish lateral line line system. Using an electrodynamic shaker, the authors induce quantifiable damage and death of hair cells after a two-hour treatment. They describe gross morphological changes of hair cells, changes in innervation and synapse distribution. In addition they describe disruption of stereocilia and kinocilia, as well as reduced mechanotransduction-dependent uptake of FM1-43 dye. Damage is no longer detectable several hours after insult, demonstrating recovery.
1) While the findings are carefully measured and described, the effects of insult on hair cells are relatively minor, with a change in hair cell number, extent of innervation or synapses per hair cell (Figs 3 and 4) in the range of 10% reduction compared to control. One potential value of the model would be to use it to discover underlying pathways of damage or screen for potential therapeutics. However with these modest changes it is not clear that there will be enough power to determine effects of potential interventions.
2) The most dramatic phenotype after shaking is a physical displacement of hair cells, described as disrupted morphology. However it is not clear what the underlying cause of this change. Are only posterior neuromasts damaged in this way? Is it a wounding response as animals are exposed to an air interface during shaking? It is also not clear to what extent this displacement reveals more general principles of the effects of noise on hair cells. Additional discussion of underlying causes would be welcome.
3) Because afferent neurons innervate more than one neuromast and more than one hair cell per neuromast, measurements of innervation of neuromasts (Figure 3) or synapses per hair cell (Fig 4) cannot be assumed to be independent events. That is, changes in a single postsynaptic neuron may be reflected across multiple synapses, hair cells, and even neuromasts. This needs to be accounted for in experimental design for statistical analysis.
4) The SEM analysis provides compelling snapshots of apical damage, but could be supplemented by quantitative analysis with antibody staining or transgenic lines where kinocilia are labeled. The amount of reduced FM1-43 labeling is one of the more dramatic effects of the shaking insult, suggesting widespread disruption to mechanotransduction that could be related to this apical damage. Further examination of the recovery of mechanotransduction would be interesting.
5) A previous publication by Uribe et al.2018 describes a somewhat similar shaking protocol with somewhat different results - more long-lasting changes in hair cell number, presynaptic changes in synapses, etc. It would be worth discussing potential differences across the two studies.
-
Reviewer #1:
In the manuscript titled "Mechanical overstimulation causes acute injury followed by fast recovery in lateral-line neuromasts of larval zebrafish" by Holmgren et al., the authors develop a method to overstimulate hair cells and determine some of the consequences of this overstimulation. The overarching goal of this work is to develop a model for noise-induced hair-cell damage in the zebrafish. The authors use the lateral line for their studies and stimulate hair cells using an electrodynamic shaker which generate significant aqueous agitation. The authors demonstrate physical damage to hair cells of the lateral line that are dependent on the position of the neuromast. The damage includes alteration of afferent synapses, afferent neurite retraction, limited damage to hair bundles and a decrease in mechanotransduction. After damage, they show macrophage recruitment and quick recovery of hair cell neuromasts, which is surprising.
The paper is interesting in that it brings a new capacity to the zebrafish animal model: mechanical overstimulation of the hair cell. Tempering this is a general feeling that the authors do not dig deep enough in the current form of the manuscript, but this could be remedied. More specifically, the authors are making a model in zebrafish for noise-induced damage, so they need to show that this model is similar to mammals in the way hair cells are damaged. This is done in the manuscript, but it is limited and should be expanded as suggested below.
Major comments
1) The authors use a vertically-oriented Brüel+Kjær LDS Vibrator to deliver a 60 Hz vibratory stimulus to damage lateral line hair cells. It is not made clear on why this frequency was selected. Did the authors choose this frequency because they screened a number of frequencies and this is the one that did the most damage to hair cells or was it chosen for another reason? Or, do all frequencies do the same amount of damage? The authors should screen a number of frequencies and choose the stimulus that does the most damage to hair cells. This would set the field in the best direction, should members of the community attempt this new technique. It is not necessary to repeat all of the experiments, but the authors should show which frequencies are best for inducing damage.
2) The SEM images of the hair bundle are beautiful and do show damage to the hair bundle, but historically speaking older studies in mammals have shown that the actin core of the stereocilia is damaged. It would be critical to know if this was the case. Showing damage to the kinocilium and stereocilia splaying is a start, but readers would need to know if the actin cores are damaged. So, TEM should be used to find damage to the actin cores of stereocilia.
3) I think the use of "Noise-exposed lateral line" as a term for mechanically overstimulated lateral line hair cells is not correct and could be misleading. The lateral line senses water motion not sound as the word noise would imply. Calling the stimulus "noise" should be removed throughout.
4) Decreases in mechanotransduction are shown by dye entry. These results should be strengthened using microphonic potentials to determine the extent of damage. This experiment is not necessary but would improve the quality of the document.
5) In figure 2, PSD labeling is not clear.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript. Doris K Wu (NIDCD, NIH) served as the Reviewing Editor.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
The paper by Fair (Gilad) and colleagues examined the determinants of gene expression variation within human and chimpanzee populations. Studies focused on an analysis of left ventricle in 39 chimpanzees and 39 human samples. The authors first developed a strategy to measure "dispersion", or gene expression variance after regressing out the effects of mean expression. This metric of dispersion was correlated between human and chimpanzee in most genes, but there were substantial differences between species that could not be explained by changes in mean expression level. Highly dispersed genes were enriched for genes with a higher amino acid divergence, TATA boxes, and cellular composition. In fact, the authors found that changes in cellular composition between samples were highly correlated with expression dispersion, wherein genes that were markers of specific cell populations were highly dispersed. Analysis of eQTLs discovered that genes which are variable based on eQTLs in one species were enriched for eQTLs in the other.
Overall, there are many good things about this paper. The data will be of broad utility to the comparative genomics community: the authors added RNA-seq data from the left ventricle of 21 chimpanzees and high coverage complete genomes from 39. The calculation of power for discovering differentially expressed genes as a function of sample size at the beginning of the paper is a thoughtful analysis that is useful to many in the community. As I have come to expect from these authors, all of the analyses are extremely thorough and well-executed. The statistical tests are appropriate and rigorous. Results are interpreted in a conservative fashion.
The main issue is that the authors are not able to conclusively disambiguate between different causes of dispersion. Genetics, cell type, and technical variation may all contribute to dispersion. The authors state this very clearly throughout the manuscript. In part, this may reflect the authors' underselling their results somewhat. But in part, this really does reflect reality: Cell type is a major confounder that may provide false signals in other analyses.
Major comments/ suggestions:
1) Did the authors test directly whether eQTLs were enriched in genes with a high dispersion? I could not find this going back through the paper. This seems almost trivially likely to be true. I may have missed this result? Or did the authors worry this is too likely to be confounded with cell type? Either way, this seems like a result that may be useful to show even if the authors did acknowledge that it was likely to be confounded.
2) Did the authors consider looking for cell-type QTLs? They state several times in the paper the possibility that genetic factors may influence cell types. They have enough data - at least in humans - to obtain QTLs for specific cell types, as others have done (Marderstein et. al. Nat Comms 2020; Donovan et. al. Nat Comms 2020). If these cell type QTLs were enriched near genes with a high dispersion, this may bolster the author's argument that genetic factors underlie dispersion by affecting cell type composition.
3) The scRNA-seq reference used for estimating cell types in heart tissue was derived from mice. Could this lead the authors to underestimate the degree to which cell types drive dispersion in genes that are variable between human and chimp? Genes that are variable between human/ chimp may also be more likely to be variable between either species and mouse, and perhaps this variability has led to them becoming more/ less of a marker of a specific cell population (and hence their dispersion in primates does not correlate with cell type specificity in mouse).
4) Have the authors tried estimating dispersion on top of what is expected based on differences in cell type? There are several strategies that might work for this: There are new strategies for estimating a posterior of cell type specific expression from a bulk sample, conditional on scRNA-seq data as prior information (Chu and Danko, bioRxiv, 2020). These cell type specific expression estimates could then be analyzed for dispersion. Alternatively, it may also work to regress the estimated proportion of each cell type out of the dispersion estimates. While there are certainly a lot of pitfalls with using these strategies, especially in the setting shown here (all of this would work better if there were species matched reference data), they might provide an avenue for depleting the contribution of cell type differences from dispersion estimates.
5) Can the authors add a dotted line to show the shape of the distribution for genes with low dispersion, or where dispersion is shared in both human and chimpanzee, in figure 4b? Is this different from genes that are dispersed in either chimp or human?
6) Type. pp. 20. "... in only in ..."
-
Reviewer #2:
In this study, Fair et al. focused on assessing inter-individual variability in gene expression, which has been shown to be heritable and associated with disease susceptibility. More specifically, unlike many studies focused on mapping associations between genetic and gene regulatory variation, authors paid attention to the group dispersion/variance of gene expression among samples as well as the evolutionary processes that shape the differences in gene regulation between individuals in humans or any other primate. Using computational deconvolution, they found that cell-type heterogeneity determines expression variability in both species. They also found a significant overlap of orthologous genes associated with eQTLs in both species. They concluded that gene expression variability in humans and chimpanzees often evolves under similar evolutionary pressures. The manuscript, in general, is well prepared. For example, authors put supplementary figures within the main text whenever they are supposed to be, which is convenient. The authors collected data from 39 human vs. 39 chimp primary heart tissue samples. The sources of human samples include 11 (old study)+28 (GTEx) and chimp samples 18 (old)+21 (new). Twenty-one new specimens are generated specifically for this study. This study involves a large number of tests, but the main problem is the lack of a coherent central hypothesis.
Major comments:
1) The first test authors conducted is to identify differentially variable (DV) genes. A total of 2658 DV genes were identified. The problem of the result is that almost equal number of up- and down-regulated DV genes symmetrically distributed around DV=0. Often, this is an indication of a lack of biological signals in data analysis. This might be due to the pooling of gene groups with diverse functionality together. Therefore, this reviewer suggests that authors should break down genes into subgroups to detail the up and down-regulatory patterns with the hope that some of the gene groups give interpretable results
2) The second test is to correlate the higher coding sequence conservation with lower dispersion. Again, the positive result is not unexpected. There are many indirect and/or confounding factors that may explain the effect. This reviewer, however, understands it is impossible to control them all (also authors have attempted to address some of them in the next few tests). However, here it is better to add exploratory analyses for genes in different functional groups and also give examples of outlier genes that do not follow the rule.
3) The third test is to examine the correlation between gene expression variability with single-cell type heterogeneity of samples. Authors first used Tabula Muris dataset to show dispersion is strongly correlated with cell-type specificity/diversity. If this is true, then the point that authors really wanted to demonstrate is, in fact, hampered. Authors might really want to show the "true" single-cell variability (see, for example, PMID: 31861624) is correlated with the level of group variance of gene expression.
4) The fourth test authors conducted is to show that dn/ds and pn/ps ratios of genes are correlated with gene expression variability (variance). However, because of the existence of heterogeneity of cell-type composition in samples, any correlation observed may be utterly biased by this single uncontrollable confounding factor. Furthermore, heart tissues contain an over-abundant expression of genes encoded in the mitochondrial genome. The expression level of these mt-genes may vary substantially between samples and reflect the health status of primary sample donors. PEER normalization may have to take this into account as a covariant.
5) Several other tests authors performed are around eQTLs (eGene overlap and eSNP overlap) between the two species. These are typical tests evolutionary biologists usually try to do whenever data is available. However, the issues with these types of tests are the low power in general. More importantly, in order to be consistent with previous tests which are all around the explanation of gene expression variance, this part should address the overlap between expression vQTLs in humans and chimps.
-
Reviewer #1:
This is a solid study, with a large sample size, identifying quantitative trait loci (eQTLs) in humans and chimpanzees, using gene expression data from primary heart samples. The authors complemented the analysis of gene expression with a comparative eQTL mapping, as opposed to relying on mean expression levels, as most studies like this one do.
1) I would like to see more discussion about the inter-relatedness of the chimpanzees in the analysis of gene expression. Is that contributing to the power of the DE analysis, which has really high numbers of DE genes. That may certainly be due to the large samples size, but should be addressed. Related to that, the support that the gene-wise dispersion estimates are well correlated in humans and chimpanzees overall (Fig1C, and S4) seems qualitative. It looks like the chimpanzees might have less dispersion overall?
2) What do the authors think these findings mean for study systems outside of humans and captive chimpanzees? Both on the technical level (e.g. sample size), and for how their approach could be helpful outside of these species. Generalizing this approach would broaden the impact and audience of the paper.
3) Just a comment that I appreciated the thoughtfulness of the possible technical confounds in the results and discussion.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
This is a solid study, with a large sample size, identifying quantitative trait loci (eQTLs) in humans and chimpanzees, using gene expression data from primary heart samples. The authors complemented the analysis of gene expression with a comparative eQTL mapping, as opposed to relying on mean expression levels, as most comparative studies like this one do. Also unlike many studies focused on mapping associations between genetic and gene regulatory variation, the authors paid attention to the group dispersion/variance of gene expression among samples as well as the evolutionary processes that shape the differences in gene regulation between individuals. The calculation of power for discovering differentially expressed genes as a function of sample size at the beginning of the paper is a thoughtful analysis that is useful to many in the community. All of the analyses are extremely thorough and well-executed. The statistical tests are appropriate and rigorous. Results are interpreted in a conservative fashion.
The main limitation is that the authors are not able to conclusively disambiguate between different causes of dispersion. Genetics, cell type, and technical variation may all contribute to dispersion. The authors state this very clearly throughout the manuscript. In part, this may reflect the authors' underselling their results somewhat. But in part, this really does reflect reality: Cell type is a major confounder that may provide false signals in other analyses.
-
-
-
Reviewer #2:
General assessment:
This work utilizes two Spiroplasma populations as the materials to study the substitution rates of symbiotic bacteria. A major finding is that these symbionts have rates that are ~2-3 orders higher than other bacteria with similar ecological niches (i.e., insect symbionts), and these substitution rates are comparable to the highest rates reported for bacteria and the lowest rate reported for RNA virus. Based on these findings, the authors discussed how this knowledge could be used to infer and to understand symbiont evolution. The biological materials used (i.e., symbionts maintained in fly hosts for 10 years and cultivated outside of the host for > 2 years) are valuable, the technical aspects are challenging, and the answers obtained are certainly interesting. The key concern is the limited sampling of other bacteria for comparison to derive the conclusions.
Major comments:
1) The key concern regarding sampling involves several points. (a) The two populations represent the species Spiroplasma poulsonii. Is this species a good representative for the genus? Or is it an exception because it is a vertically inherited male-killer? Most of the characterized Spiroplasma species appear to be commensals and are not vertically inherited. (b) The other species with a comparable rate is Mycoplasma gallisepticum (i.e. a chicken pathogen that spreads both horizontally and vertically). Mycoplasma is a polyphyletic genus with three major clades. While closely related to Spiroplasma, their hosts and ecology are quite different. Do all three groups of Mycoplasma have such high rates? If so, are the high rates simply a shared trait of these Mollicutes and has nothing to do with the distinct biology of S. poulsonii? How about other Mollicutes (e.g., Acholeplasma and phytoplasmas). (c) The group "human pathogens" in Fig. 2 show rates spreading across four orders of magnitude. This is too vague. How many species are included in this group? Are their rates linked to their phylogenetic affiliations? (d) Did Fig. 2 provide comprehensive sampling of bacteria? How about DNA viruses? Michael Lynch has done extensive works on mutation rates (e.g., DOI: 10.1038/nrg.2016.104), some of those should be integrated and discussed.
2) This study is based on two lab-maintained populations. How may the results differ from natural populations? I understand that no estimate may be available for natural populations and additional experiments may not be feasible, but at least a more in-depth discussion should be provided.
3) The authors use adaptation as a key explanation for several of the findings. Stronger support and alternative explanations are needed. For example, why genome degradation may be used as a proxy for host adaptation (line 497)? If this explanation works only for sHy but not the other strain within the same species (i.e., sNeo), is this still a good explanation? Similarly, for the arguments made in lines 524-528, supporting evidence should be presented in the Results. For example, what are the rate distribution of all genes? Do those putative adaptation genes have statistically higher rates and/or signs of positive selection?
4) The chromosome and plasmids have very different rates (lines 315-316). Since this study aims to compare across different bacteria, perhaps the analysis should be limited to chromosomes for all bacteria.
5) Formal statistical tests should be performed to test the stated correlations (e.g., lines 360-361, genome size and the number of insertion sequences).
6) Fig. 5. The differences in CDS length distribution should be investigated and discussed in more details. The authors stated that they have re-annotated all genomes using the same pipeline, so this finding cannot be attributed to the bioinformatic tools. If these findings are true (rather than annotation artifacts), it is quite interesting. How to explain these? Why is Sm KC3 so different from all others?
7) Lines 467-479. Multiple lineages have purged the prophages is an interesting hypothesis and may be important in furthering our understanding of these bacteria. More detailed info (e.g., syntenic regions of prophage sites across different species) should be provided in the Results to support the claim. Perhaps the sampling should be expanded to include the Apis clade (i.e., the clade with the highest number of described species within the genus) to test if the prophage invasion occurred even earlier or independently in multiple lineages. Additionally, CRISPR/Cas systems are known to have variable presence across Spiroplasma species (DOI: 10.3389/fmicb.2019.02701). How does this correspond to prophage distribution/abundance?
Minor comments:
1) Lines 32, 517, and possibly other parts: Use "increased" or "decreased" to describe the rate differences are inappropriate because these imply inferences of evolutionary events after divergence from the MRCA, which are clearly not the case. It would be more appropriate to use "higher" or "lower" to describe the difference.
2) Lines 31-32. This is too vague. For the rates, the description should be more explicit (e.g., higher by X orders of magnitude). The term "symbiont" is also vague. Broadly speaking, all human pathogens (included in Fig. 2) or plant-associated bacteria could be considered as symbionts as well. Would be better to define this point more clearly.
3) Fig. 1. The alignment is off. For example, June should be located near the middle between two tick marks.
4) Line 207. This is confusing. There should not be 6 circular chromosomes.
5) Line 211. Why is the hybrid assembly more fragmented?
6) Methods and Results. More detailed information regarding the sequencing and assembly should be provided. For example, how much raw reads were generated for each library? What are the mapping rates? How much variation in observed coverage across the genome?
7) Lines 341-342. How to establish an expected level of synteny conservation?
8) Line 487. I do not see how this statement could be supported by Fig. 5. Also "less pronounced" is vague.
-
Reviewer #1:
The paper has potential. It's not there yet.
The paper presents a sequencing study describing the evolution of Spiroplasma over various years in lab cultures. Spiroplasma is a fascinating bacteria that induces some unique phenotypes including enhancing insect immunity or "protection" and male-killing. The premise for the study was that sometimes these phenotypes disappear in cultures and thus the bacteria is likely quickly evolving and subject to frequent mutation. The researchers sequence various cultures of Spiroplasma (sHy and sMel), assemble and annotate genomes, compare the genomes, quantify the rates of evolution and compare these rates to some other studies on viruses, human microbiota/pathogens, and wolbachia. They find that Spiroplasma evolve real fast and speculate that the mechanism for this is a lack of various Mut repair enzymes. They look at fast evolving proteins of interest including RIP toxins which kill nematodes and spaid which is an inducer of male killing. So essentially the big result here is that Spiroplasma evolves real fast.
In my opinion the paper is weak in a few senses. It doesn't reflect hypothesis driven science. It's mostly observational data and the researchers do not test any hypotheses. Now I don't think this is a deal breaker, but I do think it weakens the paper. Also, my comment should not imply that there isn't valuable data herein; and in fact I think the other big weakness is that the researchers do NOT exploit the true value of the data to derive and test novel hypotheses.
For example: one aspect I was most excited about was to see how the researchers dissect and annotate evolutionary differences induced by axenic culture systems. The authors have the ability to compare and contrast genomes of Spiroplasma cultured in host insects AND Spiroplasma cultured without insects in axenic culture. Within these genome comparisons are likely novel insights that could shed light on mechanisms of maternal transmission and mechanisms of cell invasion etc... However, I was shocked to see that there is no in-depth analysis of specific proteins that are changing and evolving in these two diverse culture systems. I thought the analysis was entirely insufficient and didn't extract or present the real value of the datasets here. There are some brief mentions in the discussion of adherin binding proteins, but that was essentially it. I think the researchers focused too much on the past, ( the RIP toxins and spaid) rather than pointing out new interesting genes and hypotheses about them.
For example: Maternal transmission would no longer be required in axenic culture, what genes got mutated? This is perhaps the most interesting thing that is not even touched upon.
So essentially my main criticism is the added value from this paper which is the potential ability to compare symbiont genomes in hosts to symbionts with Axenic culture was NOT exploited. Given the novelty and impact of the axenic culture studies by Bruno, I would have hoped to see this upfront.
Also there are some paragraphs comparing broad genomic differences between sHy and sMel, but I didn't think the differences in how these genomes evolved over time in comparison to their earlier selves was emphasized or explained in enough detail.
Another example of not exploiting the value of the data: The plasmids are usually where much of the action is in microbes. There should be detailed annotations and figures of the plasmids. Tell me what is on them. Tell me which genes are evolving. Tell me if there are operons. Tell me what pathways are in the plasmids. I found the discussions of plasmid results wholly lacking. I also inherently felt that discussions of plasmids should be kept completely separate from discussions of chromosome evolution, regardless of similar rates of evolution or not... Plasmids are unique selfish entities and I imagine their evolution is wholly distinct from the evolution of chromosomes. They deserve their own sections and figures (in my opinion).
The figure legends are completely insufficient and they ask me to read other papers to understand them, which is annoying.
Other minor comments:
What about presence/absence of recA?
There are differences in dna extraction prior to genome sequencing for each of the strains. I suspect this is because different individuals sequenced different genomes. But I worry that different protocols could produce different results and therefore a comparison might be tainted by dna extraction and library prep specifics. Can you at least explain to the reader why this is not an issue, if it is not an issue?
Examples:
181 - why were heads removed? Why was this dna extraction protocol here different from the hemolymph extraction protocol? Might this have changed anything?
195 - how much heterogeneity do you expect in any given fly. Do you have SNP data differences amongst good reads that could point out different alleles within a Spiroplasma population within an individual fly? It would be interesting to know which genes have a large amount of different alleles.
199 - another DNA extraction protocol. There isn't consistency here. If the reads and coverage are good enough, it shouldn't be a problem. But if there were data issues or assembly issues, this would raise concern in my mind. Can the researchers discuss or alleviate concerns here? Some assemblies have 6 chromosomes, some have 3 chromosomes. I presume these were different strains of Spiroplasma and not the same one?
Figure 1: were the samples that are 6 years apart (red) sequence in exactly the same way with the same technology? Could this produce any relics? Also, why display information for sMel in a table and information for sHy in a figure? Can't you creatively standardize a visual means of showing this information and compile information to one item?
I wonder what would happen if you took the same sample and did different DNA extraction protocols, different library prep protocols, and different illumina rounds of sequencing and independent algorithm assemblies... how much would they come out the same? Has anyone ever done this experiment? Is there any reference for this control that shows they would in fact come out the same? This is essentially what I am worried about here. This could be a minor issue, if the researchers could just confidently explain why this is NOT an issue.
Line 30 - you introduce sHy and sMel without defining what they are yet? Clarify immediately that they are both S.poulsoni
line 247 - They found fragmented genes with orthofinder, if it was less than 60% length homology... why set an arbitrary cutoff of 60? Anything less than 100 is possibly a pseudogenization if the last amino acid is important, or the C-terminus is important, which it often is... What is the rationale here?
To quantify an evolutionary rate, I read that they counted the number of changes in 3rd codon wobble positions/year. Why just wobble codons... why not all SNPs period? But then in the figure 2, it seemed like they are tallying a percentage of a total 100% = 570 "variants" or changes in the sequences (I wouldn't use the word variants, as this makes me think of strains; better to say "changes", no?). These changes include snps, insertions, deletions, and "complex"... no idea what complex is? The figure legends are completely insufficient. And I still don't know if you are tallying in some kind of number of recombinations and psuedogenizations into the mix (I assume these are included in the frame-shifts)? The quantification is murky to me.
The adhesin proteins are evolving fast. But aren't Spiroplasma commonly intracellular... so why would it be binding an extracellular protein? ... can you discuss this? I presume invasion or something?
There might be a correlation with genome size and speed of evolution. You mention this in the discussion, but briefly. Can you elaborate on this, especially because Spiroplasmas are close to mycoplasmas which are REALLY small genomes.
Figure 3 is really confusing. I assume FS is frameshift, is IF induced fragmentation? After about 10 minutes I could decode it. Is this really the best way to think about these results? Perhaps? But perhaps not? ARP? I think it's adhesin stuff, but you don't say this until later.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript. Vaughn S Cooper (University of Pittsburgh) served as the Reviewing Editor.
Summary:
This work uses Spiroplasma to study the substitution rates of symbiotic bacteria, which are ~2-3 orders higher than other insect symbionts, and approaching rates reported for viruses. The use of symbionts maintained in fly hosts for 10 years and cultivated outside of the host for > 2 years are valuable, and the study is interesting. The key concern is the limited sampling of other bacteria as comparative taxa to derive the conclusions. This makes the report somewhat premature. Further analyses of existing data are also required. Equally important, the study needs to be better placed in the context of what's known about mutation rates varying as a function of effective population size, to better locate this study in the broader literature on the evolution of mutation rates.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
Bola and colleagues set out to test the hypothesis that vOT domain specific organization is due to the evolutionary pressure to couple visual representations and downstream computations (e.g., action programs). A prediction of such theory is that cross-modal activations (e.g., response in FFA to face-related sounds) should be detected as a function of the transparency of such coupling (e.g., sounds associated with facial expression > speech).
To this end, the Authors compared brain activity of 20 congenitally blind and 22 sighted subjects undergoing fMRI while performing a semantic judgment task (i.e., is it produced by a human?) on sounds belonging to 5 different categories (emotional and non-emotional facial expressions, speech, object sounds and animal sounds).The results indicate preferential response to sounds associated with facial expressions (vs. speech or animal/objects sounds) in the fusiform gyrus of blind individuals regardless of the emotional content.
The issue tackled is relevant and timely for the field, and the method chosen (i.e., clinical model + univariate and multivariate fMRI analyses) well suited to address it. The analyses performed are overall sound and the paper clear and exhaustive.
1) While I overall understand why the Authors would choose a broader ROI for multivariate (vs. univariate) analyses, I believe it would be appropriate to show both analyses on both ROIs. In particular, the fact that the ROI used for the univariate analyses is right-hemisphere only, while the multivariate one is bilateral should be (at least) discussed.
2) The significance of the multivariate results is established testing the cross-validated classification accuracy against chance-level with t-tests. Did these tests consider the hypothetical chance level based on class number? A permutation scheme assessing the null distribution would be advisable. In general, more details should be provided with respect to the multivariate analyses performed, for instance the confusion matrix in Figure 5B is never mentioned in the text.
3) I wonder whether a representational similarity approach could be useful in better delineating similarity/differences in blind vs. sighted participants sounds representations in vOT. Such analysis could also help further exploring potential graded effects: i.e., sounds associated with facial expression (face related, with salient link to movement) > speech (face related, with less salient link with movement) > animals sounds (non-human face related) > object sounds (not face related at all). The above-mentioned confusion matrix could be the starting point of such investigation.
-
Reviewer #2:
The study by Bola and colleagues tested the specific hypothesis that visual shape representations can be reliably activated through different sensory modalities only when they systematically map onto action system computations. To this aim, the authors scanned a group of congenitally blind individuals and a group of sighted controls while subjects listened to multiple sound categories.
While I find the study of general interest, I think that there are main methodological limitations, which do not allow to support the general claim.
Main concerns
1) Auditory stimuli have been equalized to have the same RMS (-20 dB). In my opinion, this is not a sufficient control. As shown in Figure 3 - figure supplement 1, the different sound categories elicited extremely different patterns of response in A1. This is clearly linked to intrinsic sound properties. In my opinion without a precise characterization of sound properties across categories, it is not possible to conclude that the observed effects in face responsive regions (incidentally, as assessed using an atlas and not a localizer) are explained by the different category types. On the stimulus side, authors should at least provide (a) spectrograms and (b) envelope dynamics; in case sound properties would differ across categories all results might have a confound associated to stimuli selection.
2) More on the same point: the authors use the activation of A1 as a further validation of the results in face selective areas. Page 16 line 304 "We observed activation pattern that was the same for the blind and the sighted subjects, and markedly different from the pattern that was observed in the fusiform gyrus in the blind group (see Fig. 1D). This suggests that the effects detected in this region in the blind subjects were not driven by the differences in acoustic characteristics of sounds, as such characteristics are likely to be captured by activation patterns of the primary auditory cortex." It is the opinion of this reader that this control, despite being important, does not support the claim. A1 is certainly a good region to show how basic sound properties are mapped. However, the same type of analysis should be performed in higher auditory areas, as STS. If result patterns would be similar to the FFA region, I guess that the current interpretation of results would not hold.
3) Linked to the previous point. Given that the authors implemented a MPVA pipeline at the ROI level, it is important to perform the same analysis in both groups, but especially in the blind, in areas such as STS as well as in a control region, engaged by the task (with signal) to check the specificity of the FFA activation.
4) I find the manuscript rather biased with regard to the literature. This is a topic which has been extensively investigated in the past. For instance, the manuscript does not include relevant references for the present context, such as:
Plaza, P., Renier, L., De Volder, A., & Rauschecker, J. (2015). Seeing faces with your ears activates the left fusiform face area, especially when you're blind. Journal of vision, 15(12), 197-197.
Kitada, R., Okamoto, Y., Sasaki, A. T., Kochiyama, T., Miyahara, M., Lederman, S. J., & Sadato, N. (2013). Early visual experience and the recognition of basic facial expressions: involvement of the middle temporal and inferior frontal gyri during haptic identification by the early blind. Frontiers in human neuroscience, 7, 7.
Pietrini, P., Furey, M. L., Ricciardi, E., Gobbini, M. I., Wu, W. H. C., Cohen, L., ... & Haxby, J. V. (2004). Beyond sensory images: Object-based representation in the human ventral pathway. Proceedings of the National Academy of Sciences, 101(15), 5658-5663.
-
Reviewer #1:
Bola and colleagues asked whether the coupling in perception-action systems may be reflected in early representations of the face. The authors used fMRI to assess the responses of the human occipital temporal cortex (FFA in particular) to the presentation of emotional (laughing/crying), non-emotional (yawning/sneezing), speech (Chinese), object and animal sounds of congenitally blind and sighted participants. The authors present a detailed set of independent and direct univariate and multivariate contrasts, which highlight a striking difference of engagement to facial expressions in the OTC of the congenitally blind compared to the sighted participants. The specificity of facial expression sounds in OTC for the congenitally blind is well captured in the final MVPA analysis presented in Fig.5.
-The use of "transparency of mapping" is rather metaphorical and hand-wavy for a non-expert audience. If the issue relates to the notion of compatibility of representational formats, then it should be expressed formally.
-The theoretical stance of the authors does not clearly predict why blind individuals should show more precise emotional expressions in FFA as compared to sighted - as the authors start addressing in their Discussion. In the context of the action-perception loop, it is even more surprising considering that the sighted have direct training and visual access to the facial gestures of interlocutors, which they can internalize. Can the authors entertain alternative scenarios such as the need to rely on mental imagery for congenitally blind for instance?
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 3 of the manuscript.
Summary:
While the work addresses an interesting research question, several shortcomings have been raised by three independent reviewers. A first issue is the lack of theoretical clarity and linkage with prior work, as discussed by Reviewer 1 and Reviewer 2. A second critical set of concerns is raised by all reviewers with the need for several additional analyses to nail down the interpretations proposed by the authors. Reviewer 2 specifically raised concerns regarding the interpretability of activation in auditory cortices, while Reviewer 3 provides insights on the MVPA analysis and suggests the possible use of RSA to clarify the main findings.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
Many of the genes whose expression is induced by the integrated stress response (ISR) encode aminoacyl tRNA synthetases. Why is expression of so many synthetases enhanced in the ISR and what is the functional significance of this induction are important unresolved questions. This manuscript focuses on the tyrosyl tRNA synthetase, which is induced by the ISR in response to different stress conditions. The study suggests that induced expression of TyrRS in response to oxidative stress leads to nuclear localization of the enzyme where it then binds to DNA targets and recruits key transcription factors that control selected gene expression that ultimately controls protein synthesis levels late in the ISR. The TyrRS dampening of translation late in the ISR apparently occurs independent of the levels of eIF2 phosphorylation.
These ideas are a potentially interesting mechanistic feature of the ISR that builds on prior reports from this lab. However, there are major reviewer concerns about the manuscript. The manuscript uses different HEK cell models do not appear to be comparable in key ways. Hence one cannot readily integrate the results between the different models and there are important gaps in each. Additionally, key controls and assays are missing from each of the studied models. Because of these major concerns, the stated conclusions are not sufficiently supported from the experimental results. A portion of these concerns are highlighted below. These concerns diminished enthusiasm for the manuscript.
Reviewer concerns:
1) Figure 1: A major concern with the manuscript is that key controls and measurements are missing in experiments. The manuscript implies that prior publications have some of these measurements but this is problematic in many ways. In Figure 1A should also measure TyrRS levels and compare these to endogenous TyrRS induced in by oxidative stress. Determine the timing and duration of the anticipated induction of TyrRS expression for endogenous translation. Are the levels comparable with the rescued expression system (shown in this study) and is there induced expression of the engineered TyrRS by stress? If not, is this problematic with the proposed ISR induction model? Does this proposed translation dampening (Fig. 1B) involve continued reduction of translation initiation or elongation? Does the TyrRS +/- nuclear localization reduce global translation in the absence of eIF2 phosphorylation function?
The H2O2 treatment involves an initial insult and presumably the H2O2 is quickly dissipated. Therefore, one is likely not measuring the length of H2O2 exposure but rather the time after a short duration of stress. Other stress treatment regimens, including those involving oxidative damage, can be continuous. In Fig. 1C and other measures the synthetases, especially TyrRS, to show the level of overexpression.
2) Figure 2 and supplement: The ChIP analyses appears to feature overexpression of TyrRS (tagged versions different than those used in Fig. 1?). Are immunoblot measurements of the versions of TyrRS in Fig 1A applicable to those in Fig 2? A key feature of this pathway is that TyrRS expression late in the ISR directs the nuclear localization of the enzyme. Test this model with versions of TyrRS whose expression levels and regulation are appropriate in the ISR. Does the mRNA measurements in Fig. 2B involve +/- oxidative stress? This is critical to the proposed model.
3) Figure 3: Explain more clearly the mini-TyrRS and its utility. This point is also germane to earlier figures.
4) Figure 4: Be clear about the expression levels of the tagged TyrRS for the MS studies. Be sure to provide statistical information and support documentation in the methods and supplemental tables. Would be helpful to include the nuclear exclusion mutant with the co-IP. The analysis of the E196K mutant of TyrRS needs fuller development (e.g. with the stress condition) and clarity.
5) Figure 5: Regarding biological implications and cell survival, one finds it difficult to separate altered TyrRS charing of tRNA(Tyr) in this equation. Show the different mutants and arrangements do not alter aminoacylation of tRNA(Tyr).
-
Reviewer #2:
This paper presents a very compelling story: TyrRS has an important moonlighting function in the nucleus involving regulated gene expression via the recruitment of transcriptional co-regulators that is subordinate to TyrRS' ability to sense changes in the cellular environment. If proven correct this notion stands to influence our thinking about cellular stress responses. Therefore, the task of the reviewers is simply to critically evaluate the evidence; the significance of the claims is not in question.
According to the authors, by a mysterious process, that is not expanded on here, under oxidative stress conditions (200 µM H2O2-treatement of HEK293 cells for extended periods) a small fraction of TyrRS finds its way to the nucleus, where it selectively represses genes involved in the ability of cells to synthesize new proteins. The consequence of this selective transcriptional repression is a sustained oxidative stress-induced repression of protein synthesis that is entirely dependent on this nuclear translocation event.
The formative experiment supporting this chain of events is a comparison of cells in which the endogenous TyrRS has been inactivated by RNAi and rescued in trans, either by a wildtype TyrRS (i.e. one subject to this regulated nuclear translocation event) or a TyrRS bearing mutations in its nuclear localization signal (242KKKLKK247 to NNKLNK. Figure 1A shows that rescue with the NLS mutant TyrRS leads to superbasal (> complete) recovery of protein synthesis, whereas rescue with the wildtype TyrRS is associated with sustained stress-dependent decrease in protein synthesis.
This foundational experiment is not described in any detail, nor are its key tenets confirmed experimentally, instead the reader is referred to two previous papers, Fu 2012 describing the NLS mutations and Wei 2014 describing the implementation of this allele swap). Neither the extent of the inactivation of the wildtype allele nor the extent of the rescue are presented. Nor, for that matter, is there evidence that in the cells tested in Figure 1A the NLS mutation indeed abolishes the stress-dependent nuclear import of TyrRS. The WT-rescued cells are not even compared to the parental cells. These weaknesses are compounded by the inherent unreliability of any comparison of two clades of cells, as near as one can tell the authors have compared here two preparations of cells to which they attribute diverse properties.
Given how much is hanging off the phenotypic comparison of the WT and NLS mut TyrRS, it seems reasonable to impose a much higher standard on the experimental system. In 2020, this amounts to an allele replacement of the endogenous TyrRS with a silently-marked wildtype and NLS (and other) mutant coding sequences. Given the essentiality of TyrRS this should be a simple matter, using CRISPR/Cas9 to target the endogenous locus and offering a repair template to bring in the new alleles. Once implemented this method will produce numerous independent stable clones with the desired genotypes that can then serve in a comprehensive phenotypic analysis that traverses the problem of random clonal variation and phenotypic drift in clades of puro-resistant cells (that plagues the interpretation of the experiments shown here) It is uncertain if the above would be enough. The NLS of TyrRS is also involved in tRNA binding and potentially in other aspects of the charging reaction. Thus, mutations in that sequence rather than purely interfering with the putative nuclear functions of TyrRS, may also compromise the protein's more conventional function, with important and unanticipated phenotypic consequences. Fu et al. 2012, have made an effort to address this issue by comparing the affinity of WT TyrRS and the NLS mutants for tyr-tRNA (Table 1 therein) and by measuring tRNA acylation (Figure 2B, therein). The upshot of these measurements is that mutations in NLS severely compromise tRNA binding and acylation and even the weakest mutation, used here, has a measurable defect. These findings call into question the sweeping conclusions regarding the functionality of the NLS mutation. Therefore, to convince the sceptic the authors need to provide parallel evidence that selectively compromising nuclear transport of TyrRS is at the heart of the phenotypes observed.
In this vein it is notable that whereas in Wei 2014, study of the phenotypic consequences of the NLS mutation (on the cells' response to DNA damage) was buttressed by manipulation of angiogenin, an agent putatively implicated in the signal that sends TyrRS to the nucleus in stressed cells, no such attempt is made here; is angiogenin no longer believed to play a role? If not, it is incumbent on the authors to discover such trans-acting factors, and study the effect of their manipulation on the phenotype. This may be challenging, but the important claims for discovery made here must be matched by equally convincing experiments.
And then there is the surprising fact that in Wei 2014 and here the same cells exposed to the same stress seem to have very different consequences to gene expression programmes - where was the nuclear TyrRS-induced downregulation of 'translation' genes in 2014? Were none included in the 718 genes on the SmartChip Real-Time PCR System (WaferGen Biosystems)? Furthermore in 2014 Wei et al were concerned about the confounding effects of the different TyrRS alleles on protein synthesis, as the basis for the effects on DNA damage response (in their words: 'Considering that a simple knockdown of TyrRS may affect global transcription through a general effect on translation...'), yet dismissed this concern only to return now with a new version of reality whereby translational effects are all important. These issues need to be discussed and accounted for.
In summary, this is a paper presenting a very interesting but inadequately supported idea.
-
Reviewer #1:
Previous work has shown that the nuclear import of TyrRS is stimulated under stress and that nucleus-localized TyrRS functions through the transcriptional machinery to promote the expression of DNA damage response genes for cell protection. In this work, evidence is presented that nuclear TyrRS also inhibits bulk translation in a manner correlated with its association with several AARS-encoding genes and that for elongation factor eEF1A, and recruitment to these genes of HDACs. Mutation of the TyrRS NLS, whose function in nuclear localization provides for coupling between low tRNATyr binding and nuclear localization, was found to derepress bulk translation after prolonged oxidative stress by H2O2, without altering eIF2 phosphorylation levels or mTOR activation, and overexpression (o/e) of TyrRS can reduce protein synthesis, in a manner enhanced by the E196K mutation associated with Charcot-Marie-Tooth disease (CMT), shown previously to enhance TyrRS association with transcriptional co-repressors. ChIP-Seq of overexpressed V5-tagged TyrRS showed binding to only 17 sites, of which 15 are within gene coding sequences, among which four encode TyrRS, TrpRS, SerRS and GlyRS, and a fifth encodes elongation factor eEF1A. These results were confirmed by ChIP analysis of endogenous TyrRS, using the HisRS gene as negative control; and the occupancies were shown to increase on H2O2 treatment. The expression of these AARS/eEF1A gene transcripts was shown to be reduced by o/e of TyrRS, in a manner enhanced for at least some of them by the E196K CMT mutation; and the repression was shown to be eliminated by the NLS_mut for YARS expressed at native levels. Reductions in AARS/eEF1A protein expression were also observed on WT TyrRS o/e. Sequence analysis of the genes showing TyrRS binding by ChIP-seq led to identification of a motif that was shown to be required for binding to TyrRS in vitro in EMSA assays with either purified TyrRS or in extracts from cells overexpressing it, in a manner requiring the full-length TyrRS and not only the catalytic core of the enzyme. It was not shown however that eliminating this motif from any of the target genes attenuated their repression by nuclear-localized TyrRS. Mass spec analysis of affinity-purified, overexpressed TyrRS identified interacting proteins, and several of which were shown to be coimmunoprecipitated with endogenous TyrRS in non-stressed cells, including the transcription cofactors Trim28, HDAC1, and subunits of the NURD co-repressor/histone deacetylase complex. ChIP assays showed that overexpression of TyrRS lead to decreased levels of H3K27Ac, a histone mark of active transcription, and elevated occupancies HDAC1, TRIM28, or NURD subunit CHD4 in non-stressed cells at the AARS/eEF1A genes, with either TRIM28/HDAC1 or CHD4 being observed for all of the genes except the TyrRS gene that shows all three cofactors present. Based on these results, the authors conclude that increased nuclear localization of TyrRS on oxidative stress leads to increased binding of TyrRS to the AARS/eEF1A genes with attendant direct recruitment of either TRM28/HDAC1 or NURD, leading to transcriptional repression of these genes, which is responsible for the reduction in bulk protein synthesis observed after prolonged H2O2 treatment. They go on to provide evidence that cell survival in H2O2 is enhanced by nuclear association of TyrRS (dependent on the NLS), and that in its absence, conferred by the NLS_mut, apoptosis is increased. They also show that ROS increases by preventing TyrRS nuclear localization by the NLS_mut, and that this effect as well as decreased cell survival for this mutant in H2O2 can be rescued by the translation elongation inhibitor harringtonine.
The results presented in this report provide some support for the main conclusions of the paper and the overall model presented in Fig. 4F. However, as detailed below, many of the main conclusions of the paper are based on correlations and lack direct experimental support, and a number of the experiments are not comprehensive enough with sufficient conditions and controls to establish that the effects observed can be attributed to enhanced nuclear localization of TyrRS in response to H2O2. Considering the statements in the abstract, the evidence is reasonably strong that nuclear localization of TyrRS leads to inhibition of global translation at a stage later than that of eIF2α/ATF4 and mTOR responses, and that excluding TyrRS from the nucleus increases apoptosis under prolonged oxidative stress (although even this last point requires better documentation). However, the evidence is inadequate in several respects to claim that TyrRS directly represses the transcription of translation-related genes by recruiting TRIM28 or NURD complex, and as claimed on p. 13 of the Discussion, that the repression of the four AARS genes and the gene for eEF1A accounts for the reduction in bulk protein synthesis on H2O2 treatment.
Major issues:
-Evidence is lacking that the binding of TyrRS to the AARS/eEF1A genes is functionally important for the repression of any of the 6 putative target genes upon increased nuclear localization of TyrRS conferred by the NLS_mut or in response to H2O2. This would require ChIP analysis of TyrRS binding to the target genes for WT vs. NLS_mut TyrRS in H2O2-treated cells; and CRISPR mutagenesis of the putative TryRS binding site in the genome and analysis of transcription in the presence and absence of H2O2 for at least one of the putative TyrRS target genes.
-Evidence from ChIP analysis is lacking that TRIM28, HDAC1, or the NURD complex are recruited to the AARS/eEF1A genes at native levels of TyrRS in a manner dependent on the NLS and stimulated by H2O2, as the ChIP experiments involved only overexpressed WT TyrRS in non-stressed cells. It is also unclear whether H3K27Ac levels at the putative target genes decline at endogenous levels of TyrRS on treatment with H2O2. Similarly, evidence is lacking that the physical association of TyrRS with these co-repressors is dependent on the NLS and stimulated by H2O2, as the co-IP analysis was limited to endogenous WT TyrRS in non-stressed cells.
-Evidence is lacking that the cofactors TRIM28, HDAC1, or CHD4 are required for the down-regulation of target gene transcription on H2O2 treatment, which would require knock-down or elimination of these factors by CRISPR accompanied by analysis of target gene transcription +/- H2O2.
-Direct evidence is lacking from ChIP analysis of RNA Pol II that the transcription of the AARS/eEF1A genes is reduced on H2O2.
-Evidence is lacking that the repression of bulk protein synthesis is actually mediated by the reduced expression of the 4 AARSs and eEF1A. The fact that the TyrRS-E196K mutation enhances repression of bulk translation and also repression of 3 of the 5 target genes does support the idea that the repression of the target genes is instrumental in reducing protein synthesis, but again, this is still a correlation. There is no evidence that the reduced expression of the AARSs is sufficient to reduce charging of the cognate tRNAs, or that the reduced expression of eEF1A decreases the rate of translation elongation in cells or cell extracts.
-There is an important lack of information provided needed to evaluate the quality and significance of the ChIP-seq analysis of TyrRS binding to DNA. No details are provided concerning the ChIP-seq analysis of V5-tagged TyrRS to indicate how the TyrRS occupancy peaks were identified and distinguished above background signal from the cells expressing V5 tag alone, whether replicates were examined to provide statistical significance for the identified occupancy peaks, and the sequencing library depths. No genome browser views were provided to show the signals from the cells expressing V5-TyrRS vs V5 alone to demonstrate the quality and reproducibility of data from replicates. The supplementary table S1 describing these data was even omitted from the submission, and it's unclear whether these data are being deposited in GEO.
-There is an important lack of information provided needed to evaluate the quality and significance of the mass-spec analysis of TyrRS interacting proteins. No details are provided about the statistical significance of the protein interactions identified by mass-spec analysis of the affinity-purified TyrRS; and a negative control for non-specific association seems not to have been included in the analysis. The supplementary table describing these data was even omitted from the submission.
-It's unclear whether the motif described in Fig. 3A was found under the peaks of TyrRS occupancy in the various genes showing TyrRS binding in the ChIP-seq experiments, nor whether its occurrence is statistically significant. It was not indicated that the motif coincides with the peak ChIP-seq occupancies for TyrRS, and if not, how this could be explained.
-Evidence is lacking that harringtonine treatment reduced bulk protein synthesis under the conditions where it suppressed the effects of the TryRS NLS mutation in elevating ROS and decreasing cell survival.
-In general, the figure legends are poorly written in lacking important details about the nature of the TyrRS being examined in the experiment (tagged vs endogenous; overexpressed vs. native levels), and also whether oxidative stress was imposed in the experiment, and if so, the exact conditions for the treatment. Figure legends should contain all of the critical details needed to understand and evaluate the significance of the experimental results without having to search elsewhere in the paper for them.
-It needs to be clarified whether the mini-TyrRS construct lacks the NLS, and the significance of its behavior as a negative control for the effects of overexpressing WT TyrRS.
-For the experiment in Fig. 5B, quantification of the fraction of caspase-3 or PARP cleaved from biological replicates is required.
-The experiment in Supp. Fig. S4 lacks the results from cells untreated with H2O2 to ensure that these proteins were being induced by H2O2 in their hands.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript. Alan G Hinnebusch (Eunice Kennedy Shriver National Institute of Child Health and Human Development) served as the Reviewing Editor.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #2:
In this manuscript, the authors describe an interaction of EGFR and Gal7 and that Gal7 binding downregulates EGFR activity. They show that Gal7 null mice exhibit thickening of the epidermis. In the absence of Gal7, EGFR is more active, which is supported by increased EGFR phosphorylation and phosphorylation of downstream molecules. Although a related protein, Gal3, has been shown to upregulate EGFR activity that may be functionally relevant in colorectal cancer, the authors' description of EGFR-Gal7 interaction is new. However, a number of claims made are not supported by the data presented. For example, in the abstract, the authors state that Gal7 is a direct binder of E-cadherin but it is not demonstrated experimentally.
Additional comments:
1) In Figure 3A graphs, authors show that both baseline (Fig. 3A) and ligand-induced (Fig. 3B) EGFR phosphorylation is higher in Gal7 knockdown cells. This reviewer is left to assume that Figure 3A graphs are derived from WB data from Figure 3B and in those WBs the increase in pEGFR, pERK, pAKT levels after Gal7 in absence of EGFR are not convincing. Also, Fig. 3B has two panels and they are not clearly explained in the figure legend.
2) Figure 4A, lower panels would be more convincing if HaCaT and shGal7 were run on same gel, just like upper panels.
3) Figure 4B, on top of WB panels, labels are not aligned properly and the reviewer is left to assume that the loading conditions are 0, 0.5, 1, 2, 4, 8, and 24 h, first for HaCaT, followed by same time points for shGal7. Also, the results from time course in Figure 4A and 4B are not consistent; total EGFR levels are downregulated as early as 2 min in Fig. 4A, whereas loss of EGFR is more gradual (over hours) in Figure 4B.
4) In Figure 4B legend, cycloheximide treatment is mentioned but in the figure it is not indicated which samples are treated with cycloheximide.
5)In Figure 7A, +EGF+rGal7 condition should be included for shGal7 cells
6) Figure 7F experiment needs to be on the same blot. Also, independent binding of Gal7 with E-cadherin is not shown in Fig. 7F or a similar experiment. This might indicate that both EGFR and Gal7 cooperate to stabilize interaction with E-cadherin as E-cadherin is unable to bind to either individually.
7) Figure 7 is referred to as Figure 8 in the text.
8) The manuscript is not well-written and needs to be thoroughly edited. For example, page 8, last line. “Colocalization assays of Gal7 and LAMP-1 gave no results”.
-
Reviewer #1:
In this paper the authors provide evidence that Galectin-7 binds the extracellular domain of EGFR regulating its signaling.
Although the in vitro study is for the most part nicely done, the major problem of this paper is the overall novelty. To this end several publications clearly show that, 1) members of the galectin family (e.g. 3) regulate EGF receptor signaling; 2) galectins (e.g. 8) regulate the early trafficking of EGFR; 3) galectins (e.g. 3) binds and regulate RTKs, including EGFR; 4) galectin-7, the topic of this paper, regulates e-cadherin expression and dynamics. Thus it is felt that the fact that galectin-7 binds to and regulates EGFR signaling is not sufficiently novel.
In addition, it is felt that some experiments are not sufficiently quantified (e.g. intracellular signaling) and some data are of descriptive nature (e.g. the characterization of the gal-7 null mice and in vivo evidence that gal-7 interacts with EGFR is somehow superficial).
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
As you can see from the reviews included, the reviewers have identified major shortcomings with this study that overall dampen the enthusiasm for the results reported. One of the major pitfalls identified is the overall novelty of the paper. As you can see from the detailed comments by the reviewers, other Gal family members have been shown to regulate EGF activation and trafficking, and to bind RTKs. Thus the identification of Gal 7 as a novel regulator of EGF receptors does not provide a clear advance. In addition, the claim that Gal7 is a direct binder of E-cadherin is not demonstrated experimentally. Some experiments shown should be shown on the same blots, and it felt that they lack solid quantification and in some cases are of descriptive nature. Finally it is felt that the manuscript is not well written and editing is recommended.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #2:
CUT&RUN, a recently developed method, is a convenient alternative to ChIP-seq. Because it generates a footprint of DNA protected from MNase digestion, it can potentially also provide more nuanced information than ChIP-seq. In this paper, CUT&RUN is applied to the mapping of RNA polymerase II (Pol II) binding sites in the genome of a human lung carcinoma cell line. A technical innovation in the current paper is that the authors bypass the attachment of cells to concanavalin A-magnetic beads for all steps from cell permeabilization on, and exploit the fact that the cells they use naturally adhere sufficiently well to the bottoms of multi-well plates that these steps can all be performed on the cell culture plates themselves.
In the original CUT&RUN paper, it was already pointed out that different size classes of protected fragments might reveal different aspects of the biology of DNA bound factors. The authors of the current work extend this observation, and report two size classes of fragments that are produced by CUT&RUN applied to RNA polymerase II. They interpret the shorter fragments as marking Pol II sitting in a poised, compact state directly at the transcription start site (TSS), and the longer fragments downstream of the TSS as reflecting a less compact or larger, stalled Pol II complex after transcription has been initiated. This is consistent with what we know about regulation of nascent RNA elongation by Pol II shortly after transcription initiation, a phenomenon that has been known for individual genes since the 1980s, and that has first been documented genome-wide well over a decade ago.
In addition, the authors suggest that a substantial fraction of Pol II is also found in a paused/stalled/poised state upstream of the TSS. Unfortunately, it is unclear what the upstream signal reflects. E.g., is this pausing because of bi-directional transcription, or because of a separate pre-initiation complex or conformation? Without such insight, the observation does not add to our understanding of transcription initiation and elongation.
In aggregate, the authors present a simplification over conventional CUT&RUN for cell cultures, and they provide additional details for Pol II positioning near TSSs. While the work is technically well done, the technical improvements are relatively minor, and there are no principally new biological insights.
-
Reviewer #1:
The technical advance, which involves CUT&RUN on plates and doing paired end reads is modest. The main result of interest is the detection of a minor Pol II ChIP peak that maps around the transcriptional start site (TSS) as opposed to the major peak that corresponds to paused Pol II downstream from the TSS. The existence of the Pol II peak near the TSS is hardly surprising on first principles, and it is unknown what this peak corresponds to in terms of mechanism. The authors refer to this as "pre-initiation" and "poised", but there is no evidence for this. It is entirely possible (in my opinion more likely) that this peak corresponds to abortive initiation, a well-known step in the transcription cycle where Pol II makes short abortive transcripts that only occasionally get extended to longer products. It wasn't clear what the CTD phosphorylation status of this TSS-linked Pol II is, but it seems like it was phosphorylated at serine 5 residues. If so, this would indicate that TFIIH had already mediated the phosphorylation, which would release Mediator and allow promoter escape. Whatever the explanation, the existence of the peak doesn't indicate anything about mechanism. Lastly, this TSS-linked peak has been seen by Erickson (2018) so the result per se isn't novel. The approach here is more physiological than Erickson, but this isn't a significant advance, especially since there is no mechanistic information.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
The method described, Back-it-up (BIU), builds upon the recently published Shake-it-off (SIO) system for EM grid preparation by eliminating the requirement for self-wicking, nano-wire grids (along with their inherent limitations including grid-to-grid variability and limited wicking capacity) by back-blotting standard copper-faced EM grids with highly absorbent glass fiber filter paper. Additional modifications to the SIO unit are reported that enable grid preparation (sample application-to-vitrification) times on the order of ~100ms. Although the achievement of this time constant has been reported for the Spotiton and chameleon automated grid preparation robots, these systems are technically complex and expensive to build or buy. As reported here, BIU represents for labs of modest financial resources a robust, reproducible high speed cryo-EM grid preparation device for around $1000 that uses a fraction of the sample volume required by typical automatic plunge freezer and can achieve sub-second plunge times that reduce the negative effects (denaturation, preferred orientation) of the air-water interface on the protein sample.
This study is well organized. First, it clearly demonstrates and provides visual supporting evidence of the absorptive capacity of the glass fiber filters. Next they validate the filters on a commonly used grid prep device using back-blotting. Finally, the authors use multiple samples and plunge speeds to demonstrate the utility and effectiveness of combining the glass fiber filters and a modified SIO device to prepare grids that yielded high resolution EM structural data.
-
Reviewer #2:
General Assessment:
The paper is well crafted and a clever improvement on current methods by combining the shake-it-off system with a Leica GP3 back blotter switching out the filter paper for a glass fiber pad. This improvement has likewise shown impressive results, and this information should be disseminated to help the field move forward. However there are a couple of issues, with borderline tangential material, that must be dealt with.
Substantive Concerns:
There are two major substantive concerns. The first revolves around the use of the influenza A hemagglutinin trimer in a direct apples to apples comparison with the work of Noble et.al. In their paper using spotiton they showed that dropping from 500ms to 100ms not only reduced the preferred orientation dramatically, but it also changed the thickness distribution of the ice in the holes. Thus the paper left the reader with a bit of an open question about whether it was a thickness effect or a temporal effect that resulted in the reduction of the preferred orientation problem. This is especially pertinent given their tomography work showing that the influenza A hemagglutinin trimer displays extreme sensitivity to the thickness of ice. For example, when the ice is too thin the trimer is completely excluded, then when the ice is just barely thick enough there is a region where only the top view orientation is possible, and finally only in the thicker ice (100-150nm) are side views possible. Thus, when attempting to compare the results from the BIU to the results from Noble et. al. the ice thickness becomes a confounding factor to the assignment of the improved distribution due to reduced time between blotting and vitrification. It is quite likely that the BIU's enhanced results are not a product of the reduced time between deposition and vitrification but rather due to the BIU producing a thicker ice in the middle of the holes due to the different thinning method, thus allowing for more side views as shown in Noble et. al.. Therefore the lines 265-271 seem, to this reviewer, to be much too strong of a conclusion; however, given the importance of the observation this reviewer suggests that the authors simply remove lines 269-271 and leave the important observation as an important observation.
The sentence starting on line 169 should be removed. A biosafety cabinet alone is insufficient to allow this invention to be compatible with BSL3/4 safety protocols, as the aerosol generated not only contaminates everything in the biosafety cabinet, but also will stay in the air for quite some time afterwards, long enough that a researcher might accidentally make the mistake of releasing whatever pathogen they are working with.
-
Reviewer #1:
I found no faults with this study and believe it is a timely contribution to the subfield of cryoEM sample preparation. Given the lower costs associated with this technology than the alternatives, it is possible that through-grid wicking with glass fiber will be widely adopted.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript. Adam Frost (University of California) served as the Reviewing Editor.
-
-
www.biorxiv.org www.biorxiv.org
-
Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Reply to the reviewers
Reviewer #1 (Evidence, reproducibility and clarity (Required)):
Molenaars et al., describe a protocol to extract and quantify a wide range of polar and apolar metabolites from the same C. elegans sample using methanol-chloroform based phase separation. The authors assess the method across different input amounts, in comparison to a 1-phase extraction method and through metabolic perturbations using RNAi against several metabolic enzymes. Finally, they provide a metabolomics analysis of metabolite variation across several C. elegans strains. The data are of overall high quality and presented in a clearly written manuscript.
We really appreciate the positive words from the reviewer.
To help assessing the value of the method to other approaches, several controls are suggested below:
1.Fig.1: Metabolite abundance in the polar phase should be compared to 1-phase extraction methods (analogous to Fig. 2I, which compares metabolites in the apolar phase to 1-phase extraction)
We acknowledge the apparent asymmetry in the text; comparing our two-phase method to a single phase lipidomics method indeed suggests a similar comparison for metabolomics. However, our established polar metabolomics method has always been based on this exact two-phase extraction. The current method exclusively asks whether it is possible to integrate our dedicated lipidomics platform into our established two-phase polar metabolomics method, by utilizing the apolar phase that is usually discarded. This way, the method enables comprehensive metabolomics/lipidomics screening while limiting the need of culturing twice the amount of material.
Our manuscript does not necessarily ask the more fundamental question of the advantages of a one-phase vs two-phase extraction for polar metabolites. Interestingly, the one-phase vs two-phase metabolomics methods have been compared previously and the authors show here that the two-phase method achieved broader metabolite coverage, satisfactory extraction reproducibility, acceptable recovery and safety (DOI: 10.1038/srep38885). This is most probably due to the cHILIC column being sensitive for contamination and therefore excluding lipids from your samples is beneficial for measuring polar metabolites. We hence believe that developing a single phase polar method would appear superfluous for the purpose of this study.
2.Are polar metabolites also detected in the apolar phase? Can the less hydrophobic lipids missing from the apolar phase detected in the polar phase?
This is an interesting question that mostly relates to the lyso-lipids that are not detected in the lipid phase of our two-phase extraction. The first point to make is that sample solvents that are used at the final stage of extraction are not compatible between methods. In other words, the solvent we normally use for the lipids phase (xxx) cannot be injected on the cHILIC column. So, in a practical sense, we would not be able to measure these compounds, even if they would technically be dissolved in the other layer. However, we tried a few different alternative approaches to get more information on this point:
We have attempted to integrate the lyso-lipids in the cHILIC measurements, in the polar layer, using the polar sample solvents. This was unsuccessful; no reproducible peaks, not even the internal standards, were measured. We will include a note on these results in our manuscript. We have, albeit for a different sample matrix, attempted to dissolve both layers of the two-phase extraction in the cHILIC sample solvents. While we cannot guarantee this for all metabolites, it appears that most polar metabolites are exclusively found in the polar layer. We were not able to integrate even a single peak from any of the sugar, amino acids, nucleotides, etc in the apolar layer dissolved in polar solvents. We have reconstituted both the polar and apolar layer of our two-phase extraction in 50:50 methanol:chloroform and analyzed them on the lipidomics platform. We did find some of the lipid internal standards partition to the polar phase, especially LPG (and to a lesser extent LPE and LPA) compared to for instance PE, SM, PG and PC that all end up in the apolar phase. We will include these data in the revised manuscript as a supplemental figure as it demonstrates that the lyso-lipids are poorly measured in the two-phase extraction. This is also why in the text we advise to use the dedicated one-phase extraction when interested primarily in these species.
3.Fig.3l-n: The authors claim that extracting metabolites from the polar and apolar phases of the same sample leads to better cross-correlation than if metabolites are extracted from different samples using methods optimized for the respective metabolite classes. To provide experimental evidence, metabolite abundance should be compared directly when metabolites are extracted from the same or from different samples using suitable methods.
We agree with this point. We will amend the text to not overstate these advantages.
Reviewer #1 (Significance (Required)):
The methodological and conceptual advancement of the present study is rather incremental. The authors essentially use the classical chloroform/methanol/water phase separation protocols developed by Bligh & Dyer and Folch, which have been used extensively for lipid extraction for many decades now. However, the effort to carefully measure the metabolites contained in the aqueous phase is laudable. For method validation, the authors use well-understood perturbations that yield predictable results. Overall, I consider the study more appropriate for a publication as a methods protocol, which could be of interest to the metabolomics community, rather than as a research paper.
We agree; our goal was indeed to create and share a method, we will make sure to emphasize this in our cover letter.
While the extraction method we use is not novel per se and based on classical extraction procedures, it is important to underscore that we are only now able to use these extractions in combination with high-resolution mass spectrometry. This opens new opportunities for basic discovery. The efficiency we achieve by using both phases of the two-phase procedure makes our method highly attractive for hypothesis generation, especially in sample sets where limited amounts of material are available.
Reviewer #2 (Evidence, reproducibility and clarity (Required)):
The authors provide a detailed description of a method to analyse both polar as well as lipophilic metabolites from the same nematode sample. This provides significant advantages over methods using individual samples. Moreover and by using internal standards they establish an extremely good correlation of individual metabolites. This paper is of immediate importance for the worms community and beyond.
We are very grateful to receive this positive response from the reviewer and for highlighting the advantages of our described method also beyond the worm community.
**Major comments:**
none **Minor comments:**
The correction process using internal standards could be described a bit more detailed.
In our revised manuscript, we will describe the internal standard use and corrections in more detail in the text. In summary: internal standards are selected for specific metabolites based on their Pearson correlation and %CV. Subsequently, metabolite peak areas were divided by the area of the appropriate internal standard. This corrects for any loss of sample during sample prep, for instance during the isolation of the two layers.
Jenni Watts has written a nice Worm Book chapter on lipids which may be cited in addition to reference 17, since it covers many of the metabolites and related enzymes contained in this manuscript
We will include a reference to this Worm book chapter reviewing fat regulation in C. elegans in our paper, thank you for the suggestion.
Reviewer #2 (Significance (Required)):
see above
Reviewer #3 (Evidence, reproducibility and clarity (Required)):
The manuscript is well written and consider. However, there is room for further improvements:
We thank the reviewer for the positive response and for the suggestions raised.
1) Author need to write exactly how many metabolites not just >, semi-quantitative analysis of >100 polar (metabolomics) and >1000 apolar (lipidomics) metabolites in C. elegans, for example they did with other papers in Table 1
We understand that this might appear vague. The notation was a compromise, based on the following considerations:
- The maximum number of reported metabolites can be different to the number of analyzed metabolites in a specific experiment or even a specific sample. For instance, our method is perfectly capable of measuring creatine metabolism –we have standards for these metabolites and they can be reliably measured–, however we have not yet been able to detect these metabolites in elegans. Some mutants also lose abundance of a certain metabolite to the point of it not being reliably measurable, which means they are filtered out in the bioinformatics.
- Since the initial draft of our manuscript we have been able, and will continue to be able, to add new metabolites to our analysis, as we perform a full scan over the range of m/z 50-1200. Because of this, we felt it more accurate to state that we can measure >100 metabolites, instead of a specific number.
2) Authors also need to clarify on number of samples in the result section while describing the statistical analysis.
We understand this point raised by the reviewer and will specify not only the number of samples, but also that they are indeed biological replicates. This will be included in the figure legends.
Reviewer #3 (Significance (Required)):
This might be interesting paper for the research community who work with C.elegans (metabolism or in general)
Thank you, we are in fact utilizing this double extraction for other non-worm samples such as mice an human tissues and we believe this could also benefit the research community beyond the model organism C. elegans.
The authors must deposit the raw data and make it available for the public, so they could also benefit from this good work.
It is our full intention to share our data in a convenient and standardized way through for instance the MetaboLights database (https://www.ebi.ac.uk/metabolights/). We agree and changes will be implemented as suggested.
Reviewer #4 (Evidence, reproducibility and clarity (Required)):
**Summary:** The authors present a method for extraction of both lipid and polar metabolites from the model organism C. elegans. This extraction method is based on the well-established Blyth and Dyer method, with a slight modification to retain and utilize both the organic and non-polar fractions for LCMS analysis. They applied and tested this method against a monophasic extraction utilizing the same solvent system. They report that there is a loss of metabolites in the non-polar fraction to the polar fraction (of more polar metabolites) and small differences between the monophasic and biphasic extractions. They also expanded on the linearity of the extraction efficiency by increasing the number of worms. Further they applied the single extraction method to both knockdown mutants of C. elegans and Recombinant Inbred Lines derived from N2 and the natural isolate CB4856 to determine whether this method would still be able to differentiate the metabolome between the genetically different C. elegans populations.
We thank the reviewer for their comments and suggestions.
**Major comments:**
*Are the key conclusions convincing?*
As a whole the conclusions are convincing and valid.
We appreciate that the reviewer considers our work convincing and valid.
*Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?*
The use of the adjective "robust" is, to an extent, erroneous. As defined, a robust method implies that the method is capable of withstanding small (deliberate or not) changes or variations. In this case the robustness of the method was not assessed and not clear how replication was carried out.
We have in fact performed analysis on both biological replicates and repeated injections of pooled samples to determine robustness. We will clarify the biological replicates in the text and will place the pooled QC samples in the main text with additional explanation and relevant statistics such as % coefficient of variance (%CV) between them. For clarity, we plotted %CV of all polar as well as apolar metabolites. For polar metabolites 97% of the metabolites had a %CV lower than 30. For apolar metabolites 86% of the metabolites had a %CV lower than 30.
*Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.*
Reproducibility would need to be assessed/quantified to establish how robust the method is. Even though linearity with an increase in the number of worms is a good indication, it does not satisfactorily establish the robustness of the method. The use of replicates to assess the agreement between measurements (i.e. bland-Altman plots), linearity as well as coefficients of variation (included in the sup material but not clear in the body of the manuscript) would characterize the methods best. The isolation of each variance originating from instrumental (pooled quality controls), biological (biological replication) and sample preparation (multiple extractions from the same biological source) is critical.
We have these data and will elaborate on this in our revised manuscript. We will discuss the quality control samples more prominently in the main body of the manuscript, and show one or more figures that specifically address both analytical and biological variance (see rebuttal figure 2). In summary, we assessed this variance using (a) a repeated injection of a pooled QC sample, and (b) biological replicates prepared individually. Especially the latter condition, in which we assess biological variance is representative for the actual method application. The %CV under these conditions is ≤20% for the majority of metabolites, which is why we consider our method robust.
*Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.*
The suggested experiments are in-fact just further analysis with the already collected data. There would be no need for further experiments, however it is not clear whether pooled QCs/or reference materials were used and the number of replicates per experimental design.
All the data are available. These analyses will be included in the revision.
*Are the data and the methods presented in such a way that they can be reproduced?*
The methods are very well described. My only comment is to address how the replicates were grown/created and how many per strain/group. If the replicate measurements were done on the same samples (repeated injections), I believe that would weaken the findings (if not invalidate them altogether), however if these were biological replicates from independent starting populations the findings are valid and convincing.
We performed bona fide biological replicates. We will explicitly mention this in the paper together with the other descriptions of our validation protocols.
*Are the experiments adequately replicated and statistical analysis adequate?*
As per my above comments.
**Minor comments:**
*Specific experimental issues that are easily addressable.*
It is not clear how the sample preparation process was carried out (randomization, run order, QCs etc). As per the guidelines widely accepted from –Broadhurst, D., Goodacre, R., Reinke, S.N. et al. Guidelines and considerations for the use of system suitability and quality control samples in mass spectrometry assays applied in untargeted clinical metabolomic studies. Metabolomics 14, 72 (2018). https://doi.org/10.1007/s11306-018-1367-3.
We will provide details on the analysis itself in a table. In summary: Samples were measured in a random order, with blanks and QC samples throughout the run.
*Are prior studies referenced appropriately?*
A major reference that has applied this extraction method before in the same model organism is missing:
Castro, C., Sar, F., Shaw, W.R. et al. A metabolomic strategy defines the regulation of lipid content and global metabolism by Δ9 desaturases in Caenorhabditis elegans. BMC Genomics 13, 36 (2012). https://doi.org/10.1186/1471-2164-13-36
We will include this paper in our references. We would like to note though that this method requires not just an LC system to analyze lipids, but also GC with additional derivatization steps. Our method achieves comprehensive lipidomics using a single technique and no additional derivatization.
Further a recent publication that goes beyond the work described by the authors using similar approach: MPLEx: a Robust and Universal Protocol for Single-Sample Integrative Proteomic, Metabolomic, and Lipidomic Analyses. Ernesto S. Nakayasu, Carrie D. Nicora, Amy C. Sims, Kristin E. Burnum-Johnson, Young-Mo Kim, Jennifer E. Kyle, Melissa M. Matzke, Anil K. Shukla, Rosalie K. Chu, Athena A. Schepmoes, Jon M. Jacobs, Ralph S. Baric, Bobbie-Jo Webb-Robertson, Richard D. Smith, Thomas O. Metz mSystems May 2016, 1 (3) e00043-16; DOI: 10.1128/mSystems.00043-16
We will also include this paper, reporting 51 polar metabolites and 84 lipid species, in our references. While we recognize that they also make use of both phases and the protein pellet, we think our method is much more practical in several key ways:
Our metabolomics platform provides twice as many species and our lipids platform exceeds their analytical capabilities 10 fold. This means a far better coverage of differences within metabolite and lipid classes, allowing for far more intricate patterns to be detected. We show this for instance in our plots comparing carbon chain length to degree of saturation (Fig 4 and S2 in original manuscript); a comparison that is only possible with the data density that our method offers. The MPLEx metabolomics method also requires the use of a GC system and derivatization steps, while our method does not, making it much more user friendly and requiring only a single analytical system.
*Are the text and figures clear and accurate?*
Yes *Do you have suggestions that would help the authors improve the presentation of their data and conclusions? *
The figures, overall are of exceptional quality.
As per current scientific consensus, Box plots should also be overlaid with the actual datapoints (which was aptly done for the bar charts and other plots).
The supplementary data even though comprehensive is hard to understand. A "readme" file detailing what data each file contains would improve readability and comply with FAIR principles.
We agree that a readme file would make the supplemental data more understandable. We will provide such a file. For the box plots we will show the actual data points in our revised manuscript.
Reviewer #4 (Significance (Required)):
Even though the approach is not novel and has long been used in Natural Products Chemistry and in other organisms, it's highly significant to set an extraction method standard for the field of C. elegans metabolomics (including myself doing metabolomics and natural products chemistry with LCMS and NMR). However, this manuscript does not cover the technical aspects of the method with sufficient depth to hallmark this method as the standard for the field. Further information is needed to fill the missing gaps (as highlighted by the authors). Ratios between solvent and biological material amounts, reproducibility, recovery rates (even though buried in the supplementary files) and metabolite coverage are still missing.
As a side note, the disparity between the monophasic and biphasic extractions could be overcome by a sequential extraction of the same sample, with no incurred cost on performance (and removing the much-dreaded pipetting uncertainty near the line between solvents). The second aspect of the manuscript, which initially was a welcoming idea (and important), became >50% of the manuscript creating a disconnect between the information set by the abstract and introduction and the results/conclusion. The work is extremely relevant in both sections of the manuscript, but the technical aspect is still lacking details and/or analysis.
Strongly suggested: explicit compliance with the minimum reporting standards as per the Metabolomics Standards Initiative (MSI) and deposition of the data to a metabolomics repository (i.e. Metabolights or Metabolomics Workbench). These are internationally accepted requirements for metabolomics publications.
We are aware that the extraction itself is an analytical chemistry staple. However, it is precisely in this fact that we find novelty. It should be noted that both of the other papers mentioned by the reviewers that have attempted to integrate lipidomics and metabolomics have had to resort to labor intensive (as well as possibly expensive and destructive) derivatization steps and a separate analysis on GC. Our method does not have these requirements. It is indeed a single and very common extraction, after which each dried phase is reconstituted and immediately injected. But this simplicity is not a concession, as our metabolome coverage is easily more comprehensive than the other mentioned methods. We therefore feel that this simplicity should not discount our currently presented method, but be considered an additional advantage.
Sequential extractions may be an option to consider. However, we feel like they are less user friendly and unneeded. Because we use internal standards, it is never an issue to pipet slightly more or less of any particular sample; making it easy to avoid the line between solvents.
We will explicitly clarify where we already comply with the standards (such as the analysis of biological replicates and repeated injection of a QC sample) and are confident we can add figures and further information such as deposition of our data to comply with the rest.
REFEREES CROSS-COMMENTING
Completely agree with reviewer #1 comments, they are on point and I completely missed it. Relevant and should be addressed.
Reviewers #2 points out work worth acknowledging, the internal standard work was quite thorough and well designed.
Reviewer #3 and my comments overlap nicely, the need for further description of samples/replication and deposition of data in a metabolomics repository.
Further work is required to make this a good publication and standard for the field, without this extra work addressing the reviewers comments I feel this work could be to certain degree misleading and/or incomplete putting in cause its publication potential.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #4
Evidence, reproducibility and clarity
Summary:
The authors present a method for extraction of both lipid and polar metabolites from the model organism C. elegans. This extraction method is based on the well-established Blyth and Dyer method, with a slight modification to retain and utilize both the organic and non-polar fractions for LCMS analysis. They applied and tested this method against a monophasic extraction utilizing the same solvent system. They report that there is a loss of metabolites in the non-polar fraction to the polar fraction (of more polar metabolites) and small differences between the monophasic and biphasic extractions. They also expanded on the linearity of the extraction efficiency by increasing the number of worms. Further they applied the single extraction method to both knockdown mutants of C. elegans and Recombinant Inbred Lines derived from N2 and the natural isolate CB4856 to determine whether this method would still be able to differentiate the metabolome between the genetically different C. elegans populations.
Major comments:
Are the key conclusions convincing?
As a whole the conclusions are convincing and valid.
Should the authors qualify some of their claims as preliminary or speculative, or remove them altogether?
The use of the adjective "robust" is, to an extent, erroneous. As defined, a robust method implies that the method is capable of withstanding small (deliberate or not) changes or variations. In this case the robustness of the method was not assessed and not clear how replication was carried out.
Would additional experiments be essential to support the claims of the paper? Request additional experiments only where necessary for the paper as it is, and do not ask authors to open new lines of experimentation.
Reproducibility would need to be assessed/quantified to establish how robust the method is. Even though linearity with an increase in the number of worms is a good indication, it does not satisfactorily establish the robustness of the method. The use of replicates to assess the agreement between measurements (i.e. bland-Altman plots), linearity as well as coefficients of variation (included in the sup material but not clear in the body of the manuscript) would characterize the methods best. The isolation of each variance originating from instrumental (pooled quality controls), biological (biological replication) and sample preparation (multiple extractions from the same biological source) is critical.
Are the suggested experiments realistic in terms of time and resources? It would help if you could add an estimated cost and time investment for substantial experiments.
The suggested experiments are in-fact just further analysis with the already collected data. There would be no need for further experiments, however it is not clear whether pooled QCs/or reference materials were used and the number of replicates per experimental design.
Are the data and the methods presented in such a way that they can be reproduced?
The methods are very well described. My only comment is to address how the replicates were grown/created and how many per strain/group. If the replicate measurements were done on the same samples (repeated injections), I believe that would weaken the findings (if not invalidate them altogether), however if these were biological replicates from independent starting populations the findings are valid and convincing.
Are the experiments adequately replicated and statistical analysis adequate?
As per my above comments.
Minor comments:
Specific experimental issues that are easily addressable.
It is not clear how the sample preparation process was carried out (randomization, run order, QCs etc). As per the guidelines widely accepted from -
Broadhurst, D., Goodacre, R., Reinke, S.N. et al. Guidelines and considerations for the use of system suitability and quality control samples in mass spectrometry assays applied in untargeted clinical metabolomic studies. Metabolomics 14, 72 (2018). https://doi.org/10.1007/s11306-018-1367-3.
Are prior studies referenced appropriately?
A major reference that has applied this extraction method before in the same model organism is missing:
Castro, C., Sar, F., Shaw, W.R. et al. A metabolomic strategy defines the regulation of lipid content and global metabolism by Δ9 desaturases in Caenorhabditis elegans. BMC Genomics 13, 36 (2012). https://doi.org/10.1186/1471-2164-13-36
Further a recent publication that goes beyond the work described by the authors using similar approach:
MPLEx: a Robust and Universal Protocol for Single-Sample Integrative Proteomic, Metabolomic, and Lipidomic Analyses Ernesto S. Nakayasu, Carrie D. Nicora, Amy C. Sims, Kristin E. Burnum-Johnson, Young-Mo Kim, Jennifer E. Kyle, Melissa M. Matzke, Anil K. Shukla, Rosalie K. Chu, Athena A. Schepmoes, Jon M. Jacobs, Ralph S. Baric, Bobbie-Jo Webb-Robertson, Richard D. Smith, Thomas O. Metz mSystems May 2016, 1 (3) e00043-16; DOI: 10.1128/mSystems.00043-16
Are the text and figures clear and accurate?
Yes
Do you have suggestions that would help the authors improve the presentation of their data and conclusions?
The figures, overall are of exceptional quality. As per current scientific consensus, Box plots should also be overlaid with the actual datapoints (which was aptly done for the bar charts and other plots). The supplementary data even though comprehensive is hard to understand. A "readme" file detailing what data each file contains would improve readability and comply with FAIR principles.
Significance
Even though the approach is not novel and has long been used in Natural Products Chemistry and in other organisms, it's highly significant to set an extraction method standard for the field of C. elegans metabolomics (including myself doing metabolomics and natural products chemistry with LCMS and NMR). However, this manuscript does not cover the technical aspects of the method with sufficient depth to hallmark this method as the standard for the field. Further information is needed to fill the missing gaps (as highlighted by the authors). Ratios between solvent and biological material amounts, reproducibility, recovery rates (even though buried in the supplementary files) and metabolite coverage are still missing.
As a side note, the disparity between the monophasic and biphasic extractions could be overcome by a sequential extraction of the same sample, with no incurred cost on performance (and removing the much-dreaded pipetting uncertainty near the line between solvents).
The second aspect of the manuscript, which initially was a welcoming idea (and important), became >50% of the manuscript creating a disconnect between the information set by the abstract and introduction and the results/conclusion. The work is extremely relevant in both sections of the manuscript, but the technical aspect is still lacking details and/or analysis.
Strongly suggested: explicit compliance with the minimum reporting standards as per the Metabolomics Standards Initiative (MSI) and deposition of the data to a metabolomics repository (i.e. Metabolights or Metabolomics Workbench). These are internationally accepted requirements for metabolomics publications.
REFEREES CROSS-COMMENTING
Completely agree with reviewer #1 comments, they are on point and I completely missed it. Relevant and should be addressed.
Reviewers #2 points out work worth acknowledging, the internal standard work was quite thorough and well designed.
Reviewer #3 and my comments overlap nicely, the need for further description of samples/replication and deposition of data in a metabolomics repository.
Further work is required to make this a good publication and standard for the field, without this extra work addressing the reviewers comments I feel this work could be to certain degree misleading and/or incomplete putting in cause its publication potential.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #3
Evidence, reproducibility and clarity
The manuscript is well written and consider. However, there is room for for further improvements,
1) Author need to write exactly how many metabolites not just >, semi-quantitative analysis of >100 polar (metabolomics) and >1000 apolar (lipidomics) metabolites in C. elegans, for example they did with other papers in Table 1
2)Authors also need to clarify on number of samples in the result section while describing the statistical analysis.
Significance
This might be interesting paper for the research community who work with C.elegans (metabolism or in general)
The authors must deposit the raw data and make it available for the public,so they could also benefit from this good work.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #2
Evidence, reproducibility and clarity
The authors provide a detailed description of a method to analyse both polar as well as lipophilic metabolites from the same nematode sample. This provides significant advantages over methods using individual samples. Moreover and by using internal standards they establish an extremely good correlation of individual metabolites. This paper is of immediate importance for the worms community and beyond.
Major comments: none
Minor comments:
The correction process using internal standards could be described a bit more detailed.
Jenni Watts has written a nice Worm Book chapter on lipids which may be cited in addition to reference 17, since it covers many of the metabolites and related enzymes contained in this manuscript
Significance
see above
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #1
Evidence, reproducibility and clarity
Molenaars et al., describe a protocol to extract and quantify a wide range of polar and apolar metabolites from the same C. elegans sample using methanol-chloroform based phase separation. The authors assess the method across different input amounts, in comparison to a 1-phase extraction method and through metabolic perturbations using RNAi against several metabolic enzymes. Finally, they provide a metabolomics analysis of metabolite variation across several C. elegans strains. The data are of overall high quality and presented in a clearly written manuscript.
To help assessing the value of the method to other approaches, several controls are suggested below:
1.Fig.1: Metabolite abundance in the polar phase should be compared to 1-phase extraction methods (analogous to Fig. 2I, which compares metabolites in the apolar phase to 1-phase extraction)
2.Are polar metabolites also detected in the apolar phase? Can the less hydrophobic lipids missing from the apolar phase detected in the polar phase?
3.Fig.3l-n: The authors claim that extracting metabolites from the polar and apolar phases of the same sample leads to better cross-correlation than if metabolites are extracted from different samples using methods optimized for the respective metabolite classes. To provide experimental evidence, metabolite abundance should be compared directly when metabolites are extracted from the same or from different samples using suitable methods.
Significance
The methodological and conceptual advancement of the present study is rather incremental. The authors essentially use the classical chloroform/methanol/water phase separation protocols developed by Bligh & Dyer and Folch, which have been used extensively for lipid extraction for many decades now. However, the effort to carefully measure the metabolites contained in the aqueous phase is laudable. For method validation, the authors use well-understood perturbations that yield predictable results. Overall, I consider the study more appropriate for a publication as a methods protocol, which could be of interest to the metabolomics community, rather than as a research paper.
-
-
www.biorxiv.org www.biorxiv.org
-
Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Reply to the reviewers
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
We thank the reviewers for their feedback and encouragement. We have now fully revised the manuscript to address all comments. Our specific responses are provided below and we have highlighted changes in the text. The major additions are:
- analysis of simulated time-courses with lower temporal resolution
- analysis of ex vivo PER2::LUCIFERASE SCN recordings
- analysis of simulated time-courses with Poisson distributions of noise
- plotted summary statistics for several figures
- mathematical formula and explanation in the Methods Overall, these revisions have strengthened our findings and improved the manuscript, particularly in demonstrating that the issues with the chi-square periodogram are not specific to sampling interval or data type.
Reviewer #1 (Evidence, reproducibility and clarity (Required)):
**Summary:**
Tackenberg & Hughey investigate the reliability of a popular period estimation algorithm, the chi-square periodogram. They find a bias in the estimation, and through careful investigation identify the cause. This is a well executed and well presented study.
**Comments:**
In Figs 2+3 the authors show that the discontinuity in periodogram coincides with the number of complete cycles, K. However, in Fig 2C there are several other positions where K abruptly changes, but little effect on the chi-squared statistic is observed. Can the authors offer an explanation as to why the magnitude of the discontinuities differ?
We have taken a closer look at how each component of the chi-square statistic calculation changes at points where K decreases, and have found that discontinuities do always occur at these points. In addition to the obvious effect of the K * N term on the sudden decreases, we found that the sum of squares of the column means alone (the primary component of the numerator) also changes abruptly at each transition point of K. As a result, the discontinuity magnitude is likely roughly proportional to the amplitude of the chi-square statistic at that point.
An important claim is that the discontinuity is observed in multiple software implementations. However, the plots of Supplementary Fig 1C,D are presented too small to evaluate this claim.
In Supplemental Fig. 1C-D, the critical information is the shape of the periodogram and the presence of a discontinuity, so we believe the plot sizes are appropriate.
It may be of interest to apply the algorithms to a single-cell experimental data set which are qualitatively different (e.g., oscillation shape, damping).
We have created a new supplemental figure (Supplemental Fig. 8) by applying the strategy and visualization used in Fig. 6 to SCN PER2::LUC recordings instead of wheel-running data, and have updated the text accordingly.
Reviewer #1 (Significance (Required)):
It has been previously shown that the chi-square periodogram algorithm has performance shortcomings for the analysis of circadian data (e.g. Zielinski et al., 2004). However, this study demonstrates exactly why, giving more conclusive evidence to support the conclusion that it should be avoided. This will be useful to many in the mammalian circadian community. It should be noted however that other algorithms are already favoured by other ciock communities (e.g. plant), even if a rigorous understanding of the biases were lacking.
The methods developed here will be valuable for future comparisons of circadian algorithms. Of particular importance will be comparing algorithms for analysis of single-cell rhythms or non-stationary rhythms.
Reviewer #2 (Evidence, reproducibility and clarity (Required)):
Chi-squared periodograms (CSP) are routinely used in circadian biology. In particular, this test has been used to determine circadian period in behavioral data (e.g. actigraphy) in mammals, flies and other species. This paper suggests that CSP, in some circumstances (e.g. where there are discontinuities), that CSP could be improved by changing the algorithm. They propose different steps to do this (e.g. using their greedy CSP code) and/or by using alternative tests such as Lomb-Scargle.
The authors use simulated data to demonstrate their findings, and whilst I can see the benefits of this, it would be useful to benchmark the algorithms on actual real world circadian data (e.g. actograms from mouse or fly experiments). Although these types of data may not be publicly available, it would be highly likely to be available from multiple labs in the circadian field. In particular, fly datasets will be abundant in many clock labs. This would aid the utility of the papers findings for the field.
Fig. 6 is entirely based on real-world circadian data (mouse wheel-running activity), as is the newly added Supplemental Fig. 8.
Reviewer #2 (Significance (Required)):
The paper is helpful for the circadian field when dealing with datasets that may contain discontinuities.
It appears that the paper will be primarily useful for behavioral data, rather than, for example, transcriptomic time courses, since these tend to be much shorter and less sample intensive. Thus, it would be useful for circadian (and other) researchers analysing activity data in particular.
My expertise is in circadian rhythms, both behavioural and molecular (e.g. sequencing) level analyses. Thus, I would be a possible end-user for the algorithms in this paper.
Reviewer #3 (Evidence, reproducibility and clarity (Required)):
**Summary:**
The authors identify a serious flaw in a popular method called Chi-squared periodogram (CSP) for period estimation in circadian rhythms. They systematically get to the source of the problem -- a discontinuity in the test statistic. This flaw leads to a bias in the period estimate. They present two modifications to the CSP, one of which they prefer. Nevertheless, they show that other more flexible methods such as Lomb-Scargle Periodogram work well without this discontinuity (bias) issue.
**Major Comments:**
1.One thing the authors do not include is timeseries lengths of non-integer days. Would it not be an interesting suggestion to choose a non-integer length time course, which is not a multiple of the periods of interest, and still continue using CSP as is ? This is also rather counter-intuitive.
Figs. 3A and 6 and newly added Supplemental Fig. 8 use non-integer (24-h) days.
2.I suppose the authors use a sampling resolution of 6min with wheel-running activity in mind. But it would be worth it in the interest of completeness to also consider a lower resolution. There is nothing in this study that ties it to the specific application, is it not?
Although a sampling resolution of 6 minutes is not specific to wheel-running activity, we have added an analysis identical to that of Fig. 5 but with a resolution of 20 minutes (Supplemental Fig. 5). Additionally, the PER2::LUC SCN recordings analyzed in Supplemental Fig. 8 have a sampling resolution of 20 minutes.
3.The authors discuss only the mean absolute error in the text but isn't the direction (sign) of the error also of interest. As far as I can see in Fig 5, conservative CSP overestimates and greedy CSP generally underestimates periods.
We discuss both the error (references to Fig. 5A) and absolute error (references to Fig. 5B) in the text. We feel the interpretation suggested by the reviewer may be too reliant on the results of 3-day simulations, as the apparent underestimation by greedy appears far less substantial in simulations of 6 and 12 days.
**Minor Comments:**
1.I would like to see the formulae for the ratio of variances and p-values to be clear about how the authors computed the CSP. They describe it in words already, but I think some mathematics is warranted here.
We have added the formula for the standard chi-square periodogram to the Methods section.
2.It is nice to the see the raw data in the plots. But I would like to see the plot of the summary statistics (mean and variance/st. dev) for each of scatter plots to judge the size of bias. It is not easy to do this with the Excel sheet.
We have overlaid a black circle representing the median and a vertical black line representing the 5th-95th percentile range onto Fig. 5 and Supplemental Figs. 3-7.
Reviewer #3 (Significance (Required)):
The authors present a sobering perspective on the chi-squared periodogram, which is still very popular among empirical biologists. They plainly show using artificial data that it is better to avoid the CSP when possible, although they suggest improvements to the CSP. The authors provide an R package to perform the analysis.
There have been previous work that have highlighted other limitations of the CSP. This might be considered one more nail in the coffin of the CSP.
I think this paper would be interest to both computational biologists and wet-lab biologists, but I think it ought to have a greater influence on the latter as the former already resort to more sophisticated approaches.
My expertise is in Computational and Theoretical biology.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #3
Evidence, reproducibility and clarity
Summary:
The authors identify a serious flaw in a popular method called Chi-squared periodogram (CSP) for period estimation in circadian rhythms. They systematically get to the source of the problem -- a discontinuity in the test statistic. This flaw leads to a bias in the period estimate. They present two modifications to the CSP, one of which they prefer. Nevertheless, they show that other more flexible methods such as Lomb-Scargle Periodogram work well without this discontinuity (bias) issue.
Major Comments:
1.One thing the authors do not include is timeseries lengths of non-integer days. Would it not be an interesting suggestion to choose a non-integer length time course, which is not a multiple of the periods of interest, and still continue using CSP as is ? This is also rather counter-intuitive.
2.I suppose the authors use a sampling resolution of 6min with wheel-running activity in mind. But it would be worth it in the interest of completeness to also consider a lower resolution. There is nothing in this study that ties it to the specific application, is it not?
3.The authors discuss only the mean absolute error in the text but isn't the direction (sign) of the error also of interest. As far as I can see in Fig 5, conservative CSP overestimates and greedy CSP generally underestimates periods.
Minor Comments:
1.I would like to see the formulae for the ratio of variances and p-values to be clear about how the authors computed the CSP. They describe it in words already, but I think some mathematics is warranted here.
2.It is nice to the see the raw data in the plots. But I would like to see the plot of the summary statistics (mean and variance/st. dev) for each of scatter plots to judge the size of bias. It is not easy to do this with the Excel sheet.
Significance
The authors present a sobering perspective on the chi-squared periodogram, which is still very popular among empirical biologists. They plainly show using artificial data that it is better to avoid the CSP when possible, although they suggest improvements to the CSP. The authors provide an R package to perform the analysis.
There have been previous work that have highlighted other limitations of the CSP. This might be considered one more nail in the coffin of the CSP.
I think this paper would be interest to both computational biologists and wet-lab biologists, but I think it ought to have a greater influence on the latter as the former already resort to more sophisticated approaches.
My expertise is in Computational and Theoretical biology.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #2
Evidence, reproducibility and clarity
Chi-squared periodograms (CSP) are routinely used in circadian biology. In particular, this test has been used to determine circadian period in behavioral data (e.g. actigraphy) in mammals, flies and other species. This paper suggests that CSP, in some circumstances (e.g. where there are discontinuities), that CSP could be improved by changing the algorithm. They propose different steps to do this (e.g. using their greedy CSP code) and/or by using alternative tests such as Lomb-Scargle.
The authors use simulated data to demonstrate their findings, and whilst I can see the benefits of this, it would be useful to benchmark the algorithms on actual real world circadian data (e.g. actograms from mouse or fly experiments). Although these types of data may not be publicly available, it would be highly likely to be available from multiple labs in the circadian field. In particular, fly datasets will be abundant in many clock labs. This would aid the utility of the papers findings for the field.
Significance
The paper is helpful for the circadian field when dealing with datasets that may contain discontinuities.
It appears that the paper will be primarily useful for behavioral data, rather than, for example, transcriptomic time courses, since these tend to be much shorter and less sample intensive. Thus, it would be useful for circadian (and other) researchers analysing activity data in particular.
My expertise is in circadian rhythms, both behavioural and molecular (e.g. sequencing) level analyses. Thus, I would be a possible end-user for the algorithms in this paper.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #1
Evidence, reproducibility and clarity
Summary:
Tackenberg & Hughey investigate the reliability of a popular period estimation algorithm, the chi-square periodogram. They find a bias in the estimation, and through careful investigation identify the cause. This is a well executed and well presented study.
Comments:
In Figs 2+3 the authors show that the discontinuity in periodogram coincides with the number of complete cycles, K. However, in Fig 2C there are several other positions where K abruptly changes, but little effect on the chi-squared statistic is observed. Can the authors offer an explanation as to why the magnitude of the discontinuities differ?
An important claim is that the discontinuity is observed in multiple software implementations. However, the plots of Supplementary Fig 1C,D are presented too small to evaluate this claim.
It may be of interest to apply the algorithms to a single-cell experimental data set which are qualitatively different (e.g., oscillation shape, damping).
Significance
It has been previously shown that the chi-square periodogram algorithm has performance shortcomings for the analysis of circadian data (e.g. Zielinski et al., 2004). However, this study demonstrates exactly why, giving more conclusive evidence to support the conclusion that it should be avoided. This will be useful to many in the mammalian circadian community. It should be noted however that other algorithms are already favoured by other ciock communities (e.g. plant), even if a rigorous understanding of the biases were lacking.
The methods developed here will be valuable for future comparisons of circadian algorithms. Of particular importance will be comparing algorithms for analysis of single-cell rhythms or non-stationary rhythms.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
The manuscript "Decision making in auditory externalization perception" aims to identify cues that create/hinder an auditory externalization percept by using a template-based modeling approach. The approach as well as the findings are very interesting, and the study is thoroughly conducted. However, the manuscript adds little new knowledge to the field. Furthermore, a critical discussion is missing. The authors use a template-based model, but do not discuss the possible problems with such an approach. Particularly as each condition uses another model fit. This potentially allows the model to use cues that the auditory system cannot or does not consider. Nevertheless, the approach can still teach us which cues are potentially important for auditory externalization.
1) The title seems inappropriate as the main work seems to be on the identification and combination of cues for externalization but not on the decision making.
2) The model needs a more detailed explanation in the introduction. Otherwise the result section is not understandable without consulting the methods section.
3) Add a Discussion on template-based models and fitting conditions. The risk of mathematical inspired models is that features are exploited that the auditory system cannot access. A more sophisticated front-end than a gammatone filterbank might reduce this risk. Alternatively, the use of physiologically inspired front-ends as in Scheidiger et al. (2018) might be interesting to consider. Nevertheless, I acknowledge that some of the features used in this study are backed by physiological and psychoacoustical studies.
4) It is known that the monaural spectral shape is important for externalization, for example from the studies that you have used. Thus, I partly question the novelty of the findings.
5) I am not too familiar with template based models but I wonder if there is a problem if you use your models to fit and test with the same datasets?
-
Reviewer #2:
The current study compares four decision rules, factoring in seven potential acoustic cues, for predicting perceived sound externalization for single-source binaural sound with stationary interaural cues. Test stimuli included a harmonic vowel complex, noise and speech. Results show that monaural and binaural cues shape externalization. However, how listeners weighted these cues varied across the tested conditions. The authors consider the fact that some of these cues covary acoustically, by additionally testing their model on subsets of two of these cues only. No single externalization cue emerged as a clear predictor for perceived externalization. However, overall, a static cue weighting strategy tended to outperform dynamic cue weighting for predicting externalization.
Major concerns dampen enthusiasm for the current work.
1) It is unclear what neural mechanism is being tested. A premise of the current approach is that perceived sound externalization is primarily driven by acoustic cues. However, we know this not to be true. Context matters. As pointed out by the authors (l370-372), when listening to sounds processed with head related transfer functions (HRTFs) over headphones, listeners can externalize sound better when the context of the test room matches the room where HRTFs were recorded (Werner and Klein 2014).
2) Most external sounds are neither anechoic nor stationary. Therefore, any neural decision metric on externalization must have been shaped by lifelong experience with dynamic, reverberant cues for interpreting externalization. The current work mostly models stationary single source sound that was either anechoic or mildly reverberant, providing pristine spatial cues. I do not follow the author's point that this would not matter (l498-502): "While the constant reverberation and visual information may or may not have stabilized auditory externalization, they certainly did not prevent the tested signal modifications to be effective within the tested condition. In our study, we thus assumed that such differences in experimental procedures do not modulate our effects of interest." That is an untested assumption.
3) Many of the current test stimuli are perceived as ambiguous - providing 50% externalization ratings - and thus do not provide a sensitive test of brain mechanisms of sound externalization.
4) Reverberation enhances perceived externalization, but this cannot be predicted by any of the tested decision metrics which only consider stationary monaural or binaural cues.
On balance, this reviewer is unconvinced that the current work will generalize to realistic dynamic and reverberant conditions.
S. Werner and F. Klein, "Influence of Context Dependent Quality Parameters on the Perception of Externalization and Direction of an Auditory Event," presented at the AES 55th International Conference: Spatial Audio (2014 Aug.), conference paper 6-4.
-
Reviewer #1:
I agree with the authors that the question at the basis of this work is timely and important both from the point of view of understanding auditory perception and for informing technology. However I am not convinced that the findings here will necessarily generalize to other stimuli/listening situations.
I think the biggest limiting factor here is that the primary data on which the modelling is based are drawn from many different studies which used different stimuli, different tasks, different presentation environments and different equipment). I can see how testing the model on existing data is an important first step, but I would think that a critical next step is to form a set of (contrasting) predictions to be tested on a single stimulus set, within a single group of participants, as a way of confirming model validity. In this experiment I would also avoid using static non-reverberant environments since we know that these factors greatly affect spatial perception.
Other comments:
1) The title greatly overstates the main findings, it would be toned down.
2) Intro, line 30-33 this statement is misleading. As written it appears to claim temporal aspects of auditory perception are based on short term regularity, whilst spatial perception is based on long term effects. This is not correct see e,g Ulanovsky 2004.
3) As a reader not highly familiar with the auditory spatial processing literature I found the results section very dense and hard to follow. If you are targeting a general audience it is important to clarify concepts, avoid using abbreviations where possible etc.
4) When discussing the various decision strategies which you tested, consider explaining how they might be implemented by the auditory system, at which stage of processing etc.
5) It is very difficult to evaluate your results without more information about the stimuli and studies from which they were taken. Whilst you do provide references, I think the paper would be much clearer if you provide a more complete description of the stimuli (even in table form; paradigms etc).
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 4 of the manuscript.
Summary:
As you will see the reviewers agreed that the premise behind this manuscript is important and timely both in the context of basic auditory science and for informing technology. However, they raised largely consistent concerns about the generalizability of your observations to other auditory stimuli and to more naturalistic listening conditions.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
In the manuscript by Miné-Hattab et al the authors revisit a phenomenon that has been extensively studied for over 10 years: the subdiffusive and diffusive properties of DNA damage binding factors in repair foci (inside and outside of foci). The work is carefully done and brings a few observations of interest, but the novel insights are extremely limited. The most original aspect is that they characterize the movement of repair molecules within the focus with movement of the focus itself (the movement of foci has been done by many and turnover of factors has also been done by many). That they compare the two with one set of measurements is the key contribution of the paper, and they do find differences in diffusion coefficients. It is likely that this was not done previously. It is difficult to judge, as key papers that showed similar conclusions or datasets are not cited.
Here are a few key examples:
1) In the last year the Haber lab published a very similar study in Plos Genetics (Live cell monitoring of double strand breaks in S. cerevisiae, Waterman et al 2019 https://doi.org/10.1371/journal.pgen.1008001 ). Although they tracked Ddc2 and Rad51, they also looked at the behavior of separate foci and this paper is not even cited. The data should be compared at the very least.
2) The characteristics of 53BP1 foci have been extensively studied by many labs including those of Altmeyer, Scherthan, DeLange and others, with very similar findings as Miné-Hattab reports for Rad52 (for example, Phase separation of 53BP1 determines liquid‐like behavior of DNA repair compartments, Kilic et al., EMBO J. 2019 38(16): e101379; Live Dynamics of 53BP1 Foci Following Simultaneous Induction of Clustered and Dispersed DNA Damage in U2OS Cells Alice Sollazzo et al., Int. J. Mol. Sci. 2018, 19, 519 as well as the single molecule work of the lab of Eric Greene). Moreover both rad52 and PCNA foci were studied by Essers et al. (Kanaar and Vermeulen) MCB 2005. 25(21): 9350-9359 and EMBO J. 2002 Apr 15. Comparisons with these studies needs to be made.
3) A number of earlier studies followed Rad52 foci in budding yeast on induced double strand breaks (even using the I-Sce1-cut system used here) that are not taken into consideration. The diffusion coefficients presented here have to be compared with these earlier studies and differences should be resolved by comparing techniques and conditions of imaging. For instance, Dion et al., Nature Cell Biology 2012).
In brief, while the execution and analysis of the data shown here is very good, without direct comparison with other data sets, it is difficult to see exactly where this paper goes beyond published studies. This is especially crucial as the paper as written makes no effort to compare their data with existing datasets. Most specifically a comparison with LLPS as defined for other chromatin-foci forming proteins in the nucleus needs to be done - particularly addressing studies in mammalian cells concerning 53BP1 and other repair factors. This, plus a careful comparison with data from induced Ise1-break movement, must both be included. Finally, insufficient data are provided to draw conclusions about whether or not the authors' observations are reflective of phase separation. Additional mobility studies in conditions that disrupt LLPS are needed, both for the individual protein and for the foci. In conclusion, serious revision is needed and an effort must be made to show to the reader that this data is comparable (or not) with other data in the literature.
-
Reviewer #2:
Miné-Hattab et al. conduct a study focusing on the behaviour of the DNA repair protein Rad52 at sites of DNA damage in budding yeast. Several DNA repair proteins, including yeast Rad52, have been previously observed to phase separate at sites of DNA damage in a number of organisms. However, the authors here aimed to more accurately consider the potential phase separation behaviour of Rad52 by using single particle tracking (SPT) and Photo-activatable Localization Microscopy (PALM). Overall, the findings are consistent with previous studies and provide additional evidence supporting the concept that Rad52, but not the ssDNA-binding protein RPA, phase separates at the site(s) of DNA damage. The data shown also support the long-appreciated concept that different DSB sites cluster within the nucleus, albeit this study presents higher resolution data. The study falls within an important area of investigation.
1) The study does not present a novel conceptual advance.
2) What is the evidence that the biophysical properties observed are of direct relevance to DNA repair? For example, is the mobility of Rad52 within the repair focus important for repair? Is the difference in diffusion kinetics within and outside of the repair focus important for genome stability? What could the authors do to alter that diffusion profile and what would be the consequence on repair? Also, addressing this point implies the need to use a more physiologically relevant system with repairable DSBs, and not the irreparable DSB system used here. The authors describe the work of many in the field as "extremely phenomenological", yet it is not clear what the authors did to go beyond such a statement.
3) Overall, the statistical significance of most of the presented data is either lacking or unclear. This needs to be carefully addressed.
4) It is unclear if the 'absence of DNA damage' condition discussed in the first section of the results is the non-induced version of the system described in the second section of the results. Also regarding these sections, it seems that the 'absence of DNA damage' control conditions were not conducted as part of the same experiments with the I-SceI DSB.
5) Several statements made are not supported by the data and without clearly stating that the statements represent speculations. E.g. page 4, longer tail is due to Rad52 molecules diffusing slowly inside the focus; page 8, observing the 2 populations also in G1 does not necessarily mean that the 2 populations in S/G2 do not reflect replication forks at all. The authors need to carefully revise their claims/statements and consider alternative explanations. Also, the writing is often unclear or confusing and the authors should consider substantially revising it to clarify their claims, clearly indicate speculations that are not supported by the data, and make the text as accessible as possible to non-specialists.
6) How do the authors reconcile previous findings indicating that recombinant DNA repair proteins phase separate in vitro with their claim that "Rad52 acts as a client of the LLPS but does not drive its formation" on page 11?
7) How was the cell cycle stage determined?
8) Fig S1 data appear to show the existence of a partial loss of Rad52 function in the Rad52-Halo cells. This should be clearly expressed in the results and consequent limitations/caveats discussed. Also, please clarify whether Fig S1 shows the viability of Rad52-Halo cells in the presence or absence of JF646.
9) Regarding the possible categories of traces evaluated, one category is not included in the study. The surface tension that defines LLPS-dependent bodies is known to both help maintain focus integrity and partly counter LLPS body fusions. So if the foci represent true phase-separated bodies, have the authors then observed traces where Rad52 molecules interact with yet fail to enter the larger Rad52 foci?
10) The authors present no direct evidence for an "attractive potential" that drives molecules towards the centre of the focus. For example, what if the 'attractive potential' is simply the focus' boundary surface tension creating a barrier against which some of the molecules inside the focus bounce back towards the centre of the focus?
11) Consider revising the discussion to shorten it while making it more focused on conceptual advances and higher level interpretations, without re-describing the results in detail.
12) Can the authors visualize the fusion of the Rad52 foci/DSBs in live cells within their experimental systems?
13) The authors state on page 10 that "Here, we found that upon different levels of Rad52 over-expression, the background concentration increases (Figure S8) suggesting that Rad52 might not be the driving molecule responsible for the LLPS formed at the damaged site." Can the authors explain the logical transition here more clearly, it was unclear.
-
Reviewer #1:
In this manuscript by Mine-Hattab and colleagues, the authors use single-molecule tracking in yeast to dissect the formation of the double-stranded break response in living cells. Specifically, they try to determine the nature of Rad52 clustering at the DSB focus. The sequential recruitment pathway is well-studied in yeast (RPA --> Rad52 -->Rad51), and the inducible I-SceI break offers a controlled system for DNA damage. Moreover, yeast could be an excellent model system to elucidate if there is any conservation or function for such compartments. Overall, I found the data and the subsequent analysis to be both rigorous and nuanced. Ultimately, one is trying to distinguish whether the focus is due to a clustering of binding sites or liquid-liquid phase separation, or perhaps some combination of the two. I feel the story falls short of providing a definitive answer, as do many in this field, but the authors conclude that the preponderance of evidence points to a LLPS model for Rad52 clustering.
1) How is it possible to distinguish a cluster of binding sites from liquid-liquid phase separation? To this referee, that is the question that needs answering. In the absence of breaks, there are two Rad52 diffusion populations (D= 1.2 and 0.3 um2/s), which the authors attribute to monomers and multimers. They don't verify these multimers by alternative approaches (say number and brightness analysis), but it seems like a reasonable possibility. After a break, a third component - slower than the previous two --becomes evident. This slow population coincides with the break. In the vicinity of the break, there is now only 1 component diffusion (D=0.03 um2/s). Also, the motion is now more confined, but not absolutely so. Also, Rad52 diffuses faster than Rfa1, which is bound to ssDNA. At this point, there is no data to distinguish between two possibilities: slow diffusion or diffusion + binding. Except, if it were diffusion + binding, one might perhaps expect to still see the free diffusion component. However, I can imagine lots of different scenarios and a range of binding affinities and multimer states that would make that analysis an unholy mess.
The authors then turn to diffusion at the boundary (Fig. 5), which I agree can be a more informative measure. Here, they see changes in the diffusion estimator for trajectories which cross the boundary, using displacement which they argue is more robust for slow diffusion. The problem is that the 'boundary' is determined by the very thing they are trying to measure, not some independent marker of the compartment. In other words, Rad52 defines the compartment, unless I missed something fundamental in the experimental design. Ideally, the way such an experiment would be done to test the hypothesis that Rad52 is forming a LLPS compartment is to look at the diffusion of an inert tracer as it comes in and out of the compartment. As designed, I frankly do not see how the observation of different diffusivities in and out of the compartment distinguishes between a cluster of binding sites and an LLPS. If you accept that DNA-binding is in no way biasing the kinetics, then the authors' interpretation seems like the most sensible one. But the fact that Rad52 is involved in DNA repair makes that a hard assumption to swallow.
Furthermore, I'm not sure I entirely grasp the significance of Fig. 6. Since Rad52 can easily escape one focus and enter another, regardless of whether it is a cluster of binding sites or a phase, I don't see how the radius of confinement measurement distinguishes between these two alternatives. The observation that the foci are 2x larger in diploids but at similar density is compelling, although recent data from the Brangwynne lab point out that conserved density need not be the case (PMID: 32405004).
2) In the syntax of this paper, Rad52 is a client in the LLPS, leaving the question of the scaffold unaddressed. After all, the Rad52 focus ultimately disappears, meaning that something caused this phase to be dispersed. So is RPA the scaffold? It might be possible to address both points 1 and 2 by knowing what is responsible for forming the LLPS in the first place.
In summary, I found the paper to be balanced and rigorous when exploring possible interpretations of the data. Although the authors may feel the preponderance of their data is consistent with LLPS, I don't feel they have nailed it. It's hard to identify a smoking gun. Of their four observations in the discussion only the second is direct, and that observation may have other explanations. However, I am not sure what experiment to recommend which would be definitive. Such is the nature of this field.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
This manuscript is in revision at eLife
Summary:
In this manuscript by Miné-Hattab and colleagues, the authors use single-molecule imaging approaches to investigate local dynamics of Rad52 foci at DSBs in budding yeast, which is an important area of investigation. They show that the dynamics of Rad52 molecules inside foci are consistent with protein movement within LLPS domains, while Rfa1 dynamics are not. Their data also provide supporting evidence to previous observations that repair sites cluster within the nuclei, and suggest that clustered foci behave as larger phase separated structures. While the idea that Rad52 and other repair proteins form phase separated domains is not novel, this study presents higher resolution data in support of this model. The reviewers generally agree that the study is interesting and well conducted, but the conceptual advancement is limited. Specifically, more convincing experiments demonstrating that the observed Rad52 dynamics reflect LLPS are required. Evidence that the dynamics are relevant for DNA repair and genome stability should also be provided. Additionally, the study should be better integrated with previous studies, statistical analyses need to be more rigorous/better presented, and the text should include a clearer separation between observations and speculations.
-
-
www.medrxiv.org www.medrxiv.org
-
Reviewer #3:
This study uses Bayesian inference to estimate the probability of detecting a malaria case and distribution of malaria cases using different surveillance methods in a district in Palawan, Philippines. The authors show that detection of malaria cases depends on household location and cannot be explained by distance to the health centre alone. They also argue that in low endemic settings it is economical to screen health care attendees stratified by their environmental risk (here, 100m proximity to closed canopy forest). The integration of unique high-quality spatial and molecular datasets is compelling. The authors argue that integrating remote sensing into triage for enhanced molecular detection of malaria could be economical in these settings.
Major comments:
1) The explanation of the modelling framework is, as written, hard to follow and reproduce. Examples of where authors could improve clarity: the equations throughout use the same notation to mean very different things (si = patent infection (L380) or diagnostic sensitivity (L394)). The statement '𝑿𝑖𝜿 represents a vector of covariate effects' L383 does not make sense. Is X a specific location and 𝜿 the covariate estimate? It is difficult to understand how models were created and evaluated. The level of detail in the spatial data (Table S1) is insufficient for reproducibility, but could be easily amended to do so. Table 1- can authors list the actual range of these covariates before they are mean-centered and scale. Contextualizing the fixed effect estimates (i.e. distance to a closed canopy forest) is difficult to interpret given that no mean or sd of these distances are given (at least not that I could find).
2) Terminology changes throughout the manuscript, making things difficult to follow. For example, surveillance method 1 is referred to as passive case detection (Line 126), existing passive surveillance systems (Line 131), standard PCD (Line 137). Although one can assume these are all the same, it would help to use consistent terminology for this throughout. Convenience sampling is used throughout, but it's unclear if this is distinct from enhanced surveillance.
3) This is mentioned in the limitation section, but I don't think it gives a sufficient explanation. One benefit of the R-INLA framework is that it can account for spatio-temporal data - why was time of year and temporally relevant environmental characteristics not examined?
4) The authors don't provide convincing evidence that integrating remote sensing into this setting would actually add value. Could health care workers not ask residents if they live next to a big, closed forest? Wouldn't this achieve the same outcome? Wasn't it already known that frontier malaria was a problem here?
-
Reviewer #2:
This is an interesting analysis and it is great to see a modelling analysis that has the potential to directly influence programmatic decisions. The idea of using remotely sensing data to stratify surveillance or diagnostic practices is interesting and scalable. The analyses are clearly described, and I found the use of the probability of detection metric particularly relevant to the types of decisions being made in pre-elimination settings. I have a few minor comments and would be curious if some discussion could be added to how this may be applicable to settings outside of SE Asia.
-
Reviewer #1:
In 'Disentangling fine-scale effects of environment on malaria detection and infection to design risk-based surveillance' the authors analyze data from the Philippines to investigate the utility of landscape data to inform risk-based surveillance programs. The authors use occupancy modeling, a common approach in ecological studies, with health facility data (that combine both passive case detection via microscopy and RDTs with molecular approaches) to analyze the effectiveness of surveillance systems to detect malaria cases. Using cross-sectional surveys based at health facilities and the residence location of sampled individuals, the authors work to develop a method to detect locations with malaria infections. They find that in highly forested areas, there is a higher proportion of infections only detectable by molecular methods.
In general, the authors provide a fine analysis. However, the novel aspects or new insights of this approach are unclear. The authors use a common standard statistical approach, although less common in epidemiology it is very common in ecology, to analyze fairly commonplace data. Their findings are in line with our existing knowledge of issues with enhanced (i.e. molecular) versus standard (RDT, PCR) and ability for ecological/landscape data to help improve surveillance systems. For example, it is not novel that enhanced surveillance would identify a wider spatial distribution than passive case detection since this method should identify more infections. Further, integrating landscape or geographic data to inform risk-prediction is commonly used for malaria or other vector-borne diseases that have an environmental component.
Major comments:
The authors do not provide adequate background on the setting, biases in the data used, and impact of health seeking behavior on their results. The authors find that the detection probability was negatively associated with travel time to the health facility. However, they do not elaborate upon whether this might be true or if health seeking biases from individuals who are from more forested areas and traveling to health clinics. In addition, the authors only analyze a single year of data which prevents any temporal trends to be analyzed or more robust analyses to be performed.
One of the key findings is that the cost per infection detected is less expensive using a risk-based surveillance. However, how do the authors suggest this would be actionable? What strategies would be done to follow-up these infections? Since these results are not about incidence or prevalence, just the presence or absence of at least one case of malaria in a location, how would this be translated into practice? In addition, is it reasonable to assume that molecular diagnostics would be deployed to these types of health facilities? It is already well known that passive case detection is less costly than molecular detection.
The authors do not elaborate on the implications of identifying additional locations where there is a larger proportion of sub-patent infections. Although the overall finding that infections only detected via molecular approaches are more common in forested areas, it is not clear how this would help the program. In addition, the primary outcome measure is the presence or absence of a malaria infection in a location. This is not a common outcome measure and further analyses of how this type of measure would be used and interpreted are needed.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
The authors have used a number of different experimental approaches to investigate the actions of LPS (as a model for inflammation) on modifying GABAergic inhibition in the medial prefrontal cortex (mPFC). They conclude that the inhibition of pyramidical neurons is selectively enhanced by the subsequent upregulated levels of GABAAR subunits, glutamine synthetase (GS) and vGAT, and downregulated BDNF and pTrkB levels as a result of microglia activation. Unfortunately the authors use a number of different approaches that preclude comparing results because of the different experimental conditions. For example, IP injection of LPS 2 hours before recording from acutely prepared brain slices is not necessarily comparable to a 20 min bath application of LPS directly onto brain slices. The entry of LPS directly into the brain is likely to be minimal and is not equivalent to the bath application of LPS. In addition, the attenuation of the "sickness behavior" after LPS injection and the attenuation by minocycline (Fig 7) is a fairly old story well studied by Dantzer's group (e.g. PMCID: PMC2683474) and previously shown to be blocked by minocycline (Henry et al 2008 PMID: 18477398).
There are discrepancies in the methods descriptions and details about the conditions. Technically some of the recordings aren't whole cell patch recordings because the pipettes contain gramicidin indicating that these were perforated patch recordings. However it is uncertain which recordings are obtained using perforated patch approach. The authors don't provide enough information on the evaluations of the perforated patch recordings to ensure there were no access resistance problems. In addition there are two different pipette solutions described in the methods. This has to be clarified. The authors also do not provide information on when the animals were sacrificed after the LPS injections and slices were obtained.
Finally the authors describe the actions of BDNF on LPS application on brain slices not on the LPS injection into the animals. They also mention two different concentrations. I am not certain the effects of LPS injection IP in the awake animal are equivalent to the LPS application for 20 min prior to BDNF. Page 6- I don't think the acute application of LPS onto inhibitory interneurons is equivalent to the effects of LPS injection in the whole animal and the preparation of slices leading to recordings from pyramidal neurons. These experiments are unconvincing and would have to be conducted under similar conditions for comparisons to be made.
The authors puff supernatant extracted from PFCs and compare +- LPS. They find a higher amplitude current from the LPS treated mice and interpret this as indicating a higher GABA content. This is insufficient evidence as there are other components in extracts such as this and the authors have no evidence using GABA antagonists that these currents truly are due to GABA-A Cl- channels.
-
Reviewer #2:
The manuscript by Tang et al 2020 entitled "Microglia activation leads to neuron-type-specific increase in mPFC GABAergic transmission and abnormal behavior in mice" investigates how changes in inflammation acutely modify GABAergic neurotransmission in the medial prefrontal cortex. The authors provide evidence that 2h-post LPS systemic injection (i.p.) leads to enhanced mIPSC amplitude and frequency and upregulation of GABAaR, vGAT, and GS protein levels. In addition, BDNF application or pre-treatment with minocycline prevents aberrant GABAergic transmission following LPS exposure. They conclude that microglia are responsible for these changes in neurotransmission. The experiments are generally well-done and the manuscript was nicely written and easy to follow. However, there are significant concerns related to the interpretation that this is a microglial effect. Above all, LPS and minocycline are very blunt and not specific to microglia. Besides their effects on the peripheral immune system, which could also affect the brain, they can also directly affect other cell types in the brain (neurons, glia, vasculature, etc.) in addition to microglia. Therefore, it cannot be concluded, without more cell-specific manipulations, that the effects are attributed to microglia. Other concerns are detailed below:
1) Are changes in neurotransmission restricted and specific to the mPFC or is this a more global disruption in neurotransmission due to full body systemic inflammation?
2) The indicators of microglial activation by immunostaining for Iba-1 and measuring soma size are fairly superficial. More in-depth molecular analyses with more microglia-specific markers would be more informative.
3) GFAP does not label all reactive astrocytes and is therefore not the best indicator of changes in reactive astrogliosis. The authors should include additional markers in their analysis outlined in Liddelow et al. Nature 2017.
4) Behavioral changes, which are largely locomotor, within 2 h post-LPS are more likely a sickness behavior rather than a specific effect of changes in neurotransmission in the mPFC.
5) It is unclear what specific pyramidal neuron population are being recorded in the mPFC. Specifying the layer would be informative.
6) The authors attempt to link the results with BDNF application with a microglial affect. This link is not particularly strong. While there are studies demonstrating microglial BDNF can affect circuits, the majority of BDNF is made by other cell types in the brain, not microglia. Without cell-specific manipulations, the authors should tone down this link.
7) Experiments displayed in Figure 4 should include a minocycline-only condition.
8) It would be informative to perform electrophysiological recordings on organotypic slices treated with minocycline followed by +/- acute LPS treatment.
9) The authors use an interesting method whereby they puff lysate from control and LPS brains to assess the impact on e-phys recordings. Due to the increased inhibitory transmission, the authors conclude that there is increased GABA content. However, it seems there could be other explanations such as other neuroactive factors, including cytokines, that could potentiate GABA transmission. Measuring GABA by, for example, immunohistochemistry could help to address this concern.
10) In several western blot panels the bands are saturated and are, thus, not ideal for use in quantifications.
11) The increase in GABAaR, vGAT, and GS at the protein level within 2 h-post LPS treatment is quite rapid and more typical of immediate early genes (e.g. c-FOS, Arc, etc.). Could the authors comment on this in the manuscript?
-
Reviewer #1:
In this manuscript, Drs Tang and colleagues study how inhibitory synapses are modulated upon intraperitoneal injection of LPS or upon direct application of LPS onto acute slices. The manuscript could certainly be strengthened by addressing the following points, which are all related:
1) The authors seem to consider that microglial "activation" identified by a morphological modification and enhanced Iba1 signal is an homogeneous all-or-none state that can be reached or blocked by different stimuli. Therefore, they compare the result of an "activation" by a 2h intraperitoneal (ip) injection of LPS with a direct 10 min application of LPS onto acute brain slices. However, it is now acknowledged that different stimuli induce different microglial phenotypes (Perry et al. Nat Rev Neurol 2010, 6:193) that may not be comparable. LPS binds to TLR4 protein which is expressed by microglia in the brain, but also by peripheral immune cells such as macrophages. The effects of ip injection of LPS might thus be due to microglia (if LPS pass the blood brain barrier), and / or to an indirect effect of peripheral immune cells activation. The effect of LPS application on acute slices is directly due to the binding to microglial TLR4. At this stage, it seems not possible to rule out the possibility that a signaling molecule coming from the periphery could both activate microglia and modulate inhibitory synapses (see point 2). It is therefore not possible to claim (as in the title) that activation of microglia results in the increase of GABAergic transmission.
2) The authors propose a role for BDNF based on the decrease of BDNF in 2h LPS mice observed by WB (figure 4D). However, they have focussed their WB analysis on this protein and have not examined any other signaling molecules. In figure S3, they showed that LPS increases the mRNAs encoding TNFα, IL1b and IL6. How can they exclude that these proteins are involved in the activation of microglia of microglia and upregulation of GABAR?
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript. Gary L Westbrook (Oregon Health and Science University) served as the Reviewing Editor.
Summary:
The impact of neuroinflammation on brain circuits is an important topic. However, all reviewers had significant and overlapping concerns and were not convinced that the data adequately supported the authors’ conclusions.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #2:
The manuscript lacks a clear hypothesis/message. It is ultimately descriptive and adds very little to our understanding of the role of immune mechanisms in the development of tissue fibrosis (including pulmonary fibrosis). Detailed profiling of the immune populations in the context of the bleomycin-induced fibrosis model has been reported previously (Tighe et al., AJRCMB, 2011, PMID 21330464). Similarly, results of the spatial analysis are also not surprising: the authors used the lung injury model and found an accumulation of the recruited immune cells in the areas of injury/fibrosis. Moreover, spatial methods are lacking appropriate rigor necessary for quantitative assessment (i.e. stereology, see Hsia et al., AJRCCM, 2010, PMID 20130146). As a machine learning methods paper, it also lacks novelty (several dimensionality reduction techniques plus random forest classifier) and not validated using external datasets.
-
Reviewer #1:
This paper uses multiple approaches to study the cellular dynamics of murine bleomycin lung injury as a model for human IPF. Multiple techniques are used for this purpose including multi-parameter flow, histology, data reduction technique, comparative analysis between BAL and lung, non-linear mixed modeling and immunohistochemistry. The results are interesting and propose a staged inflammatory response leading to IPF like pathology. However, the data is very descriptive and does not test a specific hypothesis. In particular, the results do not suggest a particular therapeutic strategy. Addition of a targeted intervention to the experiments would enhance the impact of the work.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
The manuscript uses a large temporal immuno-phenotyping dataset in the broncho-alveolar fluid and lungs of mice given bleomycin, so as to enable the modelling of the localised progression from innate to adaptive inflammation and subsequent fibrosis. While this is an immense amount of work and the analysis is interesting, the concerns regarding rigor in spatial quantification and the primarily descriptive nature of the work make the resultant insights, mechanistic or translational, somewhat too limited for a cross-disciplinary readership.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
Luyten et al's study examines the phenomenon of drug-induced post-retrieval amnesia for auditory fear memories in rats, and report that after several experiments using Propranolol, Rapamycin, Anisomycin or Cycloheximide that they essentially observe no disruption of reconsolidation, (i.e., no amnesia). This is a well-executed, written and meticulous study examining an important phenomenon. The author's lack of observing amnesia using these "reconsolidation blockers" highlights an important fact that systemic administration of these drugs at the time of memory retrieval may not robustly influence reconsolidation processes despite what the existing literature may collectively indicate. The author's data clearly indicate this point and it is important the scientific community be made aware of these difficulties in blocking reconsolidation using systemic administration of these drugs.
This group has previously published similar studies disputing similar phenomena. First highlighting a lack of amnesia following the reconsolidation-extinction paradigm and then more recently demonstrating a lack of amnesia attempting to block the reconsolidation of context fear memories. This is now their third installment focusing on Cued fear memories. Certainly, these findings are important, but arguably the novelty of such findings may be diminished a bit. In one of the "control" experiments where the experimenters administer anisomycin immediately post training, they observe a paradoxical result - they observe memory strengthening instead of the expected blockade of consolidation and amnesia. This result highlights a number of things to consider when we interpret these overall results. For one protein synthesis inhibitors(PSIs) are toxic and when administered systemically usually result in inducing the animals to have diarrhea and generally just makes them sick. This of course will make the animals stressed and agitated and result in increasing their stress and likely amygdala activity. All of this could likely be the reason why the animals exhibited memory strengthening or no impairment in consolidation even with a PSI on board. See PMCID: PMC7147976. Figure 6. In this study, they could rescue the impairment of PSI on consolidation by increasing BLA principal neuron firing. Thus an important take away is something like this could easily be happening in the reconsolidation experiments - that there is no blockade because the animals are stressed either due to PSI on board or because some issues with experimenter/animal interactions, etc lead to higher BLA neural activity and rescue of the reconsolidation process.
I don't think the authors go far enough articulating the important differences between systemic and intra-cranial administration of these drugs. Time is a potential factor. Immediate administration of the drug at high concentration in the target brain region (BLA) versus many minutes until the drug gets to the target region with uncertain concentration levels that may not mirror levels reached with intracranial administration. It's unfortunate the authors were not able to include intra-BLA administration of these drugs in this study. I do not necessarily expect them to do such experiments, since they have already done so much and it is not clear the laboratory has the appropriate expertise to conduct such experiments, but this comparison would be helpful.
I think it is important that the authors make some statement of training conditions on cannulated versus cannulated rats. For example, every animal in Nader's 2000 study was bilaterally cannulated targeting the BLA. In contrast every animal in this study underwent no such surgery. I think this is relevant. In my experience non cannulated animals are a bit smarter than cannulated animals and the training conditions across these two differing groups may not equate to the same level of learning. And of course, differences in learning levels can lead to differences in the ability of the retrieved memory to destabilize. The authors mention possibly examining markers of memory destabilization. GluR1 phosphorylation, Glur2 surface levels, protein degradation/ubiquitination have all been used to assess if destabilization has occurred. I do not fully agree with their reasons for not performing such experiments. They could examine some or one of these phenomena across differing training conditions between retrieval, no-retrieval animals. This likely could be informative. However, the authors may not possess the necessary expertise to conduct such experiments, so I'm not stating these experiments need to be completed, but certainly the study could be strengthened with such data.
Experiment 3E - Propranolol without reactivation. I don't see any data for this on the graphs. Am I missing something?
The authors should probably cite this paper too, PMID: 21688892. The authors in this study find no evidence that propranolol inhibits cued fear memory reconsolidation.
-
Reviewer #2:
General assessment:
In this study, Luyten et al. aimed to replicate post-retrieval amnesia of auditory fear memories reported numerous times in the literature. They used a variety of behavioural approaches combined with systemic pharmacological treatments (propranolol, rapamycin, anisomycin, cycloheximide) after reactivation of fear memories. Interestingly, none of the treatments induced a significant decrease of freezing responses during subsequent retrieval tests. Authors strengthened their null results by using Bayesian statistics, confirming the absence of drug-induced amnesia.
Overall, the study is really interesting. Experiments and analyses are very well designed and bring some important findings to the debated topic of post-retrieval amnesia and its clinical relevance.
I have nevertheless several comments for the authors to consider.
-Despite being very detailed, the authors should clarify and uniformize their Methods section and Supplemental information (e.g. number of CS, contexts used...) to improve the understanding of the different approaches. Similarly, methods for the reinstatement protocol (Exp 2) are missing.
-In exp 5, tests 1 and 2 are supposed to have 12 CS each. However, only 8 dots are represented on the graph. Did the authors average some freezing values after the initial 4 first CS presentations?
-There is an obvious difference in baseline freezing response before the test in Exp 7 (Figure 5A-B). Discussion of these differences is an important point and was thoroughly discussed by the authors in the Supplement.
-Ln 384-387: "... additional Bayesian analyses were carried out that collectively suggested substantial evidence for the absence of an amnestic effect". Despite the "substantial effect" given by the meta-analysis, I am a bit confused by the meaning of an "anecdotal evidence against drug < control" reported in half of the experiments. How do the authors interpret these results?
-The effect of cycloheximide on memory consolidation is indeed unexpected. Even if beyond the scope of the current study, what is the authors' hypothesis to explain that cycloheximide in their conditions induced a pro-mnesic effects on the consolidation of fear memories but altered the consolidation of extinction?
-Cycloheximide seemed to induced post reconsolidation amnesia of fear memory after extinction training (Exp 8, Fig 3G) but not after single CS reactivation. Can the authors please develop this point? Is it possible that several presentations of the CS is required to destabilise the initial memory trace?
-
Reviewer #1:
This manuscript provides evidence that drug administration during a reconsolidation window does not necessarily prevent memory recall, as has been shown by many groups. The authors attempted to replicate several published experiments and despite demonstrating that the drugs had other effects on the animals' behavior and physiology (e.g. weight gain), no effects on memory were observed.
The paper is nicely prepared.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
The reviewers all found this thorough report of the failure to replicate drug-induced post-retrieval amnesia to be interesting and the work was viewed as scientifically sound. But they were all concerned that the extent of the advance is not to the level that would be expected. They also raised substantive concerns regarding the reasons for the failures to replicate.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #2:
This manuscript concerns the application of a narrowed mass window DIA method for simultaneous detection of modified (methylated, succinylated, acetylated Arg and Lys) and unmodified peptides in the same MS run. The authors use a combination of synthetic peptide libraries and immunoaffinity enriched samples to compare the performance of several mass windows, ultimately showing improved separation of modified and unmodified (precursor) peptidoforms using a 4 Da separation window. They apply this method with a modified site localization algorithm to identify modification sites that are differentially affected by hypo- and hypermethylation potential in mouse NASH models. These studies reveal potential connections between SAM levels and methylation potential with mRNA translation and acetylation levels. Overall, this work presents a new methodology for simultaneous detection and quantitation of modified proteoforms without requiring parallel runs for enriched and unmodified protein detection. This methodology should be of interest to the proteomics community. Several of the mechanistic connections made in the NASH model are preliminary. There are several other aspects of the method presentation that should be addressed in the comments below.
Major Concerns/Comments:
1) The mechanistic jump from moderate alteration of methylation in three ribosomal proteins to causing decreased mRNA levels is not supported. The authors would need to add significantly more detail on where these modifications are and what quantitative changes are observed, as well as how these changes can affect the function of the protein of interest. Additionally, the claim that using the 4 Da DIA acquisition aids in understanding this mechanism should be expanded.
2) Similarly, the connections listed in the acetylation section are very tenuous. Specific proteins and deacetylases are listed and connected, but other relevant proteins that play redundant or counteracting roles are not considered. A more holistic presentation of sirtuins and hdacs should be included as they will collectively control the acetylation status. Finally, what is the conclusion of this section? That acetylation is lowered due to a series of effects leading to sirt3 mediated deacetylation? This should be supported experimentally if these claims are to be made.
3) Overall, the causal, rather than corrective relationships discussed on the sections focused on quantifying differential methylation/modification present in hypo/hypermethylated mouse models should be changed. For example, the authors make statements like "to determine the role that differential methylation potential plays in NASH...". The altered prevalence of sites is correlated with altered methylation potential, but these data do confirm they are playing a role in NASH. Statements like these should be adjusted.
4) Do the authors integrate information about cleaved peptides? This co-isolation issue is primarily an issue when exactly the same peptide +/- modification is close in chromatographic space. Yet the unmodified version of many of these target peptides will be cleaved by trypsin, creating a completely different peptide. How is this accounted for in data analysis?
5) The authors include a section on modifying the localization algorithm Thesaurus for the modifications studied here. Can the authors discuss these changes so the readers can assess whether these changes are appropriate and how they affect the altered performance?
-
Reviewer #1:
The manuscript by Robinson et al describes improvements to the DIA technique that are focused on enabling the quantitation of peptides bearing subtly different PTMs on lysine and arginine residues. The technique utilizes small DIA isolation windows to avoid co-isolation of precursor peptides whose m/z's are close (i.e. unmethylated vrs monomethylated or mono- vrs di-methylated, etc). The authors demonstrate that it can be utilized on unenriched samples which permits simultaneous assessment of changes to whole protein levels. Furthermore, they extend their localization algorithm (Thesaurus) to utilize these data and show POC by characterizing changes to PTMs in two mouse models of NASH.
The study represents quite a lot of work and it shows a high level of methodological sophistication, however it is quite narrow in scope. It will be of interest to mass-spectrometrists that utilize DIA, but not to a general audience.
Specific concerns:
1) The paper barely acknowledges the fact that peptides modified on lysine and arginine typically don't cleave efficiently with trypsin thereby resulting in missed cleavages. Thus most of the time it's quite simple to distinguish modified from unmodified without the need for narrow isolation windows.
2) DIA can be quite useful, but this reviewer cannot help but think that PRM might be more well-suited to detailed studies of peptidoforms with subtly different PTMs. If PRM is utilized, isolation windows can be as narrow as 1Da so the techniques employed in this manuscript are unnecessary.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 3 of the manuscript.
Summary:
While the Reviewers were in agreement that your paper reports a useful method, they also felt that it was narrow in biological focus and of primary interest to those within the mass spectrometry-based proteomics community. The Reviewers also question whether the method offers substantial advantages over alternative approaches for analyzing Lys/Arg PTMs by MS-based proteomics.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
PREreview of "Analysis of receptor-ligand pairings and distribution of myeloid subpopulations across the animal kingdom reveals neutrophil evolution was facilitated by colony-stimulating factors" Authored by Damilola Pinheiro et al. and posted on bioRxiv DOI: 10.1101/2020.06.19.161059
Review authors in alphabetical order: Monica Granados and Katrina Murphy
This review is the result of a virtual, live-streamed preprint journal club organized and hosted by PREreview and eLife. The discussion was joined by 8 people in total, including researchers from several regions of the world, a preprint author, and the event organizing team.
Overview and take-home message: Pinheiro et al. have made advances in understanding neutrophil evolution and receptor-ligand participation by using a wide range of relational taxonomic data to show how CSF1/CSF1R and CSF3/CSF3R pairings evolved and contribute to granulocyte adaptations. Neutrophils are the most prolific granulocytes of the mammalian myeloid cells involved in the immune response. The research team bridged the gap in our knowledge on how the receptor-ligand pairing signals of CSF1R/CSF1 helped with bone marrow development, where these short-lived cells are generated, and CSF3R/CSF3 signaled the maximum production volume of the neutrophils and their movement as both a cell population and a single cell for distribution. Although this work is of significant importance in the field, below we outlined some concerns that could be addressed in the next version of this manuscript.
Positive feedback:
-(The findings were) Super novel! I love the breadth of taxa that are covered.
-The intersection of cell biology and evolution is quite interesting!
-This preprint could be a great model for future research/analysis.
-The bolded subtitles for the different results sections were clear and helpful!
-Increased understanding in neutrophils is important because children with immature neutrophils end up with recurrent early-onset life-threatening infections, e.g. severe congenital neutropenia. The more we can learn about neutrophils the more we can take steps to fight this type of infection.
-I believe there is sufficient information in the materials and methods section to allow for the reproduction of the experiment.
-The format made sense and the flow could be followed.
-Cells have a tendency to call out domestic and evolutionary elements which are beneficial, so learning how receptor and ligand interactions evolve in different taxa is relevant.
-It's interesting that gene complexes are associated with specific morphological aspects (e.g. exotherms and endotherms); the gene expression is obvious.
-Figure 1 was cool to see. An expansion of Figure 1 might be of interest, where the phylogenetic tree changes over time to show the loss and gain of specific granulocytes.
-Gene sequencing data was pulled from NCBI Gene and Ensembl databases to create Figure 2a. This is a great example of having a very specific question/hypothesis that can be answered with existing data.
Can other types of physiology be tracked similarly in future research, e.g. scales, breathing - anything that could be mapped?
Are there other groups that could relate to metabolism e.g. brain studies?
It would be interesting to see the level of degradation, e.g. for fish - mapping physiology to a specific gene or brain size (the brain is more developed in different taxa).
-The preprint can be relevant for myeloid phagocyte development and across species geometric morphology/computational anatomy particularly as it can relate to brain structure and sizes. More genetic data across species and homologous brain areas is helpful.
-Overall there was a connection between the results and the research questions, yes, I would say the conclusions were supported by the data.
-Even though we don't have this specific field expertise, as a group, we recommend this manuscript to both others and further peer review.
Major concerns:
-Since this is a large selection of taxa groups, can specification (of a subset) be divided into more detail?
-Please note, taxonomy is not a field I am familiar with. It would be helpful to check the sequence conservation of the receptors across these taxa families and see whether there are any minor evolution instances where they mutated. If the receptors have mutated, do they have a particular residue that mutated?
Acknowledgments:
We thank all participants for attending the live-streamed preprint journal club. We are especially grateful for both the first author's contributions to the discussion and for those that engaged in providing constructive feedback.
-
Reviewer #2:
The article is well written.
1) Please provide a supplementary file containing all the references used for Figure 1b (complete blood count data; CBC). This would be a useful source of data for researchers interested in other blood cell types.
2) Regarding the CBC data - the authors should mention in the text if all the samples were obtained from adults. Whilst I appreciate that n are low for some species, do you obtain the same result if you analyse males and females separately? This may be worth mentioning given that neutrophil numbers have been reported to be higher in women.
3) Please provide a supplementary file containing all the NCBI gene and Ensembl accession numbers for each gene, in each species (Figure 2a).
4) The authors may want to mention that there are other receptors for IL-34 which may explain its expression (in fish, Fig2a) in the absence of Csf1r.
5) Please provide a supplementary file containing all the NCBI protein accession numbers used for Fig3a.
6) Please include isotype controls on histogram in Supplementary figure 1a, 1c and 1d.
7) Please include the full gating strategy for Supplementary figure 1a.
8) Why was 72h chosen for the mobility assays (Supplementary Fig 1b)? At this point, monocytes cultured in CSF1 would begin differentiating into macrophages, and this may affect their mobility.
9) Supplementary Fig 1c - please include the antibodies in the Lin cocktail for flow cytometry in the figure legend.
10) Please mention in text and figure legend that human blood was used (there is no mention of it within text).
11) Was a dead cell exclusion dye used for flow cytometry of human blood and neutrophils? And did you look at FSC-A v FSC-H to exclude doublets? If not, how can you exclude the possibility that the Cxcr4 hi neutrophils are not dying or doublets?
-
Reviewer #1:
Pinheiro and colleagues have described a fascinating view on the evolution of neutrophils and other myeloid cells. This is a very original and potentially important piece of work. To follow neutrophil evolution in the evolutionary tree through co-analysis of the expression of G-CSF/G-CSFR and M-CSF/M-CSFR in the same tree is smart and interesting. The article is not easy to read and some issues need some more clarification(s). So the article would benefit when (random order):
1) At several locations in the article the authors imply that G-CSF is inducing differentiation fitting with an inductive model (eg. introduction lines 41-51). At the same time the authors rightly mention the presence of mature neutrophils in G-CSF-/- mice (as well as mature eosinophils in IL5R-/- mice) more pointing at a stochastic model. This latter model assumes that expression of CSF-R's is more random, and only committed progenitors expressing these receptors will proliferate rather than differentiate in response to these CSF. Please provide sufficient arguments for the inductive model or change part of the interpretations when a stochastic model is more likely.
2) In the whole article data are provided on numbers in peripheral blood. Only a minority of myeloid cells reside in the blood, the majority is in the tissues. The situation with neutrophils is uncertain. Please discuss.
3) The part on C-EBP transcription factors is difficult to follow. Please help the reader understand why they are so important (based on KO strategies) while there is no clear picture in evolution as the genes are sometimes present, sometimes not. Some species have many, some only one. Simply stating redundancy in the system does not really fit the knock-out studies.
4) The part described in lines 372-409/Supplemental figure 1 is not adding much to the article. It is only human with no evolutionary perspective. Consider removing.
5) Please provide some more insight into the issue of eosinophils versus neutrophils. Now it is implied that the co-evolution with endothermia is relevant. Many articles suggest that eosinophils are more specialized in killing large targets (extracellular killing/e.g. parasites) vs neutrophils small targets (intracellular killing/e,g, bacteria). Can the authors provide their ideas about the functional difference of the cells in the evolutionary perspective.
6) line 466: it is stated that neutrophils comprise the largest population of myeloid cells in mammals. This needs supportive evidence, as macrophages are thought to be the largest population at least in the tissues.
7) lines 582 - 585. Although the issue of the lamins is well taken formal proof that the segmented nuclear morphology of neutrophils is important for movement and trans-cellular migration is yet to be determined (e.g. J Immunol January 1, 2019, 202 (1) 207-217; DOI: https://doi.org/10.4049/jimmunol.1801255 ).
8) Lines 61-64 young children with SCN often have mutations in the ELANE gene rather than the GSF-R gene. Can the authors discuss how ELANE fits with the model they are presenting?
9) Please provide the definitions of neutrophils and heterophils as they can be present as different cells in the same species.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript.
Summary:
Pinheiro and colleagues have described a fascinating view on the evolution of neutrophils and other myeloid cells. The authors used a wide range of relational taxonomic data to show how CSF1/CSF1R and CSF3/CSF3R pairings evolved and contributed to granulocyte adaptations. This is a very original and potentially important piece of work that sheds light into the evolution of mammalian neutrophils.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
I like this paper. It clearly and succinctly presents an interesting and (to my knowledge) novel mechanism for proofreading that is distinct from typical formulations, that decouples the enzyme itself from the proofreading functionality (essentially modularizing the proofreading mechanism). The derivations and figures explore its possibilities and physical limits in a fairly convincing fashion (subject to several minor quibbles I detail below), supporting the conclusions. This mechanism significantly broadens the scope of systems that could enact proofreading, and allows tuning of the proofreading by regulating activity or concentration of gradient maintainers or enzyme, thus promising significant implications.
My two main suggestions are to give more context about (1) the effect of enzymatic catalysis on the resulting spatial distributions and (2) the relative costs of the two most prominent energy-consuming processes needed for this scheme. Specifically:
1) The entire manuscript assumes that catalysis is negligible and thus need not be explicitly modeled in solving for the steady-state distributions. How would incorporating a boundary condition at the right that involves non-negligible catalysis change (even qualitatively) your findings?
2) When quantifying the energetic costs, the main text solely focuses on the cost of counteracting the enzyme binding substrate, diffusing, and releasing. The SI explores some theory for the other cost of maintaining the substrate gradients, but without reporting any absolute numbers. For the biologically plausible kinase/phosphatase substrate-maintenance mechanism explored in the main text, how does its cost compare to the cost that you study quantitatively in the main text?
-
Reviewer #2:
In the manuscript by Galstyn et al on "Proofreading through spatial gradients", the authors proposed and studied a new kinetic proofreading (KP) model/scheme based on having a spatial gradient of the substrate (both "correct" and "wrong" ones) and the diffusive transport of the substrate-bound enzyme molecules to a spatially localized production site. The authors did an excellent job in explaining their new model and its connection and difference w.r.t. the classical Hopfield-Ninos KP mechanism. The key insight is that with spatial inhomogeneity, e.g., in the presence of a persistent spatial gradient for the enzyme or the substrate, one can consider spatial location as a state-variable. By having the substrate and product (or production site) at different spatial locations, these spatial degrees of freedom of the enzyme, i.e., enzymes at different physical location, can be considered as the intermediate states that are necessary for kinetic proofreading - each intermediate state contributes a certain probability for error-correction. In the original Hopfield-Ninos KP scheme, the intermediate state is provided by additional enzyme(s), whereas in this new KP scheme, it depends on having a spatial gradient, which the authors argue is more tunable. I like the theory for its simplicity and elegance. I have only a few mostly technical questions/comments.
My main concern for this study, however, is about how relevant this mechanism is for realistic biological systems. The original Hopfield-Ninos KP mechanism was motivated by specific and important biological problems (puzzles), namely the unusually high fidelity in biochemical synthesis process (in comparison with its equilibrium value). In this MS, the theory is developed without a specific biological system or specific biological question in mind. It is true that spatial gradient exists across biological systems and the authors also showed that typical kinetic rates may fall in the functional range of this new gradient-dependent KP mechanism. But, what is the function of the original system that such a kinetic proofreading process can help improve? Is it biochemical synthesis? Do the authors envision "correct" and "wrong" biomolecules being produced at the production site (x=L) like in the original setting of Hopfield-Ninos? Or is it signaling like in the T-cell signaling case? If so, do the authors envision that both the correct signaling molecule and the incorrect signaling molecule have a spatial gradient and they can both be carried by the same enzyme to their functional sites? I am not asking for a detailed comparison with a specific system, but I think a known but unsolved biological phenomenon that may be explained by this new mechanism would really help motivate a biologist audience. Furthermore, a connection to a specific biological system could also lead to testable predictions that would ultimately verify (or falsify) the existence of this mechanism.
Questions related to the model/theory:
1) In this study, there is a production r for the enzymatic reaction at x=L where the enzyme is active. However, the effect of this reaction, which change ES-->E+P, is not considered in the model equations (1-3). Is it because r is considered to be small? If so, smaller than what? Since speed is directly related to r, how does the value of r affect the speed and the speed-accuracy trade-off?
2) The nonmonotonic dependence of fidelity on the diffusion time for finite gradient as shown in Fig. 3c is intriguing. What determines the optimal diffusion constant (or diffusion time) when the fidelity is maximum for a given gradient length scale?
3) The study of trade-off among energy dissipation, speed, and fidelity is quite nice and adds to a growing list of study on performance trade-off's in nonequilibrium systems. For example, a similar energy-speed-accuracy (ESA) trade-off was studied systematically in the context of adaptation in bacterial chemotaxis (Lan et al, Nature Physics 8, 422-428, 2012) and chemosensory adaptation in eukaryotic cells (Lan and Tu, J R Soc Interface 10 (87), 2013). In particular, the exponential dependence of the fidelity on power consumption (energy dissipation) shown in Fig. 4 in this MS agrees well with results in these earlier studies (see Fig. 3c and Eq. 5 in Lan et al, 2012; Fig. 4 in Lan&Tu, 2103). It would be informative to discuss the trade-off found here for the gradient-dependent KP scheme in comparison with similar trade-off relations in other systems.
4) The power dissipation P is computed by Eq.8 in this MS. Where does Eq. 8 come from? What's the physical meaning of P? The standard way to compute energy dissipation is by computing the entropy production rate S', which is well defined. Then by assuming the internal energy does not change with time in steady state, we equate energy dissipation with kT*S'. The form of entropy production rate is known and can be found in text book (such as those from T. Hill) and papers (e.g., those from H. Qian and collaborators; and from U. Seifert and collaborators), and the formula given in Eq. 8 does not seem to be consistent with the known form of entropy production. In particular, for a given reaction with forward flux J+ and backward flux J-, the entropy production rate is: (J+-J-)ln(J+/J-), which can be easily shown to be positive definite and only =0 when detailed balance J+=J- is satisfied.
Overall, the MS provided a new gradient-dependent scheme for error correction in chemical systems. The study of trade-off among energy dissipation, speed, and fidelity (accuracy) in this new mechanism is also valuable for the general study of cost-performance relation in non-equilibrium systems. My main concern is the lack of examples of specific biological systems where this gradient-dependent error correction mechanism could be at work to enhance the specific biological functions of these systems.
-
Reviewer #1:
The authors proposed a new theoretical mechanism of kinetic proofreading based on spatially distributed biochemical systems. This concept is novel and distinctive from existing models of proofreading, although it is not yet proved experimentally. The writing is clear, concise and elegant. There are no logical flaws, and I really enjoyed reading this manuscript. Yet, I have a number of comments to be addressed, which will substantially increase the quality of this manuscript.
1) P. 1. The same concentration profiles are assumed for the right substrate R and the wrong substrate W. This is a strong assumption, could the authors consider the case where the concentration gradient length of the wrong substrate profile is larger than this length for the right substrate but still smaller that the distance L? They may calculate a series of the fidelity curves with increasing Lambda_W and the same Lambda_R. How will proofreading change?
2) P. 2. "The scheme proposed here does not rely on any proofreading-specific structural features in the enzyme; indeed, any 'equilibrium' enzyme with a localized effector can proofread using our scheme if appropriate concentration gradients of the substrates or enzymes can be set up. As a result, spatial proofreading is easy to overlook in experiments and suggests another explanation for why reconstitution of reactions in vitro can be of lower fidelity than in vivo." The key is the difference in the off rates for the right substrate R and the wrong substrate W, k^W_off >k ^R_off because W & R compete for E. This has to be mentioned in the above statement.
3) P. 2. "To demonstrate the proofreading capacity of the model, we first analyze the limiting case where substrates are highly localized to the left end of the compartment, lambda S << L." However, Eq. 5 is derived assuming that not only lambda s << L, but also lambda S << lambda ES (see Appendix).
4) P. 3. "... a red curve on the plot, is reached in the limit of ideal sequestration, ... " The word sequestration has a different meaning in biochemistry, e.g., it is used to describe 'sequestration' of an enzyme by the substrate/product or an inhibitor, which is not what the authors have in mind. They use 'sequestration' to describe the ideal substrate localization, Lambda_S -> 0. Put aside that this use of 'sequestration' is not the best choice, the authors need, at least, to explicitly define what they mean under 'sequestration'.
5) Fig. 3. Please explicitly define Veq speed (when k^W_off = k^R_off). In addition, how a black dotted curve is obtained is not explained, and the corresponding parameters are not given.
6) P. 5. "an enzyme E that acts on active forms of cognate (R) and non-cognate (W) substrates which have off rates 0.1 s−1 and 1 s−1, respectively (hence, theta eq = 10)." This implies a large difference in the free energy of binding of more than 1kcal/mol. In the absence of ATP/GTP hydrolysis, the difference in the binding energies is usually small. Can the authors give a specific example for an enzyme system where the difference in the free energy of binding is more than 1kcal/mol with no ATP/GTP hydrolysis?
7) Pp 5- 6. "As expected, proofreading by these gradients is most effective when the enzyme-substrate binding is very slow, in which case the exponential substrate profile is maintained and the system attains the fidelity predicted by our earlier explanatory model (Fig. 5b). .... If the binding rate constant (kon) or the enzyme's expression level (r_E) is any higher, then enzymatic reactions overwhelm the ability of the kinase/phosphatase system to keep the active forms of substrates sufficiently localized (Fig. 5c) and proofreading is lost." This is not entirely clear because the gradients depend on the phosphatase activity, whereas the authors did not mention that they likely assumed that when the substrate is bound to the enzyme, it is protected against the phosphatase.
8) Appendix D. The authors have to also consider or at least discuss the different diffusivities for phosphorylated and unphosphorylated substrates, a feature of many spatially distributed system and cite [FEBS Letters 583 (2009) 4006-4012] where this case was considered for dynamically stable spatial gradients.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript. Ahmet Yildiz (University of California) served as the Reviewing Editor.
Summary:
In the manuscript by Galstyn et al on "Proofreading through spatial gradients", the authors proposed and studied a new kinetic proofreading (KP) model/scheme based on having a spatial gradient of the substrate (both "correct" and "wrong" ones) and the diffusive transport of the substrate-bound enzyme molecules to a spatially localized production site. The authors did an excellent job in explaining their new model and its connection and difference w.r.t. the classical Hopfield-Ninos KP mechanism. The key insight is that with spatial inhomogeneity, e.g., in the presence of a persistent spatial gradient for the enzyme or the substrate, one can consider spatial location as a state-variable. By having the substrate and product (or production site) at different spatial locations, these spatial degrees of freedom of the enzyme, i.e., enzymes at different physical location, can be considered as the intermediate states that are necessary for kinetic proofreading - each intermediate state contributes a certain probability for error-correction. In the original Hopfield-Ninos KP scheme, the intermediate state is provided by additional enzyme(s), whereas in this new KP scheme, it depends on having a spatial gradient, which the authors argue is more tunable. The reviewers were enthusiastic about the theoretical model presented in this study because of its simplicity and elegance. However, the reviewers have also raised serious concerns that need to be addressed. In summary, the panel feels that discussion of possible biological example(s) where this novel type of proofreading may be occurring would significantly improve the manuscript's appeal to a broad audience. In addition, the reviewers ask for more explicit explanation of the effect of enzymatic catalysis rates, and discussion of the full dissipation cost.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
In their manuscript 'The adaptive architecture is shaped by population ancestry and not by selection regime,' Otte and colleagues use an evolve and resequence strategy to examine the response of a Portugal population of D. simulans responds to cold temperature. The authors identify putative targets of selection and compare the number of targets, their location, and the distribution of selection coefficients to previous work on the same population exposed to hot temperatures as well as a different population exposed to hot temperatures. The topic is of general interest, the work is sound and the writing is clear and concise.
1) It is not clear what the novel contribution of this manuscript is. The title indicates that the key finding is that population of origin mediates response to selection rather than the selection regime. However, the authors fail to provide compelling data to support that. The data are from 1 population under two selection regimes and a second population under one of those regimes. There simply aren't enough comparisons to infer that population ancestry plays a bigger role than selection regime in adaptive evolution.
2) The authors also seem to argue that a contribution of this paper is that it illustrates that temperature adaptation is not a single trait. This was the major finding of a 2014 paper from the same group in D. melanogaster- a single founder population was exposed to hot and cold temperatures and the authors found almost no overlap between the putatively selected variants in the two different temperature regimes.
3) Beyond the limited impact of the current work, there are some additional specific issues. The authors note that it was 'remarkable' that the distribution of selection coefficients and the number of inferred selection targets between the hot and cold experiments was 'highly similar.' What is the null expectation? Where does the null come from?
4) The discussion is somewhat unsatisfying and largely speculative. The 'different trait optima' section reads as straw man; this could be reframed to better guide the reader. There is little support for the 'differences in adaptive variation' hypothesis. The section on LD was interesting, but the simulation findings should reside in the results section.
-
Reviewer #2:
Overall Review: This is another commendable study from the Schloterer lab that features next generation genome-wide sequencing of multiple evolving populations. It compares results obtained with two different selection regimes, one hot and one cold, and two different founding populations of Drosophila simulans, one from Portugal and one from Florida. The results reveal a lack of consistency among selection regimes and founding populations. Temperature-dependent adaptation is shown to be "local" or "contingent," rather than globally consistent. My chief recommendations concern the experimental and theoretical contexts within which this study should be interpreted.
Major points:
1) I do not require any additional data collection or statistical revision. My comments are organized in terms of experimental paradigm (A) and theoretical significance (B).
A.
2) The typical paradigm for experimental evolution in this and many other labs is the use of hybrid populations created from isofemale lines. This method for founding experimental populations can be expected to generate some degree of random "historicity" as the isofemale lines approach fixation of specific genotypes with high stochasticity. Then there are further stochastic and historical effects which arise when such lines are hybridized. The strengths and limitations of this paradigm should be addressed. Most importantly, such stochastic historical effects might be the source of the discrepancy between the replicate lines derived from Portugal and Florida.
3) As the authors themselves point out, there is a comparative difficulty arising from the different scales of replication used for the Florida versus Portugal experiments. A further question for large-scale experimentation is whether a larger and uniform level of replication might produce more similar results, such as 20 evolving populations from each source. Or indeed, three sets of ten evolving populations from three distinct founders from the two sources, with a total of 60 evolving experimental lineages. The authors should discuss whether they believe that their findings would hold up with such an expanded experimental protocol.
4) The authors themselves point out at one point that their experiments might have benefitted from some phenotypic characterization of the presumed temperature adaptation. That raises the more general question of how the field of experimental evolution can progress with some labs just doing phenotypes and other labs just doing genome-wide sequencing. Surely this and other studies would be strengthened by combining the two types of assay. Furthermore, genomic evolution might be usefully analyzed in terms of the degree to which specific genomic changes can be associated with specific phenotypic changes, as that is the foundation for adaptation itself.
B.
5) This is yet another study that finds difficulties with the invocation of noroptimal selection along a one-dimensional functional gradient. Such models have been long-standing favorites of evolutionary theorists, such as Kimura and Lande. But that preference may arise more from the ease with which these models can be formulated and analyzed by theoreticians. Actual evolving populations don't seem to embody the precepts of such theory, whether the issue is the maintenance of genetic variation (see the work of Turelli, for example) or the evolution of closely studied populations, as illustrated by this study. An alternative point of view that the authors should discuss is that such models are indeed NOT usually correct.
6) There are alternative theoretical frameworks that address the maintenance of genetic variation and the response to selection. Among these are schemes of protected polymorphism arising from overdominance, epistasis, and frequency-dependent selection. If the thrust of the preceding point 4 is accepted, then it would be theoretically salient for the authors to suggest what type of underlying population genetic machinery would best account for their findings, in place of the noroptimal selection-mutation balance model.
-
Reviewer #1:
Otte et al. used an evolve and re-sequence strategy to explore "the genetic architecture of adaptive phenotypes". The authors previously found different genetic architectures across different founder populations evolving in a common hot environment. The authors chose one of these founder populations for replicated experimental evolution (5 replicate populations) in a cold environment for 50 generations. The authors were surprised to discover the same number of loci evolve under strong selection between the hot-evolved and cold-evolved replicate populations, though the 20-ish loci are largely non-overlapping. The distribution of selection coefficients was also similar. They interpret this commonality as evidence that the founder population history has a larger effect on adaptive architecture than the selection regime.
The study demonstrates a comprehensive effort to discover the number of genome regions and distribution of selection coefficients that emerge from a highly controlled experimental evolution project. The experienced team applies a sophisticated toolkit to this powerful experimental design - a toolkit that grows ever more sophisticated with each new experimental run that they perform. However, the authors set me up to learn why such different adaptive architectures emerge from different founder populations. Ultimately, the researchers acknowledge that they "cannot pinpoint the cause for the differences in the inferred adaptive architecture..." Some results simply recapitulated the previous Portugal E&R study and other results recapitulated a D. melanogaster E&R study. I did not find the "common adaptive architecture" across different selection regimes to be a particularly compelling discovery of sufficiently broad interest. Other concerns and questions can be found below:
Major concerns:
1) Pg. 4: It is my understanding that the power of multiple populations from a single founder evolving in parallel allows for more rigorous identification of loci targeted by selection. I found it surprising to discover that if a lack of replication emerges from an experimental evolution study, this outcome is interpreted as "genetic redundancy." First, genetic redundancy has a precise definition in genetics that muddles the author's meaning. And second this interpretation seems rather post-hoc.
2) To "shed more light on the different selection responses" is a weak motivation. The introduction sets me up to understand why selection responses are so different but no major insights into the "why" emerge from the cold-adaptation experiment.
3) More explanation of figure 1 in the main text is needed. Does each point correspond to a SNP that consistently changes across all five populations? Or is this the union?
4) Line 210: How did the researchers define "stress" and determine that the degree of stress is equivalent across two temperature regimes? The absence of these data undermine the potency of the comparison.
5) How can the authors be sure that the only difference between the hot and cold populations was temperature? Was competition/population size/etc held constant? Might the lack of overlap between hot and cold adapted loci stem from one such regime selecting for a different phenotype? (i.e., not temperature tolerance)
6) Line 237: The authors assert that most alleles show a temperature-specific response - a discovery with precedent in the literature, including from this team of researchers. The authors attribute the absence of common loci between temperature regimes to the high number of generations (50) compared to the number across seasons cited in Bergland et al. The researcher could easily look for common targets at earlier time points of experimental evolution to test this idea.
7) Line 292-293: This section reads as disingenuous - the researchers could have explored overlap between Portugal and Florida founders using only the selected loci coordinates and look for non-random overlap using simulations/resampling tests.
8) Discussion: The speculation about why such different architectures emerged across Portugal and Florida was diluted by the absence of initial fitness estimation upon subjection to a cold environment (which would have offered evidence for different initial "optima" across founder populations) as well as the change in fitness from generation 0 to generation 50.
9) The simulations and corresponding discussion would make for an interesting review/opinion piece but not as new results for this manuscript.
Minor Comments:
1) Pg. 3. The recurrent citation of Barghi et al. in the Introduction undermined the reader's impression that fundamental questions are being addressed in this article
2) Lines 33-39: The argument that parallel signatures of selection across distinct natural populations are insufficient to address the polygenic basis of adaptive phenotypes, and so comparatively more contrived E&R studies are required, was unconvincing.
3) Line 158: Confusing. Should "among" actually be "within"?
4) Line 486: I believe that the authors would be hard-pressed to find in the literature a paper declaring that "single population...[is] sufficient to understand the genetic basis of adaptive traits".
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript.
Summary:
The reviewers agreed that the study was well-executed and offered important insight into how decisions around experimental set up affect the outcome of experimental evolution studies. Ultimately, however, there was consensus that the results failed to support the broadest conclusion that ancestry is more important than selection regime. Moreover, given previously published reports on experimental evolution from your group and others, the current study lacked sufficient novelty.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
The paper from Itoh is a thorough and interesting analysis of a mechanistic dissection of the underlying cause of Dupuytren's Disease (DD). One exonic SNP is associated with the disease and this mutation changes a residue in helix C of MMP14, a major collagenase, from Asp to Asn. Interestingly, helix C is distant to the catalytic center and the authors show not unexpectedly that recombinant mutant forms of the protease bearing the mutation have identical gelatinolytic and collagenolytic activity in solution. However, in the cell membrane bound form, collagenolysis is markedly reduced. The authors discuss several possibilities for this centering on the potential impaired ability to form dimers. Dimerization and collagen binding has been shown by many groups (please cite some other groups and not just your labs work) to be important for collagen triple helicase activity. This is then suggested to be the underlying cause of the defect in collagenolysis (that then leads to impaired collagen turnover and hence the build up of collagen at several locations in these patients with DD).
As always there are several points that need addressing to make this a truly nice piece of analysis and data. The major criticism resides in the very nice patient data presented in figure 5. This is key to the whole paper but sadly the authors actually ignore what is shown and drive forward with their own interpretation of the underlying mechanism.
Major comments:
1) It is quite clear from a variety of approaches used in the detailed analyses in Fig 5 that there is a strong difference in the degree of enzyme activation occurring in the patient and normal cells comparing AA, which shows the predominant fully active ~51k form vs GG very low amounts perhaps 5% of the mutant when on the cell surface. (the gels are poor quality and so the estimate of MW is difficult to be sure). Thus, the simplest explanation for the reduced collagenolytic activity of the patient is that there is less active protease, without invoking alternate mechanisms. Nonetheless, I understand why the authors investigated dimerization and hemopexin domain interactions and that is fair enough. BUT, those data and interpretations need to be placed in context with fig 5. The interpretation is that other effects occur that alter the activation of MMP14 buy furin or in its cell surface protein protein interactions or with the plasma membrane
2) Relatively few analyses have been performed of the critical residues in collagenases for collagenolysis. In MMP8 re the S3' site reveals the importance of specific residues in contacting collagen for cleavage (Pelman) that apparently is not important for the mutation under study in the present paper as 237 is distant from the active site on Helix C. Notably, 237 lies in an interesting sequence: DDDRR in which one of the Asp couples to the active site in triple salt bridge relay commencing from the NH2 of the F/Y at the start of the catalytic domain after correct activation, and this is needed to fully activate MMPs. This work by Stoecker should be referenced (though it is not in relation to MMP14 it is a general principle for all MMPs). Please discuss this D as it may affect the electrostatic environment of the 273 position and so reduce catalytic potential. While evidence presented does not indicate this (for collagen and gelatin) there are no kcat/km determinations which are needed to quantify the effect of the mutation.
3) However, the 273 position is potentially close to the top (blade I) of the adjacent hemopexin domain that the authors know very well is key for collagenolytic activity. The authors posit quite correctly that the mutation may affect the interaction with the hemopexin domain and I totally agree. Collagenolytic activity is difficult and precision in protein contacts is likely needed for catalysis to occur. A model of the catalytic domain contacting the hemopexin domain in blade I is needed to help interpret this. See Zhao et al 2014 (http://dx.doi.org/10.1016/j.str.2014.11.021 ). With the Xray scattering data this appears to be a potential mechanism for disruption, not just dimerization. Please include in Fig 1 a model of the full length MT1-MMP and the site of 273 in relation to the top B strand of blade I for the potential interaction by modelling. Arg 360 by eye might be a potential interactor. Though there are two other Arg that may be involved perhaps R 330, R343 and R345? Please investigate this as it will be interesting.
4) In this regard, a major oversight has been the lack of reference to the very good analyses of MT1-MMP membrane association by Marcinket al (2019) Structure 27: 281-292.e6. This reveals the membrane binding associations of blade III and IV of the Hx domain which differentially orients the protease on the surface and hence to collagen. An earlier paper by the same group (http://dx.doi.org/10.1016/j.str.2014.11.021 ) also has been ignored (above). These analyses are extremely detailed with amino acid resolution and much could be gained by interpreting these contact residues between collagen and the hemopexin domain and the domain and lipids and hence how it interacts with the catalytic domain where the mutation resides. This must be done in depth to be fair to other work and also for deeper biological insight to the mechanism of collagenolysis in general and in these patients in particular. The membrane association may also drive or supplement dimerization.
5) I have a serious issue with the fusion construct used in Fig 6. "The Fc part of these chimera molecules enforces the ectodomain of the enzymes to form a disulfide bonds-mediated stable homodimer (Figure 6B), thus allowing the determination of the molecular shape of the MT1-MMP homodimer". How can the authors conclude this? A dimer certainly is formed but its orientation may be totally different from the natural situation where no SS bridge occurs and potentially is in a different orientation. This is a serious caveat that must be clarified to interpret the nice data otherwise in Fig 6.
6) Only indirect evidence presented that the mutation does not affect dimerization. Please show gel filtration of the complexes or other means to clarify the dimer vs monomeric forms of the WT, mutant and 1/1 heterodimers as this is an obvious and important likely mechanism to explain the phenotype.
7) It is amazing that the allelic frequency is 0.20. So why does the heterozygous phenotype that the authors investigate in the recombinant experiments show up more in the population?
-
Reviewer #2:
The work contains interesting features, but several aspects of the work are more perplexing than insightful. The authors identify a SNP in MMP14 that occurs in 30% of the population that negatively affects the collagenolytic activity of the encoded gene product, i.e., MT1-MMP. They then propose that the resulting D-to-N mutation may play a role in the pathogenesis of Dupuytren's disease (DD). First, while the title states the the SNP variant causes " .. a defect in collagenolytic activity (that) confers the fibrotic phenotype of DD" , the findings are more appropriately described as having established a correlation between defects in collagenolytic activity and the fibrotic phenotype of DD. However, no data have been presented that document a defect in collagenolytic activity in DD pts harboring the SNP. Indeed, it remains unclear as to whether type I collagen is the key substrate in DD. Given that MT1-MMP can hydrolyze an almost bewildering array of non-collagenous substrates (both cell-surface, secreted and plasma-derived), it is difficult to rule out the possibility that that the D-to-N mutation does not more profoundly affect the hydrolysis of an alternate target. It would be interesting to know if there are changes in gene expression when COS cells are transfected with wt vs the SNP variant of MT1-MMP and cultured on plastic (or even with an E-to-A mutation in the catalytic domain). Second, these concerns notwithstanding, if one were to assume that type I collagen is the critical target, the underlying mechanisms that impact collagenolytic activity are unclear. The authors document complex changes in MT1-MMP processing and cell surface expression in combination with structural changes in the soluble homodimer. Yet, when the soluble variant was shown to express normal type I collagenolytic activity, a conclusion was reached that enzyme activity is likely affected "only when the proteinase is expressed on the cell surface." Possibly, but how do we rule out effects on MT1-MMP exocytosis, endocytosis,trafficking or post-translational modifications in the tail, hinge region, etc - or as mentioned above, hydrolysis of an alternate - and potentially more important - target?
-
Reviewer #1:
In this paper the authors focus on a mutation of MT1-MMP that seems to be associated with Dupuytren's Disease (DD). Using overexpression systems and cells isolated from patients they provide evidence that a major defect of the mutant form of MT1-MMP is it’s reduced ability to activate MMP-2 activation and in turn collagen degradation. Although interesting, the paper presents major shortcomings.
-All the results obtained are based on in vitro experiments and most of the studies are dependent on overexpression systems.
-The effects of mutant MT1-MMP on MMP2 activation are not as impressive as the authors claim. No statistical analysis is provided for Fig. 2B (MMP2 activation in cells expressing WT or mutated form of MT1-MMP) and it is not clear if the changes in MMP2 activation observed in Figure 3B (pro-MMP2 activation in cells from patients) are indeed significant. From the graph presented it does not seem to be the case. If this is the case, then the major point of the paper is indeed not corroborated by strong evidence.
-The authors propose that WT and mutated MT-MMP might form a dimer and the mutated form might act as dominant negative. IP is shown only with anti-FLAG antibodies. Reciprocal IP with anti-myc should also be shown. Also different stringency conditions should be employed to determine the 'strength' of this potential heterodimerization. Importantly advanced FRET-based techniques should be used to study and evaluate heterodimers in the plasma membrane.
-The title of the paper is misleading as these only in vitro based studies do not allow the authors to conclude that the An SNP variant MT1-MMP with a defect in its collagenolytic activity confers the fibrotic phenotype of Dupuytren's Disease. To answer this key question a vertebrate animal model needs to be provided.
-Figure 5 needs better controls and/or quantification. The IF provided is not convincing and the authors need to provide loading controls of 'surface' proteins. Importantly statistical analysis needs to be provided to determine whether the changes observed are significant and important.
In conclusion it is felt that the major conclusions of this paper are not based on convincing data and more analysis needs to be done in order to determine how exactly the mutated form of MT1-MMP might lead to DD.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
Although the reviewers recognize that the paper contains interesting features, they also addressed major concerns and pitfalls with the study, including: 1) the overall significance; 2) lack of in depth mechanism whereby MT1-MMP variants might alter collagenolytic activity; 3) lack of functional studies with cells isolated from DD patients; 4) the importance of type I collagen as a key substrate in DD remains unclear; and 6) lack of solid evidence that MT1-MMP itself plays a key role in DD.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
This study provides experimental evidence that, in contrast to a currently accepted view, some sensor histidine kinases exist in more than one oligomerization state and that a monomer-to-dimer transition might play a role in signal transduction. Such transition is well documented for eukaryotic signal transduction systems, but not in prokaryotes. Thus, the findings reported here open an avenue to a broader investigation of this phenomenon and its potential generalization.
My only major comment is the inexpert level of bioinformatics analysis. While all specific concerns seem minor (listed in the corresponding section below), taken together they amount to a bigger problem, particularly with presentation. On the other hand, none of the shortcomings with the bioinformatics part seriously affect major conclusions of this study.
-
Reviewer #2:
Manuscript Summary:
The manuscript by Dikiy et al. extends previous investigations from the Gardner lab on the oligomeric states of histidine kinases containing photosensing LOV domains (LOV-HKs). The Gardner lab had previously characterized two dimeric and one monomeric LOV-HK from Erythrobacter litoralis. In the present study, they perform sequence analyses to identify soluble LOV- and PAS-domain containing HKs similar to the previously characterized monomeric LOV-HK EL346. They characterize the photocycle, oligomeric state, and autophosphorylation activity of several of these HKs. Finally, noting that one dimeric LOV-HK (RH376) has three small regions of sequence that are absent from the monomeric EL346, they delete these regions individually and in combination to generate a set of mutated RH376 proteins that they characterize.
General Assessment:
The results of this study are consistent with previous studies from the Gardner laboratory, indicating that functional LOV-HKs can exist as monomers, dimers, or mixtures of both. Perhaps unsurprisingly, the effects of deletions engineered to identify determinants of dimerization do not clearly align with any simple hypotheses and limited insights are gained. Overall, the study would benefit from greater precision in the writing of the manuscript, greater rigor in experimental design and analyses of data, and restraint in tempering conclusions to better align with the data.
Major Comments:
1) The introduction could be improved by more precise language (see details in Minor Comments).
2) Details about the autophosphorylation assay should be provided. Specifically, the concentrations of proteins used in the assays need to be specified, unless the stated concentrations are the final concentrations in the assay, in which case this needs to be more clearly indicated. The extremely low concentration of ATP (3.6 uM) is problematic. Even for initial rate determinations, ADP generated during the reaction will likely inhibit phosphorylation under these conditions.
3) Figure 1. Given the substantial domain rearrangements that are known to occur during signaling, it would be helpful to specify the signaling states depicted in the schematic structures.
4) Line 231 subtitle and lines 257-258. This conclusion seems to be somewhat overstated given the small number of proteins examined. Within Table 2, one of three EL346-like LOV-HKs is monomeric and the same is true for the three LOV-HKs examined. This ratio of 4:2 dimers to monomers does not seem sufficient to conclude that LOV-HKs are generally dimeric.
5) Lines 270-274 and Fig. 3b. How do you know that the plateau is indicative of phosphatase activity rather than a simple equilibrium due to the presence of ADP in the reaction mixture (either as a contaminant in the ATP or generated during the reaction)? A minimum of 3 replicates should be shown with error bars. Which data from the two-trials were used to reach the conclusion of a 1.5-fold difference in activity? More rigorous statistics should be employed.
6) Lines 274-279 and Fig. 3b. It is not clear from the description of the assay in the Methods section what concentrations of HKs were used in the assays. If concentrations were not similar for all proteins assayed, differences in rates are likely to result from different amounts of ADP generated during the reaction.
7) Lines 278-279. It is a big leap to conclude that monomer-dimer transitions may be a regulatory strategy based on the observation of different rates of autophosphorylation. What concentrations of monomer and dimer proteins were used in the assays? And if the oligomeric state is used as a regulatory strategy, how? Do you envision some mechanism that regulates the oligomeric state and this in turn regulates autophosphorylation? (This is eventually addressed in the discussion. Perhaps the statement about a regulatory strategy should be withheld until the Discussion>)
8) The sequence of the loop in DHp and CA domains of HKs has been used to predict cis- vs. trans- mechanisms of autophosphorylation. Please comment on the loops in the LOV-HKs. Presumably all monomeric HKs would have loops consistent with a cis- autophosphorylation mechanism. Are they similar in monomeric and dimeric LOV-HKs?
9) Fig. 4. What are "monomer-1/dimer-1" and "monomer-2/dimer-2"? Why is there such a large difference in the activities observed for -1 and -2? Also, the y-axis in the graph in Fig. 4b appears to be mislabeled as "Concentration".
10) Fig. 6. A minimum of 3 independent activity assays should be shown and statistical tests should be applied to determine the significance of the observed differences, especially given the large variations in the data.
11) Lines 330-332 and Fig. S4. The absorbance profiles clearly differ between the proteins. How much variation would be necessary to claim that a protein was non-functional? Indeed, in the next sentence, it is acknowledged that flavin binding is adversely affected. If so, then what is meant by "the deletions do not perturb the folding and function of the LOV domain"?
12) Lines 368-369. What experiments address the sufficiency of either RH1 or RH3 for dimerization? The rationale for this statement is not clear.
13) Fig. S6. It is not conventional to introduce new data within the Discussion. Perhaps this figure should be moved to the Results.
-
Reviewer #1:
The main objective of this study was to investigate a possible relationship between oligomerization and regulation in histidine kinases. To this end the authors identified novel LOV and PAS sensor kinases based on sequence homology searches with HK EL346, a soluble monomeric HK that senses blue light through a LOV domain. To study the monomer-dimer transition as a possible regulatory mechanism they try to "monomerize" a dimeric LOVHK, named RH376, by deleting three regions that could be determinants of the oligomeric state. Nevertheless, the authors found that none of these deletions disrupt the dimeric state of the protein. The conclusion of the work appears to be that multiple domains contribute to dimerization and function of HKs.
This manuscript is experimentally well done and well written. First the authors show that Non-Lov PAS-HKs show a mix of monomers and dimers, both of which are active. Then, the study is focused in the LOV HKRH376 and in deletions RH1-RH3 and a double mutant RH1+3. RH1 and RH2 are active dimers while RH3 remains largely dimeric and is inactive. Finally, the double mutant is an inactive monomer. The major conclusions of this manuscript are that multiple regions determine oligomerization in this family of HKs and light-induced conformational changes have a complex relationship with autophosporylation and do not appear to be restricted to the oligomerization state. In summary, I found that the data, although technically sound, don´t provide mechanistic insights in the regulatory mechanism(s) of sensor kinases.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript. Michael T Laub (Massachusetts Institute of Technology) served as the Reviewing Editor.
Summary:
This study provides evidence that some sensor histidine kinases may exist in more than one oligomerization state and that a monomer-to-dimer transition might play a role in signal transduction. The results are consistent with and extend prior work from this lab and will be of interest to those studying two-component signal transduction.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
Bissett and colleagues provide an in-depth assessment of the stop signal task implementation in the ABCD protocol. Given the importance of the data set itself, as well as current trends in research funding, there are several important lessons to be learned here, both regarding this specific task implementation, as well as with respect to task designs in large-scale data collections in general.
-
Reviewer #2:
This paper reports a thorough critique of the ABCD stop-signal data set. It identifies a set of eight problems that severely limits the utility of the ABCD stopping data. In particular, the first two (which are essentially the same problem) invalidate estimates of SSRT based on the independent race model because of violations of the context independence assumption of that model. The remaining issues are more minor in the sense that while potentially problematic they either affect a very small percentage of the data and so can be dealt with by removing the affected trials or participants, or do not appear to be problematic in practice.
The authors have provided a valuable service to the research community in systematically and thoroughly cataloguing these issues, although we think it is fair to say that a number of people (including the present reviewers) have been aware of the key design issue caused by the stop signal replacing the go signal for quite some time and have been working on solutions.
Below we have a few suggestions for clarifications, but overall the paper is very clear and well written.
Although the paper mentions that "new models for stopping must be developed to accommodate context dependence (Bissett et al., 2019), the latter of which we consider to be of utmost importance to advancing the stop-signal literature", it does not discuss such models and neither does it show the potentially severe consequences of context independence violations in the ABCD data set.
All our more substantive comments relate to "Retroactive Suggestions For Issue 1". First, the authors write: "Given the above, if analyzing or disseminating existing ABCD stopping data, we would recommend caution in drawing any strong conclusions from the stopping data, and any results should be clearly presented with the limitation that the task design encourages context dependence and therefore stopping behavior (e.g., SSRT) and neuroimaging contrasts may be contaminated".
We feel that this recommendation is too lenient and would suggest the following alternative: Unless the ABCD community conclusively shows that the design flaw does not distort conclusions based on SSRT estimates (or any other stop-signal measure), researchers should not use the ABCD data set to estimate SSRTs at all.
Second, the authors suggest removing subjects who have severe violations as evidenced by mean stop-failure RT > mean no-stop-signal RT. We are concerned that this recommendation impacts on the representativeness of the sample. Also, this recommendation ignores the fact that violations are not an all-or-none phenomenon but are a matter of degree and can come in varying shapes and sizes.
Third, the authors recommend that "any results be verified when only longer SSDs are used, perhaps only SSDs > 200ms". Figure 3 does not seem to support the recommended cut-off of 200ms: at 200ms accuracy is still far from asymptotic.
In general, we feel that recommendations based on removing participants and trials are not sufficient. Such practices will affect the representativeness of the sample and will increase estimation uncertainty and hence decrease power. We believe that the only way to solve Issue 1 is by developing measurement models that can account for the dependence of the go and the stop process.
-
Reviewer #1:
General assessment:
The paper points out eight design issues observed in the stop signal task of the longitudinal Adolescent Brain Cognitive Development (ABCD) study by Casey et al. (2018). The issues are ordered by importance and are partially interrelated. The paper is written in a very clear and non-redundant style and makes a number of suggestions on how to deal with the various issues. The points made in the paper are well-taken. Moreover, the preprint of this paper has already elicited a reply by authors from the ABCD study leading to some partial adjustments of the design of the stop task.
Major comments:
1) As the authors suggest, the most important issue is the potential violation of the context invariance assumption due to the variability of the go stimulus duration across different stop signal delays (SSDs). This is a plausible concern even if the number of "clear" violations is relatively small (447 out of 7231 subjects). Nevertheless, the authors' point would be made even more convincing if they could point to some (simulation?) results showing the effect of a weaker go signal at short SSDs on the estimate of the stop signal response time (SSRT).
2) I suggest using the term "context invariance" instead of "context independence" , in order not to confound the assumptions of 'context' and 'stochastic' independence in the Logan-Cowan race model. It should be pointed out that the prediction of the race model concerning faster stop failures than go responses is conditional on both context invariance AND stochastic independence between go and stop signal processing being true (see Colonius & Diederich, 2018, Psych. Review).
3) I have no further major comments but would like to suggest a further analysis: Let us suppose, as the authors point out, that the RT distribution of responses to the go signal is indeed affected by the duration of the go signal. As a first approximation, let us assume that the observed RT distribution is a binary mixture of responses: slow RTs to a weak/short go stimulus and fast RTs to a strong/long gos stimulus. Without making specific assumptions about the two components of the mixture, one could employ a mixture distribution test first suggested by Falmagne (1968, British J. Math. Statist. Psychology): The RT ("density") distributions, plotted separately for each SSD and go signal trials, should all cross at one and the same point in time. Of course, this is not a foolproof test but if some evidence in favor of this prediction is found it would strengthen the authors' point.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 4 of the manuscript. Birte Forstmann (University of Amsterdam) served as the Reviewing Editor.
Summary:
This paper focuses on one of the benchmark magnetic resonance imaging (MRI) datasets, the so-called Adolescent Brain Cognitive Development (ABCD). In total, eight design issues observed in the stop signal task of the longitudinal ABCD study by Casey et al. (2018) are pointed out. The design issues are described in detail, ordered by importance, and a number of suggestions are given on how to overcome potential limitations. Given the importance and prominence of the ABCD study in the field of cognitive neurosciences, both the reviewers and editors believe this paper to highlight essential issues in a constructive way. Finally, we believe this paper will elicit a fruitful discussion including the adjustments of the design of the stop signal task.
Overall, this manuscript is well written, interesting, timely and will help resolve the debate in the field. We have the following suggestions to improve the manuscript.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
The work by the group of Andries Bergman investigates the heterogeneity of macrophages in prostate cancer. They identified three macrophage subsets in tumorigenic tissue, which were also present in adjacent areas. All three subpopulations were clearly distinct on the molecular level, however, none of these subsets had a clear M1 or M2 phenotype. Accordingly a gene signature could be extracted that correlates with metastasis-free survival of patients and might have prognostic value.
Even though the manuscript is interesting, well written and the finding that no clear difference in macrophage composition is evident between adjacent and tumorigenic areas is surprising and new, the paper is not sufficient in its current form to fully support the presented messages.
Main points:
1) The authors state that they identified three distinct populations of tissue resident macrophages in prostate tissue, independent of the localisation. This finding is surprising, since an accumulation of monocyte-derived tumor(-associated) macrophages can be observed in almost all tumors. According to the material and methods section, the authors did not digest their tissue. What is the impact of digestion vs. non-digestion on macrophage recovery from human prostate tissue? Is it possible that especially tissue-resident macrophage subsets embedded in the parenchyma were missed? A detailed flow cytometry experiment needs to be performed in order to identify the most sensitive but at the same time most efficient isolation procedure that captures all possible macrophage subsets. Advanced flow cytometry with a broader antibody spectrum (e.g. CX3CR1, CD11c, CD14, CD16....) needs to be used to characterise the myeloid composition in more detail. Maybe even more sophisticated methods like CyTOF are advisable and recommended (See et al., 2017).
2) The authors call the identified cells "tissue resident macrophages". However, a closer examination of the genes in the identified clusters suggest, that cluster 0 might refer to (monocyte-derived) macrophages (identified by Cx3cr1, Ms4a7, Trem2, C1q; Chakarov et al., 2019), cluster 1 to cDC1 dendritic cells (identified by Flt3, Cd207, Fcer1a, Clec10a; Heger et al., 2018; Dutertre et al., 2019) and cluster 2 likely to extravasated monocytes (high levels of S100A genes, Ifi30 and Lyz; Kapellos et al., 2019). Therefore, maybe only cluster 0 reflects true (interstitial?) tissue resident macrophages. Accordingly, the bioinformatic analysis has to be strongly intensified and the data needs to be compared to other recently published work in order to identify for instance the signatures of tissue-resident macrophages, interstitial macrophages, monocyte-derived cells and monocytes. The authors have to familiarise themselves with the common nomenclature and the state-of-the-art identification of human mononuclear phagocytes (including cDCs) based on their transcriptomic signatures.
3) The authors speculate in the discussion part that the tumor influences distant macrophages through tumorigenic factors, which might be of prognostic value. In order to make such a statement, the authors have to show the transcriptome signature of macrophages isolated from tumor-free patients. Only a direct comparison between 'healthy' and 'tumorigenic' tissue can uncover tumor-dependent effects on macrophage transcriptomes and composition.
4) Close histological examination with subset specific markers needs to be performed to show that indeed no cellular difference exists between the localisation of macrophages in adjacent and tumorigenic areas. This should be compared to 'healthy' tissue (see previous point).
-
Reviewer #2:
The manuscript is a single-cell RNA-seq approach to macrophages (CD3- and CD14/CD11b+) from prostatic adenocarcinoma tissue as well as adjacent non-tumorous prostate tissue. The authors find that three RNA-seq-defined macrophage subset clusters were found in both tumour and adjacent prostate in varying proportions in their patient series. These clusters show only weak associations of expression of genes related to the 'M1' and 'M2' macrophage activation status. They also show no differential association of expression of genes involved in T cell response regulation. One cluster appears to show evidence of NF-kappaB and WNT signalling but little interferon signalling, while another shows strong interferon signalling but poor WNT signalling, and the third cluster ('cluster 1') appears likely to consist of cells in cycle. These are intriguing populations for further work.
The authors then derive a differentially expressed gene signature, and show that it correlates with clinically relevant parameters in publicly available data sets. These correlations are very interesting from a translational perspective.
The data are substantive, and provide a valuable resource database for the transcriptional landscape of prostatic monocytic cells. However, the findings remain primarily empirical correlations at this stage, with very limited mechanistic implications.
1) The patient numbers analysed are very small. There are only four clinical samples (with three biopsies each) from which both tumour and non-tumour tissue has been used. There are no prostate samples without tumours similarly analysed to provide any indications about the 'normal' (and perhaps true 'tissue-resident') macrophage populations of the human prostate. It is thus difficult to interpret the monocytic cells analysed as blood-derived or of tissue-resident origin, limiting mechanistic speculation. It is also not clear if the observed patterns of monocytic lineage subsets are generated in patients prior to or after initiation of malignancy.
2) The cell numbers analysed are quite small as well. From four patient samples analysed, a total of 641 cells have been used for the RNA-seq-based analysis. This means an average of about 160 cells per patient sample, including both tumour and non-tumour tissue (an average of eighty cells from each location, perhaps). This seems a relatively thin basis for major interpretations.
3) Further to the above concern, there is no indication of the immune cell infiltrate density, especially monocytic cell density, in the various individual tumour samples, nor any analysis of the landscape of the immune cell infiltrate, for correlation with the monocytic lineage transcriptional groups for further mechanistic speculations. This is, again, compounded by the availability of only four patient samples.
4) There is no independent validation that there are indeed three monocytic subsets in prostatic tumours with clustered differential protein expression of interferon, WNT and cell cycle pathways, leaving the functional assumptions without rigorous support.
5) There is no clarity regarding the macrophage gene signature derived from the integrated dataset. As a result, while there is translational value to its associations with clinically relevant parameters, the biological interpretation remains unclear, since it is not clear that these genes are not expressed in non-monocytic cells in prostatic tumour biopsies, especially given that the differential expression consists of genes in the NF-kappaB, WNT and interferon pathways.
-
Reviewer #1:
In this manuscript Siefert et al., profile human prostate cancer-associated macrophage subtypes by single-cell RNA-seq. This analysis identified three major sub-population of macrophage (cluster-0, 1, and 2) in human prostate cancer and adjacent normal tissue. Next, the authors investigate the association of macrophage subtypes with recurrence and metastasis in independent prostate cancer cohorts. This leads to the identification of CSF1R+ (cluster-0) macrophage as a cell type associated with early recurrence and metastasis in prostate cancer. Overall this is an interesting study, however, in the absence of specific presence and/or enrichment of cluster-0 in tumor tissue it is not clear why these macrophages lead to early relapse or metastasis in prostate cancer. Moreover, the absence of any validation and/or functional analysis further diminish the broader implication of this observation.
1) Overall, the authors have employed very good QC parameters to filter superior quality cells. However, they detected batch effects in data (patient-specific clustering) and therefore employed batch correction methods. Unfortunately, after batch correction, they fail to detect tumor-specific macrophage populations in prostate cancer. The authors' reason that this could be due to the broader effect of 'tumor' on the adjacent normal ecosystem. However, in the absence of a comparison between macrophages from normal prostate and prostate tumor, it's difficult to conclude that tumors influence the macrophage in adjacent normal tissue. Given the well established phenomenon of tumor-associated macrophage this observation is surprising and an alternative explanation could be possible artifacts induced during the batch correction (i.e. integration) leading to the removal of subtle differences between tumor vs adjacent normal macrophages.
2) This study identifies three major sub-population of macrophages in prostate cancer. Authors discuss the limitation of M1/M2 nomenclature to define macrophage spectrum, which is evident from their analysis as well. However, they also don't provide a marker-based nomenclature of these macrophage clusters. It will be beneficial for the community to know the specific markers of these macrophage sub-populations which will be important for flow-cytometer or imaging-based validation of these populations. It is really important to validate the identity of single-cell RNA-seq clusters by flow or imaging analysis. However, the lack of validation remains one of the major limitations of this study. Not sure given the COVID situation it is possible but it will be very beneficial for the community.
3) It's not clear how cluster-0 macrophage leads to early relapse or metastasis. Given the higher expression of TNFa and IFN-g in cluster-0, it will be beneficial if authors can provide some discussion on this. Moreover, since cluster-0 is not unique to the tumor, does the frequency of these cells changes in the tumor ecosystem when compared to adjacent normal tissue? This quantification will be important to understand the possible implication of these cells in early relapse or metastasis.
4) A recent study by Huang et al., (Cell Death and Disease 2020) demonstrates the role of CCL5+ TAMs in promoting prostate cancer stem cells and metastatic phenotype. Do cluster-0 macrophages express CCL5 or any other marker which may facilitate replacement and metastasis.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
While we all considered the value of the dataset as a useful resource for the community, providing a transcriptional landscape of prostatic monocytic cells, we all agreed that the study remains too descriptive and primarily empirical correlations at this stage, with very limited mechanistic implications and validation. In addition, the lack of healthy control, an incomplete bioinformatical analysis (batch effects, other MPS cell clusters like cDCs), missing validation, and a limited number of cells/patients dampened the enthusiasm of all the reviewers.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
General assessment:
The work presented is a major scientific achievement. This is the first functional reconstitution of any CO2 concentrating mechanism. The work has major implications for engineering of CCMs into crops for increasing yields: the authors have definitively identified a set of components that confer CCM activity in a heterologous host. As a bonus, the authors demonstrate a new way of generating a Rubisco-dependent E. coli.
The writing is generally clear. The claims are well-supported by multiple lines of evidence. The engineered Rubisco-dependent E. coli showed clear improvements in growth phenotypes after introduction of H. neapolitanus CCM genes, which were then confirmed using thorough genetic and biochemical analyses.
Major comment:
The control EM images in Figure 5 should be present in the main figure, not a supplement. It is concerning that the positive control failed. It should be repeated, or, if possible, it would really help to show TEMs of WT H. neapolitanus. This would allow comparison of the putative carboxysomes to a native carboxysome and would greatly improve the quality and value of this figure.
-
Reviewer #2:
The manuscript by Flamholz et al. is a significant and excellent piece of work that is very novel and would have wide appeal to a range of microbiologists and general biologists. The manuscript is well written and represents a very interesting and largely complete set of data.
It was an ambitious goal to convert a model bacterium such as E. coli into a system that is able to grow with dependence on the CO2-fixing enzyme Rubisco, and a basic Calvin Cycle. The authors have achieved that, and as expected these engineered cells required a very high 10% CO2 for optimal growth. No LB media was required except for addition of some minimal salts and glycerol. Without added CO2 growth does not proceed with glycerol alone. Next, and Importantly, they then asked if they could add a basic CO2 concentrating mechanism (CCM) from a sulphur bacterium (Halothiobacillus) so that the E. coli cells could scavenge and accumulate enough inorganic carbon (CO2 /bicarbonate) to grow at air levels of CO2 (namely 0.04% CO2). Some 20 genes were required to make up this basic CCM work, namely a complete carboxysome operon, genes for a Ci pump (DabBA2), Rubisco genes, phosphoribulokinase, and engineered removal of both carbonic anhydrase genes from E.coli as well as riboseP-isomerase. The growth rate of cells at air was relatively slow, but shown to be at an expected rate based on modelling. Ultimately this work has implications towards the question of whether a basic CCM could function in a plant chloroplast and provides a boost to photosynthetic CO2 fixation. It seems to support this goal.
Curiously, the complete 20-gene system did not initially allow growth at air CO2 levels, but did work after a series of directed evolution experiments in bioreactors that led to some minor mutations. It is noted that one of these changes was the transfer of a the high copy number origin from one plasmid to the other, while some were 'regulatory' elements within the pCCM and pCB plasmids, then designated as pCCM' and pCB' plasmids after mutations. The authors should provide more detail on the net result of these mutations, as to whether expression was altered upwards or downwards for the two key plasmids? QPCR would be adequate.
One of the remarkable achievements in this manuscript is to mark out the necessary changes to convert an enteric bacterium into an organism that is dependent on Rubisco for CO2 fixation/carbon gain at limited CO2 levels (and glycerol as an initial carbon backbone). No more than 20 genes are required, possibly less, and clearly all the primary genes to assemble one example of a functional alpha-type carboxysome is now proven because of this experiment. Though there are likely to be some general chaperones required that the host provides.
-
Reviewer #1:
The photosynthetic efficiency of C3 plants is largely limited by the catalytic inefficiency of rubisco, the CO2 fixing enzyme in the Calvin-Benson-Bassham cycle of photosynthesis. Since rubisco can also react with O2, bacteria, algae and C4 plants have evolved CO2 concentrating mechanisms (CCMs) to increase the concentration of CO2 around rubisco. The CCM promotes carboxylation and inhibits the competitive oxygenation reaction of rubisco. Transplanting CCMs into C3 crop plants is considered a promising strategy to improve rubisco's photosynthetic performance. Bacterial CCMs consist of two essential components: inorganic carbon transporters at the membrane and the proteinaceous shell organelle, carboxysomes. Reconstitution of carboxysomes in E. coli and tobacco have been previously reported, however, there is no report of a functioning reconstituted CCM.
In this paper, the authors introduced 20 CCM-related genes from the proteobacterium H. neapolitanus into E. coli cells which have been engineered to be dependent on rubisco function for growth. Their results show that at most 20 genes are sufficient to generate a bacterial CCM which enables E. coli to grow at ambient CO2 concentration due to efficient fixation of CO2 by rubisco. This manuscript provides a useful platform for future investigations to establish the minimal number of genes required for transplanting the cyanobacterial CCM into non-native autotrophic hosts to improve their CO2 assimilation and growth.
Major comments:
1) For the benefit of a non-expert reader, the names of the 20 proteins and corresponding genes should listed in a Table, together with their function and the relevant references.
2) In Figure 3-figure supplement 1A, the authors should discuss why the gene csos1D is present in both pCB and pCCM.
3) In Figure 4B, the large variance in the OD600 after 4 days for CCMB1:pCB'+pCCM' cultures was explained as being due to genetic effects or non-genetic differences (line 1064). However, in Figure 3 - figure supplement 2B the measured growth kinetics did not show such big differences.
4) The negative control in Figure 5-figure supplement 1 is too dark and difficult to compare with the other micrographs. Moreover, to observe recombinant carboxysomes in the positive control (WT:pHnCB10), the authors should have induced the cells using a lower concentration of IPTG as reported previously by Bonacci et. al. (PNAS 2012).
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
In this manuscript, Peng et al. report three cryo-EM structures of the yeast V-ATPase holoenzyme, two without VopQ and one bound to the bacterial effector VopQ at 3-3.5A resolution. These structures reveal different functional states of the complex, with the ATPase sites adopting either closed or open conformations, supporting a rotary catalytic mechanism proposed previously. Compared to published structures of V1 or V0 subcomplexes and of the rat holoenzyme, the novelty of the authors' study lies in resolving the regulatory subunit H bound to the yeast holoenzyme at near-atomic resolution. Surprisingly, however, little mechanistic insight is provided by the authors into how this key regulator controls V-ATPase activity. For example, what is the structural explanation for why subunit H is essential for holoenzyme activity? How does subunit H inhibit ATP hydrolysis in the V1 subcomplex?
Major comments:
1) The authors refer to states 1, 2 and 3 throughout their manuscript, without ever introducing these states or explaining the differences. While experts in the V-ATPase and F-ATPase field may be familiar with these states, the manuscript in its current form is not well accessible for non-experts.
2) It is unclear why the V0V1 sample without VopQ was prepared with AMPNP, but the one with VopQ contained an equimolar mixture of AMPNP and ADP. For better comparison of both structures, it seems it would have been more appropriate to use the same nucleotide conditions. Related to that, the authors state that VopQ locks the holoenzyme in state 2. How can the authors exclude that the addition of ADP caused this effect, especially since VopQ seems substoichiometric (see below)? If VopQ stabilizes state 2, how is this achieved?
3) The density for VopQ in the authors’ structure is extremely weak, indicating only a subpopulation of particles actually contains VopQ. The authors should try focused classification to better separate VopQ-bound and -free holoenzyme.
4) Page 6: "Therefore, our data also suggests that subunit H is present in possible disassembled V1 subcomplex and in the holocomplex, ..." It is unclear how the authors' structures or ATPase data allows this conclusion. The authors should explain.
5) The authors identify specific interaction pairs between subunit H and subunits in V0 and V1. How do mutations at these interfaces affect V-ATPase holoenzyme stability and activity? Mutational analyses would provide an important validation of the structures and insights into the mechanism by which subunit H regulates V-ATPase activity.
6) The authors mention differences in the stator subunits between the rat and yeast holoenzymes. It would be worthwhile including a figure of this comparison.
7) The atomic models for the three related cryo-EM structures are poorly refined, with clash scores of >40, ~1.5% Ramachandran outliers and 16-17% rotamer outliers. The proteins and ligands in the various models also have unusually low B-factors for the reported resolutions. The authors must properly refine their atomic coordinates. It is also unclear why three different map sharpening factors are listed for each EM map.
-
Reviewer #2:
In this manuscript, the authors describe cryo-EM structures of the assembled yeast V-ATPase in the presence of the inhibitory nucleotide AMP-PNP and in the presence of VopQ, an inhibitor recently shown to bind to the Vo sector. The structure is reported to be of higher resolution than previous cryo-EM structures of the same yeast enzyme in three rotational states (2015) and the yeast V-ATPase containing the Stv1 isoform (2019), both reported by the Rubinstein lab. As in those structures, there are areas of lower resolution, and the catalytic hexamer shows the highest resolution. Three distinct conformations were observed in the Rubinstein Vph1-V-ATPase cryo-EM structure, potentially corresponding to three rotational states. Here only two states are observed, possibly as a result of the presence of the inhibitory nucleotide. VopQ inhibition of the intact V-ATPase only occurs in the absence of ATP hydrolysis, and the VopQ-V-ATPase structure, obtained in the presence of AMP-PNP and ADP, appears to enrich the State 2 conformation. However, the VopQ itself is very poorly resolved. Overall, AMP-PNP-bound and VopQ-containing V-ATPase structures do provide some new information, particularly the side-chain interactions with subunit H, but several claims are overstated.
The following issues should be addressed:
1) The authors do not give sufficient credit to previous work. The statement on lines 50 and 51, "We describe the cryo-EM structures of the first intact eukaryotic holoenzyme V-ATPase complex (V1Vo)..." is simply not true given the previous yeast structures from the Rubinstein lab. The main advance here is in improved resolution (from 6-8 A to 3.1-3.5 A) for two of three rotational states. Overall, the authors need to do a better job of highlighting what is really novel in their study, starting in the Abstract, which does not highlight the new information in the structures here.
2) The absence of the third rotational state (State 3) is attributed to disassembly of the V-ATPase (lines 64-66). However, this does not make sense given the fact that all three structures were found in the previous studies, and that V-ATPase disassembly is actually inhibited when ATPase activity is inhibited. Instead the absence of this state (which is consistently the least represented) must be associated with either the AMP-PNP inhibition or the number of particles visualized.
3) From their recent structures showing VopQ binding to the membrane Vo subcomplex, it was expected that VopQ would bind to State 2 of the holoenzyme. Unfortunately, the inhibitor could not be visualized well in the context of the intact enzyme, but there appears to be an enrichment and/or stabilization of State 2 of the V1Vo. However, the VopQ-V-ATPase samples also contain both AMP-PNP and ADP, so the authors should at least discuss whether it is the ADP or the VopQ that led to the stabilization of State 2 (especially given apparent low occupancy of VopQ). This structure did allow more detailed view of the subunit side chain interactions with subunit H than was possible previously. However, the suggestion that this structure was the first demonstration that subunit H was present in the holoenzyme (lines 107-109) is not correct, as this subunit co-purifies with intact V-ATPases and was present in previous structures.
4) The suggestion in lines 214-217 that this is the "first direct observation of various conformations of subunit pairs in a V-ATPase holoenzyme" is overstated. Conformational changes due to nucleotide binding have been visualized in even higher resolution crystal structures of the conserved bacterial (E. hirae) V1 (ref. 14).
-
Reviewer #1:
Structures are reported of yeast V-ATPase. They are similar to previously reported structures of rat and human V-ATPase, and are consistent with previously established mechanistic models. The major advance is that the new structures include subunit H, which is required for activity of the holoenzyme but inhibits ATPase activity in the isolated V1 component. Unfortunately, the structures do not indicate a mechanistic basis for subunit H activity. Another new feature of the current structures is inclusion of the bacterial effector VopQ, which was previously visualized binding to two sites on the isolated V0 subcomplex. Unfortunately, the density of VopQ in the current structures appears to be extremely poor. In summary, although the visualization of subunit H is an advance, the relative lack of new mechanistic insight from the current study diminishes my enthusiasm.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
-
-
www.biorxiv.org www.biorxiv.org
-
Author Response
Reviewer #1:
Major comments:
1) The title and the conclusion that SON and SRRM2 form nuclear speckles are not supported by the data. The data show that SON and SRRM2 are necessary for nuclear speckle formation. They do not rule out that another factor is necessary, such as SRRM1, which interacts with SRRM2 and itself harbors an intrinsically-disordered domain. That is, the authors have not shown that SON and SRRM2 are also sufficient for nuclear speckle formation. Such a test is necessary to draw the strong conclusion the authors make, and precedence for such a test has been established in the study of Cajal bodies. Specifically, central factors to Cajal body formation were shown to nucleate Cajal body formation at a specific site in chromatin when such central factors were localized to that site. The authors either need to perform such a sufficiency experiment or moderate their conclusions (and title).
2) In principle, in the immunofluorescence studies, the disappearance of mAb SC35 signal on depletion of SRRM2 does not alone prove that SRRM2 is what is visualized by the mAb SC35 in such assays. Given that this paper seeks to establish rigorously that mAb SC35 marks nuclear speckles by recognition of SRRM2, given that SRSF7 is recognized by the antibody on blots, and given that SRSF2 has been traditionally presumed the target of mAb SC35 in nuclear speckles, the rigor of this study demands that SRFS7 and SRSF2 be visualized in cells in the presence of an SRRM2 truncation to rule out that either SRSF7 or SRSF2 phenocopy SRRM2 in this assay.
This is a valid concern and we have thought of the same principal that is if any strongly speckle-associated intrinsically disordered domain containing protein, such as SRRM1 or RBM25, two proteins that are also frequently used as NS markes, would have a similar impact on NS formation as SRRM2 has. To this end, we performed a co-depletion of SON and SRRM1 (shown in Supplementary Figure 10) in a cell line that has a TagGFP2 inserted into SRRM2 gene locus. As it can be seen from the imaging presented in this figure for 4 individual cells (but also more generally on 10 independent field imaged, (data not shown)) we did not score a reduction in the GFP intensity, or dissolution of the spherical bodies as is the case in SON-SRRM2 co-depleted cells. We observed the nuclear speckles have the round-up morphology, that is seen upon SON-KD, but are not dissolved shown with PNN staining and SRRM2-TagGFP signals. Moreover, we performed a co-depletion of RBM25 (another strongly NS-associated protein also used as a NS-marker) and SON which did not result in the dissolution of nuclear speckles (Supplementary Figure 10). Therefore, we have reached to the conclusion that SON and SRRM2 form nuclear speckles with the contribution of SON being more important for the formation and titled our study accordingly.
Traditionally, because of the Fu & Maniatis 1992 paper, as pointed out by the reviewer, it is assumed that SC-35 recognizes SRSF2 in immunofluorescence experiments and potentially multiple SR-proteins in immunoblots. The former point, to the best of our knowledge, has never really been proven in any type of rigorous experiment. Fu lab. has generated SRSF2 K/O mice, but never provided an immunofluorescence image that shows that SC-35 signal disappears in K/O cells.
Just to summarize our line of reasoning here:
1) We do an unbiased IP-MS experiment, which shows that SRRM2 is the top candidate protein, at least an order of magnitude away from any other protein in the dataset by any measure. This strongly suggest that SRRM2 is the primary target of this antibody, although doesn’t prove it due to technical reasons i.e. no input normalization, some proteins produce more ‘mass-specable’ peptides than others, and larger proteins tend to produce more peptides.
2) We carry out a biased screen of 12 SR-proteins and find that SRSF7 is strongly recognized by mAb SC-35
3) We do IP-western blotting experiments, which correct for input and are not affected by relative ‘mass-specable’ peptide issues or protein sizes, which reveal a strong enrichment of SRRM2 (>10% of input), some enrichment for SRSF7 (~2% of input) and no enrichment for SRSF2, SRSF1 or other proteins that we have tested.
4) Since the “35kDa” protein is so engrained with the history of this antibody and our results were most consistent with the idea that this protein is SRSF7 rather than anything else, we insert a degron tag to SRSF7. If the hypothesis is true, then we expect a shift of the SC-35 band, concomitant to the shift in SRSF7, which is indeed the case. This is not proof that SC-35 doesn’t recognize any other protein but it does provide very strong evidence (combined with the other two experiments) that the 35kDa band detected by SC-35 in immunoblots is in fact SRSF7.
5) We then show, by TagGFP2 insertion into the SRRM2 locus, that SC-35 mAb can recognize SRRM2 specifically on immunoblots, and furthermore truncations beyond a certain point completely eliminates this signal. We also show later that siRNA mediated KD of SRRM2 also leads to the elimination of the signal from immunoblots (Supplementary Figure 9).
6) Combining the results so far, we address the issue of immunofluorescence, i.e. which protein or proteins are responsible for this signal. We think there are two possible scenarios that could both be true based on the presented evidence so far:
a. This signal is mainly, if not entirely, originates from SRRM2. b. The signal is a combination of SRRM2, SRSF7 and/or other SR-proteins that the SC-35 might be cross-reacting.
7) We then take advantage of our cell lines with SRRM2 truncations. These truncated SRRM2 version are not recognized by SC-35 mAb on immunoblots, therefore it is reasonable to suspect that they will not be recognized by SC-35 mAb in immunofluorescence as well.
8) If scenario (b) is correct and nuclear speckles are still intact in these cells (which we show that they are indeed intact, judged by SON, RBM25 and SRRM1 stainings Fig. 3A-B), then we would expect either no change in SC-35 signal, or a somewhat reduced signal. We see a complete loss of signal.
9) Being extra careful with this result, we also mix the control cell line and SRRM2-truncated cells and image them side-by-side to address any issues related to imaging settings etc. There is no detectable SC-35 signal in truncated cells.
10) We also show that the 35kDa band is still unchanged in SRRM2 truncated cells (Figure 2E), showing that SRSF7 itself is not affected in these cells.
These results, combined together, show that SC-35 signal in immunofluorescence originates from SRRM2, and any other signal potentially contributed by other proteins are below the detection of immunofluorescence microscopy.
Reviewer #2:
This study reports important evidence that the widely-used SC-35 antibody primarily recognizes SRRM2 rather than the assumed SRSF2. The manuscript provides several lines of evidence supporting this conclusion, and the work has broad impact on the field of nuclear structure and function as this antibody is the most common marker for the major nuclear component, nuclear speckles.
The one concern with the manuscript is the interpretation of some of the previous literature and understanding in the field.
First, since the 1990s it has been widely known that the SC-35 mAb has very limited specificity for denatured proteins and was not suitable for immunoblots (see abcam page for ab11826). Indeed, the assumption has always been that it recognizes a folded epitope. Therefore, the use of western blots to conclude anything about the specificity of this antibody is inappropriate.
Secondly, it has also been previously documented that this antibody has cross-reactivity with SRSF7 (i.e. 9G8; Lynch and Maniatis Genes Dev 1996).
Third, most SR proteins are not abundantly observed in tryptic MS due to high cleavage of RS domains. This is particularly true of SRSF2, which has a highly "pure" RS domain (i.e. all RS repeats) that encompasses almost half of the total protein. SRRM2, on the other hand, has much more complex and degenerate RS domains that encompass a much smaller percentage of the total protein. SRRM2 is also 10x the size of SRSF2. Thus, given equal molar amounts of SRSF2 and SRRM2, one would expect at least 20x the number of peptides and much more complete coverage of SRRM2 vs. SRSF2. Therefore, while the subsequent immunoblot in Figure 1C is compelling evidence that SRRM2 is precipitated with the SC-35 antibody, while SRSF2 is not, the IP-MS data alone is not strong proof that the SC35 mAb primarily recognizes SRRM2 rather than SRSF2. The text should be revised accordingly.
Finally, the abstract implies that the demonstration of SON as a central component of speckles is new ("elusive core"). As appropriately referenced in the text, this is not the case, rather SON is often used as a marker for nuclear speckles, and SON has long been considered to be part of the core of speckles, as knock-down has been documented by several groups to disrupt speckles. The wording in the abstract should therefore be more parsimonious.
With all due respect to all previous researchers that have used mAb SC35 and published their results, we think that the specificity issue has become unnecessarily convoluted due to the initial inaccurate characterization. Abcam’s recommendations highlight the issue in an interesting way. In the old marketing images, abcam shows a single band in a total lysate prepared from HEK293 cells: https://www.abcam.com/ps/products/11/ab11826/reviews/images/ab11826_49518.jpg
However, producing such an image, in our experience as we have also reported in the manuscript, is only possible under non-ideal western-blotting conditions i.e. when the transfer is not adequate to reveal proteins with large molecular weights. Intriguingly, a customer (not us) complains about an improper WB result obtained with this antibody (with a 2-star rating):
It looks like an unexplainable high-molecular smear without the information that we provide in our manuscript, but in light of it, it’s clear that protein stained here is SRRM2.
In our experience the antibody works perfectly fine for western blotting, and very specifically and robustly reveals SRRM2 at ~300kDa, as long as the immunoblotting conditions are optimized for large proteins. We also show that bulk of the signal around 35kDa originates from SRSF7, however as indicated by the other reviewer’s comments, and also previous research, the antibody probably cross-reacts with other proteins as well with varying degree.
In this sense, the antibody can be used for immunoblotting, but pretty much any result obtained from such an experiment must be verified with an independent antibody or independent methods, which we did in this manuscript.
The SC35 mAb is actually suitable for western blotting if the gel running and transfer conditions are carefully performed to have SRRM2: a) enter the gel and b) transferred properly to the membrane. Under conditions where SRRM2 is just not entering the gel (due to high percentage gels, or gels with too much bis-acrylamide), or doesn’t get transferred to a membrane (non-ideal buffer conditions, protein stuck in stacking part and cut away etc.), we have seen the unspecific bands, but we had to use the most sensitive detection reagents at hand to see those, so they are rather weak. We have provided a detailed explanation to what these conditions are in the methods section of our manuscript, but briefly: running the gel slowly allowing the protein to enter in the gel and transferring overnight with CAPS buffer were key to get the western blot working. As we have shown in Figure 2C and 2E, the majority of signal detected comes from SRRM2. The unspecific binding of SC35 mAb could only be scored if the above-mentioned conditions were not met.
We believe what made matters historically worse has been the use Mg++ precipitation that enriches many SR proteins, but actually completely depletes SRRM2 (Blencowe et al. 1994 DOI: 10.1083/jcb.127.3.593, Figure 5, https://pubmed.ncbi.nlm.nih.gov/7962048/ ). When we’re sure that SRRM2 is in the gel though, it just shines as a single band. So in conclusion, SC-35 is reasonably specific to SRRM2, especially in immunofluorescence, but it certainly cross-reacts with other SR-proteins, especially when SRRM2 is missing for technical or biochemical reasons.
We will update in the manuscript for the corresponding section by citing earlier studies reporting the specificity issues of mAb SC35.
We absolutely agree that IP-MS data alone is not enough to conclude that SC-35 recognizes SRRM2, or whether it is the primary target or not. The overwhelming amount of SRRM2 peptides detected, in addition to the overwhelming amount of total peptide counts from SRRM2 does strongly suggest that it is the case, which we then followed up by IP-western blotting which controls for relative input, and the various experiments shown in later figures.
We have looked at our MS results and found out that:
SRSF2 was detected with 4 unique peptides with an MS/MS count of 5 and a sequence coverage of 29% (intensity 3E+07), whereas SRRM2 was detected with 227 unique peptides with an MS/MS count of 3317 and a sequence coverage of 61.9% (intensity 2E+11).
These numbers show a 6600 times higher intensity for SRRM2 (not normalized). As the identification and abundance of different peptides/proteins can by dramatically different in MS, it is indeed correct that one should be careful with such comparisons. The only way would be to use peptide standards for both proteins and record standard curves, then a real quantitative comparison would give the true numbers. Hence, we will revise the wording of that section.
Finally, as the reviewer has pointed out, we have not shown that speckles can be reformed by introducing ectopically expressed SON/SRRM2 into cells which now appear not to have nuclear speckles. This would indeed be the formal proof showing that SON/SRRM2 are not just necessary but also sufficient to form nuclear speckles. Such an experiment is quite challenging due to the length of these proteins and difficulty in establishing conditions where one can express these proteins, but not overexpress them which leads to round-up speckles (as shown and discussed by Belmonte lab). Therefore, we will change the title to “SON and SRRM2 are essential for the formation of nuclear speckles” to better reflect our conclusions.
We really did try to be clear and just about the previous literature around SON. Indeed, it is clear that SON is a crucial part of NS, likely the most important component for the integrity of speckles. However, in all of these previous studies, RNAi-mediated depletion of SON, without exception, leaves behind spherical bodies that are strongly stained with mAb SC35, that also harbor other NS-markers (which we also show). This is of course not new, as we also appropriately cited previous work, however being able to dissolve these “left-over” speckles by co-depletion of SRRM2, and perhaps more importantly by deletion of the SRRM2’s C-terminal region is indeed novel.
In essence, our results show that in the absence of SON, as shown by previous work as well, NS-associated proteins are still able to organize themselves into nuclear bodies, indicating that either all other SR-proteins without the need of another organizer clump together, or another factor (or factors) is still acting as an organizer. When we remove the C-terminus of SRRM2, which we show is the primary target of SC-35, which strongly stains these left-over nuclear bodies in the absence of SON, then deplete SON, all NS markers that we could find become diffuse, indicating that nuclear speckles no longer exist, or become too small to be detected or classified as “nuclear bodies”. Co-depletion of SON and SRRM2 leads to the same phenotype, but co-depletion of SON and SRRM1 (or RBM25) doesn’t, leaving behind spherical nuclear speckles that harbor SRRM2 which are no different than SON KD cells.
Reviewer #3:
Nuclear speckles in the last several years have attracted significant attention for their association with transcriptionally active chromosome regions (after largely being ignored by most for the previous 20 years). Overwhelmingly, a single monoclonal antibody has been used as a marker for nuclear speckles for several decades.
This manuscript now argues convincingly that the main target that is recognized by this monoclonal antibody is not SRSF2 (SC35) as long thought, but rather SRRM2. The authors thus clarify a vast literature, while also focusing attention on the very large protein SRRM2 that in many ways resembles another nuclear speckle protein, SON. Both have huge IDRs and unusual RS repeats, while SON has been proposed to act as a scaffold for many SR-containing proteins, which is likely also true for SRRM2, by extension. Moreover, the manuscript provides a convincing explanation for why the target of this antibody was previously misidentified, by showing a lesser cross-reaction with SRSF7, of similar MW to SC35.
Finally, the manuscript suggests that SON and SRRM2 together help nucleate nuclear speckles, as a double KD, or a SON KD in a background of a truncated SRRM2, leads to loss of nuclear speckle-like staining of other proteins normally enriched in nuclear speckles (RBM25, SRRM1, PNN). The authors go on to suggest that this double KD approach will now provide an important means of disrupting nuclear speckles to aid in functional studies.
Interestingly, some of the results of this manuscript actually are already confirmed or consistent with previous literature. For example, a cited paper describes changes in Hi-C compartmentalization patterns after "elimination" of nuclear speckles- actually, they performed a SRRM2 KD and showed loss of SC35 staining, which is now explained as simply due to the KD that they performed. More recently, a new proteomics study of nuclear speckles (Dopie et al, JCB, 2020: https://doi.org/10.1083/jcb.201910207) reported both SON and SRRM2 as the two most highly enriched nuclear speckle proteins, with enrichment scores similar to each other but more than twice that of all other speckle proteins. Moreover, this same paper also did a SRRM2 KD and observed loss of anti-SC35 staining but not SON staining.
Overall, I found this manuscript of significant interest for people in the nuclear cell biology field and technically thorough and well done. I just had one issue and one point to make in my main comments, plus some minor points.
1) The evidence that nuclear speckles are nucleated by SON and SRRM2 is based on the dispersion of staining of nuclear speckle proteins RMB25, SRRM1, and PNN. However, an alternative explanation is that some other protein(s) nucleates nuclear speckles, while these other nuclear speckle proteins bind to SON and SRRM2, and are therefore enriched in nuclear speckles. To eliminate this concern, the authors could show that SON and/or SRRM2 do not bind to these proteins- for instance using co-IP or other methods. Of course, it could be that such binding or scaffolding of nuclear speckle proteins is how they form nuclear speckles. But just one protein that is not bound by SON and SRRM2 but still stains nuclear speckles after the double KD would be inconsistent with their hypothesis. Therefore, if they do find that all these proteins bind SON and/or SRRM2 they could simply discuss this as a scaffolding mechanism but qualify their conclusion based on the alternative explanation described above.
2) In our lab we have not been comfortable using the kinase manipulations, discussed in this paper, to eliminate nuclear speckles for experimental purposes because the cells appear very sick after these manipulations. For other reasons, we also tried a double SON and SRRM2 KD. Our experience is that the cells after this double KD were also not very normal. If the authors are suggesting the SON and SRRM2 double KD as an experimental tool to disrupt nuclear speckles in order to access nuclear speckle function, then it would be valuable for them to indicate cell toxicity, etc. Many SR-protein KDs for example do not allow selection of stable cells. What about this double KD?
The first point of Reviewer #3 has been addressed above in response to the Reviewer #2.
We have stated that our work identifying SON and SRRM2 as the elusive core of nuclear speckles paves the way to study the nuclear speckles under physiological conditions. Here, we have used the cells 24 hours after transfection (~18 hours of knock-down) as the primary reason being that SON-KD caused a mitotic arrest if the cells were kept longer in culture. This was reported earlier in Sharma et al MBC 2010. There was no additional severity in the phenotype when the SON-KD was combined with SRRM2-KD, therefore we believe the arrest phenotype we scored is mainly due to depletion SON. In this sense, double-depletion of SON and SRRM2 can be used to study the effects of loss of NS (transcription, post-transcriptional, topological), but certainly within a time-frame of around 24 hours in cells that haven’t gone through mitosis. We will clarify this statement in the revised manuscript to avoid any misunderstanding as pointed by the reviewer. Faster depletion strategies, and/or a system where cells are mitotically arrested would be required to observe long term effects more reliably.
-
Reviewer #3:
Nuclear speckles in the last several years have attracted significant attention for their association with transcriptionally active chromosome regions (after largely being ignored by most for the previous 20 years). Overwhelmingly, a single monoclonal antibody has been used as a marker for nuclear speckles for several decades.
This manuscript now argues convincingly that the main target that is recognized by this monoclonal antibody is not SRSF2 (SC35) as long thought, but rather SRRM2. The authors thus clarify a vast literature, while also focusing attention on the very large protein SRRM2 that in many ways resembles another nuclear speckle protein, SON. Both have huge IDRs and unusual RS repeats, while SON has been proposed to act as a scaffold for many SR-containing proteins, which is likely also true for SRRM2, by extension. Moreover, the manuscript provides a convincing explanation for why the target of this antibody was previously misidentified, by showing a lesser cross-reaction with SRSF7, of similar MW to SC35.
Finally, the manuscript suggests that SON and SRRM2 together help nucleate nuclear speckles, as a double KD, or a SON KD in a background of a truncated SRRM2, leads to loss of nuclear speckle-like staining of other proteins normally enriched in nuclear speckles (RBM25, SRRM1, PNN). The authors go on to suggest that this double KD approach will now provide an important means of disrupting nuclear speckles to aid in functional studies.
Interestingly, some of the results of this manuscript actually are already confirmed or consistent with previous literature. For example, a cited paper describes changes in Hi-C compartmentalization patterns after "elimination" of nuclear speckles- actually, they performed a SRRM2 KD and showed loss of SC35 staining, which is now explained as simply due to the KD that they performed. More recently, a new proteomics study of nuclear speckles (Dopie et al, JCB, 2020: https://doi.org/10.1083/jcb.201910207 ) reported both SON and SRRM2 as the two most highly enriched nuclear speckle proteins, with enrichment scores similar to each other but more than twice that of all other speckle proteins. Moreover, this same paper also did a SRRM2 KD and observed loss of anti-SC35 staining but not SON staining.
Overall, I found this manuscript of significant interest for people in the nuclear cell biology field and technically thorough and well done. I just had one issue and one point to make in my main comments, plus some minor points.
1) The evidence that nuclear speckles are nucleated by SON and SRRM2 is based on the dispersion of staining of nuclear speckle proteins RMB25, SRRM1, and PNN. However, an alternative explanation is that some other protein(s) nucleates nuclear speckles, while these other nuclear speckle proteins bind to SON and SRRM2, and are therefore enriched in nuclear speckles. To eliminate this concern, the authors could show that SON and/or SRRM2 do not bind to these proteins- for instance using co-IP or other methods. Of course, it could be that such binding or scaffolding of nuclear speckle proteins is how they form nuclear speckles. But just one protein that is not bound by SON and SRRM2 but still stains nuclear speckles after the double KD would be inconsistent with their hypothesis. Therefore, if they do find that all these proteins bind SON and/or SRRM2 they could simply discuss this as a scaffolding mechanism but qualify their conclusion based on the alternative explanation described above.
2) In our lab we have not been comfortable using the kinase manipulations, discussed in this paper, to eliminate nuclear speckles for experimental purposes because the cells appear very sick after these manipulations. For other reasons, we also tried a double SON and SRRM2 KD. Our experience is that the cells after this double KD were also not very normal. If the authors are suggesting the SON and SRRM2 double KD as an experimental tool to disrupt nuclear speckles in order to access nuclear speckle function, then it would be valuable for them to indicate cell toxicity, etc. Many SR-protein KDs for example do not allow selection of stable cells. What about this double KD?
-
Reviewer #2:
This study reports important evidence that the widely-used SC-35 antibody primarily recognizes SRRM2 rather than the assumed SRSF2. The manuscript provides several lines of evidence supporting this conclusion, and the work has broad impact on the field of nuclear structure and function as this antibody is the most common marker for the major nuclear component, nuclear speckles.
The one concern with the manuscript is the interpretation of some of the previous literature and understanding in the field.
First, since the 1990s it has been widely known that the SC-35 mAb has very limited specificity for denatured proteins and was not suitable for immunoblots (see abcam page for ab11826). Indeed, the assumption has always been that it recognizes a folded epitope. Therefore, the use of western blots to conclude anything about the specificity of this antibody is inappropriate.
Secondly, it has also been previously documented that this antibody has cross-reactivity with SRSF7 (i.e. 9G8; Lynch and Maniatis Genes Dev 1996).
Third, most SR proteins are not abundantly observed in tryptic MS due to high cleavage of RS domains. This is particularly true of SRSF2, which has a highly "pure" RS domain (i.e. all RS repeats) that encompasses almost half of the total protein. SRRM2, on the other hand, has much more complex and degenerate RS domains that encompass a much smaller percentage of the total protein. SRRM2 is also 10x the size of SRSF2. Thus, given equal molar amounts of SRSF2 and SRRM2, one would expect at least 20x the number of peptides and much more complete coverage of SRRM2 vs. SRSF2. Therefore, while the subsequent immunoblot in Figure 1C is compelling evidence that SRRM2 is precipitated with the SC-35 antibody, while SRSF2 is not, the IP-MS data alone is not strong proof that the SC35 mAb primarily recognizes SRRM2 rather than SRSF2. The text should be revised accordingly.
Finally, the abstract implies that the demonstration of SON as a central component of speckles is new ("elusive core"). As appropriately referenced in the text, this is not the case, rather SON is often used as a marker for nuclear speckles, and SON has long been considered to be part of the core of speckles, as knock-down has been documented by several groups to disrupt speckles. The wording in the abstract should therefore be more parsimonious.
-
Reviewer #1:
Major comments:
1) The title and the conclusion that SON and SRRM2 form nuclear speckles are not supported by the data. The data show that SON and SRRM2 are necessary for nuclear speckle formation. They do not rule out that another factor is necessary, such as SRRM1, which interacts with SRRM2 and itself harbors an intrinsically-disordered domain. That is, the authors have not shown that SON and SRRM2 are also sufficient for nuclear speckle formation. Such a test is necessary to draw the strong conclusion the authors make, and precedence for such a test has been established in the study of Cajal bodies. Specifically, central factors to Cajal body formation were shown to nucleate Cajal body formation at a specific site in chromatin when such central factors were localized to that site. The authors either need to perform such a sufficiency experiment or moderate their conclusions (and title).
2) In principle, in the immunofluorescence studies, the disappearance of mAb SC35 signal on depletion of SRRM2 does not alone prove that SRRM2 is what is visualized by the mAb SC35 in such assays. Given that this paper seeks to establish rigorously that mAb SC35 marks nuclear speckles by recognition of SRRM2, given that SRSF7 is recognized by the antibody on blots, and given that SRSF2 has been traditionally presumed the target of mAb SC35 in nuclear speckles, the rigor of this study demands that SRFS7 and SRSF2 be visualized in cells in the presence of an SRRM2 truncation to rule out that either SRSF7 or SRSF2 phenocopy SRRM2 in this assay.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
This study has yielded two significant contributions. First, the study recharacterized a widely used antibody, mAb SC35, which was initially raised against the spliceosome and characterized both as targeting the 35 kDa protein, SRSF2, an intensely studied splicing regulatory factor, and as marking nuclear speckles, which in the last several years have attracted significant attention for their association with transcriptionally active chromosome regions (after largely being ignored by most for the previous 20 years). The authors present a series of rigorously designed and carefully carried out experiments demonstrating that the 35 kDa factor that mAb recognizes is instead SRSF7. Moreover, the authors present compelling evidence that the primary target of mAb SC35 is a ~300 kDa protein, SRRM2, a spliceosomal factor originally discovered as a nuclear matrix factor and later defined as a nuclear speckle component. In the most convincing experiments establishing these targets the authors show that mAb SC35 signals shift, when the molecular weight of SRSF7 or SRRM2 is varied, and that the signal disappears when SRSF7 is depleted. Given the use of mAb SC35 for nearly three decades, these results suggest that tens if not hundreds of papers require re-interpretation. This study reminds us again of the necessity of rigorous validation of antibodies.
Second, the authors investigate the role of SRRM2 in the formation of nuclear speckles. Previous studies have shown that knock down of the nuclear speckle factor SON leads to a compaction of nuclear speckles but not their entire dissolution, implicating a role for at least one additional factor in nuclear speckle formation; other studies have implicated an array of factors as being required for nuclear speckle formation. Here, the authors show that truncation or knock down of SRRM2, in contrast to several other nuclear speckles factors, also reduce nuclear speckle number, although more modestly than SON, and the truncation or knockdown of SRRM2 in combination with the depletion of SON reduces nuclear speckles more than SON depletion alone. The authors interpret these findings to indicate that SON and SRRM2, both of which harbor intrinsically-disordered domains, form nuclear speckles in human cells, as the title indicates. Further, the authors suggest that the double knockdown provides a new tool to study nuclear speckle function. Overall, this study provides surprising and important insight into a commonly used mAb and valuable new perspectives on nuclear speckles, which have the potential to transform future studies. The study will be of broad interest to those interested in splicing, nuclear speckles, antibody specificity, and more generally, liquid-liquid phase separation.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
Serra-Marques and co-authors use CRISPR/Cas9 gene editing and live-cell imaging to dissect the roles of kinesin-1 (KIF5) and kinesin-3 (KIF13) in the transport of Rab6-positive vesicles. They find that both kinesins contribute to the movement of Rab6 vesicles. In the context of recent studies on the effect of MAP7 and doublecortin on kinesin motility, the authors show that MAP7 is enriched on central microtubules corresponding to the preferred localization of constitutively-active KIF5B-560-GFP. In contrast, KIF13 is enriched on dynamic, peripheral microtubules marked by EB3.
The manuscript provides needed insight into how multiple types of kinesin motors coordinate their function to transport vesicles. However, I outline several concerns about the analysis of vesicle and kinesin motility and its interpretation below.
Major concerns:
1) The metrics used to quantify motility are sensitive to tracking errors and uncertainty. The authors quantify the number of runs (Fig. 2D,F; 7C) and the average speed (Fig. 3A,B,D,E,H). The number of runs is sensitive to linking errors in tracking. A single, long trajectory is often misrepresented as multiple shorter trajectories. These linking errors are sensitive to small differences in the signal-to-noise ratio between experiments and conditions, and the set of tracking parameters used. The average speed is reported only for the long, processive runs (tracks>20 frames, segments<6 frames with velocity vector correlation >0.6). For many vesicular cargoes, these long runs represent <10% of the total motility. In the 4X-KO cells, it is expected there is very little processive motility, yet the average speed is higher than in control cells. Frame-to-frame velocities are often over-estimated due to the tracking uncertainty. Metrics like mean-squared displacement are less sensitive to tracking errors, and the velocity of the processive segments can be determined from the mean-squared displacement (see for example Chugh et al., 2018, Biophys. J.). The authors should also report either the average velocity of the entire run (including pauses), or the fraction of time represented by the processive segments to aid in interpreting the velocity data.
2) The authors show that transient expression of either KIF13B or KIF5B partially rescues Rab6 motility in 4X-KO cells and that knock-out of KIF13B and KIF5B have an additive effect. They also analyze two vesicles where KIF13B and KIF5B co-localize on the same vesicle. The authors conclude that KIF13B and KIF5B cooperate to transport Rab6 vesicles. However, the nature of this cooperation is unclear. Are the motors recruited sequentially to the vesicles, or at the same time? Is there a subset of vesicles enriched for KIF13B and a subset enriched for KIF5B? Is motor recruitment dependent on localization in the cell? These open questions should be addressed in the discussion.
3) The authors suggest that KIF5B transports Rab6 vesicles along centrally-located microtubules while KIF13B drives transport on peripheral microtubules. Is the velocity of Rab6 vesicles different on central and peripheral microtubules in control cells?
4) The imaging and tracking of fluorescently-labeled kinesins in cells as shown in Fig. 4 is impressive. This is often challenging as kinesin-3 forms bright accumulations at the cell periphery and there is a large soluble pool of motors, making it difficult to image individual vesicles. The authors should provide additional details on how they addressed these challenges. Control experiments to assess crosstalk between fluorescence images would increase confidence in the colocalization results.
-
Reviewer #2:
The manuscript by Serra-Marques, Martin, et al provides a tour de force in the analysis of vesicle transport by different kinesin motor proteins. The authors generate cell lines lacking a specific kinesin or combination of kinesins. They analyze the distribution and transport of Rab6 as a marker of most, if not all, secretory vesicles and show that both KIF5B and KIF13B localize to these vesicles and describe the contribution of each motor to vesicle transport. They show that the motors localize to the front of the vesicle when driving transport whereas KIF5B localizes to the back of the vesicle when opposing dynein. They find that KIF5B is the major motor and its action on "old" microtubules is facilitated by MAP7 whereas KIF13B facilitates transport on "new" microtubules to bring vesicles to the cell periphery. The manuscript is well-written, the data are properly controlled and analyzed, and the results are nicely presented. There are a few things the authors could do to tie up loose ends but these would not change the conclusions or impact of the work and I only have a couple of clarifying questions.
In Figure 2E, it seems like about half of the KIF5B events start at or near the Golgi whereas most of the KIF13B events are away from the Golgi? Did the authors find this to be generally true or just apparent in these example images?
In Figure 8G, the tracks for KIF13B-380 motility are difficult to see, which is surprising as KIF13B has been shown to be a superprocessive motor. Is this construct a dimer? If not, do the authors interpret the data as a high binding affinity of the monomer for new microtubules and if so, do they have any speculation on what could be the molecular mechanism? It appears as if KIF13B-380 and EB3 colocalize at the plus ends for a period of time before both are lost but then quickly replenished. Is this common?
-
Reviewer #1:
In their manuscript, Serra-Marques, Martin, et al. investigate the individual and cooperative roles of specific kinesins in transporting Rab6 vesicles in HeLa cells using CRISPR and live-cell imaging. They find that both KIF5B and KIF13B cooperate in transporting Rab6 vesicles, but KIF5B is the main driver of transport. In these cells, Eg5 and other kinesin-3s (KIF1B and KIF1C) are dispensable for Rab6 vesicle transport. They find that both KIF5B and KIF13B are present on these vesicles and coordinate their activities such that KIF5B is the main driver of the cargos on older, MAP7-decorated MTs, and KIF13B takes over as the main transporter on freshly-polymerized MT ends that are largely devoid of MAP7. Interestingly, their data also indicate that KIF5B is important for controlling Rab6 vesicle size, which KIF13B cannot rescue. Upon cargo switching from anterograde to retrograde transport, KIF5B, but not KIF13B, engages in mechanical competition with dynein. Overall, this paper provides substantial insight into motor cooperation of cargo transport and clarifies the contribution of these distinct classes of motors during Rab6 vesicle transport. The experiments are well-performed and the data are of very high quality.
Major Comments:
1) In Figure 5, it is very interesting that only KIF5B opposes dynein. It would be informative to determine which kinesin was engaged on the Rab6 vesicle before the switch to the retrograde direction. Can the authors analyze the velocity of the run right before the switch to the retrograde direction? If the velocity corresponds with KIF5B (the one example provided seems to show a slow run prior to the switch), this could indicate that KIF5B opposes dynein more actively because KIF5B was the motor that was engaged at the time of the switch. Or if the velocity corresponds with KIF13B, this could indicate that KIF5B becomes specifically engaged upon a direction reversal. In any case, an analysis of the speed distributions before the switch would provide insight into vesicle movement and motor engagement before the change in direction.
2) One of the most interesting aspects of this paper is the different lattice preferences for KIF5B, which shows runs predominantly on "older" polymerized MTs decorated by MAP7, and for KIF13B, whose runs are predominantly restricted to newly polymerized MTs that lack MAP7. The results in Figure 8 suggest a potential switch from KIF5B to KIF13B motor engagement upon a change in lattice/MAP7 distribution. In general, do the authors observe the fastest runs at the cell periphery, where there should be a larger population of freshly polymerized MTs? For Figure 4E, are example 1 and example 2 in different regions of the cell? Do the authors think the intermediate speeds are a result of the motors switching roles? Additional discussion would help the reader interpret the results.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 2 of the manuscript. Kassandra M Ori-McKenney (University of California) served as the Reviewing Editor.
Summary:
Serra-Marques, Martin et al. investigate the individual and cooperative roles of specific kinesins in transporting Rab6 secretory vesicles in HeLa cells using CRISPR and live-cell imaging. They find that both KIF5B and KIF13B cooperate in transporting Rab6 vesicles, but Eg5 and other kinesin-3s (KIF1B and KIF1C) are dispensable for Rab6 vesicle transport. They show that both KIF5B and KIF13B localize to these vesicles and coordinate their activities such that KIF5B is the main driver of the cargos on older, MAP7-decorated microtubules, and KIF13B takes over as the main transporter on freshly-polymerized microtubule ends that are largely devoid of MAP7. Interestingly, their data also indicate that KIF5B is important for controlling Rab6 vesicle size, which KIF13B cannot rescue. By analyzing subpixel localization of the motors, they find that the motors localize to the front of the vesicle when driving transport, but upon directional cargo switching, KIF5B localizes to the back of the vesicle when opposing dynein. Overall, this paper provides substantial insight into motor cooperation of cargo transport and clarifies the contribution of these distinct classes of motors during Rab6 vesicle transport.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
General assessment:
In this research article, authors claim that HIP1 plays an important role in promoting the proliferative ability of prostate cancer cells by HIP1-STAT3-GDF15 signaling axis activation. HIP1 overexpression increased STAT3 signaling in response to FGF2 receptor activation and increased GDF15 transcription. The increase in GDF15 protein secretion was dependent on HIP1 and STAT3 expression and was shown to have paracrine growth-promoting effects. Although some of the information is new, the relevance and importance of this information is inconclusive and not supported from the data presented in this article.
Major Comments:
This paper needs a substantial amount of revision, as indicated below.
A. Novelty:
HIP-1 has been extensively studied in cancer including prostate cancer (Rao et al., 2002). Its role in STAT3 signaling has also been demonstrated (Hsu et al, 2015). This study is not very novel.
B. Major comments:
1) Figures 1A, S1: Changes in p-AMPK1α, and p-Akt are very profound in this array, however, the authors indicate that "By contrast to our validation of STAT3 phosphorylation by Western blotting, it was not possible to detect increased levels of p-AMPK1α (T174), p-Akt (S473) or p-PLC-γ1 when we attempted to validate these by blotting (Supplementary Figure S1D-F)." Why do the authors think this is happening? Did the authors use the same experimental conditions for the array and validation experiments? These apparent discrepancies need further clarification.
2) Figure 1E: the authors show that shHIP1#2 caused a modest knockdown of HIP1, while shHIP1#1 induced a dramatic reduction in HIP1 protein level, however, both the shRNAs significantly inhibited pSTAT3 to the same extent. This indicates that total knockdown (KD) of HIP1 is not necessary to completely shut-down the activity of pSTAT3. How does this translate to the biological functions of HIP1?
3) How come DMSO treatment blocks the phosphorylation of ERK1/2 in lane 2 of Fig 1(F)?
4) Figure S1F: pSTAT3 western blot: the authors should indicate which band they considered positive for p-STAT3; if it's the lower band why was there no activity in lane 4?
5) Fig 2A and 2B should be repeated in HIP1 knockout cells.
6) What is the endogenous level of HIP1 and GDF15 in prostate cancer cell lines vs. normal prostate epithelial cells? Why was HIP1 overexpressed in LNCaP cells? Was the level of HIP1 expression low in LNCaP and PNT1A, when compared in a panel of prostate cancer cell lines? Did the authors observe any differential expression of HIP1 and GDF15 in hormone sensitive vs. hormone resistant prostate cancer cells?
7) GDF15 is a very ambiguous biomarker of cancer as its levels are even higher in the case of mental disorders including psychosis (for reference https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5554200/ ). And from this study, it is not even clear that the GDF15 upregulation is just one of the several outcomes of the activation of this signaling axis or if it is the only consequence of this signaling axis to promote the growth of cancer cells by increasing paracrine signaling. An experiment in GDF15 knockout cells/mice can document the role of this axis in a more precise manner.
8) It has been shown that wt p53 significantly reduces STAT3 tyrosine phosphorylation and inhibits STAT3 DNA binding activity in prostate cancer cell lines that express both constitutively active STAT3 and mutant p53 protein. The authors have claimed that the increase in STAT3 phosphorylation is due to HIP1 expression. All three of the cell lines evaluated in this paper have different p53 status and show differences in expression of activated STAT3. Is the expression of HIP1 independent of the status of p53?
9) Figure 3: Does STAT3 silencing (siRNA/stattic) downregulate HIP1, and does this decrease STAT3 activation over time? Also, does STAT3 silencing or treatment with WP1066 inhibit HIP1-induced tumor growth in vivo?
10) The role of GDF15 in prostate cancer is likely stage specific. It may promote early stages of tumorigenesis, but suppress the progression of advanced prostate cancers. The authors claim that HIP1 overexpression is mediated by stat3 activation, which leads to increased secretion of GDF15. Does expression of HIP1 correlate with the expression of GDF15 and does this also associate with stage-specific progression of prostate cancer?
11) How was cellular transformation studied and confirmed? Did HIP1 cause transformation of normal prostate cells?
12) Fig 1B: HIP1 western blot is not clear, please quantify 1C, 1D, 1E.
13) Most of the studies are done only in one cell line which is not adequate.
14) What is the clinical relevance of this study? The authors should study clinical samples along with multiple cell lines.
15) Several of the Western blot figures need better quality blots; Figs 1E (FGFR), S2C (all).
-
Reviewer #2:
The paper describes a novel signaling pathway which links HIP1 and STAT3. HIP1 is an oncolgene which should be targeted in prostate cancer. In previous studies the role of HIP1 in prostate cancer was established. The paper is well-written and the experiments needed to make appropriate conclusions are performed. The paper is also important because of identification of the role of GDF15 in prostate cancer. In my opinion, the paper may benefit from clarification whether HIP1 treatment leads to up-regulation of cytokines such as interleukin-6. This is possible because the effect of HIP1 could also be indirect, i.e. mediated by interleukin-6. No other major revisions are suggested. In general, the paper is an important contribution to understanding of signaling pathways of STAT3 in prostate cancer.
-
Reviewer #1:
In this manuscript by Rao et al, the authors use an immortalized prostate cancer epithelial cell line, PNT1A, to identify the effects of HIP1 overexpression. The authors show in a series of well-controlled experiments the positive relationship between HIP1, phosphorylation of STAT3, and expression of FGFR4. Phenotypically, this relationship is also associated with pro-tumorigenic events such as in vitro migration and invasion, and development of tumor xenografts. Finally, the authors demonstrate that HIP1 results in increased expression of the GDF15 cytokine to exert its effects on tumor cells in a paracrine fashion.
There are no major concerns with this manuscript.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
In this manuscript by Rao et al, the authors use an immortalized prostate cancer epithelial cell line, PNT1A, to identify the effects of HIP1 overexpression. The authors define a positive relationship between HIP1, phosphorylation of STAT3, and expression and activation of the FGF2 receptor, FGFR4. Phenotypically, this relationship is also associated with pro-tumorigenic events such as in vitro migration and invasion, and growth of tumor xenografts. Finally, the authors make the case that HIP1 results in increased expression of the GDF15 cytokine to exert its effects on tumor cells in a paracrine fashion.
In general, the paper is well-written, and the results clearly presented. The authors have previously extensively studied HIP1 in cancer, including prostate cancer (Rao et al., 2002). A role for HIP1 in STAT3 signaling has also been demonstrated (Hsu et al, 2015). Hence, the primary novelty and importance of the study is because of identification of role of GDF15 in prostate cancer, and delineation of a tumor-promoting, paracrine HIP1-STAT3-GDF15 signaling axis. While this was viewed as a strength of the study, there were significant weaknesses. Most prominent of the weaknesses was the fact that the bulk of the experiments were performed only in a single cell model, PNT1A, which reduces confidence that the results are generalizable, as opposed to reflecting idiosyncratic signaling response in this model. The consensus of the reviewers was that the key findings of the studies should be further validated in additional cell line models, and/or the relationships proposed should be validated in clinical specimens for prostate cancer. Ideally, both additional cell lines and clinical samples would be used, but at least one is essential to support conclusions. In addition to this important global critique, the reviewers made several specific criticisms of the experiments presented in the study, which should be addressed.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #2:
In this paper, the authors describe a web-app that can create, customized, and labeled volcano plots. Technically, from three columns of a CSV file (log fold change, log p-value, and gene name), it displays a scatter plot, with labeled dots. The app (made with shinyR) can be used online or run locally with R/Rstudio. In itself, the app is well done, easy to use, and reactive. Compared to similar existing tools (VolcanoR, Genavi, msVolcano), it is an improvement: it is more intuitive and more "interactive". All that said, it's still a single-use plotting tool, with limited applications, as it avoids doing any statistical analysis on the data.
1) It's not possible to interact directly with the spreadsheet inside the web-app or to select a subset of it, or do simple arithmetic operations on the columns (replacing a log fold-change by a log2 for example).
2) The x-axis cannot be put in log-scale.
3) Being able to export the R code that generates such a plot would be a nice functionality, for those who want to be able to easily use the general look of the plot inside their own pipelines.
4) It would be nice to be able to get q-value from p-values or to measure a false discovery rate.
-
Reviewer #1:
Goedhart and Luijsterburg developed a R-based web application VolcaNoseR for plotting a kind of scatter plot widely used in transcriptomics/proteomics research(significance vs log fold change), also known as a volcano plot. Using VolcaNoseR it is very easy to create nice-looking, annotated volcano plots, as the GUI provides control of most of the parameters of the plot, such as labels, the significance threshold, the colour schemes etc. Importantly, VolcaNoseR plots are also interactive, which can be used to explore the data and get easy access to any particular gene/protein.
1) As the authors indicated in the very beginning of their paper, volcano plots are used for visualization of large amounts of data. Making scatter plots is possible with almost all existing plotting tools: from MS Excel to specialized packages in R (https://www.bioconductor.org/packages/release/bioc/vignettes/EnhancedVolcano/inst/doc/EnhancedVolcano.html ) and plotly (for interactive plots). The authors make the point that VolcanoseR is unlike all these softwares because it does not require the user to have any programming skills, since it has a custom-tailored GUI. However, producing and correctly interpreting the underlying big data already requires computational/coding skills that far exceed making a scatter plot (especially with many tutorials for the latter available online (https://huntsmancancerinstitute.github.io/hciR/volcano.html )).
2) One of the main features of VolcaNoseR is the ability to make publication ready plots. Yet one will need many more visualisations for any manuscript, than volcano plots. And to do other visualisations (e.g. heatmaps, violin plots and others) potential users will still need to use other plotting tools (and even be proficient in it to match the style of other visualisations in the manuscript with the volcano plot produced by the VolcaNoseR web app).
3) In the part data re-use authors provide a nice example of previously published data, where data points that were not annotated in the source study could be of special interest (Fig. 3). However, I doubt that investigating labels of hundreds of data points one by one on the interactive plot with the cursor is easier, than just filtering underlying source data tables for significant results and searching for genes of interest in the resulting table.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
The connection between core transcriptional regulation and tumor metabolism is an area of current interest. The reciprocal regulation of ZBTB18 and CTBP2 has potential value in understanding the functional regulation of lipid biology. However, there are substantial concerns with the studies that limit its rigor and value.
Major concerns:
1) It is advised that the authors consider referencing the International Cell Line Authentication Committee's Register of Misidentified Cell Lines before investing in experiments. The vast majority of critical experiments used only SNB19 (SNB-19). This is a contaminated line and should not be used for studies. The following is from the ATCC:
“SNB-19 (ATCC CRL-2219) and U-373 MG (ATCC HTB-17) - STR analysis at ATCC revealed that SNB-19, a human glioblastoma cell line has a STR pattern identical to that for U-373 MG (ATCC HTB-17). SNB-19 and U-373 MG also share derivative chromosomes. These observations were confirmed with the original stock available to ATCC. Since then distribution of SNB-19 was discontinued. U-373 MG (ATCC HTB-17) - As a result of sequencing, the authenticity of ATCC HTB-17 has been questioned by R.F. Petersson in Stockholm and collaborator E.G. Van Meir in Atlanta (personal communication and see Ishii, N., et al. Brain Pathol 9: 469-79, 1999). They report similarities between U-373 MG (ATCC HTB-17) and another glioblastoma, U-251. The cell line U-373 MG, obtained from the original lab in Uppsala has differing genetic properties from the ATCC HTB-17 (U-373 MG). Following further investigations, ATCC stopped distribution of this cell line.”
It is not only a concern about the naming of the line. The use of a single cell line grown in metabolically artifactual conditions for most of the studies weakens the ability to connect the results to the disease being studied. It also raises concern about global rigor overall. It would have been much better to consider using the BTSC cells for most of these studies. The validation efforts were minimal (sometimes even missing loading controls).
2) Figure 1A, C, D, F: I assume that EV really was with FLAG alone. If not, the comparison should be between FLAG-ZBTB18 and FLAG alone. In each of these studies, there were no replicates and only a single cell line.
3) Figure 1B: Why were CTBP1 and CTBP2 prioritized, instead of other molecules with more peptides?
4) Co-IP of endogenous proteins ZBTB18 and CTBP2 in a panel of cells would be important.
5) The shRNA experiments are poorly controlled. There is a single shRNA used and no rescue studies to address potential off-target effects. All experiments should include better controls.
6) As the authors note, ZBTB18 is expressed at different levels in different glioblastomas, with greater expression in mesenchymal tumors. I would suggest that the authors better consider defining the putative reciprocal function of ZBTB18 and CTBP2 with both loss-of-function and gain-of-function studies.
7) The in vivo studies are limited in scope. There is a single replicate of a single cell line (SNB-19, with the caveats above) with a single shRNA and no rescue studies.
8) It is not surprising that ZBTB18 and CTBP2 have differences in gene regulation, but the current studies make it difficult to fully support the overall model. There are no rescue studies that show the rescue of proliferation or other defects, which would be important for the molecular model.
9) MTOB is a regulator of the methionine salvage pathway, not simply CTBPs. Why wasn't methionine signaling investigated? The rescue efforts for MTOB with ZBTB18 failed, but it would be important to at least validate CTBP rescues.
10) It wasn't clear to me why SREBP signaling was not studied in rescue studies? There is largely an effort to show changes in transcription, but few functional studies to show rescue of metabolism, proliferation, and tumor growth.
11) Figure 4 should include endogenous ZBTB18 IP, as well, with better cells.
12) Figures 4-7 show that the media used for most studies is not really appropriate to study ZBTB18 and CTBP2 function. These efforts should include more consideration of serum-free conditions and in vivo studies, especially as many studies have shown that standard serum conditions with excess oxygen cause artifacts of metabolism.
13) The findings of changes in lipid metabolism are interesting, but quite preliminary. Lipid droplets have been strongly linked to aggressiveness in gliomas. The quantification does not show very strong differences. It would be important to show that the differences in lipid biology explain the effects of ZBTB18 and CTBP2 on tumor cell metabolism and proliferation. Are these findings the driver or passenger of effects?
14) I would suggest that the authors consider deeper in silico efforts to examine target expression and patient outcome or genetic events.
-
Reviewer #2:
In this manuscript, the authors claim that ZBTB18 interacts with CTBP2 and represses SREBP target genes to inhibit fatty acid synthesis in glioblastoma. However, the mechanisms presented in the manuscript are not convincing. This is because there are several major concerns for their conclusions as described below.
1) It looks that Figure 1D shows almost no endogenous interaction between CTBP2 and ZBTB18 when α-CTBP2 was used. This is perhaps because their cell lines may express very low ZBTB18 levels. Moreover, in reciprocal IP experiments using cells with FLAG-ZBTB-18 overexpression, α-ZBTB18 IP shows weak CTBP2 band that is inconsistent with the CTBP2 band in Figure 1C. In addition, this manuscript relies too much on results that were generated from overexpression for the tumor suppressor candidate gene ZBTB18.Therefore, it is possible that many results in this manuscript may represent artificial results based on FLAG-ZBTB18 overexpression. Of note, knockdown or loss-of-function experiments are generally better for a tumor suppressor genes.
2) ZBTB18 is a transcriptional repressor. CTBP2 is a transcriptional corepressor that interacts with LSD1 and other repressive proteins, although it may act as a transcriptional activator via the association with certain factors. If ZBTB18 interacts with CTBP2, it is reasonable to think that they would cooperate for gene repression and is also worthy to compare the effect of ZBTB18 knockdown with that of CTBP2 knockdown on gene expression. However, without a good rationale, authors compared the effect of ZBTB18 overexpression with that of CTBP2 silencing on gene expression. In this regard, they should have also compared the effect of ZBTB18 knockdown with that of CTBP2 knockdown on gene expression. If ZBTB18 knockdown is not suitable because of its low expression in their cell lines, they may have to use a different cell line.
3) LSD1's role: LSD1 can demethylate H3K4me2 and H3K4me1 but not H3K4me3. It may demethylate H3K9me2 in certain contexts (for example, upon the interaction with AR). Authors said "H3K9me2 is a well-established target of LSD1 demethylase activity" and then examined the effect of ZBTB18 overexpression on LSD1, H3K9me2, and H3K4me3 (but not H3K4me2) using quantitative ChIP. Authors should have checked H3K4me2 as well. Nevertheless, their results showed that ZBTB18 overexpression increased LSD1 and H3K9me2 but decreased H3K4me3. Authors then mentioned "a possible explanation is that the recruitment of CTBP2 complex by ZBTB18 to its target sites inhibits LSD1 demethylase activity and might be employed by ZBTB18 to counteract CTBP2-mediated activation.” However, another possibility would be that increased recruitment of ZBTB18 and LSD1, maybe along with CTBP2, would increase the repressive mark H3K9me2 but decrease the active mark H3K4me3. Perhaps, consistent with the latter possibility, authors mentioned that CTBP2 has been linked to the inhibition of cholesterol synthesis in breast cancer cells through direct repression of SREBF2 expression. To clarify this issue, authors need to show the effect of LSD1 knockdown on expression of SREBP target genes as well as on HDAC1/2, H3K4me2 and H3K9me2 levels at these genes.
Note: authors measured the LSD1 activity in nuclear lysates using a commercial kit. This assay is based on LSD1-mediated H3K4 demethylation but not H3K9 methylation. However, the purpose of this experiment appeared to show the effect of ZBTB18 on LSD1 activity for H3K9me2 demethylation. It is not clear that this was an appropriate use of this assay.
4) Some results are not entirely novel. For example, previous studies from authors and other groups showed that ZBTB18 negatively affected proliferation of cancer cells (Figure S2). In addition, other previous studies have reported that CTBP2 promotes tumorigenesis for hepatoma and may be a glioma prognostic marker (PMID: 27698809) (Figures 2I & 2J). LSD1-interacting proteins (Figures 4A-4C) have been known.
5) Many labels and legends for the figures should have been better described as they are often confusing and difficult to read. Along with this, many figures should have been better presented. Some examples are as follows:
• What is the protein number in Figure 1B?
• For multiple figures (Figures 2H, 3H, 3G & 3H, 4D-4I, 5C, etc), there are no statistical analysis.
• Authors should have better labelled to present their figures. For example, to present transfection and ChIP in Figure 3G, authors may want to use the labels as follows: EV + IgG; EV + α-FLAG; FLAG-ZBTB18 + IgG; FLAG-ZBTB18 + α-FLAG (instead of IgG_EV; FLAGEV; IgG ZBTB18; FLAG_ ZBTB18, respectively).
• In Figure 7E, SREBP target genes would be better than SREBP genes
-
Reviewer #1:
This manuscript explores the mechanism by which ZBTB18 regulates the expression of SREBP genes in glioblastomas. The authors use IP and MS experiments to identify CTBP2 as a new ZBTB18 binding protein. ChIP-seq shows some overlaps of CTBP2 with ZBTB18 largely on gene promoters. CTBP2 activates, while ZBTB represses the expression of some SREBP genes. ZBTB18 disrupts the CTBP2/LSD1 complex leading to increased H3K9me2, decreased H3K4me3, and gene silencing. SREBP proteins are transcription factors that control the expression of enzymes involved in fatty acids and cholesterol biosynthesis. Consequently, ZBTB18 expression leads to reduction of several phospholipid species. Overall, although this manuscript demonstrates the role of ZBTB18 in suppressing lipid synthesis and storage and a potential oncogenic role of CTBP2 in glioblastoma cells, the mechanism underlying its regulation of gene expression is still not clear.
1) According to the model, CTBP2 binds at SREBP gene promoters to maintain active transcription; expression of ZBTB18 enhances its binding to other LSD1 complex components and their chromatin association, however, on the contrary, ZBTB18 inhibits the enzymatic activity of LSD1 thus to repress gene expression. This model itself is seemingly paradoxical. Why does CTBTP18 recruit a corepressor (such as LSD1) and then inhibits its repressive function? Does LSD1 indeed function as a co-repressor or co-activator? Is its enzymatic function required?
2) LSD1 is well-known for its demethylation activity against H3K4 mono- and di-methylation; its demethylase activity on H3K9 is far from clear. The data as presented does not rule out the possibility that LSD1 is a co-repressor of ZBTB18.
3) The enzymatic assay in Figure 4J is preliminary. In vitro enzymatic assays using pure proteins with proper controls are necessary.
4) The analysis of ChIP-seq data is preliminary. In Figure 3B, there are close to 12K peaks of CTBP2 binding sites (EV-CTBP2 only) that are lost upon co-expression of ZBTB18, and these peaks are not bound by ZBTB18. How does this happen? Also, there are close to 10K of gained CTBP2 binding sites upon coexpression of ZBTB18, half of which are bound by ZBTB18. What are these peaks? I did not find information on how many repeats are done for each ChIP. If only once, this may simply reflect huge variations between experiments. Basic analysis to access the quality of ChIP-seq is also not shown.
5) Supplementary Figure 6A does not tell whether there is a good overlap between ZBTB18 bound peaks and the bindings of CTBP2 interactors (NCOR1, ZNF217 and LSD1). Vann diagrams need to be used to show overlaps with P-values.
6) The entire study relies on overexpression of ZBTB18. Complementary knockouts using CRISPR in cells expressing ZBTB18 are needed.
7) All Western blots miss protein standard markers. Percentage of input is also not labelled making it difficult to judge how strong the ZBTB18 and CTBP2 protein-protein interaction is.
-
-
www.biorxiv.org www.biorxiv.org
-
Author Response
This paper analyzes the evolution of the KRAB-containing zinc finger protein (KZFP) family of proteins. While the reviewers were all interested in the topic, several major concerns came up during review. These include technical limitations of the methods chosen to analyze this challenging protein family (e.g., determination of orthology, selection analysis, and so on), and that new ideas, including claims about non-coding evolution and positive selection, are not convincingly supported by the analysis presented.
Response: In our study, we focused on the co-evolution between zinc fingers in KZFPs and non-TE regions, not ‘non-coding regions’. Non-TE regions are located in both coding and non-coding regions.
Reviewer #1:
1) The title and abstract make it clear that the authors are trying to argue that non-coding sequence contributes to rapid evolution of the KRAB-ZFP family….
Response: As we mentioned above, we focused on the co-evolution between zinc fingers in KZFPs and non-TE regions, not ‘non-coding regions’. Non-TE regions are located in both coding and non-coding regions.
5) Page 6 line 122: The authors do not define, here or in the methods, what constitutes a "variant" KRAB domain.
Response: In fact, the meaning of variant KRAB domains had been simply described here (Page 6, line 120-122). The variant KRAB domains display a very significant degree of sequence divergence from the KRAB A-box consensus sequence, and variant KRAB domains are clustered into one separated branch in the phylogenetic tree of KRAB domain A-box amino acid sequence. This description was similar to that in the reference (Helleboid et al., 2019). We will explain it more detailly in method section in the further revision of the manuscript.
8) Page 9 lines 189-193: Does the 90% cited here refer to 90% of the ~50% that are called as "tending to bind non-TE sequence" or 90% of all KZFPs? Regardless, this point is very misleading: the fact that less than 50% of the binding sites of a KZFP is not found to overlap TEs does not mean that the KZFP only binds to non-TEs.
Response: Here, ‘90%’ refer to 90% of all KZFPs. We did not state that ‘less than 50% of the binding sites of a KZFP is not found to overlap TEs means that the KZFP only binds to non-TEs’. Instead, we mean that they tend to bind to non-TEs.
12) Page 11 lines 249-251: 1) It is not clear how the author defined genes as transcription factors (they also do not define the acronym), or why they included them in the analysis. 2) Additionally, the authors say that the divergence time of KZFPs is correlated with expression level but does not provide correlation values or the significance of these correlations.
Response: 1) Since KZFPs can bind to target genes and regulate their transcription, most of them are regarded as potential transcription factors. To confirm whether the special features of KZFPs found in our study are KZFP-specific or common to all transcription factors, we compared KZFPs with other transcription factors. The data source of transcription factors was described in the method section (page 18, lines 422-424). 2)We showed the correlation values in figure 4A and the corresponding P values were listed in Figure 4–source data 1.xlsx.
15) How were the target genes selected for qPCR validation among the KZFP targets? 2) In Fig.5 suppl. 3 the authors show that there is a fraction of genes that is only accessible in ESCs, but there is also a similar number of genes that is accessible both in ESCs and HEK293T cells, so the authors could have tried to validate some of those in both cell lines...
Response: 1) We screened the target genes with significant changes in the expression level from ESC into endoderm or mesoderm for qPCR validation. 2) Indeed, we have performed some validations (Fig.5 suppl. 1)
16) Page 16 lines 367-373: The conclusions that can be drawn from the ZNF611 reporter assay and associated evolutionary analysis are minimal. …There is almost no experimental methodology on how the tree was generated, how the authors overcame these issues, and how the authors identified the orthologous binding site in different species.
Response: Sequence alignments were performed using ClustalX (version 2.1) with default parameters (Larkin et al., 2007), and the phylogenetic tree (neighbor-joining tree) was constructed using MEGAX (Kumar, Stecher, Li, Knyaz, & Tamura, 2018) with default parameters. To identify the orthologous binding site in different species, firstly, we found the ZFN611 binding site in the ZNF611 ChIP peak sequence in STK38 promoter according to the predicted ZNF611 binding motif in human. Then we compared the ZFN611 binding site within the promoter of orthologous STK38 in different species.
17)There is not enough detail about how the human KRAB-ZFPs were identified. Bare minimum, the authors need to report thresholds used to determine if a protein's domains scored high enough to be either a KRAB or C2H2 ZF domain.
Response: All KRAB domains and C2H2 zinc fingers in human proteins were identified using HMMER v3.1b2 with E value < 0.01. The proteins containing both a KRAB domain and C2H2 zinc fingers were defined as KZFPs. This method was not described clearly in the manuscript (page 18). We will add detailed description of that in the revision.
20) How the authors performed the gene ontology enrichment/depletion analysis is not clear. For example, if the authors indeed prefiltered their list to remove genes that have no GO terms, that would bias the results.
Response: This has been described in the method section (page 23, lines 530-532). The genes that haven’t GO term annotation were filtered out. It’s also needed that the genes were expressed at least in one sample. These genes were regarded as the background of the enrichment/depletion analysis. This strategy was used widely in published papers.
Reviewer #2:
It is not clear how the authors identified KRAB-ZNF genes in the 80 species analysed, nor how they defined orthology relationship of KRAB-ZNF genes.
Response: The methods were described in lines 425-443 (page18-19). To identify the divergence time of KRAB domain in human KZFPs, protein sequences of 80 species from 80 genera in deuterostomia were downloaded from Ensembl database. All KRAB domains and C2H2 zinc fingers in proteins were identified using HMMER v3.1b2 with E value < 0.01. The proteins containing both a KRAB domain and C2H2 zinc fingers were defined as KZFPs. The divergence time of the full protein sequence was inferred according to the homology information from Ensembl Compara (Herrero et al., 2016; Vilella et al., 2009).
it is puzzling that the divergence time of the full protein sequence can be estimated above 400 Mya, while the root of the KRAB-ZNF gene family has been assigned to the common ancestor of coelacanths, lungfish and tetrapods (Imbeault et al., 2017).
Response: the root of the KRAB-ZNF gene family in the research (Imbeault et al., 2017) was based on the earliest appearance of the gene encoding both KRAB domain and zinc fingers. However, the divergence time of the full protein sequence based on pairwise alignments, Large-scale syntenies and Enredo-Pecan-Ortheus (EPO) multiple alignments (Herrero, et al., 2016). Thus, some of the orthologous proteins of human KZFPs do not containing a KRAB domain.
Peaks filtering should include, at the very least, the canonical ENCODE blacklisted regions (Amemiya et al., 2019)
Response: we have used the corresponding total input samples as controls to get credible peaks.
Of note, numerous ChIP-seq datasets from ENCODE are listed in the method, but are not referenced or mentioned in the text. Were those included in the ChIP-seq binding sites analysis? How do the two datasets (ENCODE and Imbeault et al., 2017) relate to one another?
Response: ChIP-seq datasets from ENCODE are included in the ChIP-seq binding sites analysis. We firstly used the ChIP-seq data in Imbeault et al., 2017, and ChIP-seq datasets from ENCODE were used as supplements of the ChIP-seq data of other KZFPs.
No details are given regarding the method used to assign "the expression level grade" of genes to a specific category.
Response: The threshold wasn’t described clearly in the manuscript. Genes with read counts over 10 are considered to be expressed, while genes with read counts less than 10 are considered to be unexpressed (undetected). For each dataset, we used the upper and lower quartiles of TPMs of all expressed genes to divide them into three expression level grades: low-abundant genes, the genes with TPMs lower than lower quartile; medium-abundant genes, the genes with TPMs between the lower quartile and the upper quartile; high-abundant genes, the genes with TPMs higher than the upper quartile.
The KD efficiency of ZNF611 is really poor (<20%, Figure 6B), and prevents further conclusions on this experiment (especially since a western blot cannot be performed). We are also sceptical about the statistical analysis performed in this panel. The authors should explain in detail which t-test was used and whether it was performed on raw or normalized values.
Response: The statistical method was described in the figure legend. We used grouped t test. And it was performed on normalized values (relative mRNA levels of predicted target genes were normalized to GAPDH).
-
Reviewer #3:
This paper gives the impression that it is two stories bundled together into one. One story is the evolution of the family and the other one is the experimental part focusing on a very specific KFZP, ZNF611. However, it is a rather weak synthesis with results of moderate interest and likely low phenotypic impact.
The authors state that the KFZP family is coevolving with TEs and suppresses their expression. According to previous knowledge, that is why this family is evolving so fast. However, the authors argue that this fast evolution is further attributed to the fact that KFZPs also positively regulate the promoters of other non-TE genes. They have analysed published Chip-seq data toward this end. Furthermore, they have experimentally identified that a "young" KFZP, ZNF611 can bind to a promoter element of the STK38 gene and positively regulate its expression in ESCs. However, I did not see substantial experimental evidence supporting a strong phenotypic effect of this particular regulation.
-
Reviewer #2:
This work interestingly addresses the evolutionary pressures undergone by KRAB-ZNF genes. However, a large part of the manuscript is based on the analysis of pre-existing datasets, but neither exploits these data in new ways nor reveals novel findings overlooked in the original studies. The authors' findings are not a significant addition to the conclusions made by the original investigations, which are, by the way, not properly referenced and often misquoted. Moreover, when the authors attempt to build a systematic method for the identification of non-TE related / activating functions for KRAB-ZNFs, the experimental validation tends to point to few regulatory exceptions rather than general principle for the KRAB-ZNF family. The paper finishes by the analysis of a single non-TE target of a young KRAB-ZNFs, ZNF611, which is clearly not the best candidate considering the proposed model of bimodal evolution of KRAB-ZNFs (old vs. young). The picture that comes out of this manuscript is that of a patchwork of analyses that struggle to stand together as a whole.
Major points:
Figure 1 / Comparison of the divergence time of the full sequence, KRAB domain and zinc fingers in KZFPs: The method section is not very clear, which suggests that the authors may have done their analysis by relying on pre-existing database annotations which could bias the estimation of the divergence time.
-It is not clear how the authors identified KRAB-ZNF genes in the 80 species analysed, nor how they defined orthology relationship of KRAB-ZNF genes. This should precede the estimation of the divergence time. Methods to infer orthology for KRAB-ZNF genes has been based the on best reciprocal hit of the full protein-sequence by Blast (Liu et al., 2014) or on KRAB-ZNF fingerprint (Imbeault et al., 2017). Is it based on Ensembl? It is known that Ensembl has a poor annotation of KRAB-ZNF genes especially in distantly related species with human. Clarification is needed regarding de novo KRAB-ZNF gene detection, annotation and comparison in the method section.
-Related to this, it is puzzling that the divergence time of the full protein sequence can be estimated above 400 Mya, while the root of the KRAB-ZNF gene family has been assigned to the common ancestor of coelacanths, lungfish and tetrapods (Imbeault et al., 2017). In addition, some of the oldest KRAB-ZNF genes found in the human genome are ~320 Mya (Liu et al., 2014). How do the authors reconcile this with the estimation of the full protein divergence time?
Figure 2 / The diversification pattern of KRAB domains and zinc fingers in humans: The authors suggest that old KZFPs tend to have a variant KRAB variant domain and thus are involved in non-canonical protein-protein interactions. This analysis has been entirely made in Helleboid et al., 2019, who further validated these results by identifying the interactome of these proteins by mass-spectrometry. Considering the timeline of this submission and the release of the original paper, the authors could have modified their conclusions. They could also have taken greater advantage of non-overlapping findings, such as the disordered nature of the variant KRAB domain. This is interesting but under-exploited.
Figure 3 / KZFPs tend to bind to non-TE regions in exon and promoter: The analysis of pre-existing data from different sources come with considerable drawbacks, notably in terms of unforeseen experimental artifacts and biases, which could affect peak calling, data interpretation and conclusion. As such, KZFPs may display promiscuous binding to unrelated "opened" regions, especially when they are overexpressed in a non-native context (Amemiya et al., 2019; Marinov et al., 2014). While the authors tested different parameters of the ChIP-seq analysis pipeline, I do not see any attempts to assess the overall reliability of KZFPs peaks within open regions in the method section or in supplementary figures:
-Peaks filtering should include, at the very least, the canonical ENCODE blacklisted regions (Amemiya et al., 2019). Additional steps of filtering should be included such as building background models that are experiment-specific and cell-type specific, as it has been done in the past (Helleboid et al., 2019; Imbeault et al., 2017; Schmitges et al., 2016). Does it change the overall proportion of peaks falling into TE/non-TE regions?
-As emphasized in the manuscript, targets of KRAB-ZNFs are expected to be highly specific (Schmitges et al., 2016) as only few of them display similar key amino-acids in their ZFs (Figure 2E/F) and may depend on the appearance of their binding site in evolution (Figure 6). As such, only a minimal overlap of non-TE targets peaks is to be expected for different KRAB-ZNFs proteins: it is likely that non-TE targets bound by many KRAB-ZNFs may result from promiscuous binding sites. The authors should show the overlap of non-TE targets bound by different KRAB-ZNFs before and after filtering steps.
-As a consequence, these promiscuous binding sites would skew the results of the over-/under-representation of genes in specific biological processes (as presented in Figure 3D) and gene essentiality tolerance (in Figure 3E). What would be the result of these analyses once peaks and gene lists are filtered? Similarly, what would be the result if only promiscuous binding sites were considered?
-Of note, numerous ChIP-seq datasets from ENCODE are listed in the method, but are not referenced or mentioned in the text. Were those included in the ChIP-seq binding sites analysis? How do the two datasets (ENCODE and Imbeault et al., 2017) relate to one another?
Figure 4 / KZFP genes encoding young zinc fingers tend to have higher expression level in early embryonic development and the ESC differentiation into mesoderm:
-The author should refer to previous work on young KZFPs expression during human embryogenesis (Pontis et al., 2019) when they introduce this section. This is especially important since the TE-controlling function of ZNF611 has been investigated in this study, and is not discussed or mentioned in Figure 6.
-No details are given regarding the method used to assign "the expression level grade" of genes to a specific category. Is it common arbitrary thresholds used for all genes or is it based on something similar to a z-score value ? Clarification is needed.
Figure 5 / KZFPs can positively regulate target genes by binding to non-TE regions in endoderm or mesoderm differentiation: We would suggest the authors reorganize the figure 5 to bring their strongest evidence of KRAB-ZNFs activating function in the main figure. For instance, genes over/under-representation (Figure 5C) and essentiality (Figure 5D) are not very informative. On the other hand, the Figure 5-figure supplement 1D/E could be presented in the main figure as it reinforces the link between chromatin accessibility and regulatory activities of KRAB-ZNFs in non-TE regions. Of note, while the authors may conclude to regulatory differences between ESC and HEK293, it would be farfetched to superimpose their conclusions to mesoderm and endoderm differentiation without experimental validation. Therefore, the authors should tone down their conclusion in the corresponding section.
For the KRAB-ZNFs functionally investigated in Figure 5-figure supplement 1D/E, the authors should highlight :
-Their divergence time, the type of KRAB domain, their known interactors and endogenous expression levels in ESCs, HEK293, during endoderm and mesoderm differentiation (it is impossible to zoom in Figure 4).
-The proportion of peaks falling in TE/non-TEs region and their associated chromatin accessibility in the different cell types (such as plotHeatmap function from the deepTools suite).
-The correlation matrix of the chromatin accessibility signals in non-TE binding sites between the two cell lines should be displayed for all the KRAB-ZNFs functionally investigated.
Figure 6 / The emergence of new sequence in STK38 promoter may drive the evolution of zinc fingers in ZNF611: While the emphasis on KZFPs divergence time and KRAB domain feature is clear in the first part of the manuscript, the shift toward the functional assessment of a young KRAB-ZNF is somehow inconsistent and should be explained.
-As mentioned above for the KRAB-ZNFs of Figure 5-figure supplement 1D/E, ZNF611 features (divergence time,...) should be displayed in the figure or stated in the text. The number of peaks of ZNF611 in non TE/ non-TE regions should be plotted. Also, previous work on ZNF611 function during embryogenesis should be introduced in this section.
-ZNF611 expression during mesoderm differentiation (with corresponding correlation) and ESCs should be added to Figure 6-figure supplement 1A.
Overall, the effect of ZNF611 overexpression or knock-down appears to be mild, and should be reinforced by additional information:
-Considering the discrepancy of the effect of ZNF611 overexpression and knock-down on the level of STK38 (Figure 6A/B): (i) a western blot analysis of ZNF611-FLAG protein levels in overexpressing cells (like in Figure 5 - figure supplement 1C) could indicate that the overexpression of the protein is actually mild compared to overexpression mRNA levels of ZNF611. Similarly, a previous study analysed the effect of ZNF611 overexpression in hESCs (Pontis et al., 2019), is STK38 upregulated in those datasets? That would reinforce the conclusions made by the authors.
-The KD efficiency of ZNF611 is really poor (<20%, Figure 6B), and prevents further conclusions on this experiment (especially since a western blot cannot be performed). We are also sceptical about the statistical analysis performed in this panel. The authors should explain in detail which t-test was used and whether it was performed on raw or normalized values.
-Since the BMPR2 gene remained unaffected by ZNF611 "KD" or "overexpression", could the authors show / perform the same analysis as for STK38 promoter region in Figure 1C-D for this gene?
-The authors emphasize that ZNF611 functions in mesoderm differentiation through STK38 regulation. This analysis was conducted in the pluripotent state (hESCs). What about the differentiation potential of these cells toward the mesoderm lineage? Does it prevent STK38 upregulation?
-The authors have shown that KRAB-ZNF effect is largely cell type dependent (figure 5 - figure supplement 1D/E), while the experimental assessment of ZNF611 was done in ESCs, the luciferase assay was performed in HEK293 (figure 6H-I). The authors should repeat the experiments in ESCs or tone down their conclusions.
-Interestingly, RACDE use TE-related sequences to identify the binding motif of KRAB-ZNFs, suggesting that the binding motif of ZNF611 to STK38 promoter is fairly similar to its TE-derived consensus motif (figure 6F). How many binding sites of ZNF611 in non-TE region present binding sites with a close similarity to the consensus motif derived from TE-binding? Are there changes in specific DNA bases of the canonical binding site motif that could predict activating function of ZNF611 in non-TE regions?
-
Reviewer #1:
Summary:
In this study, the authors seek to determine patterns of KRAB-ZFP family evolution and identify the factors that drive those patterns. To do so, they first annotated KRAB-ZFP genes in the human genome and determined the age of these genes in four different ways: orthology, divergence age of full protein, KRAB, and ZnF domain respectively. They found that age estimates based on the KRAB domain and Zinc finger array were older and younger, respectively, relative to full-length or orthology-based estimates of divergence, and that many human KRAB-ZFPs emerged in the eutherian common ancestor. They also determined that older KRAB-ZFPs were more likely to have variant, disordered KRAB domains, and that zinc finger arrays were most variable at the residues directly in contact with DNA. By reanalyzing existing data, the authors claim that most KRAB-ZFPs bind to non-TE regions, and that many KZFP genes are expressed during early embryonic development. They show correlative evidence that KRAB-ZFPs are capable of positively regulating gene expression, and functionally validate a single candidate gene of a KZFP using reporter gene assays. Based on this evidence, they propose a 2-way model of evolution of KRAB-ZFP evolution, where older KRAB-ZFPs are more likely to have non-TE silencing roles and thus have different patterns of evolution compared with younger KRAB-ZFPs.
General Comments:
While the subject of KRAB-ZFP family evolution is of interest, the data and conclusions the authors present in this manuscript are mostly confirmatory. Nearly every major conclusion of the paper, including the 2-way model of KRAB-ZFP evolution, has been extensively documented before by the Trono lab (Imbeault, et al. 2017 Nature; Helleboid, et al. 2019 EMBO J; Ecco, et al. 2017; Pontis, et al. 2019), many of which the authors cite. The conclusion that older KZFPs gained new functions not related with TEs repression (such as imprinting regulation or meiotic hotspot determination) is already well established knowledge, which goes together with the model of higher purifying selection of the zinc finger array to retain the binding specificity, while the KRAB domain loses interaction with KAP1. Furthermore, the fact that KZFPs don't only bind to TEs has also been already reported by Imbeault et al. that originally provided the datasets re-analyzed in this manuscript.
The functional validation of ZNF611 binding to one of its target sequences is welcome and adds another example of a KRAB-ZFP that might have positive transcription regulatory function, however it is only a single KRAB-ZFP in a single assay. The finding that a KRAB-ZFP is capable of activating gene expression is also confirmatory (Ye at al. 2004; Frietze et al. 2010; Hallen et al. 2011).
There is value in replicating existing research, but the article is not written with that in mind. One contrast with previous studies is that their reanalysis of existing ChIP-seq data showed KRAB-ZFPs primarily bind to non-TE regions. However, these findings are based on thin evidence. It is not enough to say that a KRAB-ZFP mostly binds non-TE regions because >50% of its binding sites are outside of a TE. Rather, more quantitative statistics, such as enrichment or depletion of binding in a given genomic compartment compared to a random expectation is required. Additionally, there is no evidence such as heatmaps or metaplots over a subset of peaks to further demonstrate that the peaks identified in the new analysis are any better than the previous analysis. The authors argue that the more significant p values of their peaks are indicative of better peak calls, but there is no formal comparison of true/false negative rate (such as at known binding sites). Furthermore, many TEs, which are poorly mappable, will have less significant p values simply because fewer unique reads are mapped there relative to unique sequences. More careful analysis will be needed to assess these claims.
Finally, the paper itself is hard to read and the logic is difficult to follow, often due to a lack of sufficient detail. The methodology is also light on details, making it challenging to understand exactly what the authors did or did not do (see specific examples below). Additionally, the figures (especially Figure 1, Figure 3A, and Figure 4) are difficult to read and understand as currently presented.
Specific Comments:
1) The title and abstract make it clear that the authors are trying to argue that noncoding sequence contributes to rapid evolution of the KRAB-ZFP family. While this is possibly true, the authors' data, which is limited to a phylogenetic analysis of a single gene (using methodology that does not work well for highly repetitive sequences such as the KRAB-ZFP C2H2 zinc finger array) and its potential binding site. Much more analysis (such as selection analysis of more KRAB-ZFPs and their predicted or empirically determined binding sites) is required to make this claim.
2) Page 4, lines 66-70: The authors present the two possible models of KRAB-ZFP evolution (ie: arms race/domestication model) as if they are mutually specific, when most argue they would not be. Also, the authors state: "and (2) the domestication model (Ecco et al., 2017; Pontis et al., 2019), in which KZFPs regulate domestication of TEs instead of restraining the transposition potential of TEs". This should be rephrased, because in most of the cases reported, the "domesticated TEs" have lost transposition potential and only regulatory and protein coding sequences got domesticated with new functions. If the authors were referring to the adaptation of KZFPs to non-TE related functions, this cannot be called domestication, since KZFP genes are already from the host.
3) Page 5, lines 91-93: Here and throughout the authors use language such as "later" or "earlier" which is confusing - these should be replaced with "younger/more recent" and "older".
4) Page 6, lines 111-115: This section is highly speculative and should be moved to discussion.
5) Page 6 line 122: The authors do not define, here or in the methods, what constitutes a "variant" KRAB domain.
6) Page 7 lines 129-133: The authors only inferred their conclusion, yet they state that their result is consistent with a previous study. No real evidence is provided there.
7) Page 7 line 138-140: The authors say that the data suggests variant KRAB domains were formed gradually rather than in a burst, but their analysis is not sufficient to conclude this. Also, the only conclusion that can be drawn from Figure 2A is that the KZFPs that were clustered as "vKRAB" are on a separated branch in the tree on the left. This would mean that early in evolution some KZFP got a "vKRAB" and subsequently this gene underwent duplication and diversification, like all the other KZFP genes with "sKRAB" did.
8) Page 9 lines 189-193: Does the 90% cited here refer to 90% of the ~50% that are called as "tending to bind non-TE sequence" or 90% of all KZFPs? Regardless, this point is very misleading: the fact that less than 50% of the binding sites of a KZFP is not found to overlap TEs does not mean that the KZFP only binds to non-TEs. Some of this non-TE binding could also be an artifact of overexpression, which has not been considered but which has been well documented (for example ZFP809, Macfarlan Lab, and PRDM9 Simon Myers lab).
9) Lines 196-197, the authors state that they randomly selected 30 KZFPs. The authors should state in a supplementary figure which KZFPs were selected and, among them, what is the percentage of KZFPs that bind or not to TEs according to the analysis performed in the original paper (Imbault et al. 2017) and in this manuscript.
10) Page 11 line 230: Here and throughout the rest of the document the authors use the acronym "PCGs" without defining it (outside a figure legend).
11) Page 11 lines 234-237: Here the authors cite their use of pLI, RVIS, Shet, and dN/dS values as evidence of purifying selection. Of those, only dN/dS measures purifying selection, and the authors do not specify whether the dN/dS values they obtain are statistically significant evidence of purifying selection relative to a neutral model (likely the case when only considering chimp-human, as the authors do). Moreover, while the other measures do suggest some constraint, the differences between the KZFP-TE and KZFP-nonTE protein coding genes is very subtle. Also, they don't provide any explanation as to why, according to their claim, there should be less purifying selection for the KZFPs involved in mesoderm differentiation. Thus, the authors should temper their claims or else omit this data.
12) Page 11 lines 249-251: It is not clear how the author defined genes as transcription factors (they also do not define the acronym), or why they included them in the analysis. Additionally, the authors say that the divergence time of KZFPs is correlated with expression level but does not provide correlation values or the significance of these correlations.
13) Page 12 lines 266-268: This is not surprising, since TEs are generally silenced, while the rest of the genes can be either active or silent, so comparison of accessibility of cumulative TEs versus non-TEs will inevitably show open chromatin for non-TEs.
14) Page 13 lines 280-286: Here the authors try to draw conclusions from comparing chromatin accessibility of binding sites in ESCs and 293T cells and conclude that because they are more accessible in ESCs that suggests that KRAB-ZFPs activate in conditions. In reality, it is difficult to compare epigenetic states across cell lines, especially in undifferentiated vs differentiated, making it almost impossible without genetic manipulation to determine that KRAB-ZFPs are the cause of these differences.
15) How were the target genes selected for qPCR validation among the KZFP targets? In Fig.5 suppl. 3 the authors show that there is a fraction of genes that is only accessible in ESCs, but there is also a similar number of genes that is accessible both in ESCs and HEK293T cells, so the authors could have tried to validate some of those in both cell lines...Also, if the KZFPs are responsible for the target genes activation, why overexpression did not activate genes that are repressed in HEK293T cells? The ChIP-exo dataset used here (from Imbeault et al. 2017) was obtained from overexpression of the KZFPs in HEK293T cells, so obviously the proteins could bind to these genes in this cell line. This would rather suggest that if it's true that the tested KZFPs can promote transcriptional activation, this might be a secondary effect, since it might rely on something else making the genes already accessible and expressed in ESCs.
16) Page 16 lines 367-373: The conclusions that can be drawn from the ZNF611 reporter assay and associated evolutionary analysis are minimal. First, the authors cloned in a large chunk of DNA (1.2kb) rather than just the predicted binding site. This is mitigated somewhat by the deletion, but the deletion construct also deletes sequence upstream of the binding sites making the results hard to interpret. Additionally, the evolutionary analysis is very weak - traditional methods to generate phylogenetic trees do not work well for repetitive sequences, such as the ZnF arrays, and the bootstrap values on the tree are poor. There is almost no experimental methodology on how the tree was generated, how the authors overcame these issues, and how the authors identified the orthologous binding site in different species.
17) Page 18 lines 417-424: There is not enough detail about how the human KRAB-ZFPs were identified. Bare minimum, the authors need to report thresholds used to determine if a protein's domains scored high enough to be either a KRAB or C2H2 ZF domain.
18) Page 19: Given the highly repetitive nature of KRAB-ZFPs, it is not sufficient to use the homology estimations from Ensembl to identify orthologous proteins. Other methods, such as synteny, should be used to confirm orthologs. Additionally, the authors identify homologs between different KRAB domains based on %identity, but this will likely give spurious results, as functional domains do not evolve neutrally and often have high similarity across proteins due to functional constraint. Regarding the phylogenetic analysis, there is again not enough detail to explain how the authors overcome issues with alignments and low bootstrap values - additionally, they did not perform a model test prior to constructing the tree, which can impact the final results.
19) Page 22 lines 517-520: The authors do not elaborate why they chose FC > 1.1 or FC < 0.9 to call differentially expressed genes
20) Page 23 lines 529-532: How the authors performed the gene ontology enrichment/depletion analysis is not clear. For example, if the authors indeed prefiltered their list to remove genes that have no GO terms, that would bias the results.
21) Page 24 lines 552-554: For the non-targeting siRNA, it is unclear whether this is a scramble or targeting another gene (such as GFP)?
-
-
www.biorxiv.org www.biorxiv.org
-
Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Reply to the reviewers
We would like to thank the reviewers for their comments and suggestions. Our responses to them are listed below. We are hopeful that they will be satisfied with our responses and the changes we made in the revised version of the manuscript.
REVIEWER #1
Reviewer #1 (Evidence, reproducibility and clarity (Required)): In this manuscript, Ameen and colleagues report the results of a multidimensional proteomic analysis which combined quantitative proteomics, phosphoproteomics and N-terminomics in an effort to identify neuronal proteins displaying altered abundance or modifications by proteolysis and/or phosphorylation following an excitotoxic insult. Excitotoxicity is known to initiate by over-activation of ionotropic glutamate receptors which allows an increase in intracellular Ca2+ , ultimately leading to activation of proteases. The analysis revealed that glutamate treatment for up to 240 min did not significantly affect the abundance of neuronal proteins but caused dramatic changes in the phosphorylation state of many neuronal proteins. Based upon the phosphopeptides and neo-N-peptides, which contain the neo-N-terminal amino acid residue generated through proteolytic cleavage of intact neuronal proteins during excitotoxicity, the authors identified the proteins that undergo phosphorylation, dephosphorylation and/or enhanced proteolytic processing in excitotoxic neurons. By combining different software packages, they found that these modified proteins form complex interactions that affect signaling pathways regulating survival, synaptogenesis, axonal guidance and mRNA processing. These data suggest that perturbations in the aforementioned pathways mediate excitotoxic neuronal death. Then, the authors showed by Western blot analysis that CRMP2, a crucial regulator of axonal guidance signaling, exhibited enhanced truncation and reduced phosphorylation at specific sites upon glutamate treatment. These events may contribute to injury to dendrites and synapses associated with excitotoxic neuronal death. Furthermore, the authors showed that calpains are responsible for the proteolytic processing and cathepsins for enhanced degradation of proteins during excitotoxicity. Blockage of calpain-mediated cleavage site of the tyrosine kinase Src during excitotoxicity confers neuroprotection in an in vivo model of neurotoxicity. In that regard, over twenty protein kinases are predicted to be activated in excitotoxic neurons. Collectively, this study contributes to the construction of an atlas of phosphorylation and proteolytic processing events that occur during excitotoxicity and as such they can be targeted for therapeutic purposes.
**Comments** Comment: The identification of potential calpain cleavage sites in neuronal proteins modified during excitotoxicity is an interesting finding of the study. However, the atlas presented appears to miss components such as Kinase D-interacting substrate of 220 kDa (Kidins220), also known as ankyrin repeat-rich membrane spanning (ARMS), a protein recently shown to be cleaved by calpain during excitotoxicity (López-Menéndez et al, 2019, Cell Death and Disease 10, 535).
Response: The calpain cleavage site of neuronal ARMS/KIDINS220 was mapped to the peptide bond between Asn-1669 and Arg-1670 (Gamir-Morralla, et al. (2015) Cell Death & Diseases 6, e1939). The cleavage is expected to generate two truncated fragments – one of ~185 kDa and another of ~10 kDa at the N-terminal and C-terminal sides, respectively of the cleavage site. Our TAILS analysis failed to detect the 10 kDa fragment which contains the neo-N-terminus generated by calpain cleavage. Here are the possible explanations:
The neo-N-terminus of the 10 kDa C-terminal fragment is unlikely to be observed in our experiment as the TAILS method relies on the production of peptides by trypsin. The 10 kDa fragment has Arginine as the first amino acid which means that the N-terminal peptide released and isolated by the TAILS method would be a single amino acid. In their publication, Gamir-Morralla, et al. showed that the total levels of both intact and degraded ARMS/Kidins220 decreased as a result of ischemic cerebral stroke, suggesting degradation rather than proteolytic processing to generate stable truncated fragments as the final outcome of calpain cleavage of ARMS/Kidins220 (Figure 2b of the publication by Gamir-Morralla, et al.). The TAILS method predominantly detects proteolytic processing whereas degradation can be more difficult to capture. Degradation often results in peptides containing less than 5-6 amino acids that are difficult to align with a single protein or result in transient peptide that may not be detectable in neurons at 240 min after glutamate treatment. **Overall, it is possible that Kidins220 is generated but was undetected by the TAILS approach.
Comments: The CRMP2 antibody (Cell Signalling, 35672) used for western blots (figure 5D, also figure S11) and immunofluorescence (figure 5E) is problematic. Copied from https://www.cellsignal.com/products/primary-antibodies/crmp-2-d8l6v-rabbit-mab/35672: Monoclonal antibody is produced by immunizing animals with a synthetic peptide corresponding to residues surrounding lle546 of human CRMP-2 protein. The truncated CRMP2 (figure 5D) studied in the whole section (residues 1-516 or 1-517, ~57kDa) cannot be recognized by this monoclonal antibody. The detected band with the red letters in figure 5D might represent another cleavage product. In any case, asking Cell Signalling for more information about the exact immunogen might help, but since it's monoclonal and derived from residues surrounding lle546 it's very hard to include residues before aa516 and the unique epitope recognition upstream of aa516. The whole result section and discussion has to be reconsidered. Alternatively another antibody can be used to repeat those experiments in order to support the hypothesis. Time and resources are very familiar to authors since they have to repeat their previous work with a new antibody. Finally, there are no "western blot" and "immunofluorescence" methods for CRMP2.
Response: We would like to apologise for incorrectly listing the catalogue number of the anti-CRMP2 antibody purchased from Cell Signalling technology. Rather than the rabbit monoclonal anti-CRMP2 antibody (Cell Signalling, Cat#: 35672), we used the polyclonal anti-CRMP2 antibody (Cell Signalling, Cat#9393) to perform all the Western blot and immunofluorescence analysis in this paper. The e-mail confirming the purchase of this antibody is appended. According to the vendor, the antibody was raised by immunizing rabbits with a synthetic peptide derived from the human CRMP2 sequence. We decided to order this antibody because Zhang, et al. (Sci Rep. 2016; 6: 37050) reported that it could detect the truncated CRMP2 fragments generated by calpain cleavage in primary cortical neurons in vitro in response to axonal damage.
*The procedures of Western blot and immunofluorescence detailing the correct CRMP2 antibody descriptions are added in the revised version of the submitted manuscript.
*
Comment: The truncated DCLK1 bands detected in figure S8B cannot be attributed to the proteolytic processing of DCLK1 at the sites described: T311↓S312, S312↓S313 and N315↓G316 (predicted M.W. of the (C-terminal) products: 48.7-49.1kDa (figure S8A) which is very close to be well-separated with conventional PAGE). The number and the separation of the bands suggest other cleavage sites. Response: We agree with the reviewer’s comment that conventional SDS-PAGE cannot differentiate the proteolytic products generated by cleavage at the three sites identified by TAILS. Furthermore, the TAILS methods could not detect all peptides generated by a protein during proteolysis. Therefore, validating our results with a Western blot experiment may reveal unidentified peptides in certain cases. We have now added the following statement in the revised manuscript to reflect the presence of other cleavage sites: “Besides detecting the 50-56 kDa truncated fragments, the antibody also cross-reacted with several truncated fragments of ~37-45 kDa. These findings suggest that DCLK1 underwent proteolytic processing at multiple other sites in addition to the three cleavage sites identified by our TAILS analysis.”
Comment: Could the striking observation that almost all proteolytic processing during excitotoxicity is catalyzed by calpains and/or cathepsins have derived (partially) from unspecific targets of calpeptin such as a subset of tyrosine phosphatases (Schoenwaelder and Burridge, 1999: approx. 1h treatment of fibroblasts with approx.. 10x less concentration) or other(s)? Response: Schoenwaelder and Burridge (1999, JBC 274:14359) reported that calpeptin exhibits both protease inhibitor as well as a protease inhibitor-independent activities in fibroblasts. Besides inhibiting calpains and cathepsins, they demonstrated that calpeptin could selectively inhibit a subset of membrane-bound tyrosine phosphatases. Since the TAILS method monitored the protease inhibitor activity of calpeptin, the proteolytically processing events mitigated by calpeptin in neurons during excitotoxicity are likely attributed to its protease inhibitor activity. Additionally, Schoenwaelder and Burridge reported this unconventional protease inhibitor-independent activity of calpeptin in fibroblasts. Since the protein tyrosine kinases expressed in neurons and fibroblasts are different, it is unclear if calpeptin can also exert such activity in neurons.
Comment: Describing the final part of figure 4C the authors suggest that "Liver kinase B1 homolog (LKB1), CaM kinase kinase β (CaMKKβ) and transforming growth factor‐β‐activating kinase 1 (TAK1) are the known upstream kinases directly phosphorylating T172 of AMPKα to activate AMPK (Herrero-Martin et al., 2009; Woods et al., 2005; Woods et al., 2003). Our findings therefore predict activation of these kinases during excitotoxicity (Figure 4C)." The first question arising here is whether these three kinases are the only ones know to phosphorylate AMPKα. Even if this is true, it is highly speculative to suggest that the findings of the present study predict the activation of these kinases during excitotoxicity, without providing the necessary experimental data, since the increased phosphorylation of AMPK may be an indirect effect of the reduced function of a phosphatase. Thus the proposed model does not hold. Response: Agree. We have therefore revised our interpretation of the results to reflect this possibility. The Revised sentence on page 13 reads “**Liver kinase B1 homolog (LKB1), CaM kinase kinase β (CaMKKβ) and transforming growth factor‐β‐activating kinase 1 (TAK1) are the known upstream kinases directly phosphorylating T172 of AMPKα to activate AMPK (Herrero-Martin et al., 2009; Woods et al., 2005; Woods et al., 2003), while a member of the metal-dependent protein phosphatase (PPM) family could dephosphorylate T172 of AMPK in cells (Garcia-Haro et al., 2010). Our findings therefore predict activation of these kinases and/or inactivation of the PPM family phosphatase in neurons during excitotoxicity (Figure 4C).”
Additionally, we also deleted the schematic diagram depicting the possibility of activation of LKB1, CaMKKβ and TAK1 in Figure 4 of the revised manuscript.
__**Minor points**
__
Minor Comment: Highlights could present the key points of the study in a more straightforward manner. Response: Agree. We have edited the highlights in our revised manuscript to make them more straightforward.
Minor comment: Figure 4A is too complicated. Proteins considered as hubs of signaling pathways in neurons should be somehow highlighted to distinguish them.
Response: Agree. We have now highlighted the signalling hubs by shading them in green in the revised figure. As we merged figures 2 and 4 of the original manuscript, these signalling hubs are presented in Figure 2B of the revised manuscript.
Minor Comment: The analysis of proteins with enhanced truncation and reduced phosphorylation such as CRMP2 and DCLK1 is fragmented. In addition, the authors should mention the criteria based on which these proteins were selected for further analysis.
Response: IPA analysis revealed synaptogenesis and axonal guidance as the top-ranked perturbed canonical signalling pathways governed by neuronal proteins undergoing significantly increased proteolytic processing and altered phosphorylation. As CRMP2 and DCLK1 are the key players in these pathways, they were chosen for further biochemical analysis to validate the TAILS results. To address this point, we added a few statements in the sections describing results of biochemical analysis of CRMP2 and DCLK1 in the revised manuscript. The additional sentences on page 13 now read “IPA analysis of the significantly modified neuronal proteins identified in our study predicted perturbation of signalling pathways governing axonal guidance and synaptogenesis in neurons during excitotoxicity (Figure S7). Since CRMP2 (also referred as DPYSL2) is a key player in neuronal axonal guidance and synaptogenesis (Evsyukova et al., 2013) and it underwent significant changes in phosphorylation state and proteolytic processing (Figures 5A and S7), it was chosen for validation of our proteomic results.” The additional sentences on page 15 read ”Similar to CRMP2, DCLK1 is also a key player in regulation of axonal guidance and synaptogenesis (Evsyukova et al., 2013). Since our TAILS results revealed significant proteolytic processing of DCLK1 (Figure S8A), it was chosen for validation of our proteomic results.”
- *
Minor comment: The potential therapeutic relevance of phosphorylation and proteolytic processing events that occur during excitotoxicity can be further explored. Response: Thanks for the suggestion. We have added a paragraph describing the additional evidence that protein kinase inhibitors and cell-permeable inhibitors blocking calpain cleavage of specific neuronal proteins as potential neuroprotectants to reduce brain damage induced by ischemic stroke. The additional sentences near the end of the Discussion section (page 25) now read “Since CRMP2 is key player in axonal guidance and synaptogenesis revealed by our proteomic analysis as the most perturbed cellular processes in excitotoxicity, blockade of its cleavage to form the truncated CRMP fragment is another potential neuroprotective strategy. Indeed, a cell-permeable Tat-CRMP2 peptide encompassing residues 491-508 close to the identified cleavage sites of CRMP2 could block calpain-mediated cleavage of neuronal CRMP2 and protect neurons against excitotoxic cell death (Yang et al., 2016)**.”
- *
The additional paragraph at the end of the Discussion section (page 25) now reads: “Besides the neuronal proteins undergoing enhanced proteolytic processing during excitotoxicity, protein kinases predicted by our phosphoproteomic results to be activated during excitotoxicity are also targets for the development of neuroprotective drugs. For example, our results demonstrated significant activation of neuronal AMPK during excitotoxicity, suggesting that aberrant activation of AMPK can contribute to neuronal death. Of relevance, small-molecule AMPK inhibitors could protect against neuronal death induced by ischemia in vitro, and brain damages induced by ischemic stroke in vivo. Likewise, inhibitors of Src and other Src-family kinases were known to protect against neuronal loss in vivo in a rat model of in traumatic brain injury (Liu et al., 2008a; Liu et al., 2017). Future investigation of the role of the excitotoxicity-activated protein kinases in excitotoxic neuronal death will reveal if small-molecule inhibitors of these kinases are potential neuroprotective drug candidates.”
*
*
Minor comment: I am sorry but I could not find Figure 8, which is supposed to show the "In vivo model of NMDA neurotoxicity" (please, see page 30).
Response: Our apology for the mistake. This should be Figure 6 of the revised manuscript.
Minor comment: Introduction: O'Collins et al., 2006; Savitz and Fisher, 2007; both references are missing.
Response:* This was an oversight from our part and the references have been added to the revised manuscript.**
*
Minor comment: Figure S1A-B: vehicle treatment time course is needed. Response: All neurons were cultured in neurobasal media for seven days. The control neurons were incubated in culture media while we started treating the other neurons with glutamate for MTT and LDH assay. The additional paragraph describing the design of the cell viability/death assays in page 32 reads “Primary cortical neurons were incubated for 480 min with and without the addition of 100 μM of glutamate. The control neurons were incubated for 480 min in culture medium. For neurons treated with glutamate for 30 min, 60 min, 120 min and 240 min, they were pre-incubated in culture medium for 450 min, 420 min, 360 min and 240 min, respectively prior to the addition of glutamate to induce excitotoxicity. For neurons treated with glutamate for 480 min, they were treated with glutamate just after seven days of culture in neurobasal media.”
- *
Minor comment: Figure 5E: Control close-up is missing. Response: A close-up view of the control neurons is now provided in Figure 4E of the revised manuscript.
*
*
Minor comment: "Moreover, the number of CRMP2-containing dendritic blebs in neurons at 240 min of glutamate treatment was significantly higher than that in neurons at 30 min of treatment (inset of Figure 5E)." Such a statistic is not shown in the graph. Response: The statistical analysis results are now added to the revised manuscript in Figure 5E.
- *
Minor comment: "Consistent with this prediction, our bioinformatic analysis revealed that the identified cleavage sites in most of the significantly degraded neuronal proteins during excitotoxicity are mapped within functional domains with well-defined three-dimensional structures (Figures 6A)." Authors might mean figure S12A? Response: Correct. Our apology for the mislabelling. This has been corrected to “S12A”in the revised manuscript.
Minor comment: "Neuronal Src was identified by the three criteria of our bioinformatic analysis to be cleaved by calpains to form a stable truncated protein fragment during excitotoxicity (Figures 6A and Table S6)." Authors might mean figure 6D?
Response: Correct. Our apology for the mislabelling. Since we merged figures 2 and 4 of the original manuscript. This has been corrected to now read “(Figure 5D)” on page 18 of the revised manuscript.
Minor comment: Figure 2B: Clusters 1, 3, 4 and 6 do not follow treatment trends homogenously in all time points. For example in cluster 1 there is a phosphopeptide following the pattern 1, 0, -1 and another one following the pattern 0, 1, -1, which is actually a very different pattern even if the end value is stable (-1). The first example could belong to the cluster 6 as well, while the second example to cluster 5. Please elaborate on the rationale behind the categorization. Is there any other clustering method that can be used without making the categorization more complicated? Response: Since we merged Figures 2 and 4 of the original manuscript. This comment relates to the right panel of Figure 2A of the revised manuscript. The rationale behind the categorization of the phosphopeptides into six clusters was based upon the patterns of changes of their abundance (i.e. average of log-2 normalized z-score of phosphopeptide intensity) in three sample groups. **We calculated the number of permutations where the number of sample groups in set (n) = 3 (i.e. Control neurons, neurons of 30 min glutamate treatment and neurons of 240 min glutamate treatment) and number of sample groups in each permutation (r) = 3 (i.e. all three sample groups should be present in each permutation). Hence the number of permutations is 6. The six clusters refer to the six possible permutations of the patterns of abundance changes of the identified phosphopeptides rather than the end results.
Minor comment: A problem of the manuscript is its length and lack of coherence. Apart from presenting the data from the proteomics, phosphoproteomics and N-terminomics analyses, the authors focus on several different proteins to perform validation experiments and further characterize the biological significance of their modification. Because these proteins do not fall on the same pathway, the authors end up presenting several independent stories that complicate the reader. Response: We agree that proteins that do not operate in the same signalling pathway were chosen for further biochemical analysis. Their choice was justified because they are key players in the most perturbed canonical signalling pathways identified by bioinformatic analysis with the IPA software. We agree that this may complicate the reader. However, it also helps to illustrate that excitotoxic neuronal death is a complicated cell death process caused by dysregulation of multiple neuronal proteins which regulate different cellular processes.
Minor comment: Moreover, it is necessary for the authors to restructure their introduction, and avoid over-representing previous research on nerinetide, which is not used anywhere in the manuscript. Instead, the introduction must be more focused to better capture the necessity and essence of the present study. Response: We agree. Based on the reviewer’s comments, we decided to restructure the introduction by shortening the description of the results of Nerinetide research. Please refer to the track changes of the revised manuscript for the changes.
Minor comment: Taking into account figures 1 and S2 I understand that the authors combined samples of neuronal cell cultures (treated or not with Glu) with samples from mouse brains (that have undergone ischemic stroke/TBI or sham operation). If this is the case, why did the authors do that? How did they combine the different samples? And why this is not mentioned anywhere is the main text? Response: For a data-independent acquisition (DIA) based mass spectrometry experiment, it is essential we generate a library of identifiable peptides first using a standard data-dependent acquisition (DDA) approach. For the DIA type experiment to work, the identified peptides have to be in that library first. Excitotoxicity is a major mechanism of neuronal loss caused by ischemic stroke and traumatic brain injury. We therefore included the brains of sham-operated mice, brains of mice suffering ischemic stroke and traumatic brain injury to construct the spectral libraries and that is why the library contains pooled samples from the representative samples. Pre-fractionation of the pooled peptides was also performed to increase the number of identifiable peptides and generate a deeper library.
- Once we generated that library, all samples are analysed individually as a separate DIA experiment. The DIA approach then makes use of the generated library for identification and quantitation. This methodology allows for deeper identification and lower number of missing values. These statements were added in the method section of the revised manuscript (page 33)*
Minor comment: Regarding figure 5D, the authors write in the main text "Consistent with our phosphoproteomic results, the truncated fragment CRMP2 fragments could not cross-react with the anti-pT509 CRMP2 antibody (Figure 5D)" In the upper blot the truncated CRMP2 fragment runs well below the 70 kDa marker. However, in the middle panel, where we see the blot with the phospho specific antibody, the respective area of the blot has been cropped, so we cannot see whether the truncated fragment cross-reacts with the phospho specific antibody. Response: The presentation of the western blots in Figure 5D in the revised manuscript are now less cropped and clearly demonstrate there is no cross reactivity of the phospho specific antibody with the truncated fragment. Please refer to the revised Figure 5 for the updated Western blot images.
Minor comment: It is strange that only 1 and 13 proteins showed significant changes in abundance at 30 and 240min respectively. Especially after 240min of glutamate treatment one could expect that many proteins should change in their levels, since the neurons are almost diminished by cell death at that point. How could the authors explain this phenomenon? Additionally, in their previous publication, they showed that much more proteins change significantly in abundance following glutamate treatment (at 30min and 240min).
Response: Even though our global spectral libraries contain over 49,000 identifiable peptides derived from 6524 proteins, only 1696 quantifiable proteins were identified in the DIA mass spectrometry analysis (Figure 1) because we used stringent criteria for their identification: (i) false discovery rate of We agree with the reviewer that many more proteins are expected to change their abundance at 240 min as significant cell death was detected. However, if we had used less stringent false discovery rates of their identification and quantification, included proteins with just one unique identified peptide and lowered the threshold of abundance fold changes, many more proteins with significantly changed abundance would be detected. But we preferred to use these stringent criteria to ensure a high confidence in our identification of neuronal proteins undergoing significant changes during excitotoxicity.*
*
*
In agreement with the low number of neuronal proteins exhibiting significant changes in abundance reported in this manuscript, our previously published study (Hoque, et al. (2019) Cell Death & Diseases) detected only 26 neuronal proteins undergoing changes in abundance. Hence, we disagree with the reviewer that our previous publication reported much more proteins undergoing changes in abundance in excitotoxicity.
Reviewer #1 (Significance (Required)): Comment on significance: The manuscript delivers a large amount of data, regarding changes in the proteome, the activation of specific kinases, phosphatases, as well as the molecular pathways that are activated at distinct time points of excitotoxicity. This information could be used in future studies to validate and develop potential therapeutic strategies that could protect against neuronal loss in various neurological disorders. Response: We are excited that Reviewer #1 felt that this large amount of generated data will be useful for subsequent studies to validate and develop novel therapeutic strategies.
Comment on significance: The same group has very recently published a work very similar to the particular manuscript (Hoque et al. Cell Death and Disease, 2019). In their previous publication, the authors cover a large part of their current objectives. They performed again a proteomic and phosphoproteomic analysis of mouse primary cortical neurons treated with glutamate for distinct time points, in their aim to identify changes in expression and phosphorylation state of neuronal proteins upon excitotoxicity. Apart from the N-terminome, which they investigate in their current manuscript, the proteomic and phospho proteomic analysis are very similar. As such, and because of the fact that the current manuscript is very extensive, the authors should consider to minimize it, and include only their novel findings (changes in the N-terminome, the involvement of specific kinases that contribute to excitotoxic neuronal death, the regulatory mechanism of CRMP2, etc).
Response: Since the coverage of phosphoproteins undergoing changes in neurons during excitotoxicity identified in the current study is much higher than that of phosphoproteins identified in our previously published study, we prefer to retain the description of the phosphoproteomic findings in this manuscript. Nonetheless, we agree that the manuscript needs to be shortened. Our suggestions to shorten the manuscript are listed below:
- Move the description and results of global proteomic analysis to supplementary information. Since we made the same observation that only a small number of neuronal proteins undergo significant changes in abundance during excitotoxicity in our previously published study, moving the global proteomic analysis results away from the main text will not adversely impact the quality of the presentation.
- For the description of how we classified the identified N-terminal peptides as those derived from degradation and those derived from proteolytic processing, we would like to move it to the supplementary information. Comment on significance: The authors should describe in a simpler way the proteomic and bioinformatics analyses they are using in the manuscript. It is difficult to understand the methodology used if you are not an expert in proteomics and bioinformatics. My suggestion is to revise their text and make it simpler and more concise. Response: We agree with this criticism. As we are not allowed to make a major revision of the manuscript at this stage, the revised manuscript contains only minor revisions that addresses all of the comments and suggestions provided by the two reviewers. Further changes will be added in the next revised version. Our suggestions to further restructure the manuscript are listed below:
Figure S5 depicting the rationale for classification of N-terminal peptides as products of degradation and those of proteolytic processing will be moved to the main text. The description of the rationale in the main text will be revised to help readers who are not experts in proteomics to better understand the rationale. A diagram depicting the workflow of our TAILS method will be added as a supplementary figure. For bioinformatic analysis of the proteomic results, we will provide in the supplementary information the definition of the following terms relevant to Ingenuity Pathway Analysis and PhosphoPath analysis of the perturbed biological processes and signalling pathways: (a) Canonical Signalling Pathways, (b) Cellular Processes and (c) Interaction Networks. A short description of how their identification benefits the mapping of the neurotoxic signalling networks in neurons will be provided in the supplementary information.
*
*
REVIEWER #2
Reviewer #2 (Evidence, reproducibility and clarity (Required)): Comment: In this article, Ameen and collaborators identify the modified proteins during neuronal excitotoxicity by using an in vitro model in which mouse primary cortical neurons are treated 30 and 240 min with 100 µM Glutamate. They use different approaches: a quantitative label-free global and phospho-proteomic methods and a quantitative N-terminomic procedure called Terminal Amine Isotopic Labelleling of Subtrates (TAILS). Results show that 240 min glutamate has minimal impact on protein abundance (13 neuronal proteins show significant changes) but enhance a modification of phosphorylation state and proteolysis of nearly 900 proteins. A significant part of these proteins are involved signalling pathway involved in cell survival, synaptogenesis and axonal guidance.
The paper is globally well written and experiments are convincing. The methodology and the analysis are well described and well explain. The text and each figure are clear and accurate. However, I have just one comment that needs answers and/or clarifications. Thanks for your work. Response: We appreciate the compliment provided by this reviewer on our submitted manuscript.
**Minor comment:**
Minor comment: Primary neurons are used at DIV7 and it has been shown that at DIV7 the percentage of astrocytes is relatively low, however astrocytes plays a key role in glutamate recapture and release. It will be relevant to know the percentage of glial cell in the culture model of the authors and how astrocytes are involved in glutamate recapture and also in excitotoxicity.
Response: The compositions of the DIV7 cultures are: 94.1+/- 1.1 % neurons, 4.9%+/-1.1% astrocytes, and *
Reviewer #2 (Significance (Required)):
Comment on significance: Excitotoxicity is a cell death process involved in many neurological disorders. However, nowadays, there are no existent FDA-approved pharmacological agents targeted to protect against excitotoxicity leading to neuronal death. A better comprehension of excitotoxicity is required to improve prevention, therapy and reparation following the disease.
With this work, the authors highlighted modified proteins in excitotoxic neurons. Interestingly, few of these proteins are involved in cell survival, mRNA processing or axonal guidance. This atlas of phosphorylation and proteolytic processing events during excitotoxicity permit the identification of new therapeutic targets such as calpain-mediated cleavage of Src kinase. This atlas will interest a lot of team working on neurological disorders such as Alzheimer disease, Parkinson disease or stroke. It will permit to better characterize cellular/molecular events involved in neuronal loss and will permit to find new therapeutic targets. Response: In response to this comment and a similar comment by Reviewer 1, we expanded the discussion to include the potential therapeutic values of our findings.
Comment on significance: My field of expertise: Stroke, cell death, excitotoxicity, signalling pathways and molecular targets, autophagy. I don't have sufficient expertise to evaluate proteomic analysis.
Response: No response is needed.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #2
Evidence, reproducibility and clarity
In this article, Ameen and collaborators identify the modified proteins during neuronal excitotoxicity by using an in vitro model in which mouse primary cortical neurons are treated 30 and 240 min with 100 µM Glutamate. They use different approaches: a quantitative label-free global and phospho-proteomic methods and a quantitative N-terminomic procedure called Terminal Amine Isotopic Labelleling of Subrates (TAILS). Results show that 240 min glutamate has minimal impact on protein abundance (13 neuronal proteins show significant changes) but enhance a modification of phosphorylation state and proteolysis of nearly 900 proteins. A significant part of these proteins are involved signalling pathway involved in cell survival, synaptogenesis and axonal guidance.
The paper is globally well written and experiments are convincing. The methodology and the analysis are well described and well explain. The text and each figure are clear and accurate. However, I have just one comment that needs answers and/or clarifications. Thanks for your work.
Minor comment:
Primary neurons are used at DIV7 and it has been shown that at DIV7 the percentage of astrocytes is relatively low, however astrocytes plays a key role in glutamate recapture and release. It will be relevant to know the percentage of glial cell in the culture model of the authors and how astrocytes are involved in glutamate recapture and also in excitotoxicity.
Significance
Excitotoxicity is a cell death process involved in many neurological disorders. However, nowadays, there are no existent FDA-approved pharmacological agents targeted to protect against excitotoxicity leading to neuronal death. A better comprehension of excitotoxicity is required to improve prevention, therapy and reparation following the disease.
With this work, the authors highlighted modified proteins in excitotoxic neurons. Interestingly, few of these proteins are involved in cell survival, mRNA processing or axonal guidance. This atlas of phosphorylation and proteolytic processing events during excitotoxicity permit the identification of new therapeutic targets such as calpain-mediated cleavage of Src kinase. This atlas will interest a lot of team working on neurological disorders such as Alzheimer disease, Parkinson disease or stroke. It will permit to better characterize cellular/molecular events involved in neuronal loss and will permit to find new therapeutic targets.
My field of expertise: Stroke, cell death, excitotoxicity, signalling pathways and molecular targets, autophagy. I don't have sufficient expertise to evaluate proteomic analysis.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #1
Evidence, reproducibility and clarity
In this manuscript, Ameen and colleagues report the results of a multidimensional proteomic analysis which combined quantitative proteomics, phosphoproteomics and N-terminomics in an effort to identify neuronal proteins displaying altered abundance or modifications by proteolysis and/or phosphorylation following an excitotoxic insult. Excitotoxicity is known to initiate by over-activation of ionotropic glutamate receptors which allows an increase in intracellular Ca2+ , ultimately leading to activation of proteases. The analysis revealed that glutamate treatment for up to 240 min did not significantly affect the abundance of neuronal proteins but caused dramatic changes in the phosphorylation state of many neuronal proteins. Based upon the phosphopeptides and neo-N-peptides, which contain the neo-N-terminal amino acid residue generated through proteolytic cleavage of intact neuronal proteins during excitotoxicity, the authors identified the proteins that undergo phosphorylation, dephosphorylation and/or enhanced proteolytic processing in excitotoxic neurons. By combining different software packages, they found that these modified proteins form complex interactions that affect signaling pathways regulating survival, synaptogenesis, axonal guidance and mRNA processing. These data suggest that perturbations in the aforementioned pathways mediate excitotoxic neuronal death. Then, the authors showed by Western blot analysis that CRMP2, a crucial regulator of axonal guidance signaling, exhibited enhanced truncation and reduced phosphorylation at specific sites upon glutamate treatment. These events may contribute to injury to dendrites and synapses associated with excitotoxic neuronal death. Furthermore, the authors showed that calpains are responsible for the proteolytic processing and cathepsins for enhanced degradation of proteins during excitotoxicity. Blockage of calpain-mediated cleavage site of the tyrosine kinase Src during excitotoxicity confers neuroprotection in an in vivo model of neurotoxicity. In that regard, over twenty protein kinases are predicted to be activated in excitotoxic neurons. Collectively, this study contributes to the construction of an atlas of phosphorylation and proteolytic processing events that occur during excitotoxicity and as such they can be targeted for therapeutic purposes.
Comments
The identification of potential calpain cleavage sites in neuronal proteins modified during excitotoxicity is an interesting finding of the study. However, the atlas presented appears to miss components such as Kinase D-interacting substrate of 220 kDa (Kidins220), also known as ankyrin repeat-rich membrane spanning (ARMS), a protein recently shown to be cleaved by calpain during excitotoxicity (López-Menéndez et al, 2019, Cell Death and Disease 10, 535).
The CRMP2 antibody (Cell Signalling, 35672) used for western blots (figure 5D, also figure S11) and immunofluorescence (figure 5E) is problematic. Copied from https://www.cellsignal.com/products/primary-antibodies/crmp-2-d8l6v-rabbit-mab/35672: Monoclonal antibody is produced by immunizing animals with a synthetic peptide corresponding to residues surrounding lle546 of human CRMP-2 protein. The truncated CRMP2 (figure 5D) studied in the whole section (residues 1-516 or 1-517, ~57kDa) cannot be recognized by this monoclonal antibody. The detected band with the red letters in figure 5D might represent another cleavage product. In any case, asking Cell Signalling for more information about the exact immunogen might help, but since it's monoclonal and derived from residues surrounding lle546 it's very hard to include residues before aa516 and the unique epitope recognition upstream of aa516. The whole result section and discussion has to be reconsidered. Alternatively another antibody can be used to repeat those experiments in order to support the hypothesis. Time and resources are very familiar to authors since they have to repeat their previous work with a new antibody. Finally, there are no "western blot" and "immunofluorescence" methods for CRMP2.
The truncated DCLK1 bands detected in figure S8B cannot be attributed to the proteolytic processing of DCLK1 at the sites described: T311↓S312, S312↓S313 and N315↓G316 (predicted M.W. of the (C-terminal) products: 48.7-49.1kDa (figure S8A) which is very close to be well-separated with conventional PAGE). The number and the separation of the bands suggest other cleavage sites.
Could the striking observation that almost all proteolytic processing during excitotoxicity is catalyzed by calpains and/or cathepsins have derived (partially) from unspecific targets of calpeptin such as a subset of tyrosine phosphatases (Schoenwaelder and Burridge, 1999: approx. 1h treatment of fibroblasts with approx.. 10x less concentration) or other(s)?
Describing the final part of figure 4C the authors suggest that "Liver kinase B1 homolog (LKB1), CaM kinase kinase β (CaMKKβ) and transforming growth factor‐β‐activating kinase 1 (TAK1) are the known upstream kinases directly phosphorylating T172 of AMPKα to activate AMPK (Herrero-Martin et al., 2009; Woods et al., 2005; Woods et al., 2003). Our findings therefore predict activation of these kinases during excitotoxicity (Figure 4C)." The first question arising here is whether these three kinases are the only ones know to phosphorylate AMPKα. Even if this is true, it is highly speculative to suggest that the findings of the present study predict the activation of these kinases during excitotoxicity, without providing the necessary experimental data, since the increased phosphorylation of AMPK may be an indirect effect of the reduced function of a phosphatase. Thus the proposed model does not hold.
Minor points
Highlights could present the key points of the study in a more straightforward manner.
Figure 4A is too complicated. Proteins considered as hubs of signaling pathways in neurons should be somehow highlighted to distinguish them.
The analysis of proteins with enhanced truncation and reduced phosphorylation such as CRMP2 and DCLK1 is fragmented. In addition, the authors should mention the criteria based on which these proteins were selected for further analysis.
The potential therapeutic relevance of phosphorylation and proteolytic processing events that occur during excitotoxicity can be further explored.
I am sorry but I could not find Figure 8, which is supposed to show the "In vivo model of NMDA neurotoxicity" (please, see page 30).
Introduction: O'Collins et al., 2006; Savitz and Fisher, 2007; both references are missing.
Figure S1A-B: vehicle treatment time course is needed.
Figure 5E: Control close-up is missing.
"Moreover, the number of CRMP2-containing dendritic blebs in neurons at 240 min of glutamate treatment was significantly higher than that in neurons at 30 min of treatment (inset of Figure 5E)." Such a statistic is not shown in the graph.
"Consistent with this prediction, our bioinformatic analysis revealed that the identified cleavage sites in most of the significantly degraded neuronal proteins during excitotoxicity are mapped within functional domains with well-defined three-dimensional structures (Figures 6A)." Authors might mean figure S12A?
"Neuronal Src was identified by the three criteria of our bioinformatic analysis to be cleaved by calpains to form a stable truncated protein fragment during excitotoxicity (Figures 6A and Table S6)." Authors might mean figure 6D?
Figure 2B: Clusters 1, 3, 4 and 6 do not follow treatment trends homogenously in all time points. For example in cluster 1 there is a phosphopeptide following the pattern 1, 0, -1 and another one following the pattern 0, 1, -1, which is actually a very different pattern even if the end value is stable (-1). The first example could belong to the cluster 6 as well, while the second example to cluster 5. Please elaborate on the rationale behind the categorization. Is there any other clustering method that can be used without making the categorization more complicated?
A problem of the manuscript is its length and lack of coherence. Apart from presenting the data from the proteomics, phosphoproteomics and N-terminomics analyses, the authors focus on several different proteins to perform validation experiments and further characterize the biological significance of their modification. Because these proteins do not fall on the same pathway, the authors end up presenting several independent stories that complicate the reader.
Moreover, it is necessary for the authors to restructure their introduction, and avoid over-representing previous research on nerinetide, which is not used anywhere in the manuscript. Instead, the introduction must be more focused to better capture the necessity and essence of the present study.
Taking into account figures 1 and S2 I understand that the authors combined samples of neuronal cell cultures (treated or not with Glu) with samples from mouse brains (that have undergone ischemic stroke/TBI or sham operation). If this is the case, why did the authors do that? How did they combine the different samples? And why this is not mentioned anywhere is the main text?
Regarding figure 5D , the authors write in the main text "Consistent with our phosphoproteomic results, the truncated fragment CRMP2 fragments could not cross-react with the anti-pT509 CRMP2 antibody (Figure 5D)" In the upper blot the truncated CRMP2 fragment runs well below the 70 kDa marker. However, in the middle panel, where we see the blot with the phospho specific antibody, the respective area of the blot has been cropped, so we cannot see whether the truncated fragment cross-reacts with the phospho specific antibody.
It is strange that only 1 and 13 proteins showed significant changes in abundance at 30 and 240min respectively. Especially after 240min of glutamate treatment one could expect that many proteins should change in their levels, since the neurons are almost diminished by cell death at that point. How could the authors explain this phenomenon? Additionally, in their previous publication, they showed that much more proteins change significantly in abundance following glutamate treatment (at 30min and 240min).
Significance
The manuscript delivers a large amount of data, regarding changes in the proteome, the activation of specific kinases, phosphatases, as well as the molecular pathways that are activated at distinct time points of excitotoxicity. This information could be used in future studies to validate and develop potential therapeutic strategies that could protect against neuronal loss in various neurological disorders.
The same group has very recently published a work very similar to the particular manuscript (Hoque et al. Cell Death and Disease, 2019). In their previous publication, the authors cover a large part of their current objectives. They performed again a proteomic and phosphoproteomic analysis of mouse primary cortical neurons treated with glutamate for distinct time points, in their aim to identify changes in expression and phosphorylation state of neuronal proteins upon excitotoxicity. Apart from the N-terminome, which they investigate in their current manuscript, the proteomic and phospho proteomic analysis are very similar. As such, and because of the fact that the current manuscript is very extensive, the authors should consider to minimize it, and include only their novel findings (changes in the N-terminome, the involvement of specific kinases that contribute to excitotoxic neuronal death, the regulatory mechanism of CRMP2, etc).
The authors should describe in a simpler way the proteomic and bioinformatics analyses they are using in the manuscript. It is difficult to understand the methodology used if you are not an expert in proteomics and bioinformatics. My suggestion is to revise their text and make it simpler and more concise.
-
-
www.biorxiv.org www.biorxiv.org
-
Author Response
Reviewer #1
1) In many instances inappropriate controls were used. For instance, a straightforward experiment to corroborate the authors model would be to employ cells that exclusively express non-phosphorylatable eIF4E mutant (such as eIF4E KI MEFs described in Furic et al., 2010) and/or MNK KOs to establish the requirement of eIF4E phosphorylation and potential cross-talk with MNK dependent mechanisms, respectively. Although there were some attempts to do this (e.g. MNK1 KD, using pharmacological inhibitors that are by the way quite non-specific), the data are insufficient to support the authors' claims. Moreover, the interaction between eIF4E and eIF4G and potential changes in the eIF4F levels that are likely to confound authors' conclusions were not assessed.
2) Several mechanisms involving indirect effects of mTOR on eIF4E phosphorylation that have been reported in the literature were not considered. For instance, it is plausible that mTOR affects eIF4E phosphorylation by bolstering eIF4E:eIF4G association and recruitment of MNKs.
Appropriateness of the controls to be employed is imperative. We would appreciate if controls that appear inappropriate were identified for us to improve upon. We also endorse that pharmacological inhibitors like MNK inhibitor tend to be promiscuous. However, their use in combination with knockdown experiments offers a reasonable choice for strengthening a data point. We are surprised at the insistence of the reviewer for his emphasis on indirect regulation of eIF4E phosphorylation via eIF4G and eIF4F to proximate mTORC1 and MNK response, despite the evidence herein that identifies direct regulation of this phosphorylation by mTORC1 coupled with rapamycin induced feed back response by MNK. Data generated by us so over the years including some interesting unpublished observations (Majeed R and Andrabi KI) have strengthened our contention that eIF4E phosphorylation is regulated by mTORC1 directly with eIF4E: eIF4G regulation as a back up.
3) The evidence for direct phosphorylation of eIF4E by mTOR was based on non-optimally designed experiments. The description of methodology for the in vitro kinase assays was inadequate, and the experiment was carried out solely using GST-WTeIF4E as a substrate without appropriate controls. There also appears to be rapamycin dependent eIF4E phosphorylation in KD mTOR lanes.
The in vitro kinase assay for eIF4E as a mTORC1 substrate has been described in detail by us previously (Batool et-al 2020). The experiment referred to, by the reviewer has been included as part of supplementary data only to serve as a ready reference.
4) The authors use non-transformed cells as a control for eIF4E overexpression, whereby eIF4E overexpression is well-established to transform immortalized cells (Work from Sonenberg's, Bitterman's etc. labs).
The primary data to appreciate the dynamics of eIF4E expression is represented by human tumour samples (Fig 1A-D), that clearly indicated tumour specific over-expression and eIF4E hyper-phosphorylation. In an attempt to substantiate the universality of this observation, we examined its expression across several cell lines including the ones that are not transformed. In addition, non-transformed cells were used to assess whether phosphorylation of eIF4E was a function of its over-expression which otherwise not be possible to appreciate in a tumour cell scenario.
5) Functional assays are warranted to establish the effects of proposed mechanism on cell functions/fate.
We appreciate the significance of functional assays and intend to include them wherever necessary.
6) Many blots throughout the paper were of insufficient quality to be clearly interpreted.
We would like to know which blots the reviewer is referring to.
7) Many interpretations of the results were not justified by the data (e.g. in Figure 1C it is claimed that phosphorylation of eIF4E is increased in overexpressors, but this could be simply due to the increase in total protein levels).
We do not believe that the enhanced phosphorylation of eIF4E is due to the increase in the total protein. As seen in Fig.1C the levels of the protein are the same throughout.
8) Most of the work relies on transient (except for FLAG-S6K1) overexpression strategies which are prone to artifacts and not likely to represent physiological stoichiometry of investigated proteins.
We have already used five stable cell lines. It is not possible to generate stable cells for every protein as we are studying signalling cross-talks. We believe that we have used enough positive and negative controls to rule out the possibility of artefacts.
9) It has been previously shown (e.g. Lowe & Pelletier's labs) that eIF4E confers resistance to rapamycin by mechanisms that were clearly distinct and at least in my opinion far better substantiated than those published previously by the authors and proposed here. Indeed, eIF4E overexpression results in increased eIF4F levels, which has been shown to attenuate efficacy of not just rapamycin, but also active mTOR inhibitors, and many other oncogenic-kinase inhibitors.
Our study although being in concert with other evidences suggesting the feedback activation of Mnk/4E pathway upon mTORC1 inhibition differs from some of the studies as quoted by the reviewer. The basic difference for this anomaly lies in the difference of the experimental conditions that we use to monitor the phosphorylation status of eIF4E, that lies from a range of 20 min to 48 hrs at 50nM concentration of Rapamycin. Studies carried out elsewhere use either 250nM conc. of rapamycin for 2hrs (Michael C. Brown-2017), 100nM for 2 hrs (Rebecca L Stead-2013) or use of rapalogs for 12 hrs (Pierre E Joubert-2015). Although, these and many other studies have implicated crosstalk to explain increase in 4E phosphorylation upon mTOR inhibition, yet they grossly fall short of comprehensively monitoring the status of 4E phosphorylation from 20 min to 2 hrs at lower conc. of rapamycin. We believe that use of higher concentration of Rapamycin allows the Mnk1 induced phosphorylation to resurface early (>3 hrs) to reconcile with the literature about the rapamycin dependent upsurge in 4E phosphorylation.
10) Many published articles are misinterpreted as supporting the authors' claims. For instance, the authors write that "the inconsistent stature of mTORC1 as a 4EBP1 kinase in vivo" and the reference provided suggests that GSK3beta may phosphorylate 4E-BP1 in addition to mTOR which in certain contexts may lead to rapamycin resistance. As far as I understand, this, and other similar studies, do not challenge the status of mTORC1 as a 4E-BP1 kinase in vivo, but that GSK3beta (and other kinases such as Pim kinases, CDK1) may also phosphorylate 4E-BPs in certain contexts. Moreover, as initial studies on active-site mTOR inhibitors by Thoreen et al., and Feldman et al., as well as studies from Blenis' and Sonenberg's groups indicated, rapamycin does not efficiently inhibit 4E-BPs n the vast majority of contexts, which suggest that GSK3beta-dependent resistance to rapamycin may result from mTOR effectors other than 4E-BPs
We have previously summarized the studies that question the stature of 4E-BP1 as an mTOR substrate. We would like the reviewer to go through that manuscript (Batool et al, EJCB, 2017). We have missed to cite that paper in this manuscript.
Reviewer #2
1) A large portion of Figures 1-3 is a reproduction of data from the authors' 2020 paper (Batool et al., 2020) which showed that elF4E is phosphorylated by MNK1, and that MNK1 is repressed by activation of mTORC1 signaling. While some new experiments have been added (e.g. the analysis showing increased expression of S6k1 in cancer cell lines/tissue and the in silico peptide docking analysis), these are minimal additions to the recently published work from this group.
This study was built on our previous publication that suggest eIF4E as an important effector of mTORC1. This study however, focusses on the regulation of S6K1 and following are the additions in the paper:
• Overexpression of eIF4E WT and S209E correlates with S6K1 phosphorylation and activity and is rapamycin-insensitive (Figure 1E, F and Supplementary Figure S1).
• S6K1 TOS, but not HM phosphorylation is required for its interaction with eIF4E (Figure 4A, D).
• mTORC1 is required for priming S6K1 for activation while as mTORC2 activity is responsible for phosphorylation of TOS- and CT-deficient S6K1 (Figure 5D, F).
• Identification of a region in S6K1 that mediates mTORC2 response (Fig 6).
• Identification of a short peptide in S6K1, which appears to interact with PHLPP1 (Fig 7).
2) One new finding in this paper is that elF4E binds the TOS motif on S6K1 and this binding promotes the hydrophobic motif phosphorylation of S6K1. The authors interpret their data to mean that binding of elF4E induces a conformational change to relieve autoinhibition. Is there any structural information to support this conformational change? What if the binding of elF4E recruits the hydrophobic motif kinase (mTORC2 proposed) in the absence of a conformational change? There are multiple other explanations that need to be considered and addressed.
TOS deletion/ mutation renders S6K1, inactive due to:
The failure of hydrophobic motif (HM) to get phosphorylated implying that TOS may recruit a kinase to phosphorylate HM and activate the enzyme (prevailing model). If this were true, then phospho-mimicking HM should rescue the loss of enzyme activity due to TOS- mutation, which however is not the case.
Or
The failure of carboxy terminal domain (CTD) to disinhibit, implying that TOS-engagement must somehow orchestrate CTD disinhibition (conformational change) to allow HM phosphorylation as a consequence. Since loss of function due to TOS-mutation/deletion can be rescued only by CTD truncation, it is reasonable to infer that TOS engagement with 4E must serve to remove inhibition due to CTD by a change in conformation to facilitate HM phosphorylation to occur in TOS independent manner.
Although there is no structural data, the inferences are compelling to propose the conformational change at the behest of eIF4E interaction with S6K1.
The possibility of mTORC2 recruitment by eIF4E is not supported by any data. This is because TOS &CTD deleted variant of S6K1 continues to be phosphorylated in a torin sensitive manner (Fig 5D).
Other consideration have also been discussed to the best of our ability.
3) The authors propose that PHLPP1 is constitutively bound to S6K1 to suppress hydrophobic motif phosphorylation, and serum stimulation causes the release of PHLPP1 to fully activate S6K1. Unfortunately, this potentially important mechanism is experimentally addressed by only 3 co-IPs in Figure 7: overexpressed PHLPP1 co-IPs with a GST fusion with residues 78-85 of S6K1, PHLPP1 co-IPs with S6K1 (and less efficiently in the presence of serum), the PHLPP1 regulation of S6K1 is abolished in a construct in which residues 78-95 are deleted. The identification of a PHLPP1-binding determinant on S6K1 is significant but the current data just scratch the surface. What are the residues? Are they evolutionarily conserved? Are they conserved in other PHLPP1 substrates? Does the GST fusion with these 8 amino acids result in the activation of S6K1 by sequestering PHLPP1? A compelling mechanistic analysis is missing and should be provided especially since PHLPP1 is in the title of the paper.
While deletion of sequence between 78-85 renders S6K1 non-responsive to serum stimulation, it does not affect its sensitivity towards rapamycin. Also, GST fusion of these 8 amino acids resulted in the activation of S6K1 as it sequestered PHLPP1. Some more experiments can be added to further support the contention. Three out of eight amino acids appear to be evolutionary conserved. We have performed a detailed mutagenesis of the region and the data is part of a manuscript in preparation.
4) Deletion of residues 91- 109 inactivates S6K1, which the authors interpret as meaning the regions is critical for mTORC2 binding and HM phosphorylation. But this encompasses the Gly-rich loop and its deletion will inactivate any kinase.
The deletion, 91-109, referred to by the reviewer, was introduced to evaluate the ability of this S6K1 variant to act as a substrate for mTORC2 mediated HM phosphorylation rather than to determine the state of S6K1 enzyme activity as perceived by the reviewer. Regardless of the influence this deletion may have in the activity state of S6K1, it should have no bearing on the ability of mTORC2 to phosphorylate S6K1at its HM situated 300 amino acids carboxy terminus to the deletion. Since this deletion results in the failure of mTORC2 to phosphorylate S6K1 at Hm, we drew following conclusion.
• This region appeared sufficient to mediate HM phosphorylation irrespective of the presence of TOS motif.
• That this region may support mTORC2 docking.
• That mTORC2 mediated S6K1 phosphorylation is specific and not a random event (Refer to discussion).
Reviewer #3
1) While the authors claim that MNK1 is not the "primary" kinase phosphorylating eIF4E, they fail to show the lack of CGP57380 effect on p-eIF4E(S209) and pS6K1(T412) phosphorylation in HEK293 cells they preferentially use for their experiments.
As suggested by the reviewer, the blots can easily be probed for p-eIF4E (S209) and pS6K1(T412) to check the effect of CGP57380 in HEK293 cells, though this has already been done in our previous manuscript (Batool et al, Molecular and Cellular Biochemistry, 2019).
2) The quality of pS6K1(T412) blots is questionable: while on Figure 1DEF, Figure 2A, Figure 5C and Figure 7B there is a clear single band, on Figure 1G, Supplementary figure S1, Figure 5ABDEF, Figure 6CDE and Figure 7ADE the authors ignore the strong band and appear to focus on the weak one.
The reviewer has rightly noticed the presence of one sharp band in some blots probed with Thr412 and two bands in few. The difference lies in the use of two different antibodies (Cell Signaling Technology Cat no. 9205 and 9234). One among them detects only one band while other detects two bands may be because of the potency of the antibody towards a particular species.
3) The authors do not comment on the reproducibility nor present quantitation of the essential experiments (Figure 1EFG, Figure 3D, Supplementary figure S1, etc). Quantitation should at least include essential WBs (pS6K1(T412) and p-eIF4E(S209)) and S6K1 activity towards S6 and must explicitly state the number of independent experiments and the reported statistic.
The quantitation for these figures can be added as suggested by the reviewer.
4) The authors should comment on the puzzling result in Figure 1F where control shRNA significantly decreases S6K1 activity towards S6.
We acknowledge that this is an anomaly and can be corrected.
5) The authors should consider alternative models. Thus, for instance, Blenis lab has previously shown that S6K1 and mTORC1 cooperate in the context of eIF3 complex. Could this mechanism contribute to the increased S6K1 activity upon eIF4E overexpression?
This possibility was over ruled as we observed a direct binding of eIF4E and S6K1.
Furthermore, I would strongly recommend extensive editing to improve the structure and style of the manuscript.
We agree to re-structure and re-style the manuscript as and when required.
-
Reviewer #3
High eIF4E/4EBP1 ratio is known to predict low cell sensitivity to mTOR inhibitors, suggesting that high eIF4E could help bypass mTOR requirement for cell growth and cap-dependent mRNA translation. The manuscript by Majeed et al examines how eIF4E affects S6K1 HM phosphorylation and activity. The authors claim that phosphorylated eIF4E (and not mRaptor) is the factor required "to overcome mTORC1 dependence of S6K1" activation and suggest mTORC2 (rather than mTORC1) as a kinase phosphorylating S6K1 HM.
To support this conclusion, the authors argue that:
• overexpression of eIF4E WT and S209E correlates with S6K1 phosphorylation and activity and is rapamycin-insensitive (Figure 1EF, Supplementary Figure S1)
• mTORC1 activity is required for S6K1 and eIF4E phosphorylation (Figure 2AB, Figure 3BCE)
• S6K1 TOS, but not HM phosphorylation is required for its interaction with eIF4E (Figure 4AD)
• MNK1 activity is not required for eIF4E phosphorylation (Figure 3CD)
• mRaptor is not required for S6K1 binding to eIF4E (Figure 4DE)
• mTOR is required for S6K1 activity and mTORC2 activity is responsible for phosphorylation of TOS- and CT-deficient S6K1 (Figure 5DF)
Further, the authors identify a short peptide in S6K1, which appears to interact with PHLPP1.
While some of the results are indeed interesting, the presented data are not sufficient to support the authors' central claim (that eIF4E and not mRaptor/mTORC1 is required for mTORC1-independent S6K1 phosphorylation and activity). Thus, the key experiment to demonstrate that (phosphorylated) eIF4E is necessary and sufficient for S6K1 phosphorylation and activity in the presence of rapamycin is missing. Figure 1F and Figure 1G come closest to that, but still fall short of convincingly supporting the central claim. Further, the fact that mTORC2 could phosphorylate the HM in TOS- and CT-deficient S6K1 has already been elegantly and definitively shown by Ali & Sabatini in their 2005 JBC publication.
Besides the central deficiencies outlined above, the following major points should be addressed:
1) While the authors claim that MNK1 is not the "primary" kinase phosphorylating eIF4E, they fail to show the lack of CGP57380 effect on p-eIF4E(S209) and pS6K1(T412) phosphorylation in HEK293 cells they preferentially use for their experiments.
2) The quality of pS6K1(T412) blots is questionable: while on Figure 1DEF, Figure 2A, Figure 5C and Figure 7B there is a clear single band, on Figure 1G, Supplementary figure S1, Figure 5ABDEF, Figure 6CDE and Figure 7ADE the authors ignore the strong band and appear to focus on the weak one.
3) The authors do not comment on the reproducibility nor present quantitation of the essential experiments (Figure 1EFG, Figure 3D, Supplementary figure S1, etc). Quantitation should at least include essential WBs (pS6K1(T412) and p-eIF4E(S209)) and S6K1 activity towards S6 and must explicitly state the number of independent experiments and the reported statistic.
4) The authors should comment on the puzzling result in Figure 1F where control shRNA significantly decreases S6K1 activity towards S6.
5) The authors should consider alternative models. Thus, for instance, Blenis lab has previously shown that S6K1 and mTORC1 cooperate in the context of eIF3 complex. Could this mechanism contribute to the increased S6K1 activity upon eIF4E overexpression?
Furthermore, I would strongly recommend extensive editing to improve the structure and style of the manuscript.
-
Reviewer #2
This manuscript builds on a previous publication from the authors identifying an mTORC1-sensitive and MNK1-mediated phosphorylation of elF4E, which they now propose is involved in the mechanism of activation of S6 kinase1 (S6K1). Specifically, the authors propose that the binding of MNK-1-phosphorylated elF4E to the TOR Signaling motif (TOS) of S6K1 relieves autoinhibition of the kinase, in turn promoting the phosphorylation by mTORC2 of the regulatory hydrophobic motif phosphorylation site. Furthermore, they propose that this phosphorylation is kept in check by binding of the phosphatase PHLPP1 to an 8 amino acid segment on S6K1, and that serum stimulation results in the release of PHLPP1 to increase phosphorylation at the hydrophobic motif and allow full activation. This is a potentially very interesting finding but unfortunately the data are poorly presented, many experiments are superficial, and alternative explanations are not considered.
Major comments:
1) A large portion of Figures 1-3 is a reproduction of data from the authors' 2020 paper (Batool et al., 2020) which showed that elF4E is phosphorylated by MNK1, and that MNK1 is repressed by activation of mTORC1 signaling. While some new experiments have been added (e.g. the analysis showing increased expression of S6k1 in cancer cell lines/tissue and the in silico peptide docking analysis), these are minimal additions to the recently published work from this group.
2) One new finding in this paper is that elF4E binds the TOS motif on S6K1 and this binding promotes the hydrophobic motif phosphorylation of S6K1. The authors interpret their data to mean that binding of elF4E induces a conformational change to relieve autoinhibition. Is there any structural information to support this conformational change? What if the binding of elF4E recruits the hydrophobic motif kinase (mTORC2 proposed) in the absence of a conformational change? There are multiple other explanations that need to be considered and addressed.
3) The authors propose that PHLPP1 is constitutively bound to S6K1 to suppress hydrophobic motif phosphorylation, and serum stimulation causes the release of PHLPP1 to fully activate S6K1. Unfortunately, this potentially important mechanism is experimentally addressed by only 3 co-IPs in Figure 7: overexpressed PHLPP1 co-IPs with a GST fusion with residues 78-85 of S6K1, PHLPP1 co-IPs with S6K1 (and less efficiently in the presence of serum), the PHLPP1 regulation of S6K1 is abolished in a construct in which residues 78-95 are deleted. The identification of a PHLPP1-binding determinant on S6K1 is significant but the current data just scratch the surface. What are the residues? Are they evolutionarily conserved? Are they conserved in other PHLPP1 substrates? Does the GST fusion with these 8 amino acids result in the activation of S6K1 by sequestering PHLPP1? A compelling mechanistic analysis is missing and should be provided especially since PHLPP1 is in the title of the paper.
4) Deletion of residues 91- 109 inactivates S6K1, which the authors interpret as meaning the regions is critical for mTORC2 binding and HM phosphorylation. But this encompasses the Gly-rich loop and its deletion will inactivate any kinase.
-
Reviewer #1
In this article Majeed et al propose a previously unrecognized model of S6K1 activation whereby eIF4E interacts with the TOS motif of S6K1, which facilitates phosphorylation of its hydrophobic motif by mTORC2. The authors also propose that another motif in S6K1 is responsive for serum induced and PHLPP1-mediated activation of S6K1. Furthermore, the authors propose that eIF4E may be a direct downstream substrate of mTORC1, and that mTOR is a major kinase that phosphorylates eIF4E. Although of potential interest, the data are frequently overinterpreted, the experimental design is not optimal, previous literature was not adequately considered, and many of the authors' conclusions were open to alternative explanations. My specific comments are outlined below:
1) In many instances inappropriate controls were used. For instance, a straightforward experiment to corroborate the authors model would be to employ cells that exclusively express non-phosphorylatable eIF4E mutant (such as eIF4E KI MEFs described in Furic et al., 2010) and/or MNK KOs to establish the requirement of eIF4E phosphorylation and potential cross-talk with MNK dependent mechanisms, respectively. Although there were some attempts to do this (e.g. MNK1 KD, using pharmacological inhibitors that are by the way quite non-specific), the data are insufficient to support the authors' claims. Moreover, the interaction between eIF4E and eIF4G and potential changes in the eIF4F levels that are likely to confound authors' conclusions were not assessed.
2) Several mechanisms involving indirect effects of mTOR on eIF4E phosphorylation that have been reported in the literature were not considered. For instance, it is plausible that mTOR affects eIF4E phosphorylation by bolstering eIF4E:eIF4G association and recruitment of MNKs.
3) The evidence for direct phosphorylation of eIF4E by mTOR was based on non-optimally designed experiments. The description of methodology for the in vitro kinase assays was inadequate, and the experiment was carried out solely using GST-WTeIF4E as a substrate without appropriate controls. There also appears to be rapamycin dependent eIF4E phosphorylation in KD mTOR lanes.
4) The authors use non-transformed cells as a control for eIF4E overexpression, whereby eIF4E overexpression is well-established to transform immortalized cells (Work from Sonenberg's, Bitterman's etc. labs).
5) Functional assays are warranted to establish the effects of proposed mechanism on cell functions/fate.
6) Many blots throughout the paper were of insufficient quality to be clearly interpreted.
7) Many interpretations of the results were not justified by the data (e.g. in Figure 1C it is claimed that phosphorylation of eIF4E is increased in overexpressors, but this could be simply due to the increase in total protein levels).
8) Most of the work relies on transient (except for FLAG-S6K1) overexpression strategies which are prone to artifacts and not likely to represent physiological stoichiometry of investigated proteins.
9) It has been previously shown (e.g. Lowe & Pelletier's labs) that eIF4E confers resistance to rapamycin by mechanisms that were clearly distinct and at least in my opinion far better substantiated than those published previously by the authors and proposed here. Indeed, eIF4E overexpression results in increased eIF4F levels, which has been shown to attenuate efficacy of not just rapamycin, but also active mTOR inhibitors, and many other oncogenic-kinase inhibitors.
10) Many published articles are misinterpreted as supporting the authors' claims. For instance, the authors write that "the inconsistent stature of mTORC1 as a 4EBP1 kinase in vivo" and the reference provided suggests that GSK3beta may phosphorylate 4E-BP1 in addition to mTOR which in certain contexts may lead to rapamycin resistance. As far as I understand, this, and other similar studies, do not challenge the status of mTORC1 as a 4E-BP1 kinase in vivo, but that GSK3beta (and other kinases such as Pim kinases, CDK1) may also phosphorylate 4E-BPs in certain contexts. Moreover, as initial studies on active-site mTOR inhibitors by Thoreen et al., and Feldman et al., as well as studies from Blenis' and Sonenberg's groups indicated, rapamycin does not efficiently inhibit 4E-BPs n the vast majority of contexts, which suggest that GSK3beta-dependent resistance to rapamycin may result from mTOR effectors other than 4E-BPs
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3
This manuscript describes a complete model of robust insect navigation. The originality of this remarkable work relies on a clear endeavour to describe the neural basis of each function involved in the homing behaviour of the ant. This paper focuses on the neural processing related to various theoretical hypotheses in terms of signal processing. Several previous studies replicated the route following behaviour but did not account for visual homing, i.e., the ability of the ant to return to familiar regions from novel locations. The proposed model extends the one proposed by Webb in 2019 to account for two very challenging points: the ability of the ants to home from new locations and the ability of the ant to switch between strategies according to the context.
Major points:
- I was very surprised by the slow velocity of the simulated ant (Vo = 1cm/s) compared to the real one (about 50cm/s). Why is the speed so slow? This point must be discussed. Is there any fundamental reason?
- Concerning the path integration strategy, the distance does not seem to be measured (odometer) or included in the model.
- What would happen to the simulated ant if an obstacle was placed on the familiar route? What is the robustness of the Zernike-based moment algorithm to the unpredicted presence of an obstacle that could appear during the homing? I suggest doing additional simulations in this sense that could show the robustness of the proposed navigation model. These new simulations could be in line with the well-known experiments proposed by Wehner and Wehner (Insect navigation: use of maps or ariadne's thread?).
Page 16, lines 417: would it be possible to plot Crf with respect to angular orientation of the simulated ant in various places (every 10° steps for example)?
-
Reviewer #2
The beautifully illustrated manuscript by Sun et al is a challenging but highly rewarding, interesting and intellectually stimulating modeling study that proposes a unified model of insect navigation, which, at least in large parts, is constrained by neuroanatomical and physiological data. It elegantly combines previous models of path integration of the central complex and visual learning in the mushroom body (underlying visual homing) and proposes a third model for habitual route following. In the end, all three models are integrated and mapped onto known neural structures of the insect brain, most notably the central complex and the mushroom body. The information extracted from the environment is decomposed using a novel method that separates rotationally invariant feature information from rotational variant directional information. While the first is used to carry out visual homing based on image familiarity, the second is used to follow habitual routes. The important novelty in the paper is that this new information processing strategy allows to integrate all mentioned navigational modules. Moreover, it does so using previous biologically constrained models and expands this basis towards a full system that can replicate numerous behavioral data from ants, including difficult experiments, in which ants have to trade off different strategies against each other. I highly welcome this paper as an important addition to both the literature on the insect central complex, as well as to more theoretical navigational work, in particular as many predictions can be made based on the presented models. Nevertheless I have several points that need to be addressed.
Major comments:
1) Accessibility to a broad readership. While the general text is written very well and the content is highly interesting for a life science (in particular insect neuroscience) audience, the methods section and some aspects of the reasoning behind the model are very technical. Being an insect neurobiologist myself, I struggle to follow large parts of the methods and had admittedly never heard of Zernike moments. Given that the mathematical model and the concepts of frequency analysis are the foundations of the paper, I suggest to add some more intuitive and broadly accessible language that would allow a biologist to grasp at least the key principles of what is done by those initial analyses of the visual information in the model (of course, the math is needed for a computational audience and essential for replication of the model, but a few additions might go a long way for biologists). A schematic illustration as to what Zernike moments are, maybe combined with some simple examples might help a lot. This is important as the paper is not only directed towards computational biologists, but is highly relevant also for physiologists, anatomists and behaviorists, most of whom (extrapolating from my own mathematical ignorance) probably fail to grasp the essence of the new principles presented.
2) Neuroanatomical correspondence of model details: The paper claims that the model is in most parts biologically constrained and that most elements can be mapped onto known neurons. Where this was not possible (route following) the authors speculated about the possible implementations. While on the levels of neuropil groups this is all quite true, the details, especially in the central complex, are less clear and many of the proposed circuits have no known counterpart in any insect brain to date. This is not saying that those parts of the model are not realistic or interesting, but that the claim that they correspond to existing neurons in the central complex, is slightly misleading. I will list a series of obvious mixups of cell types below, which need to be corrected (2.1), but additionally, it should be clearly stated where the model does not (yet) have a solid grounding in biology (see point 2.2). Finally, the speculative route following implementation seems at odds with neurophysiological data from various species and alternative pathways and implementations seem more likely (point 2.3).
2.1)
- Line 126: CPU3 neurons are supposed to be a mirrored TB1 ring attractor network? I'm not sure if this is what the authors want to say, as CPU3 neurons are known in locusts (Heinze and Homberg, 2008), but connect the PB with the FB as columnar cells. If the authors mean CPU4 cells, these neurons are also not forming a ring-network (even though they could receive shifted compass information from TB1 cells by some means). Most simply, would not a parallel set of TB1 cells be optimally suited for this task? There are four TB1 cells for each column in the PB, potentially enough for four parallel ring attractors. These cells are neurochemically distinct and could function independently (see Beetz et al, 2015).
- There is no known direct connection between the EB and the FB (proposed in figure 4)
- There is no direct connection from the OL to the CX (indicated in caption of figure 1 as underlying PI).
- line 348: CL2 neurons should be CL1 (CL2 correspond to fly P-EN neurons, not E-PG)
- In the PI section of the methods, sometimes TN cells are referred to as TN2 cells or just as TN cells. TN2 is one of two types of TN cells (tangential noduli neurons) and was the one primarily used for the standard model of Stone et al 2017. Please be consistent. Also, the tuning cells of the visual homing circuit are called TN cells. This is very confusing and should be changed.
2.2) There are no known ring attractors in the FB. The only ring attractor shown experimentally is the one in the EB/PB, which employs recurrent feedback loops with the PB (E-PG/P-EN/P-EG cells; equal to CL1a, CL2, and CL1b) and inhibitory neurons in the PB (TB1 or delta7 cells). While a similar recurrent connection pattern is thinkable in the FB as well, using unknown types of columnar cells, there is no experimental support for that. Pontine cells might also form local connections that could result in a RA, but that is even more speculative. Please clearly state that the numerous RAs required by the model are hypothetical and have not yet any biological correspondence in the form of identified cell types. Also, I suppose not all the neuron rings drawn in the figures are ring attractors. I suggest making that distinction clearer (the many abbreviations for the different neuron rings do not make this easier to follow either).
2.3) The authors assume a second compass system in the PB that is fed directly from the OL via the posterior optical tract. There is no evidence for this beyond a single cell type from locusts that connects the accessory medulla (circadian clock) to the POTU, which is also innervated by TB1 neurons. However, there is no connection to the visual part of the OL, and no physiological data exists on the AME->POTU connection. In contrast, the anterior optic tract via the AOTU has been shown in Drosophila to contain many neurons that respond to visual features and they converge on the head direction cells in the EB via a recently resolved mechanism. It seems odd to ignore this known compass pathway and propose another one for which no evidence exists. That said, the authors use the anterior pathway to construct a desired heading via an ANN residing in the AOTU/BU pathway, information that is then used to feed into an EB ring attractor that then connects to additional attractors in the FB. Whereas the EB attractor (in conjunction with the PB) exists, there is no evidence for FB based ring attractors and there is no known direct connection between the EB and the FB. While this all results in a really nice figure, it unfortunately is misleading and based on not enough evidence to show it so prominently (readers might easily take it for factual).
If I may, I would like to point out that there is an alternative solution for at least the compass problem: There are four individual CL1 cells in each column of the EB in locusts as well as in flies (EPG/PEG cells). While they are identical in their projection patterns, some connect the PB to the EB and others connect the EB to the PB, so that there are in theory enough cells to form two parallel recurrent loops (needed to maintain a head direction signal). One of them could be driven by landmarks, while the other could be driven by global compass cues. Whereas the current idea is that both inputs converge on a single head direction signal (celestial and local cue based), this might not be true, given that local cues have been tested in Drosophila and global cues in locusts and some other species. These neurons are neurochemically distinct and most likely play different functional roles.
Finally with respect to the desired heading, a short term plasticity based, associative mechanism linking the phase of the head direction signal and the local environment was recently demonstrated in Drosophila (Fisher at al. 2019 and Kim et al, 2019). The authors state that several of these phases can be stored and retrieved in each respective environment. To me this sounds very close to what the authors of the current study suggest for routes in ants. Please consider these points and revise the proposed circuit identity accordingly.
3) The overall layout of the model is not fully clear to me from the paper. The authors present many (nicely illustrated) parts of the model, but I fail to reconcile some of the partial models with one another and have no immediate way of seeing how many neurons there are overall, or what their complete connectivity patterns are. I assume this is all obvious from the code itself, but being a neuroanatomist and physiologist, I struggle to get an intuition for the circuits based on Python code. This hinders independent interpretation and finding alternative solutions for mapping the model onto anatomical neural circuits once newly discovered neurons become available in the future. I suggest including (at least in the supplements) a full graphical depiction of the model with all existing neurons and their connections. Maybe using a force directed graph diagram like used by the authors of Stone et al. 2017 for their path integration model results in a model illustration that is intuitively understandable for researchers who think more in terms of anatomy. But even if it turns out to be somewhat messy, it would still be helpful.
-
Reviewer #1
This is an interesting and timely study on a topic of considerable interest: computational strategies used by insects to perform their remarkable navigational feats. The authors identify shortcomings in existing models – specifically, that they do not account for the entire range of capabilities and the flexibility that the most accomplished of insect navigators display – and integrate and build upon prior models to successfully fill these gaps. The integrated model pins specific computational functions on specific anatomical structures, making it, in principle, testable in the near-medium term. The figures are well-made and the writing is compact but readable. Here are a few specific concerns:
1) It is entirely reasonable that the authors combine experimental and modeling work from a range of different insect species to build different pieces of their own model. By and large they are careful to state which is which. However, they could make it clearer which assumptions are based on experimental data and which are based on prior models (i.e., not actual data). As an example, although the mushroom body has been suggested by numerous modeling studies and conceptually driven reviews to be involved in visual navigation, the experimental evidence for this is lacking, and their precise role is far from well-established.
2) I commend the authors for integrating useful components from prior models to construct their integrated model, but, although the figures go some way towards clarifying how the different pieces might fit together, it would be useful to make even clearer what is entirely novel here and what is derived/integrated from previous work. In addition, although the authors make a testable case for the involvement of the fan-shaped body in a series of different navigational computations, controlled by the mushroom body, the figures are still somewhat complex and confusing. Please try and further clarify them.
3) The authors could derive more constraints from the fly physiology literature than they do. As examples, Fisher et al., Nature, 2019 and Kim et al., Nature, 2019 have relevant findings relating to plasticity in mapping visual stimuli onto a compass representation. Turner-Evans et al., eLife, 2017 has a data-driven ring attractor model that is relevant, and Turner-Evans, bioRxiv, 2019 features data demonstrating that the fly compass for current heading relies on visual input from the anterior optic tubercle, contrary to the authors' assumption deriving from an anatomical pathway from the posterior optic tubercle to the protocerebral bridge (175-176). On a somewhat related note, the fly heading system does not necessarily show 'bar following' in open loop (line 164): the experiments cited (Seelig & Jayaraman, 2015) were performed in closed loop, with the animal controlling bar position.
Tags
Annotators
URL
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3
The authors report the use of a novel model of intracardiac infusion of Aβ peptides in zebrafish larvae to study the effects of Aβ on sleep and neuronal activity. They provide convincing data that preparations of shorter Aβ oligomers induce neuronal activity and decrease sleep, while longer oligomers suppress neuronal activity and decrease sleep. They then delete known Aβ receptor proteins, and show that the effects of Aβ-short can be blocked by deletion of Adrb2 and Pgrmc1, while the effects of Aβ-long are blocked by prion protein deletion, or specific drugs.
This is a unique system and the method for administering Aβ that is quite powerful, and the experiments are rigorous and generally use multiple converging approaches (for instance genetic+pharmacologic) to support their findings. The reversibility of the effect, as well as blockade with specific pharmacological agents suggests that these are not non-specific toxic events. The findings provide a framework with which to potentially test other neurodegenerative proteins (such as a-syn), and to inform similar studies in mammalian systems.
1) While the experiments are well performed and the data intrinsically consistent, the applicability to mammals (and humans) is a consideration. Infusion of Aβ into the heart of larvae is a highly artificial system, and events that occur during sudden changes in Aβ levels may be different that those observed when Aβ is chronically present (as in AD). For example, infusion of Aβ peptide into the brains of mice or rats can induce acute, local neurodegeneration that is not observed in APP transgenic mice with chronically elevated Aβ levels. This is a fundamental shortcoming of the model, and there is little that can be done to address it, but it should be perhaps mentioned in the Discussion.
2) The implications of this bidirectional effect of short and long oligomers for sleep phenotypes in AD are also a bit unclear, as oligomers of all sizes are likely present in AD brain (though perhaps in different ratios as the disease progresses). It would be helpful to determine which pathway is dominant when both short and long oligomers are infused together, perhaps in different ratios. This is the only experiment I would suggest.
-
Reviewer #2
The use of zebrafish to investigate the role of beta amyloid polymers on sleep/wake regulation is potentially interesting as AD patients suffer from insomnia. Here Ozcan and colleagues inject oligomers synthesized in vitro into the fish neonate hearts and fish motion was then recorded and used as a proxy for sleep and wake states. The authors found a correlation between the polymer length and the impact on fish motor and brain activity.
While the findings are potentially interesting, several points are unclear or concerning to the reviewer:
1) First, all the experiments and interpretations rely on overexpression of Abeta polymers; there is no description or investigation in this study of the normal baseline of Abeta accumulation in this species. One would expect to see such data in Fig. 1 and S1 for example. Is there in fish a night vs. day, sleep vs. night rhythm of Abeta accumulation/expression?
2) The fish undergo anesthesia and heart perforation and are recorded a few hours later. How can handling, surgical stress, and confounds of prior anesthesia be eliminated from "sleep-wake" data interpretation?
3) It is hard for the reader to distinguish a specific effect on sleep/wake. Increased or decreased motion could be due to toxicity or specific stimulation of neuronal circuits due to non physiological presence of exogenous oligomers. The authors try to tackle this issue with cfos and ERK staining, but Fig. 2 shows at least 6 different staining patterns, none of them compared to a sleep/wake baseline of staining. It is quite worrisome to see such a broad over expression of cfos throughout the brain when A beta is accumulated. Are the fish having a seizure? Toxicity could lead to reduced motion and even if it's reversible it can still be transient toxicity until oligomers are washed out. Hyperactivity could be due to a specific overstimulation of neurons as illustrated by cfos and ERK staining.
4) Injections in mutant backgrounds indeed show some specificity in binding/interaction but still it does not demonstrate that the impact is on wake or sleep regulation per se. Again only motion or broad brain staining (at one time point) are shown. An alternative interpretation is that adrb2a, pgrmc, prp1 can indeed bind Abeta but relay the toxic or aspecific impact of oligomers over expression in a brain that normally does not accumulate such molecules.
This study has the potential to be extremely interesting but many controls and demonstration of endogenous Abeta role on sleep-wake cycle are needed.
-
Reviewer #1
There is a growing appreciation about the fundamental bidirectional link between sleep and Alzheimer's disease. Here Rihel and colleagues use a zebrafish model coupled to the injection of amyloid beta oligomers (the initiating pathogenic species for AD) to examine the link between Abeta and sleep. They demonstrate that the length of the oligomers determines whether Abeta induces wake (short Abeta) or sleep (long Abeta), providing novel insights into the role of different forms on sleep/wake. Importantly, they extend their findings to reveal novel molecular insights into the mechanisms into how Abeta exerts these sleep/wake effects. Overall, the findings make an important advance that will be of interest to a broad readership.
I have one significant concern relating to claims that these studies reveal novel functions for the endogenous Abeta. A key missing experiment in this regard is manipulation of the endogenous Abeta gene/protein (or even assessment of endogenous Ab) and thus it is unclear if exogenous (intracardiac) injection of Abeta faithfully reproduces how an endogenous neuronal pathway would deliver Abeta in terms of location, local concentrations and kinetics. I think the findings are significant and important on their own without having to make this claim, which in this case is highly speculative. I would suggest either addressing experimentally or rewording and de-emphasizing this point in the text to make clear the speculative possibilities. In any case, these shortcomings should be more forthrightly noted.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to Version 2 of the preprint: https://www.biorxiv.org/content/10.1101/610014v2
Summary
This study describes the use of intracardiac infusion of various sized amyloid-beta (Aβ) peptides in zebrafish larvae to study the effects of Aβ on sleep and neuronal activity and dissect the molecular mechanism of their action. They show that short Aβs induce neuronal activity and decrease sleep, while long Aβs suppress neuronal activity and decrease sleep. They use genetic perturbations to show that short Aβs act through Adrb2 and Pgrmc1, while long Aβs act via PrP.
As described below, the reviewers consider this manuscript to be a potentially important methodological and conceptual advance, but recommend that the authors address the following concerns:
The model is based on intracardiac injection of Abeta, so the phenotypes result from exogenous expression/overexpression. Given this, the authors should refrain from drawing conclusions about endogenous Abeta. At the same time, the manuscript would benefit from minimal characterization of the endogenous molecules. For instance, is there a rhythm of Abeta expression over the sleep:wake cycle?
The fish undergo anesthesia and heart perforation and are recorded a few hours later. What are the controls for handling, surgical stress, and confounds of prior anesthesia? On a related note, can the authors exclude toxicity, which could affect motion? They address this point by showing cfos and ERK staining, but many different patterns are observed and none are compared to staining under baseline sleep:wake conditions. It is also concerning that the c-fos expression is so widespread. The reversibility of the effect is important and the role of specific molecules is interesting, but these still do not demonstrate impact on wake or sleep regulation per se.
Given that AD brains likely have oligomers of all sizes, it would be good to know what happens when short and long oligomers are infused together.
Tags
Annotators
URL
-
-
www.biorxiv.org www.biorxiv.org
-
Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Reply to the reviewers
Reviewer #1 (Evidence, reproducibility and clarity (Required)):
The manuscript describes two advances. First is the technical development for a protein targeting system called PInT that brings a target protein close to (~320 bp) a DNA sequence of interest. The idea is that localisation of the target protein allows one to distinguish its effects on the DNA sequence either in cis (when targeted) or in trans (when not targeted but expressed at the same level). Since targeting is conveyed by simply adding the small molecule ABA to the experiment, it is easy to compare the two situations. This is a clever idea and it is substantiated by data showing that the components of PInT do not affect triplet repeat instability or gene expression of GFP, into whose gene the PInT system is placed. Moreover, targeting is shown to enable enzymatic activity in the targeted region. Using the DNA methylase DNMT1, there are local increases in DNA methylation. Similarly, targeting the histone deacetylase HDAC5 results in local decreases in histone H3 acetylation.
We thank the reviewer for a thoughtful and helpful review.
What is not clear from these experiments, however, is whether the targeted proteins can interact normally with partner proteins to form functional complexes. One necessary control is to add ChIP for at least one interacting protein each for DNMT1 and for HDAC5 and show that targeting permits normal protein-protein interactions. This experiment is straightforward as specific interacting proteins are known and good antibodies to precipitate those proteins are available.
This is a good suggestion and we plan on doing this experiment in our 59B-Y-HDAC5 and 89B-Y-DNMT1 lines with and without ABA using interacting proteins. The exact interacting protein to be used will depend on the antibodies availability and quality, which we will test. We will start with UHRF1 and HDAC3 for PYL-Dnmt1 and PYL-HDAC5, respectively.
Overall, PInT would likely be useful for many groups studying the effects of chromatin modifiers on a DNA sequence of interest.
The second advance is conceptual and is focused more specifically on triplet repeat expansions. The manuscript describes experiments that measure genetic instability of long CAG-CTG repeats with and without protein targeting. The results show that allele size distributions are not significantly affected by targeting either DNMT1 or HDAC5. One curious outcome that is not discussed is contraction frequency in the HDAC5 experiment. Zero contractions are reported compared to 10-20% contractions in the other two experiments. Authors need to provide an explanation.
Lack of contractions in this experiment is likely due to the lower number of repeats in this line (59 vs 89/91). It is known that longer repeats display higher frequency of contractions, and contractions are rarely seen in short repeats (Larson et al Neurobiology of Disease 2015, Gomes-Pereira et al PLOS Genet 2007, Morales et al HMG 2020). Albeit, the threshold may be different in our HEK293-derived cells. Of note, we had a clone of 89B-Y-HDAC5 that did not express the expected amount of GFP for unknown reasons and we did not use it here. However, small pool PCRs using this line with 89 repeats showed that contractions were indeed present. Although we cannot rule out that the reason for the contractions is the unknown mutation(s), it suggests that the difference is due to the size of the expansion. We have added a comment in the methods section.
It reads: “We have noted that cell lines with repeats that are mildly expanded (e.g., 59 CAGs) have fewer contractions than longer ones. This is consistent with several studies in the context of DM1 and HD [82], albeit the size threshold for seeing more contractions may be shorter in HEK293-derived cells than in mice.”
The major issue with this set of experiments is that there is no positive control where instability is shown to be clearly manipulated. A knockdown of FAN1 would be the most likely avenue to pursue for identifying a positive control. This is straightforward to perform since successful FAN1 knockdowns have been described in the literature.
We agree that a positive control to show that the model behaves as expected is necessary. We will add the experiments proposed by the reviewer in the revised version of the manuscript.
The manuscript also looks at effects on gene expression measured by GFP fluorescence intensity. The potential significance is to see if disease-causing genes with expanded triplet repeats can be silenced by targeting chromatin-modifying enzymes. In the examples tested here, the answer seems to be no. Expression of DNMT1 or HDAC5 reduce fluorescence even in the absence of targeting. Upon targeting, there is a small further decrease, but the expanded triplet repeat resists this further decrease. Domain analysis of HDAC5 indicates that protein-protein interactions, not deacetylase activity, are important for silencing. The key interaction may be with HDAC3, since small molecule inhibition of HDAC3 relieved repeat length-dependent silencing by HDAC5. It was very curious that targeting HDAC3 actually increased expression, instead of silencing. The explanation for this observation was inadequate.
We have added the following paragraph to the discussion to address this.
It reads: “We found that targeting of PYL-HDAC3 increases gene expression slightly, independently of repeat size and in the presence of an inhibitor of its catalytic activity. Although this appears counterintuitive, several studies suggest that this is not unexpected. Specifically, HDAC3 has an essential role in gene expression during mouse development that is independent of its catalytic activity [73]. Moreover, HDAC3 binds more readily to genes that are highly expressed in both human and yeast cells [74,75]. The mechanism or function of HDACs binding to highly expressed genes are currently unknown.”
The claim on page 16 final paragraph that the manuscript 'settled a central question for both HDAC5 and DNMT1 and their involvement in CAG/CTG repeat instability' is not supported by the data. Most of the results are negative so it is premature to claim the question is 'settled'.
We have rephrased all the conclusions about this in the text, emphasizing that we find no evidence of a role in cis, rather than stating that there is no role in cis.
Overall, with appropriate modifications described here, these experiments would be of interest with regards to potential therapies of triplet repeat expansion diseases, where silencing the expanded gene is the goal.
**Minor concerns**
P 4, last line. 59 bp should read 59 repeats - This is now fixed.
P 5, line 2. 38 bp of what? This is now amended. It reads: “The CAG/CTG repeats affect splicing of the reporter in a length-dependent manner, with longer repeats leading to more robust insertion of an alternative CAG exon that includes 38 nucleotides downstream of the CAG, creating a frameshift [30].”
P 10, first paragraph. DNA methylation levels rise from ~10% to ~20% with DNMT1 targeting. Is there a good precedent in the literature that the magnitude of this increase can be expected to be biologically meaningful?
To our knowledge, it is the first time that DNMT1 is used for targeted epigenome editing. This is therefore the first evidence that targeting DNMT1 leads to silencing of a reporter construct. Nevertheless, this reviewer’s comment stands: is an increase in DNA methylation of 10 to 20% biologically relevant? The answer to this is yes, changes in 10-20% are known to have functional impact on gene expression in various settings (for example see the recent study in developing oocytes by Li et al Nature 2018). Furthermore, there is evidence that DNMT1 has weak de novo activity (Li et al Nature 2018, Wang et al Nat Genet 2020), consistent with a small increase in CpG methylation upon targeting. We now acknowledge in the discussion that one reason for the lack of effect upon targeting may be that the changes in CpG methylation are not dramatic enough. We also point out more clearly that changes of 10 to 20% are correlated with changes in repeat instability (Dion et al HMG 2008). We have amended the text to reflect this.
The results now reads “To do so, we performed bisulfite sequencing after targeting PYL-DNMT1 for 30 days. This led to changes of 10 to 20% in the levels of CpG methylation, a modest increase(Fig. 3C), which is in line with the weak de novo methyltransferase activity of DNMT1 (for example see [39,40]). Similar changes in levels of CpG methylation in Dnmt1 heterozygous ovaries and testes were seen to correlate with changes in repeat instability in vivo [31].”
The discussion now states: “It should be pointed out that there remains the possibility that DNMT1 targeting did not lead to large enough changes in CpG methylation to affect repeat instability.”
P12 first paragraph. Text describing Fig 5 is confusing. First, GFP expression is referred to in terms of fold decrease, but subsequently in percent. Second, the ABA-induced silencing looks to reduce expression from about 0.6 to 0.5 of control. I presume this is where the claim of 16% comes from but it was not clear. Indeed, this is what we mean.
We now state: “In 16B-Y-DNMT1 cells, ABA treatment decreased GFP expression by 2.2-fold compared to DMSO treatment alone. Surprisingly, ABA-induced silencing was 1.8 fold compared to DMSO alone, or 16% less efficient in 89B-Y-DNMT1 than in 16B-Y-DNMT1 cells.”
P 15 paragraph 2. Where does the P value of 0.78 come from? Fig 7B shows no corresponding value. The P-value in figure 7B has now been corrected.
Reviewer #1 (Significance (Required)):
See above.
Reviewer #2 (Evidence, reproducibility and clarity (Required)):
**Summary:**
We still do not know whether epigenetics contributes to repeat instability and/or transcriptional activity in unstable CAG/CTG repeat associated pathologies. The aim of this manuscript is to examine whether induced binding of DNMT1 (CpG methylation) or HDAC5 (histone H3 acetylation) modulates CAG/CTG repeat instability and/or gene silencing upon expansion. For this the authors developed a highly sophisticated reporter system (PlnT) that allows to recruit a specific chromatin modifying enzyme (DNMT1/ HDAC5) to a GFP reporter near a CAG/CTG expansion, in the course of transcription (Dox-inducible promoter). This is to determine whether the CTGs, when lengthened and transcribed, become unstable or impede gene activity via epigenetic modifications.
We appreciate the reviewer highlighting the importance of the question that we address here and the usefulness of PInT.
**Findings:**
1.Binding of DNMT1 to the reporter results in a modest increase (~10%) in local DNA methylation, with no change in repeat instability.
3.Targeting HDAC5 to the reporter results in local reduction in histone H3 acetylation, with no effect on repeat stability.
4.DNMT1/HDAC5 binding reduces GFP intensity differentially, in normal but not expanded alleles.
5.The N-terminal domain of HDAC5, when mutated, abolishes the reduction in GFP expression levels.
6.RGFP966 abolishes the allele-specific effect of HDAC5, resulting in a general decrease in GFP expression regardless of repeat tract size
7.CTG expanded alleles abolish the reduction in GFP repression by HDAC5 via HDAC3 activity
**Conclusions:**
Based on the results using the PlnT reporter assay, the authors claim that:
1.HDAC5 and DNMT1 do not affect repeat instability in cis
2.Expanded CAG/CTGs reduce the efficiency of gene silencing by targeting DNMT1/HDAC5 to the locus
3.Gene silencing that is mediated by HDAC5 recruitment can be abolished by inhibition of HDAC3 activity
Unfortunately, none of the claims in this manuscript are convincing.
We note that in the comments below the reviewer does not include a reason why he/she does not find the claims convincing. We therefore cannot address this criticism.
**General Comments:**
The major drawback of the PlnT experimental approach is that it ignores the importance of the flanking regions and the genomic organization of the endogenous locus. This is a major concern as it makes the conclusions irrelevant to the related loci. In the case of myotonic dystrophy type 1, for example, the reporter should reside within a CpG island, should be positioned immediately next to CTCF binding site(s), and should be transcribed bi-directionally.
HDAC3 and DNMT1 were found to have effects on repeat instability both at reporters, which do not harbour flanking sequences from disease loci, and indeed at endogenous loci in vivo (Dion et al HMG 2008, Debacker et al PLoS Biol 2012, Suelves et al Sci Rep 2017, Williams et al PNAS 2020). This highlights the fact that cis elements from disease loci are not required for chromatin modifiers to affect repeat instability.
The reviewer is suggesting a very interesting set of experiments where specific sequences may be added to our reporter and tested for their influence on gene expression and on repeat instability. PInT is ideally suited for this and we have now added a paragraph highlighting this in the discussion. We have also highlighted that the current study aims to isolate the repeats from its cis-elements to specifically side-step potential locus-specific effects and to look for chromatin modifiers that would be useful for epigenome editing for as many loci as possible.
Furthermore, only large expansions (at least several hundred copies) can trigger heterochromatin at the DM1 locus. None of these features are recapitulated by the PlnT reporter assay, making it difficult to draw any conclusion regarding the role of these chromatin modifying enzymes to the locus.
This is true for DM1 but untrue for other disease loci. For example, we have shown that there are changes in the flanking chromatin marks at the SCA1 locus of a mouse model with 145 repeats (Dion et al HMG 2008), DNA methylation is also affected near a SCA7 transgene with 92 CAG repeats (Libby et al PLoS Genet 2008) and transgenes containing CAG repeats (without the flanking sequences) lead to silencing regardless of where the transgene is integrated in the genome (Saveliev et al Nature 2003). Moreover, HDAC5 had effects on repeat expansion in a cell-based shuttle system containing as few as 22 CAG repeats (Gannon et al NAR 2012), again suggesting that chromatin modifiers affect repeat instability in a wide range of repeat sizes. We have reviewed this in Dion and Wilson TiG 2009.
In fact, the authors state in their Discussion that "targeting a chromatin modifying peptide to different loci can have very different effects"!
This is indeed the case and the reason why we sought to control for locus-specific effects using an exogenous reporter.
To better substantiate their conclusions the authors must set up an improved model system that takes into account the flanking regions and the 3D genomic organization of the locus (TADs). The preferable approach would be to insert a reporter cassette by homologous recombination into the differentially methylated/acetylated regions near the repeats, and compare between normal vs. expanded alleles.
We would like to point out that we have recently published a study where we looked at 3D chromatin folding at the DM1, HD, and the GFP transgene used here. We did not find any evidence for changes in TADs that would underlie changes in repeat instability at these loci (Ruiz Buendia et al Sci Advances 2020). We therefore do not think that it would be important to further manipulate 3D genomic organization in this context.
To be clear, we are not denying that cis elements are likely to have an effect, there is plenty of evidence supporting this. Rather, we are using a reporter assay to disentangle the potential locus-specific (or cis-element specific) effects from the trans-activating factors. In short, we focus on the trans-acting factors rather than on the cis-elements, as suggested by the reviewer.
We believe that the addition of the following paragraph highlights the goal of our study and also bring in the idea that cis acting elements can be studied using PInT.
It now reads:
“We designed PInT specifically to isolate expanded repeats tracts from other potential locus-specific cis elements. This is helpful to identify factors that would affect instability and/or gene expression across several diseases. Moreover, both HDAC3 and DNMT1 were found to impact repeat instability at different loci, including at reporter genes [31,33,36,37,45]. These observations highlight that cis-acting elements from disease loci are not required by chromatin modifiers to affect repeat instability. A potential application of PInT includes cloning in specific cis elements, including CTCF binding sites and CpG islands, next to the repeat tract and evaluate their effects on instability with or without targeting. In fact, PInT can be used to clone any sequence of interest near the targeting site and can be applied for a wide array of applications, beyond the study of expanded CAG/CTG repeats.”
My impression was that there is a lot of data but none of it makes sense.
The focus of the manuscript is not entirely clear: it starts with monitoring the effect of epigenetics on repeat instability and gene activity, then it shifts to the mechanism by which HDAC5 functions, and ends with the allele-specific effect of HDAC5 on gene expression. I lost my train of thought.
We have now improved the transitions in this new version of this manuscript. Specifically, at the core of this manuscript is the development of PInT, which is highly versatile and allowed us to study multiple aspects of expanded CAG/CTG repeat biology. We hope that it is now clearer.
**Other concerns:**
(1)the modest increase in methylation levels following DNMT1 recruitment (10%, reaching a total of 20% at the most) prevents from drawing any conclusions regarding the effect of methylation on stability or expression.
As mentioned in the response to reviewer 1 above, although 10% to 20% of CpG methylation are associated with changes in gene expression in a variety of settings, we now point out that one reason for the lack of effect in cis is that the de novo activity of DNMT1 is too weak to produce an effect.
(2)The effect of protein targeting on GFP levels should be better defined at the RNA/protein level. Does it act by blocking transcription? alternative splicing? or alters steady state levels?
Although the exact mechanism remains unclear, this goes beyond the current scope of this study. All these possibilities remain possible as we pointed out in the discussion.
(3)Fig 5: the scale is different for A vs. B and C. Also, better to compare the effect of targeting on equal sized expansions (either 91, 89 or 58 repeats).
We have fixed the scale on the figures.
Unfortunately, it is not possible to have the same repeat sizes for all the cell lines because by their very nature, repeats are unstable. We have added a note relating to this in the methods.
It reads: “Notably, it is not possible to obtain several stable lines with the exact same repeat size as they are, by their nature, highly unstable. This is why we have lines with different repeat sizes. Furthermore, the sizes can change over time and upon thawing.”
(4)Add asterix for significance in all figures.
This has now been done.
(5)Figure 6: show raw data rather than normalized.
We have now added representative flow cytometry profiles for each construct as a new supplementary figure (S5).
(6)Figure 7: there is a notable difference in GFP expression levels in untreated wild type control (16 CAG repeats) between A vs. B. Why?
Fig. 7a shows PYL targeting only, whereas 7b shows the GFP expression upon PYL-HDAC5 targeting. The values for PYL-HDAC5 targeting are lower because targeting it, unlike targeting PYL alone, silences the reporter.
(7)Avoid redundancy. No need to show schematic representations so many times.
We believe that the schematics make it clearer for the reader.
Reviewer #2 (Significance (Required)):
REFEREES CROSS-COMMENTING
I totally agree with the Reviewer #1 that the PinT targeting system is a potent experimental tool to study the function of specific chromatin binding proteins. However, the significance of the flanking regions is discounted.
We hope it is now clear that we are not discounting the potential significance of flanking regions and that rather we have designed the system to avoid their potentially complicating effects.
The fact that the recruitment of HDAC5 has resulted in a significant reduction in acetylated histones provides evidence for that "the targeted proteins can interact normally with partner proteins to form functional complexes". Still, I agree with that the activity of DNMT1 needs to be better established, considering the minor increase in DNA methylation levels.
We will be using ChIP against interacting proteins of DNMT1 and HDAC5 to address this issue.
The request for a positive control for repeat instability is totally correct.
We will be adding this in the revised manuscript.
It is difficult to discuss the missing effect of HDAC5 on contractions or the unexpected effect of HDAC3 on gene silencing bearing in mind the limits of the experimental system.
There is no expectation for the effect of HDAC5 on contractions as this has not been studied in any system yet. However, we believe that there is no contractions not because of HDAC5 per se but rather because of the shorter repeat size this line has (see comment to reviewer 1 above). We have now addressed the “unexpected effect” of HDAC3 by citing a number of studies finding a similar evolutionary conserved effect (see comment to Reviewer 1 above).
I also agree with the statement that "this manuscript settled a central question for both HDAC5 and DNMT1 and their involvement in CAG/CTG repeat instability", is not supported by the data.
We have now rephrased our conclusions. In this particular case, we changed ‘settled’ to ‘addressed’. We have also rephrased this in the results headings.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #2
Evidence, reproducibility and clarity
Summary:
We still do not know whether epigenetics contributes to repeat instability and/or transcriptional activity in unstable CAG/CTG repeat associated pathologies. The aim of this manuscript is to examine whether induced binding of DNMT1 (CpG methylation) or HDAC5 (histone H3 acetylation) modulates CAG/CTG repeat instability and/or gene silencing upon expansion. For this the authors developed a highly sophisticated reporter system (PlnT) that allows to recruit a specific chromatin modifying enzyme (DNMT1/ HDAC5) to a GFP reporter near a CAG/CTG expansion, in the course of transcription (Dox-inducible promoter). This is to determine whether the CTGs, when lengthened and transcribed, become unstable or impede gene activity via epigenetic modifications.
Findings:
1.Binding of DNMT1 to the reporter results in a modest increase (~10%) in local DNA methylation, with no change in repeat instability.
3.Targeting HDAC5 to the reporter results in local reduction in histone H3 acetylation, with no effect on repeat stability.
4.DNMT1/HDAC5 binding reduces GFP intensity differentially, in normal but not expanded alleles.
5.The N-terminal domain of HDAC5, when mutated, abolishes the reduction in GFP expression levels.
6.RGFP966 abolishes the allele-specific effect of HDAC5, resulting in a general decrease in GFP expression regardless of repeat tract size
7.CTG expanded alleles abolish the reduction in GFP repression by HDAC5 via HDAC3 activity
Conclusions:
Based on the results using the PlnT reporter assay, the authors claim that:
1.HDAC5 and DNMT1 do not affect repeat instability in cis
2.Expanded CAG/CTGs reduce the efficiency of gene silencing by targeting DNMT1/HDAC5 to the locus
3.Gene silencing that is mediated by HDAC5 recruitment can be abolished by inhibition of HDAC3 activity
Unfortunately, none of the claims in this manuscript are convincing.
General Comments:
The major drawback of the PlnT experimental approach is that it ignores the importance of the flanking regions and the genomic organization of the endogenous locus. This is a major concern as it makes the conclusions irrelevant to the related loci. In the case of myotonic dystrophy type 1, for example, the reporter should reside within a CpG island, should be positioned immediately next to CTCF binding site(s), and should be transcribed bi-directionally. Furthermore, only large expansions (at least several hundred copies) can trigger heterochromatin at the DM1 locus. None of these features are recapitulated by the PlnT reporter assay, making it difficult to draw any conclusion regarding the role of these chromatin modifying enzymes to the locus. In fact the authors state in their Discussion that "targeting a chromatin modifying peptide to different loci can have very different effects"! To better substantiate their conclusions the authors must set up an improved model system that takes into account the flanking regions and the 3D genomic organization of the locus (TADs). The preferable approach would be to insert a reporter cassette by homologous recombination into the differentially methylated/acetylated regions near the repeats, and compare between normal vs. expanded alleles.
My impression was that there is a lot of data but none of it makes sense.
The focus of the manuscript is not entirely clear: it starts with monitoring the effect of epigenetics on repeat instability and gene activity, then it shifts to the mechanism by which HDAC5 functions, and ends with the allele-specific effect of HDAC5 on gene expression. I lost my train of thought.
Other concerns:
(1)the modest increase in methylation levels following DNMT1 recruitment (10%, reaching a total of 20% at the most) prevents from drawing any conclusions regarding the effect of methylation on stability or expression.
(2)The effect of protein targeting on GFP levels should be better defined at the RNA/protein level. Does it act by blocking transcription? alternative splicing? or alters steady state levels?
(3)Fig 5: the scale is different for A vs. B and C. Also, better to compare the effect of targeting on equal sized expansions (either 91, 89 or 58 repeats).
(4)Add asterix for significance in all figures.
(5)Figure 6: show raw data rather than normalized.
(6)Figure 7: there is a notable difference in GFP expression levels in untreated wild type control (16 CAG repeats) between A vs. B. Why?
(7)Avoid redundancy. No need to show schematic representations so many times.
Significance
REFEREES CROSS-COMMENTING
I totally agree with the Reviewer #1 that the PinT targeting system is a potent experimental tool to study the function of specific chromatin binding proteins. However, the significance of the flanking regions is discounted. The fact that the recruitment of HDAC5 has resulted in a significant reduction in acetylated histones provides evidence for that "the targeted proteins can interact normally with partner proteins to form functional complexes". Still, I agree with that the activity of DNMT1 needs to be better established, considering the minor increase in DNA methylation levels. The request for a positive control for repeat instability is totally correct. It is difficult to discuss the missing effect of HDAC5 on contractions or the unexpected effect of HDAC3 on gene silencing bearing in mind the limits of the experimental system. I also agree with the statement that "this manuscript settled a central question for both HDAC5 and DNMT1 and their involvement in CAG/CTG repeat instability", is not supported by the data.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #1
Evidence, reproducibility and clarity
The manuscript describes two advances. First is the technical development for a protein targeting system called PInT that brings a target protein close to (~320 bp) a DNA sequence of interest. The idea is that localisation of the target protein allows one to distinguish its effects on the DNA sequence either in cis (when targeted) or in trans (when not targeted but expressed at the same level). Since targeting is conveyed by simply adding the small molecule ABA to the experiment, it is easy to compare the two situations. This is a clever idea and it is substantiated by data showing that the components of PInT do not affect triplet repeat instability or gene expression of GFP, into whose gene the PInT system is placed. Moreover, targeting is shown to enable enzymatic activity in the targeted region. Using the DNA methylase DNMT1, there are local increases in DNA methylation. Similarly, targeting the histone deacetylase HDAC5 results in local decreases in histone H3 acetylation. What is not clear from these experiments, however, is whether the targeted proteins can interact normally with partner proteins to form functional complexes. One necessary control is to add ChIP for at least one interacting protein each for DNMT1 and for HDAC5 and show that targeting permits normal protein-protein interactions. This experiment is straightforward as specific interacting proteins are known and good antibodies to precipitate those proteins are available. Overall, PInT would likely be useful for many groups studying the effects of chromatin modifiers on a DNA sequence of interest.
The second advance is conceptual and is focused more specifically on triplet repeat expansions. The manuscript describes experiments that measure genetic instability of long CAG-CTG repeats with and without protein targeting. The results show that allele size distributions are not significantly affected by targeting either DNMT1 or HDAC5. One curious outcome that is not discussed is contraction frequency in the HDAC5 experiment. Zero contractions are reported compared to 10-20% contractions in the other two experiments. Authors need to provide an explanation. The major issue with this set of experiments is that there is no positive control where instability is shown to be clearly manipulated. A knockdown of FAN1 would be the most likely avenue to pursue for identifying a positive control. This is straightforward to perform since successful FAN1 knockdowns have been described in the literature. The manuscript also looks at effects on gene expression measured by GFP fluorescence intensity. The potential significance is to see if disease-causing genes with expanded triplet repeats can be silenced by targeting chromatin-modifying enzymes. In the examples tested here, the answer seems to be no. Expression of DNMT1 or HDAC5 reduce fluorescence even in the absence of targeting. Upon targeting, there is a small further decrease, but the expanded triplet repeat resists this further decrease. Domain analysis of HDAC5 indicates that protein-protein interactions, not deacetylase activity, are important for silencing. The key interaction may be with HDAC3, since small molecule inhibition of HDAC3 relieved repeat length-dependent silencing by HDAC5. It was very curious that targeting HDAC3 actually increased expression, instead of silencing. The explanation for this observation was inadequate. The claim on page 16 final paragraph that the manuscript 'settled a central question for both HDAC5 and DNMT1 and their involvement in CAG/CTG repeat instability' is not supported by the data. Most of the results are negative so it is premature to claim the question is 'settled'. Overall, with appropriate modifications described here, these experiments would be of interest with regards to potential therapies of triplet repeat expansion diseases, where silencing the expanded gene is the goal.
Minor concerns
P 4, last line. 59 bp should read 59 repeats
P 5, line 2. 38 bp of what?
P 10, first paragraph. DNA methylation levels rise from ~10% to ~20% with DNMT1 targeting. Is there a good precedent in the literature that the magnitude of this increase can be expected to be biologically meaningful?
P12 first paragraph. Text describing Fig 5 is confusing. First, GFP expression is referred to in terms of fold decrease, but subsequently in percent. Second, the ABA-induced silencing looks to reduce expression from about 0.6 to 0.5 of control. I presume this is where the claim of 16% comes from but it was not clear.
P 15 paragraph 2. Where does the P value of 0.78 come from? Fig 7B shows no corresponding value.
Significance
See above.
Tags
Annotators
URL
-
-
www.biorxiv.org www.biorxiv.org
-
Note: This rebuttal was posted by the corresponding author to Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Reply to the reviewers
Reviewer #1
Summary
The authors present well written work on the evolution of proteome size and complexity, and the corresponding changes in chaperone proteins. Interestingly, they find chaperone copy numbers increase linearly with proteome size, despite the increasing 'complexity' of, in particular, post-LECA genomes. They suggest that to address the rise in complexity, organisms express chaperones at higher levels and an expanding network of co-chaperones has evolved across the tree of life.
Major comments
Comment-1. Summary reads strangely relative to the rest of the manuscript, and lists facts in a way that makes the purpose of the study confusing. I think most readers will dislike the characterisation of evolution as a progress from simple to complex, and the authors' might want to avoid this language throughout the manuscript- bacteria and archaea have also been evolving over this period of times, and have not become more 'complex'? Similarly the authors should reconsider their figure legend titles. As a specific example, 'in the course of evolution' should become 'across the tree of life'.
Response
Thank you for these crucial suggestions. We agree with the reviewer, and with Reviewer 2 (see below) that bacteria and archaea have also been evolving since their emergence, so basically, we (humans) and the simplest archaea have the same evolutionary origin. However, we all agree that the simplest archaea/bacteria are far more similar to LUCA than we are. That said, we accept the criticism that putting our analysis in the context of evolutionary time is an over-interpretation given that we have not examined the protein/proteome phylogeny (in relation to proteome complexity; for chaperones we have). We have thus reformulated the figures and text, to a comparison across the Tree of life, rather than a time-dependent evolutionary process. Specifically: as a first step, we revised the Figures to rename the X-axis as “Order of divergence”, rather than “Divergence time (million years)” in the previous version. In the revised main text we emphasized the fact that the branch lengths of the Tree of Life represent the relative order of divergence of the different clades, rather than time. All instances of ‘in the course of evolution’ has been replaced by ‘across the Tree of Life’.
Secondly, we revised the main text to emphasize on prokaryote vs. eukaryote comparison, rather than comparing organisms that diverged at different time-points. Within bacterial and archaeal domains, proteomes do not seem to expand against the order of divergence (as the reviewer argued, bacteria and archaea have not become more complex, also see Comment-5).
Thirdly, the word ‘complexity’ has been omitted from the manuscript. The section “The expansion of proteome complexity” now reads as “Proteome expansion by de novo innovations”. In the previous version, increasing complexity in fact implied a torrent of de novo innovations that impose a larger burden on the chaperone machinery. Instead of ‘complexity’, the latter is clearly stated in the revised manuscript.
In the spirit of these changes, the title of the revised manuscript, figure legend titles, and related section titles have been edited as follows.
Submitted version
Revised version
Paper title. On the evolution of chaperones and co-chaperones and the exponential expansion of proteome complexity
On the evolution of chaperones and co-chaperones and the expansion of proteomes across the Tree of Life
Section title. A Tree of Life analysis of the expansion of proteome complexity and chaperones
A Tree of Life analysis of the expansion of proteomes and chaperones
Section title. The expansion of proteome size
The expansion of proteome size across the Tree of Life
Section title. The expansion of proteome complexity
Proteome expansion by de novo innovations
Figure 1 legend title. Expansion of proteome size
Expansion of proteome size across the Tree of Life
Figure 2 legend title. Expansion of proteome complexity
Expansion of proteomes by de novo innovations
Further, changes have been made in the Summary and in the main text to exclude any impression that proteomes/organisms have become more complex with time. Rather we emphasized prokaryote versus eukaryote comparison.
Comment-2. I think the manuscript would be improved if the authors significantly shortened the discussion of genome size evolution- this is fairly well understood, and could be covered briefly, especially as the main focus of the manuscript is on the evolution of chaperone and co-chaperone repertoire. They could also make clearer quantitative links between protein complexity and the evolution of chaperones and co-chaperones- perhaps this should be in the discussion? The authors might also consider referencing 'The evolution of genome complexity', which could be relevant to this manuscript and might make the work of broader interest.
Response
We thank the reviewer for this suggestion. The main focus of our paper is indeed the evolution of chaperones and co-chaperones but within the context of the expansion of proteomes. Having this focus in place, the discussion on proteome size evolution (section: The expansion of proteome size across the Tree of Life) has been revised and shortened to emphasize more on prokaryote versus eukaryotic comparison.
The suggestion to provide “clearer quantitative links between protein complexity and the evolution of chaperones and co-chaperones” is indeed very useful and we authors sincerely thank the reviewer. To address this suggestion we revised Figure 4 to quantitatively compare the expansion of proteomes and that of chaperones, under one roof. This Figure compares proteome parameters that supposedly demands more chaperone action in all three domains of life and simultaneously summarizes the expansion of the chaperone machinery lacking de novo innovations.
The first paragraph of the Discussion section has been revised accordingly that walks the reader through the revised Figure 4 and finally introduces to the dichotomy it implies.
We did not understand the last comment “The authors might also consider referencing 'The evolution of genome complexity', which could be relevant to this manuscript and might make the work of broader interest.” We’d be glad to address it upon further clarification.
Comment-3. The authors state 'protein trees were generated and compared with ToL to account for gene loss and transfer events'. The methodology for this procedure is not given in the manuscript. The authors should back up this point, and make it clear this is why they reconstruct the trees. Currently it is not convincing to me that the authors have found HGT given the considerable phylogenetic uncertainty in the basal events in the tree of life. I also expect the tree of a single protein to be potentially lack information due to the short sequence considered and possible lack of power. The authors need to consider whether the data is really of high enough quality to assess this.
Response
Thank you for this suggestion. For the various chaperone families, we manually compared the protein trees with the Tree of Life. This is clearly stated in the revised Methods section (see Page 25, Lines 31-32). We agree, however, that the identifying HGT, and in general, trees of single domains that are highly diverged, are tricky. We did our best to address these caveats. Specifically:
We re-evaluated our work in the light of a recent study (PMID: 32316034). This paper discussed the phylogenetic uncertainties associated with molecular dating and re-evaluated the assignment of several protein families to LUCA. A careful analysis revealed that the reviewer is indeed right, meaning many of the HGT events shown in the previous version Figure 3B was indistinguishable from the phylogenetic uncertainties.
Accordingly, we revised the section “The core-chaperones emerged in early-diverging prokaryotes”. We removed the previous version Figure 3B, along with all instances of HGT events mentioned in the main text, except one (archaea to Firmicute HGT of HSP60, which is well-supported by the data and was also detected previously). Dating the emergence of chaperone families was also re-evaluated. Though the major conclusions were not altered, we discussed the phylogenetic uncertainties associated with our work and the overall confidence of each dating analysis. We believe these discussions would be very useful to the readers.
Finally, we note that most of our key assignments (points of emergence, and major HGT events) are in agreement with previous works. Specifically: the emergence of HSP20 and HSP60 to LUCA (Sousa et al., 2016; Weiss et al., 2016) and HSP60 being horizontally transferred from archaea to Firmicute (Techtmann and Robb, 2010) and HSP20 being horizontally transferred between bacterial clades and between bacteria and archaea (Kriehuber et al., 2010).
Comment-4. Methods- the authors could consider taking an alternative source of LUCA proteins, rather than those found in 'Nanoarchaeota and Aquificae': it's possible these are not representative of LUCA, and it seems a somewhat arbitrary choice- the authors could consider using one of the available curated sets, such as that generated by Ranea et al. (2006).
Response
The reviewer is right that a more robust LUCA set could be used. However, given that the revised manuscript focuses on comparison across the ToL, and foremost on prokaryote versus eukaryote comparison, we don’t think that refining this set is important. Foremost, this set was used for one purpose only, for determining changes in domain length. And, the set of 38 X-groups used for this analysis are in fact, the ones present in all organisms across the ToL. Hence, we kept the original analysis, while mentioning that these 38 X-groups are conserved across the ToL, and removed the argument for LUCA assignment. See Page 5, Line 22.
Comment-5. The patterns observed might only hold because of differences in the taxa that diverged pre and post LECA? The authors might consider subgroup analyses to ensure this is not the case. The authors could also consider using methods that take phylogeny into account.
Response
The reviewer is right that within prokaryotic domains proteomes do not seem to expand. For example, excluding a few early-diverging prokaryotes and parasites, proteome size in bacteria and archaea varies within 2000-3000 proteins per proteome. Only when pre-LECA and post-LECA organisms are compared, significant differences are observed. We thank the reviewer for this suggestion. We revised the main text to focus on prokaryote versus eukaryote comparison. This re-focusing does not change any of our major conclusions, but rather puts our analysis in the right context (see Comment 1).
Minor comments
Comment-6. 'Life's habitability has also expanded from its 10 specific niche of emergence-likely deep-sea hydrothermal vents, to highly variable and extreme 11 ranges of temperature, pressure, exposure to high UV-light, dehydration and free oxygen.' This is not really correct, as bacteria and archaea are found worldwide, and in the most extreme environments.
Response
Thank you for this suggestion. We removed the above-mentioned sentence.
Comment-7. 'We reconciled the topology of our tree'- on first read this was not clear, I did not realise the authors were only building trees for subsets of the data- time tree is the best source for the overall topology. The phrase 'manually curated and adjusted' is used in the methods. This language is much too vague, and not a clear explanation of the steps taken.
Response
We apology for this confusion. The overall topology of our Tree of Life is indeed taken for TimeTree. We edited the text in Page 4, Line 4 to clarify this issue.
The obtained tree topology was manually curated and adjusted to depict eukaryotes stemming from Asgard archaea and Alphaproteobacteria, by an endosymbiosis event. This is clearly mentioned in the Methods section (see Page 22, Lines 24-28).
Reviewer #2
Summary
Rebeaud and colleagues analyze evolution of chaperones compared to the evolution of whole proteome complexity across the entire tree of life. Their principal conclusions are well captured in the following quote from the Discussion:
"Comparison of the expansion of proteome complexity versus that of core-chaperones presents a dichotomy-a linear expansion of core-chaperones supported an exponential expansion of proteome complexity. We propose that this dichotomy was reconciled by two features that comprise the hallmark of chaperones: the generalist nature of core-chaperones, and their ability to act in a cooperative mode alongside co-chaperones as an integrated network. Indeed, in contrast to core chaperones, there exist a consistent trend of evolutionary expansion of co-chaperones."
Major comments
Comment-1. The general theme of the evolution of proteome management is of obvious interest. Unfortunately, the entire analysis is shaky and fails to convincingly ascertain the authors' conclusions. There are many issues. Throughout the manuscript, the authors discuss 'expansion' of the proteome in bacteria, archaea and eukaryotes, creating the impression of a consistent evolutionary trend. No such trend actually exists if one considers the means or medians of proteome sizes within each of the three domains of life (there is a transition to greater complexity in eukaryotes). The maximum complexity, certainly, increases with time which can be attributed to the 'drunkard's walk' effect. This hardly qualifies as 'expansion'.
Response
The reviewer is right that within prokaryotes proteomes do not seem to significantly expand. Reviewer-1 raised a similar concern that prokaryotes and eukaryotes have been evolving for the same period of time and have not expanded significantly. We understand the misconception instated by the earlier version and we thank the reviewers for pointing it out. Accordingly, we revised the main text to clarify these issues, as described in the following.
Firstly, the main text was revised to emphasize on prokaryote versus eukaryote comparison. The reviewer agrees that compared to prokaryotes, “there is a transition to greater complexity in eukaryotes”. This re-focusing does not change any of our major conclusions, but rather provides a systematic comparison that is adequately supported by data.
Secondly, we revised the Figures to rename the X-axis as “Order of divergence”, rather than “Divergence time (million years)” in the previous version. We emphasized the fact that the X-axis actually represent the relative order of divergence of the different clades, rather than absolute dates. This emphasis certainly does not create the impression of a consistent evolutionary trend. Instead, combined with the revised main text, it depicts that only when pre-LECA and post-LECA organisms are compared, clear trends of proteome expansion is observed.
Comment-2. The authors further claim a 'linear' expansion of the chaperone set and 'exponential' expansion of the total proteome size. These are precise mathematical terms and, as such, require fitting to the respective functions. No such thing in this manuscript. Even apart from that shortcoming, the explanation of both 'linear' and 'exponential' are quite confusing. Thus, when explaining the 'linearity' of chaperone evolution, the authors refer to the lack of major innovation among the chaperones. This is correct in itself but has nothing to do with linearity. Apart from the aforementioned conceptual problems, the estimation of the 'exponential' growth of the proteome are naive, inconsistent and inaccurate.
Response
Our uses of ‘linear expansion’ versus ‘exponential expansion’ may have been confusing although we have defined quite clearly what we mean by that (i.e., that it is not the mathematical sense). The statement regarding “the lack of major innovation among the chaperones” was made in this context/definition and was consistent with it.
Nonetheless, to avoid confusion, we revised the main text by excluding the ‘linear expansion’ and ‘exponential expansion’ terms. We simply stated that a torrent of de novo innovations has occurred during the expansion of proteomes from prokaryotes to eukaryotes. In contrast, the evolutionary history of core-chaperones lacks such major innovations. Accordingly, the title of the revised manuscript, figure legend titles, and related section titles have been edited as follows.
Submitted version
Revised version
Paper title. On the evolution of chaperones and co-chaperones and the exponential expansion of proteome complexity
On the evolution of chaperones and co-chaperones and the expansion of proteomes across the Tree of Life
Section title. A Tree of Life analysis of the expansion of proteome complexity and chaperones
A Tree of Life analysis of the expansion of proteomes and chaperones
Section title. The expansion of proteome complexity
Proteome expansion by de novo innovations
Figure 1 legend title. Expansion of proteome size
Expansion of proteome size across the Tree of Life
Figure 2 legend title. Expansion of proteome complexity
Expansion of proteomes by de novo innovations
Comment-3. As the base point for the expansion estimates for archaea and eukaryotes, the authors take parasitic forms. Even leaving aside the highly dubious claims that these organisms belong to the clades that diverged first from the respective ancestors, parasites are not an appropriate choice for such estimates because they certainly are products of reductive evolution. For bacteria, inconsistently, the authors choose a free-living form from a dubious ancient clade, and not even the one with the smallest genome. All taken together, this robs the expansion estimates of any substantial meaning.
Response
This point is overall valid. Although we adamantly reject the insinuation of “dubious claims that these organisms belong to the clades that diverged first from the respective ancestors” – firstly, we did not make any claims to this end, but took the ToL constructed by others (Hedges et al., 2015); second, that these claims are dubious need to backup by counter-evidence/data and with all due respect, neither were provided by the reviewer. However, what is of concern is that in a symbiont/parasite chaperones of the host may have a key role, and thus the comparison to free-living organisms could be misleading. To address this concern we excluded the obligatory endosymbiont Nanoarchaeum equitans and the parasitic organisms from the expansion estimates and such discussions are now limited to free-living organisms only. Further, as described in response to Comment-1, the revised manuscript focuses on prokaryote versus eukaryote comparison.
Note that phylogenetic analysis often assigns parasitic and symbiotic organisms that have experienced reductive evolution as the earliest diverging clades of their corresponding kingdoms of life. Examples include Nanoarchaeum equitans, an obligate symbiont, assigned as the earliest diverging archaea (Hedges et al., 2015; Huber et al., 2002; Waters et al., 2003), and parasitic Excavate assigned as one of the earliest diverging eukaryotes (Burki et al., 2020; Simpson et al., 2002). In accordance with these studies, these parasitic and symbiotic organisms were included in our analysis. We acknowledged this fact in the Methods section (see Page 22, Lines 9-16).
Comment-4. The authors do make a salient and I think essentially correct observation: chaperones typically comprise about 0.3% of the proteins in any organism. As such, this presents no dichotomy in evolutionary trends to be explained. Surely, as examined and discussed in the paper, eukaryotes also show significant increases in the size and domain content of the encoded proteins, suggesting the possibility that might need more chaperones. However, if this is the explanandum, rather than the number of proteins in the proteome as such, it should be clearly stated. Furthermore, it is quite natural to assume that this increase in protein complexity without a commensurate increase in the chaperone diversity, is enabled by higher expression of the chaperones as suggested in the Discussion of this paper. I doubt there is any big surprise here and even much need for an extended discussion let alone a special publication.
Response
As emphasized, and shown, eukaryotes have not only larger proteomes in terms of the number of proteins or protein size. They have a higher content of proteins that are prone to misfolding. This is shown explicitly, in Figure 2 (namely, multidomain proteins, repeat, beta-rich proteins, etc’) and is reiterated in a summary figure (suggested by Reviewer 1). Further, in response to Reviewer-3’s suggestion, we showed that eukaryotes feature much higher proportions of aggregation-prone proteins per proteome than prokaryotes (Figure 2E).
To further clarify, we revised Figure 4 to quantitatively compare the expansion of proteomes and that of chaperones, under one roof. This Figure compares proteome parameters that supposedly demands more chaperone action in all three domains of life and simultaneously summarizes the expansion of the chaperone machinery lacking de novo innovations.
In addition, the first paragraph of this Discussions section is revised to state that from prokaryotes to eukaryotes, proteomes have expanded by duplication-divergence as well as by innovations (de novo emergence of new folds). Thus, it’s not about the size only (a challenge that a proportion expansion of chaperone genes would resolve, i.e., the 0.3%) but about proteome composition changing in a way that demands more and more chaperone action.
We also agree with the assertion that “it is quite natural to assume that this increase in protein complexity without a commensurate increase in the chaperone diversity, is enabled by higher expression of the chaperones”. However, we belong to a group of scientists for whom natural assumptions are insufficient, and think that supporting evidence is of importance.
Reviewer’s significance statement
As such, in the opinion of this reviewer, there is no substantial advance over the existing knowledge in this paper. Should the authors wish to revise, they would need to develop robust methodology to measure proteome expansion. That would involve starting from reconstructed ancestors rather than any extant forms (let alone parasites). I doubt that such analysis, non-trivial in itself, reveals an strong, consistent trends other than the well known increase in complexity in eukaryotes.
Response
We agree that to assert evolutionary, time-dependent trends one needs to analyze phylogenies and reconstructed ancestors, but still think that a comparison of proteome and chaperone contents along the Tree of Life is meaningful. We thus respectfully, yet adamantly disagree with “no substantial advance over the existing knowledge”. We strongly believe, as does Reviewer-3, that the results and the model presented in this paper are “fascinating to consider and… will stimulate a good deal of important discussion…”.
Reviewer #3
Summary
The manuscript by Rebeaud et al describes phylogenetic analyses of proteome and chaperone complexity. The authors analyzed species across the tree of life to predict the proteome and chaperone properties of ancestors spanning to the last universal common ancestor. Their analyses indicate that many proteome properties increased in complexity over evolutionary time including: average protein length, the number of multi-domain proteins, the size of the proteome, the number of repeat proteins, and the number of beta-superfold proteins that are known to be difficult to fold. Their analyses also indicate an expansion in chaperone families that corresponds to the increase in proteome complexity. Based on their analyses, the authors propose a model where early life relied on a limited number of chaperones (Hsp20 and Hsp60) and that as proteome complexity evolved, so did chaperone complexity. Core chaperones including Hsp90, Hsp70, and Hsp100 evolved relatively early, and later chaperone evolution was driven by the appearance and alterations of co-chaperones and auxiliary factors as well as by increases in the protein abundance of chaperones.
Major concerns
Comment-1. This work is appropriately based on phylogenetic inferences, but as such, the limitations and uncertainties of phylogenetic inferences need to be discussed. This in no way takes away from the work, quite the opposite, it would make it richer by encouraging broader interpretations where justified and clear understanding of where support for the model is strongest. Posterior probabilities need to be discussed and the range of properties that a likely ancestor might have based on the data should be discussed. How this impacts the conclusions and models should be discussed. Throughout the manuscript, the authors present most-likely ancestral models (as I understood it), what are the next most likely models? How much power is there to distinguish one model from another? It would be very helpful to have a section describing the limitations and uncertainties of the phylogenetic analyses and how these relate to the main findings and conclusions.
Response
We thank the reviewer for this suggestion. Reviewer-1 raised a similar suggestion (see Comment-3). The phylogenetic analysis in our paper included dating the emergence of core- and co-chaperone families, and attempt to infer major their HGT events, foremost in relation to the origin of eukaryotic chaperones. To highlight the uncertainties of phylogenetic inferences we re-evaluated our work in the light of a recent study (PMID: 32316034) that carefully analyzed the uncertainties associated with the assignment of several protein families to LUCA.
Ideally, for a protein family to be assigned to LUCA, there must be a single split of bacterial and archaeal domains at the root of the protein tree with strong bootstrap support, and the inter-domain branches would be longer than the intra-domain branches (PMID: 32316034). In the revised main text we discussed that only the HSP60 protein tree satisfies this criterion. HSP20 protein tree depicts a clear single split of bacterial and archaeal domains at the root, albeit with weak bootstrap support, and inter-domain branch lengths are smaller than intra-domain branch-lengths. We discussed that this is indeed the case of phylogenetic uncertainty, which means the sequence of this small, single-domain chaperone lacks the information to make reliable inference at the basal events in the ToL.
In addition, the HGT events discussed in the previous version appear to be indistinguishable from phylogenetic uncertainties and we removed all instances of HGT events mentioned in the main text as well as Figure 3B. Only one HGT event – HSP60 being horizontally transferred from archaea to Firmicute, which is well-supported by the data is kept in the revised main text. We believe these discussions would be very useful to the readers.
Finally, we note that most of our key assignments (points of emergence, and major HGT events) are in agreement with previous works. Specifically: the emergence of HSP20 and HSP60 to LUCA (Sousa et al., 2016; Weiss et al., 2016) and HSP60 being horizontally transferred from archaea to Firmicute (Techtmann and Robb, 2010) and HSP20 being horizontally transferred between bacterial clades and between bacteria and archaea (Kriehuber et al., 2010).
Comment-2. General features that impact foldability, including contact order, should be discussed and what features can be searched for in genomes that relate to these - e.g. beta-rich proteins.
Response
Thanks for this valuable idea! Contact order, and other predictors of problematic folding are highly relevant but their analysis is structure-based and hence inapplicable on the proteome (sequence) scale. We did, hwoever, estimate the proportion of aggregation-prone proteins in the proteome. These proteins were identified by CamSol method that assigns poorly soluble regions from sequence data. Indeed, some of these predicted ‘poorly soluble segments’ refer to the hydrophobic core of the respective folded state instead of ‘true’ aggregation hotspots. With this unavoidable potential caveat, it appears that compared to prokaryotes, aggregation-prone proteins in the proteome have become nearly 6-fold more frequent in Chordates.
Following changes were made to accommodate this new analysis:
Figure 2 is revised to include a new panel (panel-E) that shows the expansion of aggregation-prone proteins in the proteome across the Tree of Life. The same result is summarized in the summary Figure 4.
A new paragraph entitled “Proteins predicted as aggregation-prone became ~6-fold more frequent in the proteome” is added to the Results section, which describes the principle and the main results (see Page 7, Lines 14-28).
The methodology is included in the Methods section, in a paragraph entitled “Predicted proportion of aggregation-prone proteins in the proteome”, see Page 24 Lines 17-27. For each representative organism, the percent of aggregation-prone proteins in proteome data are provided as Data S10.
This analysis is also included in the revised Abstract: “Proteins prone to misfolding and aggregation, such as repeat and beta-rich proteins, proliferated ~600-fold, and accordingly, proteins predicted as aggregation-prone became 6-fold more frequent in mammalian compared to bacterial proteomes.” See Page 2, Lines 7-9.
Comment-3. "Core" chaperones needs to be defined.
Response
Thank you for this suggestion. We restructured Page 3 Lines 19-23 in the Introduction to clearly explain this aspect. The current text is quoted below.
“Chaperones can be broadly divided into core- and co-chaperones. Core-chaperones can function on their own, and include ATPases HSP60, HSP70, HSP100, and HSP90 and the ATP-independent HSP20. The basal protein holding, unfolding, and refolding activities of the core-chaperones are facilitated and modulated by a range of co-chaperones such as J-domain proteins (Caplan, 2003; Duncan et al., 2015; Schopf et al., 2017).”
Minor concerns and thoughts
Comment-4. This manuscript stimulated me to think about the dynamics between chaperone evolution and proteome evolution. The ability to tolerate proteins that need chaperones seems linked to major evolutionary innovations. Once you have these innovations though, you are addicted to the chaperones - and an expansion of the number of sub-optimal proteins. These ideas seem like they would be valuable to include in the discussion of this work. More generally, it would be wonderful to have a discussion of future directions that this work may spark.
Response
This is indeed a fascinating question or set of questions, that we have also become intrigued about following this work, We introduced a short section, though more of an ‘appetizer’ than a detailed discussion, as we know almost nothing about the co-evolution of new proteins and chaperones.
Reviewer’s significance statement
This manuscript provides a fascinating glimpse back in time of a fundamental interplay - between chaperone evolution/addiction and proteome evolution. I am not an expert in phylogenetic analyses so I cannot judge the details of the analyses. As an expert in molecular evolution and chaperones, I found the approach and model fascinating to consider and I believe it will stimulate a good deal of important discussion in these fields. I have one major concern that I feel ought to be addressed in the manuscript and a number of points that I would encourage the authors to consider. I am sure that these can be readily addressed and I look forward to seeing this work published and the further discussion and ideas that it may stimulate.
Response
Thank you!
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #3
Evidence, reproducibility and clarity
The manuscript by Rebeaud et al describes phylogenetic analyses of proteome and chaperone complexity. The authors analyzed species across the tree of life to predict the proteome and chaperone properties of ancestors spanning to the last universal common ancestor. Their analyses indicate that many proteome properties increased in complexity over evolutionary time including: average protein length, the number of multi-domain proteins, the size of the proteome, the number of repeat proteins, and the number of beta-superfold proteins that are known to be difficult to fold. Their analyses also indicate an expansion in chaperone families that corresponds to the increase in proteome complexity. Based on their analyses, the authors propose a model where early life relied on a limited number of chaperones (Hsp20 and Hsp60) and that as proteome complexity evolved, so did chaperone complexity. Core chaperones including Hsp90, Hsp70, and Hsp100 evolved relatively early, and later chaperone evolution was driven by the appearance and alterations of co-chaperones and auxiliary factors as well as by increases in the protein abundance of chaperones.
Major concerns:
- This work is appropriately based on phylogenetic inferences, but as such, the limitations and uncertainties of phylogenetic inferences need to be discussed. This in no way takes away from the work, quite the opposite, it would make it richer by encouraging broader interpretations where justified and clear understanding of where support for the model is strongest. Posterior probabilities need to be discussed and the range of properties that a likely ancestor might have based on the data should be discussed. How this impacts the conclusions and models should be discussed. Throughout the manuscript, the authors present most-likely ancestral models (as I understood it), what are the next most likely models? How much power is there to distinguish one model from another? It would be very helpful to have a section describing the limitations and uncertainties of the phylogenetic analyses and how these relate to the main findings and conclusions.
- General features that impact foldability, including contact order, should be discussed and what features can be searched for in genomes that relate to these - e.g. beta-rich proteins.
- "Core" chaperones needs to be defined.
Minor concerns and thoughts:
- This manuscript stimulated me to think about the dynamics between chaperone evolution and proteome evolution. The ability to tolerate proteins that need chaperones seems linked to major evolutionary innovations. Once you have these innovations though, you are addicted to the chaperones - and an expansion of the number of sub-optimal proteins. These ideas seem like they would be valuable to include in the discussion of this work. More generally, it would be wonderful to have a discussion of future directions that this work may spark.
Significance
This manuscript provides a fascinating glimpse back in time of a fundamental interplay - between chaperone evolution/addiction and proteome evolution. I am not an expert in phylogenetic analyses so I cannot judge the details of the analyses. As an expert in molecular evolution and chaperones, I found the approach and model fascinating to consider and I believe it will stimulate a good deal of important discussion in these fields. I have one major concern that I feel ought to be addressed in the manuscript and a number of points that I would encourage the authors to consider. I am sure that these can be readily addressed and I look forward to seeing this work published and the further discussion and ideas that it may stimulate.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #2
Evidence, reproducibility and clarity
Rebeaud and colleagues analyze evolution of chaperones compared to the evolution of whole proteome complexity across the entire tree of life. Their principal conclusions are well captured in the following quote from the Discussion:
"Comparison of the expansion of proteome complexity versus that of core-chaperones presents a dichotomy-a linear expansion of core-chaperones supported an exponential expansion of proteome complexity. We propose that this dichotomy was reconciled by two features that comprise the hallmark of chaperones:the generalist nature of core-chaperones,and their ability to act in a cooperative mode alongside co-chaperones as an integrated network.Indeed, in contrast to core chaperones, there exist a consistent trend of evolutionary expansion of co-chaperones."
The general theme of the evolution of proteome management is of obvious interest. Unfortunately, the entire analysis is shaky and fails to convincingly ascertain the authors' conclusions. There are many issues. Throughout the manuscript, the authors discuss 'expansion' of the proteome in bacteria, archaea and eukaryotes, creating the impression of a consistent evolutionary trend. No such trend actually exists if one considers the means or medians of proteome sizes within each of the three domains of life (there is a transition to greater complexity in eukaryotes). The maximum complexity, certainly, increases with time which can be attributed to the 'drunkard's walk' effect. This hardly qualifies as 'expansion'. The authors further claim a 'linear' expansion of the chaperone set and and 'exponential' expansion of the total proteome size. These are precise mathematical terms and, as such, require fitting to the respective functions. No such thing in this manuscript. Even apart from that shortcoming, the explanation of both 'linear' and 'exponential' are quite confusing. Thus, when explaining the 'linearity' of chaperone evolution, the authors refer to the lack of major innovation among the chaperones. This is correct in itself but has nothing to do with linearity. Apart from the aforementioned conceptual problems, the estimation of the 'exponential' growth of the proteome are naive, inconsistent and inaccurate. As the base point for the expansion estimates for archaea and eukaryotes, the authors take parasitic forms. Even leaving aside the highly dubious claims that these organisms belong to the clades that diverged first from the respective ancestors, parasites are not an appropriate choice for such estimates because they certainly are products of reductive evolution. For bacteria, inconsistently, the authors choose a free-living form from a dubious ancient clade, and not even the one with the smallest genome. All taken together, this robs the expansion estimates of any substantial meaning.
The authors do make a salient and I think essentially correct observation: chaperones typically comprise about 0.3% of the proteins in any organism. As such, this presents no dichotomy in evolutionary trends to be explained. Surely, as examined and discussed in the paper, eukaryotes also show significant increases in the size and domain content of the encoded proteins, suggesting the possibility that might need more chaperones. However, if this is the explanandum, rather than the number of proteins in the proteome as such, it should be clearly stated. Furthermore, it is quite natural to assume that this increase in protein complexity without a commensurate increase in the chaperone diversity, is enabled by higher expression of the chaperones as suggested in the Discussion of this paper. I doubt there is any big surprise here and even much need for an extended discussion let alone a special publication.
Significance
As such, in the opinion of this reviewer, there is no substantial advance over the existing knowledge in this paper. Should the authors wish to revise, they would need to develop robust methodology to measure proteome expansion. That would involve starting from reconstructed ancestors rather than any extant forms (let alone parasites). I doubt that such analysis, non-trivial in itself, reveals an strong, consistent trends other than the well known increase in complexity in eukaryotes.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #1
Evidence, reproducibility and clarity
Summary:
The authors present well written work on the evolution of proteome size and complexity, and the corresponding changes in chaperone proteins. Interestingly, they find chaperone copy numbers increase linearly with proteome size, despite the increasing 'complexity' of, in particular, post-LECA genomes. They suggest that to address the rise in complexity, organisms express chaperones at higher levels and an expanding network of co-chaperones has evolved across the tree of life.
Major comments:
-Summary reads strangely relative to the rest of the manuscript, and lists facts in a way that makes the purpose of the study confusing. I think most readers will dislike the characterisation of evolution as a progress from simple to complex, and the authors' might want to avoid this language throughout the manuscript- bacteria and archaea have also been evolving over this period of times, and have not become more 'complex'? Similarly the authors should reconsider their figure legend titles. As a specific example,'in the course of evolution' should become 'across the tree of life' .
-I think the manuscript would be improved if the authors significantly shortened the discussion of genome size evolution- this is fairly well understood, and could be covered briefly, especially as the main focus of the manuscript is on the evolution of chaperone and co-chaperone repertoire. They could also make clearer quantitative links between protein complexity and the evolution of chaperones and co-chaperones- perhaps this should be in the discussion? The authors might also consider referencing 'The evolution of genome complexity', which could be relevant to this manuscript and might make the work of broader interest.
-The authors state 'protein trees were generated and compared with ToL to account for gene loss and transfer events'. The methodology for this procedure is not given in the manuscript. The authors should back up this point, and make it clear this is why they reconstruct the trees. Currently it is not convincing to me that the authors have found HGT given the considerable phylogenetic uncertainty in the basal events in the tree of life. I also expect the tree of a single protein to be potentially lack information due to the short sequence considered and possible lack of power. The authors need to consider whether the data is really of high enough quality to assess this.
-Methods- the authors could consider taking an alternative source of LUCA proteins, rather than those found in 'Nanoarchaeota and Aquificae':it's possible these are not representative of LUCA, and it seems a somewhat arbitrary choice- the authors could consider using one of the available curated sets, such as that generated by Ranea et al. (2006)
-The patterns observed might only hold because of differences in the taxa that diverged pre and post LECA? The authors might consider subgroup analyses to ensure this is not the case. The authors could also consider using methods that take phylogeny into account.
Minor comments:
'Life's habitability has also expanded from its 10 specific niche of emergence-likely deep-sea hydrothermal vents, to highly variable and extreme 11 ranges of temperature, pressure, exposure to high UV-light, dehydration and free oxygen.' This is not really correct, as bacteria and archaea are found worldwide, and in the most extreme environments.
' We reconciled the topology of our tree'- on first read this was not clear, I did not realise the authors were only building trees for subsets of the data- time tree is the best source for the overall topology. The phrase 'manually curated and adjusted' is used in the methods. This language is much too vague, and not a clear explanation of the steps taken.
Significance
The work presents interesting results that suggest that more 'complex' organisms have evolved a strategy to cope with increasing proteome size, and is interesting to researchers in the field of molecular evolution.
I am a researcher in population genetics and molecular evolution.
-
-
www.biorxiv.org www.biorxiv.org
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #3
Evidence, reproducibility and clarity
This study outlines calcium probes for assessing the poorly understood role of peroxisomes in calcium signaling. The authors suggest that these organelles sequester calcium from either calcium influx across the plasma membrane or from release from the ER/SR. This is important since we need to know more about the roles of these organelles in calcium homeostasis and signaling. However, it needs to be robustly demonstrated that the probes are targeted to the right organelle without confounding contamination from other organelles which can be very significant even for a small degree of mis-targeting.
Major
- The difference between the signals seen between the peroxisome and cytosolic D3 versions are not compelling, other than a dampened spike with the former (higher resting levels, smaller peak). See below for pH concerns.
- How clean is the peroxisome distribution? Prove that D3 spillover from its being partially in (or on) other compartments (e.g. cyto, ER) is not contributing to the changes. Selective manipulation of Ca2+ in these other compartments should not affect the peroxisome signal.
- a. For example, the small changes in the D3-px could be explained by peroxisome not changing at all but rather the other compartments (where larger responses are observed) signal(s) contaminating the response.
- b. e.g. if in the ER lumen, the signal should be eliminated with SERCA inhibitors (thapsigargin, CPA). They used Thapsigargin in cardiac myocytes, why not in HeLa during characterization)?
- a. For example, the small changes in the D3-px could be explained by peroxisome not changing at all but rather the other compartments (where larger responses are observed) signal(s) contaminating the response.
- Any Ca2+ reporter will pH-sensitive to an extent, even D3 (Ca2+ binding, inherent fluorescent proteins).
- a. It is essential to prove that the signal changes are not due changes perox pH. Target pH-sensitive proteins to the perox lumen by the same strategy and show that the same Ca2+ interventions do not cause pH changes.
- b. The authors claim different resting levels of [Ca2+] in cytosol/mitochondria/peroxisome. The resting FRET level also depends on the resting pH of the compartments which may also be different. Certainly, mitochondria are more alkaline than the cytosol. Again, to interpret these are real Ca2+ differences requires the pH to be accounted for.
- I am puzzled by the model, in particular in view of Fig 3. The genetically-encoded calcium indicator (GECI) is allegedly in on the cytosolic face of the peroxisome and measuring peri-peroxisomal Ca2+.
- a. The changes with this reporter look pretty similar to the luminal reporter (save that the resting ratio may be lower). I don't understand how the lumen [Ca2+] > cytosolic [Ca2+] without a higher local [Ca2+] (unless there is an energy-driven uptake mechanism, but then how does this fit in with ER-driven Ca2+ release?).
- The claim that resting peroxisome [Ca2+] is higher than cytosol is questionable. Is this a calibration artifact (e.g. compartment pH-differences or the reporter behaves differently in the lumen)? Such a gradient could not be sustained without energy-dependent Ca2+ uptake. The authors make no discussion of this.
Minor
- Quantitate localization. Pearson's coefficients for GECIs and Peroxisomes.
- Different upstroke rates of D3 with His vs Cao. Quantify.
- Page 5. Line 161. 'Different sites', do the authors mean different sides? Similarly, the Legend of Fig 3.
Significance
Good peroxisome calcium probes is important to the genral calcium signaling field. This is fundamental science of interst to all cell biologists.
There has been little published on peroxisome calcium, although for example, the Pozzan lab published a paper in JBC in 2008 on a GFP-based lumenally targeted peroxisome probe. There is contradictory data in the field and reliable new approaches are needed.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #2
Evidence, reproducibility and clarity
The manuscript by Sargsyan et al describes an unappreciated role for peroxisomes in Calcium dynamics. Specifically, the authors propose that GPCR/VDCC/SOCE-mediated cytosolic Ca2+ elevation is rapidly sensed by peroxisomes and sequestered. The authors used/generated a peroxisome-targeted genetically encoded Ca2+ indicators which is elegant and powerful tool to monitor the luminal Ca2+ dynamics. While the results and conclusions are novel, there are some important gaps that need to be addressed for consideration for publication in EMBO J.
Comments:
Peroxisomes are single membrane bound organelles which are conserved across species spanning from yeast to humans. While housing only -100 proteins, they are responsible for essential steps in lipid metabolism, amino acid metabolism and ROS homeostasis. Unlike other organelles, peroxisomes import fully folded and cofactor-bound proteins into their matrix. Though peroxisomes house specific metabolic functions, there is extensive crosstalk with other organelles, including mitochondria. It is essential to test and define whether silencing/knockdown of mitochondrial Ca2+ transport components like MCU will impact peroxisome Ca2+ uptake upon stimulation with histamine or electrical stimulation.
Since peroxisomes buffer significant amount of Ca2+, it is worth testing whether blockade of mitochondrial Ca2+ uptake would not alter peroxisome mediated Ca2+ influx. This analysis will provide Ca2+ uptake rate of mitochondria vs peroxisomes (mallilankaraman K. et al CELL 2012 and Nemani N. et al Science Signaling 2020).
Peroxisomal synthesis of plasmalogens is Ca2+ and oxygen tension dependent, it is essential to show that altering Ca2+ controls plasmalogen synthesis.
In the introduction authors have stated that "Elevated mitochondrial uptake increases 39 mitochondrial reactive oxygen species (ROS) production and is associated with heart falure and ischemic 40 brain injury (Starkov et al., 2004; Santulli et al., 2015)." These cited articles remotely links MCU and ROS elevation. It is important to point out that Tomar et al 2016 Cell Reports clearly demonstrated that genetic ablation of MCU suppresses mROS production that is mitochondrial Ca2+ dependent.
Significance
The significance of the work is very high. The authors employ a variety of complementary techniques and experimental systems to demonstrate that peroxisomes indeed buffer a large quantity of Ca2+ upon stimulation.
-
Note: This preprint has been reviewed by subject experts for Review Commons. Content has not been altered except for formatting.
Learn more at Review Commons
Referee #1
Evidence, reproducibility and clarity
These are straight forward studies aimed to develop probes to asses peroxisomal Ca2+ in rest and in response to receptor stimulation. The probes were designed to measure intraperoxisomal Ca2+ and the Ca2+ the peroxisome experience when cytoplasmic Ca2+ is increased. The pobes fill a need in understanding peroxisomal Ca2+ and Ca2+ signaling in general and should be very useful to investigators in the field.
The comments are aimed to help in improving the studies and taking them to the next stage.
The grammar needs improvement and the introduction needs sharpening. It is long and, in many places, not to the point. The results and discussion sections are also quite verbose.
The sidedness of the probes need to be validated further, especially since the peroxisomal Ca2+ increase follows the cytoplasmic and the slower reduction rate may results from the environment experienced by the probe. Simple experiments: how the probes respond to Ca2+ ionophore; does Ca2+ reduced rapidly when removed from the media of the digitonin permeabilized cells; how the cytoplasmic and peroxisomal thapsigargin responses compare using the protocols in 2A and 4A? Sidedness of PEX13-D3cpV was not examined.
Calculation of peroxisomal Ca2+ are based on Kd reported in the literature. The Kds of D3cpV-px and PEX13-D3cpV should be determined when in the peroxisome in permeabilized cells for the numbers to have any meaning.
How the localization of the probes look in the differentiated cardiomyocytes? How it compares to RyRs, VACC, etc..
The major weakness of the study is that the probes are used only as a tool. The enhance the study and bring it beyond an excellent technical achievement, the authors should use them to study a significant Ca2+-dependent peroxisomal function and show how the use of the tools eliminate the role of Ca2+ in such a function.
Significance
These are straight forward studies aimed to develop probes to asses peroxisomal Ca2+ in rest and in response to receptor stimulation. The probes were designed to measure intraperoxisomal Ca2+ and the Ca2+ the peroxisome experience when cytoplasmic Ca2+ is increased. The pobes fill a need in understanding peroxisomal Ca2+ and Ca2+ signaling in general and should be very useful to investigators in the field.
The major weakness of the study is that the probes are used only as a tool. The enhance the study and bring it beyond an excellent technical achievement, the authors should use them to study a significant Ca2+-dependent peroxisomal function and show how the use of the tools eliminate the role of Ca2+ in such a function.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #2:
In 2011 these authors showed that Drosophila DmPI31 is a binding partner of the F box protein Nutcracker, a component of an SCF ubiquitin ligase (E3) required for caspase activation during sperm differentiation in Drosophila. DmPI31 binds Nutcracker via a mechanism that is also used by mammalian FBXO7 and PI31. Subsequently, they have shown that PI31 serves as an adaptor to couple proteasomes with dynein light chains and inactivation of PI31 inhibited proteasome motility in axons and disrupted synaptic proteostasis, structure, and function. In addition, conditional loss of PI31 in spinal motor neurons (MNs) and cerebellar Purkinje cells (PCs) caused axon degeneration, neuronal loss, and progressive spinal and cerebellar neurological dysfunction.
Here the authors show that like Fbxo7 mutant mice, PI31 conditional KO mice have a decreased testis and thymus size and motor neuron specific loss of either FBXO7 or PI31 produced similar phenotypes in motor neurons. They generated a mouse that conditionally expressed FLAG-tagged PI31 this could rescue PI31 mutant mice; this transgene (under a Chat driver) rescued the phenotype of FBXO7 mutant mice from which they concluded that the consequences of FBXO7 mutation relate to loss of PI31 function in the cell types studied.
FBXO7 is the substrate recognition module of a novel proteasome‐interacting E3 ubiquitin ligase. In addition to binding PI31, FBXO7 also drives PI31 ubiquitylation and thus regulates its cellular levels. That the transgene can rescue the phenotype in the Chat-expressing cells is surprising and striking. However, it would necessary to reveal more about the underlying molecular mechanism. In the cell types rescued, is there another E3 ligase with overlapping substrate specificity? Are there mitochondrial phenotypes that are not rescued?
-
Reviewer #1:
This manuscript focuses on the role played by the PI31 protein in regulating presynaptic proteasome abundance and the health of motor neurons. In particular, it presents striking data from knockout and conditional KO mice showing that depletion of PI31 and Fbxo7/PARK15 (Parknson's disease gene) yield similar phenotypes, including motor neuron defects, following their conditional depletion. Furthermore, in the absence of Fbxo7/PARK15, PI31 levels were greatly reduced. This suggested that a major role for Fbxo7 is to promote the abundance/stability of PI31. In support of this model, transgenic expression of PI31 completely rescued overall health, body weight and motor neuron morphology in Fbxo7 mutant mice. These results are impressive. However, the manuscript implies but does not show that the mechanism through which PI31 supports neuronal health is by promoting the axonal transport of proteasomes and thus suppressing the presynaptic accumulation of ubiquitinated proteins. Several key experiments to address this issue would greatly strengthen the manuscript (outlined below).
1) Major statements are made about the importance of PI31 for axonal transport of proteasomes and presynaptic aggregate clearance. In order to establish that PI31 is indeed supporting neuronal health by promoting axonal transport of proteasomes and clearing presynaptic protein aggregates, it is necessary to show:
-- That motor neuron presynaptic proteasome number is reduced in the PI31 and Fbxo7 KO mice and restored in the Fbxo7 mutant mice that express the PI31 transgene.
-- That expression of the PI31 transgene in the Fbxo7 mutant mice suppresses the presynaptic accumulation of P62 aggregates.
2) It would be helpful if the abstract defined the Parkinson's disease model (PARK15) that was investigated.
3) Quantification of the presynaptic P62 aggregate phenotype in figure 2 would be helpful as would including a higher magnification image of the wildtype synapse with the P62 labeling.
4) Given that the major phenotypes that are characterized are not directly related to Parkinson's disease, the upfront emphasis on Parkinson's disease might not be warranted. Although the mouse phenotypes that are reported are striking, the title in particular suggests a more direct connection to this disease than is warranted by the data.
5) Figures 3C and 4B: Individual data points should be plotted and a statistical test would be helpful.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
There is great interest in understanding the molecular basis of FBXO7/PARK15 pathogenesis and the present, high quality story includes an impressive rescue in cells transgenically overexpressing PI31 protein. Nevertheless, as discussed in greater detail below, the two reviewers felt that more work would be needed to document the molecular basis for this phenotype rescue.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
Summary
The manuscript presents an experiment in which participants listened to ten auditory sequences, generated with either first- or second-order statistical structure ("simple" vs "complex" SL respectively) and predicted 20 elements in each sequence during simultaneous EEG recording. Behavioural results showed that all participants performed better for simple than complex sequences and musicians performed better than non-musicians for both sequence types. A Bayesian model was developed with parameters controlling memory decay, sensory noise, model order (hierarchy) and selection noise, which were fitted to the responses of each participant. The results showed differences between musicians and non-musicians for parameters related to SL (model order, selection noise) but not parameters related to stimulus processing (sensory noise and memory decay). Specifically musicians showed evidence of higher-order prediction and lower selection noise. The EEG results linked increased amplitude at fronto-central electrodes at around 300 ms to modelled surprise for each participant, which was stronger for musicians than non-musicians. Separate analyses for models of different order produced evidence for an early modulation around 200ms for zeroth-order predictions which did not differ between musicians and non-musicians and a later modulation around 300ms for first- and second-order predictions which did differ between the two groups. These modulations were linked to the MMN and P300 respectively. The results are taken as evidence for better SL in musicians and discussed in terms of the Bayesian brain hypothesis.
Substantive Concerns
-- p. 4, para. 2: I believe that the evidence for musicians showing better SL is less strong than presented in the manuscript. In particular, using different stimuli and methods, both Loui et al., (2010) and Rohrmeier et al. (2011) found no difference between musicians and non-musicians in statistical learning of auditory sequences. Furthermore, with regard to reference 7 in the manuscript, although some studies have found larger ERAN amplitudes in musicians than non-musicians (Jentschke & Koelsch, 2009; Kim et al., 2011; Koelsch et al., 2007, 2002; Regnault et al., 2001) the differences are usually small and have not been replicated in all studies (e.g., Koelsch & Jentschke, 2008; Koelsch & Sammler, 2008; Miranda & Ullman, 2007; Steinbeis et al., 2006). The introduction and motivation for the experiment should be adapted to give a more detailed and balanced view of the literature and the divergence between the present results and those of Loui et al., (2010) and Rohrmeier et al., (2011) should be discussed and accounted for.
-- I'm not sure complexity is the most appropriate term to use in distinguishing statistical regularities of different order, since different transition tables at a single given order could be described as varying in statistical complexity. Having introduced the term, why not stick to "higher-order" and "lower-order"?
-- p 7: "Control analysis revealed that musicians and non-musicians do not benefit from an overall increase in performances during the course of the experiment." But there should be an improvement during each individual sequence, right? Is it possible to demonstrate this?
-- I think the authors should analyse the interaction in Fig. 1B and report whether or not it is significant.
-- I noted that while the authors report the consistency between the model and participants, they do not report the average accuracy of the model, which should be included for completeness. It would be good to report both of these analyses separately for complex and simple sequences, given the significant difference in performance between them.
-- p. 15: clarify that the same transition matrix was used for all five sequences of a given order
-- p. 15: what were the inclusion/exclusion criteria for the groups of musicians and non-musicians? How were participants recruited? This is important, especially given the divergence between the present findings and previous results (as noted above).
-- p. 16: are there any consequences of the fact that participants were aware of the probabilistic nature of the sequences and the differences between the two sequence types? Again, this seems to me to be an important divergence from other SL studies which could impact on the behavioural and neural effects observed and should, therefore, be discussed.
-- p. 16: "one participant was removed" - musician or non-musician?
-- p. 18 why was FCz used as the reference?
-- there are some inconsistencies in the way the model parameters are named - e.g., "late noise" in Supp. Figure 5. Please check through and use consistent terms throughout.
-- To facilitate replication and follow-up research, I would encourage the authors to make their data and model openly available.
-
Reviewer #2:
The paper compares musicians' behavior and ERP responses to those of non-musicians with the following statement in the abstract:
"these better performances could be due to an improved ability to process sensory information, as opposed to an improved ability to learn sequence statistics. Unfortunately, these very different explanations make similar predictions on the performances averaged over multiple trials. To solve this controversy, we developed a Bayesian model and recorded electroencephalography (EEG) to study trial-by-trial responses."
The authors claim:
"This higher performance is explained in the Bayesian model by parameters governing SL, as opposed to parameters governing sensory information processing. " This is correct - but meaningless - the experiment does not challenge sensory noise since the 3 sounds used are so distinct that sensory noise is zero in the two groups. Given that basic design - this phrasing is not only too strong, it is in proper.
My understanding is that are two actual observations in the paper:
1) Musicians' learning of second order markov statistics is better than that of non-musicians based on parameter fitting of a Bayesian model of their behavior in answering explicit questions regarding which sound (of 3 very distinct options) should come next.
2) ERP measures - specifically P300 of musicians, is more sensitive to this statistics as evident by its magnitude with respect to predictability/surprise of the sound based on serial statistics. These claims are interesting BUT - I am not convinced by the claim of specificity. I think the data (and previous studies) suggest that musicians do better with sound related judgments - with all respects.
I am not convinced that the model adds information since it explains the data as a good as single accuracy numbers (or did I miss something?). So I am not convinced that this trial by trial analysis adds information.
With respect to the specific model parameters:
Sensory noise is zero - the sounds are quite distinct. This is not an observation - this is how the experiment was designed. The authors admit that (indeed - any study that focused on sensory discrimination found an advantage in musicians) - but then state specificity, particularly in the abstract.
Regarding rate of decay - I wonder if this is relevant to overall performance when asked only up to 2nd order serial statistics. It may be sufficient for the task. The relevance of this parameter should be clarified.
Thus the lack of group difference in these parameters probably tells about the experiment rather than the groups.
Similarly, musicians' ERP responses are larger. But the early difference is not addressed at all. Is the earlier response sensitive to simpler stat - but in a similar way in both populations? Can't be - since they have a different magnitude. The authors base their analysis on (MEG analysis) in their 2019 paper. I tried to do the exact comparison, and wasn't sure about the mapping to components - please clarify the exact similarity.
Thus - overall - I am not sure that the model analysis provides new conceptual insights.
-
Reviewer #1:
In this work, the authors used a combination of modelling, behavioral methods and EEG to understand whether sensitivity to the statistical structure of unfolding sound sequences differs between musician and non-musicians. Overall they demonstrate that musicians are better than non musicians at predicting forthcoming items. Modelling suggests that this advantage arises because they estimate higher order transition probabilities than non-musicians. The analysis of EEG data recorded during task performance showed that the amplitude of the P3 correlated with item predictability. Further analyses suggested that musicians and non-musicians have similar responses to surprise in simple sequences, with divergence between the groups occurring for higher order transition probabilities.
I have several concerns about task design, analysis and interpretation of the data which are detailed below:
1) The EEG data are recorded whilst participants are performing the behavioral prediction task. Though probe trials occurred rarely, it is conceivable that participants were making an active judgement for each sequence item. There is therefore a concern that the measured EEG data would reflect this aspect (active task performance) rather than automatic SL. This makes conclusions about "neural statistical learning" (e.g. as in the title) difficult to make.
2) In the results section the authors consider various differences between the musician and non-musician groups that could lead to differences in performance. One aspect that does not seem to be considered is that of attention, or task engagement. Is it possible that the musician participants were simply more engaged/less bored by the task? The EEG data (figure 3) are consistent with this interpretation showing overall substantially larger responses in the musicians relative to the non musicians.
3) Relatedly, is it possible that the results in Figure 3C are at least partly related to the overall amplitude differences between groups? Higher SNR in the musician group may lead to higher beta values. One way around this is to normalize the data (e.g. based on the P1 response) before computing the correlations.
4) Figure 4: can you show the ERP data on which the beta values are based?
5) Figure 4: the authors seek to conclude that the two groups have similar responses to surprise in simple statistical contexts (K=0) with divergence occurring for more complex statistical structure. However, they do not provide statistics to support this claim. It is not enough to show no significant difference between groups for K=0, but significant differences for K=1, 2 : you need to demonstrate an interaction.
6) More broadly, though, I do not understand the theoretical implications for this finding: why would brain response to K=0 occur earlier than k=2? Shouldn't the prediction be formed already before sound onset (especially given the relatively slow sequence rate).
7) Discussion: "Our results shed light on the musical training induced plasticity". This statement confuses correlation with causation. The authors discuss the reservation later in the discussion but it should be removed altogether.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
This work constitutes an innovative and timely combination of modelling, behaviour and EEG to understand potential differences in SL abilities between musicians and non-musicians. However, as detailed below, we have many concerns regarding the modelling, experimental design and interpretation of the results.
Our major concerns are summarized here (and further elaborated in the individual reviews below):
1) Modelling: please report the accuracy of the model and whether this differs between groups.
2) You should analyse the interaction in Fig. 1B and report whether or not it is significant.
3) Relatedly, there appears to be an inconsistency between the behavioural results and the modelling. In the behavioural data you report a main effects of musicianship and of sequence complexity. Modelling of this data suggests that whilst the K for musicians is higher than non musicians it is substantially above 1 for both. If anything this should predict larger differences between groups in larger K than smaller K which is different from what is seen behaviourally. A similar inconsistency is present between the behavioural results and the results in figure 4 (see below). This requires careful consideration.
4) Can you do more to convince the reader that the model is performing well? Is the fit good, how does it vary across participants? Does rate of memory decay affect performance at all? Can you show good versus poor performers within the same group - do parameters also vary there?
5) It is important that you address the issues related to participants being aware of the stimulus construction. Are there any consequences of the fact that participants were aware of the probabilistic nature of the sequences and the differences between the two sequence types? This seems to be an important divergence from other SL studies which could impact on the behavioural and neural effects observed and should, therefore, be discussed.
6) The EEG data are recorded whilst participants are performing the behavioural prediction task. Though probe trials occurred rarely, it is conceivable that participants were making an active judgement for each sequence item. There is therefore a concern that the measured EEG data would reflect this aspect (active task performance) rather than automatic SL. This makes conclusions about "neural statistical learning" (e.g. as in the title) difficult to make.
7) In the results section the authors consider various differences between the musician and non-musician groups that could lead to differences in performance. One aspect that does not seem to be considered is that of attention, or task engagement. Is it possible that the musician participants were simply more engaged/less bored by the task? The EEG data (figure 3) are consistent with this interpretation showing overall substantially larger responses in the musicians relative to the non musicians.
8) In general, we think the model has been constructed with due care and attention and we like the separation of parameters related to statistical learning (model order and selection noise) and more general aspects of perception and cognition (sensory noise and memory decay). We think the difficulties arise in the relationship between the model and the experiment. Specifically, the sensory noise model parameter reveals very little in the analysis of this data because the sounds were so readily distinguishable, which appears to have been a deliberate choice in the experimental design, somewhat confusingly. The present stimulus set is therefore not suitable for distinguishing differences in sensory processing vs. SL between groups. We suggest that the authors could simply remove this parameter from the analysis and the paper would be clearer as a result. This would involve re-modelling and you will also have to reshape the way the experiment is motivated.
9) We have some questions about how the EEG data are analysed. In particular, the large amplitude difference between groups should be quantified, discussed and interpreted. We would also like to see stronger justification and discussion of why these differences are not affecting the main conclusions. We note that the authors provide R2 results in supp materials but we feel that a better approach may involve normalizing the responses before modelling. Higher SNR in the musician group may lead to stronger correlations. One way around this is to normalize the data (e.g. based on the P1 response) before computing the correlations.
10) You should perform the appropriate statistical analysis to support the claims associated with Figure 4. You seek to conclude that the two groups have similar responses to surprise in simple statistical contexts (K=0) with divergence occurring for more complex statistical structure. However, you do not provide statistics to support this claim. It is not enough to show no significant difference between groups for K=0, but significant differences for K=1, 2. You need to demonstrate an interaction between group and model order. Additionally, it was also not quite clear how modelling was performed here. We understand that you take surprise values from the model fitted to each participant but with the order fixed at 0, 1 or 2. This may mean that the other parameters might no longer be optimal in the context of the new fixed K values, depending on how different these were from the fitted values for each participant, which might plausibly differ for the musicians and non-musicians. To address this, Can you supplement the existing analysis with an analysis in which the K parameters are fixed at 0, 1 and 2, and the other parameters are re-optimised in the context of these fixed parameter values. Please also provide information about how well each individual data were fit, and whether there was a significant difference between musicians and non musicians. In general, we think the authors should present the result in figure 4 more cautiously and also flesh out the interpretation in more detail in relation to the literature along with a consideration of other potential interpretations. A small related point is that the term hierarchy is strongly related to this interpretation and we would prefer a more neutral term such as 'model order'.
11) The paper would benefit from a careful discussion of exactly what information, on top of that revealed with behaviour, is added by EEG and the significance of this in the context of the existing literature on expectation related ERP components.
-
-
www.biorxiv.org www.biorxiv.org
-
Reviewer #3:
In this manuscript, Ramachandran and colleagues describe how cholecystokinin-related NLP-12 neuropeptide signalling in C. elegans can regulate two different behavioural programmes, area-restricted search (ARS) and basal locomotion, by conditionally engaging different specific receptors that are expressed in different neuronal targets. They thoroughly characterise the CKR-1 receptor which had not been described previously, and place its function in context with that of the previously known NLP-12 receptor CKR-2. The manuscript gives new insight into an interesting and likely conserved mechanisms of how neuromodulatory systems enable adaptive behaviour by coordinating the action of neural circuits even when they are not directly connected. The conclusions drawn appear solid and are justified by the data presented, and the experimental approaches and results are well documented.
The main problem with the work is a certain lack of clarity regarding the separation of the roles of the CKR-1 and CKR-2 receptors on basal locomotion/body bending and head bending/reorientations. Overexpression of NLP-12 places animals in a chronic ARS state, as described in a previous publication. Is the NLP-12 overexpression model representative of the increased reorientation in area restricted search, or of control of undulations in basal locomotion, or both? If it is primarily representative of area restricted search, this would mean that CKR-2, similarly to CKR-1, mediates the chronic ARS state induced by NLP-12 overexpression, because in fig. 1B and C its mutation causes a reduction in the phenotype, and deletion of both ckr-1 and ckr-2 causes a stronger reduction.
Also, it is unconvincing that SMD neurons do not express ckr-2 (see S3D); no comparison of ckr-1 and ckr-2 expression levels in SMD is provided and in fact the CeNGEN data of single cell RNAseq of C. elegans neurons shows similar expression of both receptors in SMDD (accessible at cengen.shinyapps.io/SCeNGEA). On the other hand, loss of ckr-2 on its own does not cause a significant reduction in ARS (fig 3A). To clarify this, the authors could measure the reorientation rate in the nlp-12OE ckr-2 mutant strain.
Given that ckr-1 overexpression as shown in figs 4-6 increases both body bending amplitude (and ARS-like high reorientation rate, the authors offer the interesting possibility that SMD may also affect basal locomotion. I would suggest an experiment that clarifies whether SMD also controls body bending in basal locomotion using the single-worm tracking assay shown in fig 2A with the SMD-specific ckr-1 rescue strains in a ckr-1 mutant background (as used in figure 7). Also they could measure body bending in the existing data on the SMD::Chrimson optogenetics.
-
Reviewer #2:
Ramachandran et al. report the discovery of a C. elegans GPCR - CKR-1 - that mediates some of the effects of the cholecystokinin-like neuropeptide NLP-12 on posture and foraging behavior. The discovery of this receptor permits further study of this neuropeptide signaling system, which is conserved from worms to vertebrates. Although CKR-1 is expressed in many neurons, the authors show that its function in SMD head-motorneurons is especially important for control of posture and foraging. The manuscript's strengths include: (1) rigorous characterization of receptor-ligand interactions in vitro, using a cell-based assay for GPCR activation, and in vivo, using genetic analysis, (2) compelling data in support of a model in which NLP-12 regulates SMD neurons to control foraging, (3) high-resolution analysis of C. elegans posture during foraging, which illustrates the complexity and richness of this behavior, and (4) the circuit model, i.e. a role for SMDs, is tested using a number of independent methods and clearly indicated.
The manuscript does have some weaknesses. In addition to specific technical points listed below, the manuscript discussed neuropeptides derived from a single source, the DVA pre-motor neuron, acting on distinct targets via distinct receptors in a conditional manner. This interesting model is suggested by the title and the abstract and comes up plainly in the introduction and discussion. However, the model is not clearly supported by the data, which primarily focus on the characterization of CKR-1 as a relevant receptor for NLP-12 peptides. Another weakness in the manuscript arises from the authors' switching between various assays for posture during locomotion, which makes it difficult for the reader to compare data between figures. Rich kymography data are relegated to supplementary figures, and data from only a subset of relevant genotypes are shown as kymographs. The manuscript would be strengthened by more uniform analysis of posture and foraging. Finally, while the data clearly show that effects of NLP-12 on posture and foraging require SMD neurons, the manuscript does not investigate how NLP-12 affects SMD activity. The manuscript would be strengthened by experiments showing a functional connection between DVA and SMD neurons, e.g. functional imaging of SMDs during optogenetic manipulation of DVAs.
Specific comments:
1) One premise of the work is that DVA neurons are the sole source in vivo of NLP-12 peptides. A recent study (Tao et al. 2019, Dev. Cell) shows that there is an alternate source of NLP-12, the PVD nociceptors. The authors should address the possibility that their assays also detect a contribution of PVD neurons to posture/foraging.
2) The text associated with Figure 1B-C is tentative with respect to assigning redundant functions to CKR-1 and CKR-2. Why? The data are clear; these receptors function redundantly.
3) The very nice in vitro analysis of NLP-12 receptors should include negative controls. Ideally, the authors would use a scrambled neuropeptide or a related neuropeptide to demonstrate specificity of the interactions between NLP-12 and CKR-1/2.
4) The different 'bending angles' used in Figures 1 and 2 make it difficult to compare data between figures. Also, the schematics used to explain the bending angles have small fonts and are hard to read.
5) Figure 3E shows the results of a nice experiment in which optogenetic activation of NLP-12-expressing cells - presumably DVA - causes reorientations. The authors assert that this effect requires CKR-1 but not CKR-2. The data, however, suggest that CKR-2 might have an effect. The variance of the data does not allow the authors to reject a null hypothesis, but they err in then assuming that this means that CKR-2 plays no role in the phenomenon. This experiment should be repeated to determine whether there is indeed a specific or privileged role for CKR-1 in mediating NLP-12-dependent reorientations.
6) Also, Figure 3E should show raw data - don't show proportional changes - and all Figure 3 should be scatter plots allowing the reader to assess the variance of the data.
7) The authors show that effects of receptor overexpression are suppressed by loss of NLP-12 peptides. Is there precedent for this kind of genetic interaction in the literature?
8) Also, the authors assert that suppression of effects of CKR-1 overexpression by loss of NLP-12 shows that NLP-12 peptides are the sole ligands for this receptor (page 9, line 17). It is not clear why the authors reach this conclusion.
9) There are some very nice data that are assigned to supplementary figures but might be better placed in main figures. Fig. S3A-B shows data that are integral to the authors' model and could be presented in a main figure. Also, the localization of NLP-12::Venus in DVA axons near SMD processes would be appropriate to show in a main figure. It would be ideal to mark SMDs with a red fluor so that NLP-12::Venus colocalization with SMD processes could be assessed.
10) The kymography data are nice but incomplete. The authors should show kymographs from strains of all relevant genotypes. This would include: (1) ckr-1(oe); nlp-12, (2) nlp-12, ckr-1, and ckr-2 single mutants, and (3) ckr-1; ckr-2 double mutants.
11) Page 12, last paragraph indicates that 'low levels' of expression rescue ckr-1 phenotype - how has the expression level been determined? I guess that the authors refer to the amount of DNA used for transgenesis, not a direct measure of transgene expression - this should be reworded.
12) The manuscript would be strengthened by experiments that measured the effect of DVA activation on SMD physiology and what contribution NLP-12 signaling makes to any functional connection between these neurons. One potential impact of this work is that it establishes a nice paradigm for new molecular genetic analyses of neuropeptide signaling. Direct observation of the effects of NLP-12 peptides on SMD neuron physiology would further strengthen the authors' conclusions and suggest mechanisms by which CKR-1 regulates cell physiology.
13) Minor comment: Fig S1C is a little confusing w/ respect to how the ligand is indicated - it implies that there exists a ligand-binding site at the amino terminus of the receptors.
-
Reviewer #1:
In this manuscript Ramachandran et al. provide a C. elegans behavioral genetics study focused on the worm cholecystokinin-like neuropeptide-receptor system. They show that nlp-12 neuropeptides released from the DVA neuron fulfill a dual role in controlling body posture as well as head-bending mediated area restricted search (ARS). Previous work showed that DVA controls body posture via nlp-12 signaling to ckr-2 receptor in ventral cord motor neurons. Moreover, nlp-12 signaling was implicated in ARS; but the exact circuit mechanisms and targets of nlp-12 remained elusive. The present work shows in a pretty straight forward way that ckr-1 in SMD head motor neurons is the missing link. In worms, ARS is composed of quiet complex body movements including high angle turns during the worm's forward crawling state. Nlp-12 and ckr-1 mutants show reduced head bending during ARS, while overexpression leads to a stark ectopic ARS like behavior. The authors convincingly show that SMDs are the site of action for ckr-1 and implicated in ARS. They show both requirement and sufficiency of SMDs for ARS like behaviors. The regulation of ARS vs. dispersive behaviors has been extensively studied at the levels of sensory and interneurons in the worm, but how the switch is implemented at motor circuits was largely unknown. Conceptually, this is one of only a few studies investigating the selective control of head versus body movements and provides some interesting insights into the underlying mechanisms; therefore, the study is definitely important and timely. But, it is unclear still how upper sensory circuits transmit the switch between ARS and dispersal to the DVA-SMD circuit. Moreover, the present study does not investigate the signaling pathway of ckr-1 in SMDs and its role in controlling neuronal activity, e.g. via Ca++ imaging. As a sole behavioral genetics study, however, I find the manuscript quite complete. The experiments logically build upon each other and the paper is well written. My only major critique is that parts of the behavioral analyses are described with insufficient detail so that it is unclear to the expert how and what exact movements were quantified. This should be addressed by providing more detailed figure captions, methods sections, more supplemental figures and movies.
1) The authors should exclude (or separate) reversal states and post-reversal turns in their analyses when measuring head bending, body bending and turn events, but it is unclear if they did so.
2) Fig 1C and methods: it is unclear what defines a singular bending event as marked on the y-axis. Did the authors measure the maximum angle during each half-oscillation? If yes, this should be explained and how maxima were calculated etc. Or do the histograms represent all values from all recording frames. In the latter case, the y-axis labelling is misleading, and I suggest use "fraction of frames".
3) Fig 1C: these are averaged histograms of n=10-12 worms, but what is the average number of events per worm and in total?
4) Fig 1B-C, 2A etc.: to perform the measurements as depicted in upper panels is not really trivial, and I have the impression that the authors used their software packages in a black-box manner. What are the exact image processing steps to implement these measurements, i.e. how was vertex and sides of the angles exactly positioned? The authors should provide a time-series of individual examples alongside with movies demonstrating how accurately the pipeline performs during complex ARS postures.
5) Fig 2B: the angles and body segments describing the head and head-bending angels should be unambiguously defined. The cartoon in 2B looks like they just measured nose movements.
6) Fig 3B: reorientation events are not sufficiently defined here. During ARS, worms frequently switch between forward-backward movement, perform post-reversal turns and in a continuous manner exhibit curved trajectories. From a trajectory like the red one in 3A, it is again not trivial to identify and discretize individual turning events with a start and an end and distinguish them from reversals and post reversal turns.
-- The procedure needs to be explained in greater detail with justification of parameter choice.
-- How did the authors validate that the procedure performed well, especially during the complex ARS behaviors?
-- Again, example trajectories and movies should be shown.
7) All histogram panels lack statistics, e.g. KS test or appropriate alternatives.
-
Preprint Review
This preprint was reviewed using eLife’s Preprint Review service, which provides public peer reviews of manuscripts posted on bioRxiv for the benefit of the authors, readers, potential readers, and others interested in our assessment of the work. This review applies only to version 1 of the manuscript.
Summary:
The reviewers find your work very interesting and acknowledge its importance in understanding the role of cholecystokinin signaling in differentially controlling aspects of locomotion behavior in C. elegans. In its current form, it represents a near complete and well done behavioral genetics study that could improve further with addressing some of the comments below and also harmonizing the behavior metrics that were used for quantifications. The work could be brought to another level though if the authors performed new lines of experiments that give further mechanistic insights, e.g. via physiological methods, into how ckr-1 signaling controls SMD activity.
-